minio

Commit Graph

Author	SHA1	Message	Date
Mark Theunissen	9511056f44	fix: simplify error logged when logger target is unreachable (#20304 )	2024-08-22 02:43:48 -07:00
Sveinn	743ddb196a	Removing the audit log retry mechanism (#20259 )	2024-08-14 15:25:08 -07:00
Harshavardhana	e7a56f35b9	flatten out audit tags, do not send as free-form (#20256 ) move away from map[string]interface{} to map[string]string to simplify the audit, and also provide concise information. avoids large allocations under load(), reduces the amount of audit information generated, as the current implementation was a bit free-form. instead all datastructures must be flattened.	2024-08-13 15:22:04 -07:00
Klaus Post	d8f0e0ea6e	Simplify error logging on event send (#20246 ) Overly verbose, hard to read and can leak data. Print even as JSON and simplify target&error printing.	2024-08-12 08:55:28 -07:00
Harshavardhana	7fcb428622	do not print unexpected logs (#20083 )	2024-07-12 13:51:54 -07:00
Harshavardhana	a22ce4550c	protect workers and simplify use of atomics (#19982 ) without atomic load() it is possible that for a slow receiver we would get into a hot-loop, when logCh is full and there are many incoming callers. to avoid this as a workaround enable BATCH_SIZE greater than 100 to ensure that your slow receiver receives data in bulk to avoid being throttled in some manner. this PR however fixes the unprotected access to the current workers value.	2024-06-24 18:15:27 -07:00
Anis Eleuch	3ba857dfa1	race: Fix detected test race in the internal audit code (#19865 )	2024-06-03 08:44:50 -07:00
Harshavardhana	ba54b39c02	fix: crash when audit webhook queue_dir is not writable (#19854 ) This is regression introduced in #19275 refactor	2024-06-01 20:03:39 -07:00
Anis Eleuch	2a75225569	kafka: _MINIO_KAFKA_DEBUG to enable sarama debug messages (#19849 )	2024-06-01 08:02:59 -07:00
Aditya Manthramurthy	5f78691fcf	ldap: Add user DN attributes list config param (#19758 ) This change uses the updated ldap library in minio/pkg (bumped up to v3). A new config parameter is added for LDAP configuration to specify extra user attributes to load from the LDAP server and to store them as additional claims for the user. A test is added in sts_handlers.go that shows how to access the LDAP attributes as a claim. This is in preparation for adding SSH pubkey authentication to MinIO's SFTP integration.	2024-05-24 16:05:23 -07:00
Anis Eleuch	d0e0b81d8e	Fix race get/set system/audit targest to avoid race errors (#19790 )	2024-05-22 09:23:03 -07:00
Harshavardhana	1fd90c93ff	re-use StorageAPI while loading drive formats (#19770 ) Bonus: safe settings for deployment ID to avoid races	2024-05-19 01:06:49 -07:00
Harshavardhana	8ff70ea5a9	turn-off coloring if we have std{err,out} dumb terminals (#19667 )	2024-05-03 17:17:57 -07:00
Klaus Post	4a60a7794d	Use better gzip for log rotate (#19651 ) Should be 2x faster with same usage.	2024-05-02 04:38:40 -07:00
Harshavardhana	402a3ac719	support compression after rotation of logs (#19647 )	2024-05-01 15:38:07 -07:00
Harshavardhana	8c1bba681b	add logrotate support for MinIO logs (#19641 )	2024-05-01 10:57:52 -07:00
Anis Eleuch	95bf4a57b6	logging: Add subsystem to log API (#19002 ) Create new code paths for multiple subsystems in the code. This will make maintaing this easier later. Also introduce bugLogIf() for errors that should not happen in the first place.	2024-04-04 05:04:40 -07:00
Sveinn	ba46ee5dfa	Adding console targets back into systemtarget log slice (#19398 )	2024-04-02 15:56:14 -07:00
Sveinn	1fc4203c19	Webhook targets refactor and bug fixes (#19275 ) - old version was unable to retain messages during config reload - old version could not go from memory to disk during reload - new version can batch disk queue entries to single for to reduce I/O load - error logging has been improved, previous version would miss certain errors. - logic for spawning/despawning additional workers has been adjusted to trigger when half capacity is reached, instead of when the log queue becomes full. - old version would json marshall x2 and unmarshal 1x for every log item. Now we only do marshal x1 and then we GetRaw from the store and send it without having to re-marshal.	2024-03-25 09:44:20 -07:00
Anis Eleuch	b657ffa496	fix: Fix crash when logging events and anonymous is enabled (#19313 ) Events log does not have a stacktrace. So Trace is nil. Fix a crash in this case when an event is printed while anonymous logging is enabled.	2024-03-21 10:19:36 -07:00
Harshavardhana	233cc3905a	add batchSize support for webhook endpoints (#19214 ) configure batch size to send audit/logger events in batches instead of sending one event per connection. this is mainly to optimize the number of requests we make to webhook endpoint.	2024-03-07 12:17:46 -08:00
Harshavardhana	e91a4a414c	merge startHTTPLogger() many callers into a simpler pattern (#19211 ) simplify audit webhook worker model fixes couple of bugs like - ping(ctx) was creating a logger without updating number of workers leading to incorrect nWorkers scaling, causing an additional worker that is not tracked properly. - h.logCh <- entry could potentially hang for when the queue is full on heavily loaded systems.	2024-03-06 08:09:46 -08:00
Harshavardhana	74ccee6619	avoid too much auditing during decom/rebalance make it more robust (#19174 ) there can be a sudden spike in tiny allocations, due to too much auditing being done, also don't hang on the ``` h.logCh <- entry ``` after initializing workers if you do not have a way to dequeue for some reason.	2024-03-06 03:43:16 -08:00
Harshavardhana	53aa8f5650	use typos instead of codespell (#19088 )	2024-02-21 22:26:06 -08:00
Harshavardhana	cd419a35fe	simplify broker healthcheck by following kafka guidelines (#19082 ) fixes #19081	2024-02-20 00:16:35 -08:00
Anis Eleuch	68dde2359f	log: Add logger.Event to send to console and other logger targets (#19060 ) Add a new function logger.Event() to send the log to Console and http/kafka log webhooks. This will include some internal events such as disk healing and rebalance/decommissioning	2024-02-15 15:13:30 -08:00
Anis Eleuch	6fd63e920a	log: Use error log type instead of Application/MinIO type (#18930 ) * log: Use error log type instead of Application/MinIO type Also bump github.com/shirou/gopsutil version to address cross compilation issues. * Apply suggestions from code review Co-authored-by: Aditya Manthramurthy <donatello@users.noreply.github.com> --------- Co-authored-by: Anis Eleuch <anis@min.io> Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Aditya Manthramurthy <donatello@users.noreply.github.com>	2024-02-01 16:13:57 -08:00
Harshavardhana	1d3bd02089	avoid close 'nil' panics if any (#18890 ) brings a generic implementation that prints a stack trace for 'nil' channel closes(), if not safely closes it.	2024-01-28 10:04:17 -08:00
Praveen raj Mani	c905d3fe21	fix: Re-use TCP connections for Kafka dials (#18860 ) Fixes #18857	2024-01-24 13:10:52 -08:00
Harshavardhana	dd2542e96c	add codespell action (#18818 ) Original work here, #18474, refixed and updated.	2024-01-17 23:03:17 -08:00
Anis Eleuch	8bd4f6568b	server-info: Avoid initializing audit/log http/kafka targets (#18703 ) This can cause unnecessary ServerInfo() call delay.	2023-12-22 10:25:08 -08:00
Klaus Post	51aa59a737	perf: websocket grid connectivity for all internode communication (#18461 ) This PR adds a WebSocket grid feature that allows servers to communicate via a single two-way connection. There are two request types: * Single requests, which are `[]byte => ([]byte, error)`. This is for efficient small roundtrips with small payloads. * Streaming requests which are `[]byte, chan []byte => chan []byte (and error)`, which allows for different combinations of full two-way streams with an initial payload. Only a single stream is created between two machines - and there is, as such, no server/client relation since both sides can initiate and handle requests. Which server initiates the request is decided deterministically on the server names. Requests are made through a mux client and server, which handles message passing, congestion, cancelation, timeouts, etc. If a connection is lost, all requests are canceled, and the calling server will try to reconnect. Registered handlers can operate directly on byte slices or use a higher-level generics abstraction. There is no versioning of handlers/clients, and incompatible changes should be handled by adding new handlers. The request path can be changed to a new one for any protocol changes. First, all servers create a "Manager." The manager must know its address as well as all remote addresses. This will manage all connections. To get a connection to any remote, ask the manager to provide it given the remote address using. ``` func (m Manager) Connection(host string) Connection ``` All serverside handlers must also be registered on the manager. This will make sure that all incoming requests are served. The number of in-flight requests and responses must also be given for streaming requests. The "Connection" returned manages the mux-clients. Requests issued to the connection will be sent to the remote. * `func (c Connection) Request(ctx context.Context, h HandlerID, req []byte) ([]byte, error)` performs a single request and returns the result. Any deadline provided on the request is forwarded to the server, and canceling the context will make the function return at once. `func (c Connection) NewStream(ctx context.Context, h HandlerID, payload []byte) (st Stream, err error)` will initiate a remote call and send the initial payload. ```Go // A Stream is a two-way stream. // All responses must be read by the caller. // If the call is canceled through the context, //The appropriate error will be returned. type Stream struct { // Responses from the remote server. // Channel will be closed after an error or when the remote closes. // All responses must be read by the caller until either an error is returned or the channel is closed. // Canceling the context will cause the context cancellation error to be returned. Responses <-chan Response // Requests sent to the server. // If the handler is defined with 0 incoming capacity this will be nil. // Channel must be closed to signal the end of the stream. // If the request context is canceled, the stream will no longer process requests. Requests chan<- []byte } type Response struct { Msg []byte Err error } ``` There are generic versions of the server/client handlers that allow the use of type safe implementations for data types that support msgpack marshal/unmarshal.	2023-11-20 17:09:35 -08:00
Anis Eleuch	12f570a307	audit: Try to send audit even if the status is offline (#18458 ) Currently, once the audit becomes offline, there is no code that tries to reconnect to the audit, at the same time Send() quickly returns with an error without really trying to send a message the audit endpoint; so the audit endpoint will never be online again. Fixing this behavior; the current downside is that we miss printing some logs when the audit becomes offline; however this information is available in prometheus Later, we can refactor internal/logger so the http endpoint can send errors to console target.	2023-11-17 10:40:28 -08:00
Anis Eleuch	6ef8e87492	Support case insensitive kafka SASL mechanism config values (#18398 )	2023-11-08 20:04:01 -08:00
Shubhendu	5b9656374c	Error if target went offline (#18221 ) If target went offline while MinIO was down, error once while trying to send message. If target goes offline during MinIO server running, it already comes through ping() call and errors out if target offline. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-10-12 06:13:57 -07:00
Praveen raj Mani	c27d0583d4	Send kafka notification messages in batches when queue_dir is enabled (#18164 ) Fixes #18124	2023-10-07 08:07:38 -07:00
Sveinn	603437e70f	Fix startup formatting (#18156 ) Percentages in root user names are used for formatting. Before: ``` S3-API: http://192.168.50.21:9000 http://172.31.96.1:9000 http://127.0.0.1:9000 RootUser: "U4B6Zi!b75DXSPm%!!(MISSING)a(MISSING)vZb" RootPass: "Q4#Q6y8G%!P(MISSING)x#npP4dudUobU#NBcGB7RMKV4ajYb" Console: http://192.168.50.21:51915 http://172.31.96.1:51915 http://127.0.0.1:51915 RootUser: "U4B6Zi!b75DXSPm%!!(MISSING)a(MISSING)vZb" RootPass: "Q4#Q6y8G%!P(MISSING)x#npP4dudUobU#NBcGB7RMKV4ajYb" Command-line: https://min.io/docs/minio/linux/reference/minio-mc.html#quickstart FORMAT: %117s MESSAGE: $ mc alias set myminio http://192.168.50.21:9000 "U4B6Zi!b75DXSPm%avZb" "Q4#Q6y8G%%Px#npP4dudUobU#NBcGB7RMKV4ajYb" $ mc alias set myminio http://192.168.50.21:9000 "U4B6Zi!b75DXSPm%!a(MISSING)vZb" "Q4#Q6y8G%Px#npP4dudUobU#NBcGB7RMKV4ajYb" ``` After: ``` Status: 1 Online, 0 Offline. S3-API: http://192.168.50.21:9000 http://172.31.96.1:9000 http://127.0.0.1:9000 RootUser: "U4B6Zi!b75DXSPm%avZb" RootPass: "Q4#Q6y8G%%Px#npP4dudUobU#NBcGB7RMKV4ajYb" Console: http://192.168.50.21:52421 http://172.31.96.1:52421 http://127.0.0.1:52421 RootUser: "U4B6Zi!b75DXSPm%avZb" RootPass: "Q4#Q6y8G%%Px#npP4dudUobU#NBcGB7RMKV4ajYb" Command-line: https://min.io/docs/minio/linux/reference/minio-mc.html#quickstart $ mc alias set myminio http://192.168.50.21:9000 "U4B6Zi!b75DXSPm%avZb" "Q4#Q6y8G%%Px#npP4dudUobU#NBcGB7RMKV4ajYb" ``` No need for special Windows case. `mc` works just fine.	2023-10-02 07:39:47 -06:00
Shubhendu	10d5dd3a67	fix: a regression with audit log sending (#18112 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-09-26 12:23:02 -07:00
Anis Eleuch	4eeb48f8e0	Return cached online/offline status for audit/http loggers (#18083 ) To avoid having delays in prometheus scrape and in 'mc admin info' command.	2023-09-21 16:58:24 -07:00
Harshavardhana	1472875670	fix: failed messages counting in audit_http metrics (#18075 ) all retries must not be counted as failed messages, a failed message is a single counter not for all retries, this PR fixes this. Also we do not need to retry 10-times, instead we should retry at max 3 times with some jitter to deliver the messages.	2023-09-21 11:24:56 -07:00
Aditya Manthramurthy	1c99fb106c	Update to minio/pkg/v2 (#17967 )	2023-09-04 12:57:37 -07:00
Anis Eleuch	6a8d8f34a5	kafka: Do not require key when sending a message (#17962 ) Keys are helpful to ensure the strict ordering of messages, however currently the code uses a random request id for every log, hence using the request-id as a Kafka key is not serve any purpose; This commit removes the usage of the key, to also fix the audit issue from internal subsystem that does not have a request ID.	2023-09-01 08:37:22 -07:00
Harshavardhana	07b1281046	add queue_dir to help message for logger/audit targets	2023-08-29 16:07:35 -07:00
Harshavardhana	adb8be069e	tune-kafka targets to ensure timeout triggers on hung brokers (#17898 ) hung brokers can cause slowness to the entire system when many callers are hung, leading to large goroutine build-up.	2023-08-22 20:26:35 -07:00
Harshavardhana	3a0125fa1f	remove unexpected logging from peer calls (#17888 ) also make sure RequestID is set for system logs	2023-08-21 14:25:24 -07:00
Harshavardhana	11dfc817f3	do not log client canceled events (#17838 )	2023-08-17 14:53:43 -07:00
Praveen raj Mani	0285df5a02	fix: prioritize audit_webhook and logger_webhook ENVs over the config KVS (#17783 )	2023-08-03 02:47:07 -07:00
Anis Eleuch	9c0e8cd15b	logger: Avoid slow calls in http logger Send() function (#17747 ) Send() is synchronous and can affect the latency of S3 requests when the logger buffer is full. Avoid checking if the HTTP target is online or not and increase the workers anyway since the buffer is already full. Also, avoid logs flooding when the audit target is down.	2023-07-29 12:49:18 -07:00
Aditya Manthramurthy	f3248a4b37	Redact all secrets from config viewing APIs (#17380 ) This change adds a `Secret` property to `HelpKV` to identify secrets like passwords and auth tokens that should not be revealed by the server in its configuration fetching APIs. Configuration reporting APIs now do not return secrets.	2023-06-23 07:45:27 -07:00
Aditya Manthramurthy	5a1612fe32	Bump up madmin-go and pkg deps (#17469 )	2023-06-19 17:53:08 -07:00

1 2 3

128 Commits