minio

mirror of https://github.com/minio/minio.git synced 2025-05-01 15:11:32 -04:00

Author	SHA1	Message	Date
Klaus Post	51aa59a737	perf: websocket grid connectivity for all internode communication (#18461 ) This PR adds a WebSocket grid feature that allows servers to communicate via a single two-way connection. There are two request types: * Single requests, which are `[]byte => ([]byte, error)`. This is for efficient small roundtrips with small payloads. * Streaming requests which are `[]byte, chan []byte => chan []byte (and error)`, which allows for different combinations of full two-way streams with an initial payload. Only a single stream is created between two machines - and there is, as such, no server/client relation since both sides can initiate and handle requests. Which server initiates the request is decided deterministically on the server names. Requests are made through a mux client and server, which handles message passing, congestion, cancelation, timeouts, etc. If a connection is lost, all requests are canceled, and the calling server will try to reconnect. Registered handlers can operate directly on byte slices or use a higher-level generics abstraction. There is no versioning of handlers/clients, and incompatible changes should be handled by adding new handlers. The request path can be changed to a new one for any protocol changes. First, all servers create a "Manager." The manager must know its address as well as all remote addresses. This will manage all connections. To get a connection to any remote, ask the manager to provide it given the remote address using. ``` func (m Manager) Connection(host string) Connection ``` All serverside handlers must also be registered on the manager. This will make sure that all incoming requests are served. The number of in-flight requests and responses must also be given for streaming requests. The "Connection" returned manages the mux-clients. Requests issued to the connection will be sent to the remote. * `func (c Connection) Request(ctx context.Context, h HandlerID, req []byte) ([]byte, error)` performs a single request and returns the result. Any deadline provided on the request is forwarded to the server, and canceling the context will make the function return at once. `func (c Connection) NewStream(ctx context.Context, h HandlerID, payload []byte) (st Stream, err error)` will initiate a remote call and send the initial payload. ```Go // A Stream is a two-way stream. // All responses must be read by the caller. // If the call is canceled through the context, //The appropriate error will be returned. type Stream struct { // Responses from the remote server. // Channel will be closed after an error or when the remote closes. // All responses must be read by the caller until either an error is returned or the channel is closed. // Canceling the context will cause the context cancellation error to be returned. Responses <-chan Response // Requests sent to the server. // If the handler is defined with 0 incoming capacity this will be nil. // Channel must be closed to signal the end of the stream. // If the request context is canceled, the stream will no longer process requests. Requests chan<- []byte } type Response struct { Msg []byte Err error } ``` There are generic versions of the server/client handlers that allow the use of type safe implementations for data types that support msgpack marshal/unmarshal.	2023-11-20 17:09:35 -08:00
jiuker	f56a182b71	fix: close http body when webhook send (#18487 )	2023-11-20 14:40:07 -08:00
jiuker	215ca58d6a	fix: close the http.Body when WebhookTarget isActive (#18467 )	2023-11-17 12:02:26 -08:00
Anis Eleuch	12f570a307	audit: Try to send audit even if the status is offline (#18458 ) Currently, once the audit becomes offline, there is no code that tries to reconnect to the audit, at the same time Send() quickly returns with an error without really trying to send a message the audit endpoint; so the audit endpoint will never be online again. Fixing this behavior; the current downside is that we miss printing some logs when the audit becomes offline; however this information is available in prometheus Later, we can refactor internal/logger so the http endpoint can send errors to console target.	2023-11-17 10:40:28 -08:00
Adrian Najera	96c2304ae8	allow MINIO_STS_DURATION to increase the IDP token expiration (#18396 ) Share link duration is based on the IDP token expiration, for the share link to last longer, you may now use MINIO_STS_DURATION environment variable.	2023-11-15 20:42:31 -08:00
Harshavardhana	91d8bddbd1	use sendfile/splice implementation to perform DMA (#18411 ) sendfile implementation to perform DMA on all platforms Go stdlib already supports sendfile/splice implementations for - Linux - Windows - *BSD - Solaris Along with this change however O_DIRECT for reads() must be removed as well since we need to use sendfile() implementation The main reason to add O_DIRECT for reads was to reduce the chances of page-cache causing OOMs for MinIO, however it would seem that avoiding buffer copies from user-space to kernel space this issue is not a problem anymore. There is no Go based memory allocation required, and neither the page-cache is referenced back to MinIO. This page- cache reference is fully owned by kernel at this point, this essentially should solve the problem of page-cache build up. With this now we also support SG - when NIC supports Scatter/Gather https://en.wikipedia.org/wiki/Gather/scatter_(vector_addressing)	2023-11-10 10:10:14 -08:00
Anis Eleuch	6ef8e87492	Support case insensitive kafka SASL mechanism config values (#18398 )	2023-11-08 20:04:01 -08:00
Harshavardhana	754f7a8a39	replace io.Discard usage to fix some NUMA copy() latencies (#18394 ) replace io.Discard usage to fix NUMA copy() latencies On NUMA systems copying from 8K buffer allocated via io.Discard leads to large latency build-up for every ``` copy(new8kbuf, largebuf) ``` can in-cur upto 1ms worth of latencies on NUMA systems due to memory sharding across NUMA nodes.	2023-11-06 14:26:08 -08:00
Adrian Najera	06f59ad631	fix: expiration time for share link when using OpenID (#18297 )	2023-10-30 10:21:34 -07:00
jiuker	dbc2368a7b	fix: parse the subsys env error (#18319 )	2023-10-26 08:12:57 -07:00
Klaus Post	74253e1ddc	Fix BackendInfo() race (#18305 ) `GetParityForSC` has a value receiver, so Config is copied before the lock is obtained. Make it pointer receiver. Fixes: ``` WARNING: DATA RACE Read at 0x0000079cdd10 by goroutine 190: github.com/minio/minio/cmd.(erasureServerPools).BackendInfo() github.com/minio/minio/cmd/erasure-server-pool.go:579 +0x6f github.com/minio/minio/cmd.(erasureServerPools).LocalStorageInfo() github.com/minio/minio/cmd/erasure-server-pool.go:614 +0x3c6 github.com/minio/minio/cmd.(peerRESTServer).LocalStorageInfoHandler() github.com/minio/minio/cmd/peer-rest-server.go:347 +0x4ea github.com/minio/minio/cmd.(peerRESTServer).LocalStorageInfoHandler-fm() ... WARNING: DATA RACE Read at 0x0000079cdd10 by goroutine 190: github.com/minio/minio/cmd.(erasureServerPools).BackendInfo() github.com/minio/minio/cmd/erasure-server-pool.go:579 +0x6f github.com/minio/minio/cmd.(erasureServerPools).LocalStorageInfo() github.com/minio/minio/cmd/erasure-server-pool.go:614 +0x3c6 github.com/minio/minio/cmd.(peerRESTServer).LocalStorageInfoHandler() github.com/minio/minio/cmd/peer-rest-server.go:347 +0x4ea github.com/minio/minio/cmd.(peerRESTServer).LocalStorageInfoHandler-fm() ```	2023-10-24 08:15:41 -07:00
Krishnan Parthasarathi	8cd80fec8c	Add unit test for lifecycle.FilterRules (#18284 )	2023-10-19 21:33:28 -07:00
Klaus Post	e37508fb8f	fix: linter errors in Windows specific code (#18276 )	2023-10-18 11:08:15 -07:00
Krishnan Parthasarathi	557df666fd	Don't skip rules with ExpiredObjectDeleteMarker (#18256 )	2023-10-16 22:46:46 -07:00
Aditya Manthramurthy	28a2d1eb3d	Allow OpenID ARN resource ID to start with a `-` (#18255 )	2023-10-16 13:50:51 -07:00
Klaus Post	128256e3ab	Add event counters (#18232 ) Export metric for global events sent and skipped for the lifetime of the server.	2023-10-12 15:39:22 -07:00
Shubhendu	5b9656374c	Error if target went offline (#18221 ) If target went offline while MinIO was down, error once while trying to send message. If target goes offline during MinIO server running, it already comes through ping() call and errors out if target offline. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-10-12 06:13:57 -07:00
Harshavardhana	6829ae5b13	completely remove drive caching layer from gateway days (#18217 ) This has already been deprecated for close to a year now.	2023-10-11 21:18:17 -07:00
Praveen raj Mani	c27d0583d4	Send kafka notification messages in batches when queue_dir is enabled (#18164 ) Fixes #18124	2023-10-07 08:07:38 -07:00
Sveinn	603437e70f	Fix startup formatting (#18156 ) Percentages in root user names are used for formatting. Before: ``` S3-API: http://192.168.50.21:9000 http://172.31.96.1:9000 http://127.0.0.1:9000 RootUser: "U4B6Zi!b75DXSPm%!!(MISSING)a(MISSING)vZb" RootPass: "Q4#Q6y8G%!P(MISSING)x#npP4dudUobU#NBcGB7RMKV4ajYb" Console: http://192.168.50.21:51915 http://172.31.96.1:51915 http://127.0.0.1:51915 RootUser: "U4B6Zi!b75DXSPm%!!(MISSING)a(MISSING)vZb" RootPass: "Q4#Q6y8G%!P(MISSING)x#npP4dudUobU#NBcGB7RMKV4ajYb" Command-line: https://min.io/docs/minio/linux/reference/minio-mc.html#quickstart FORMAT: %117s MESSAGE: $ mc alias set myminio http://192.168.50.21:9000 "U4B6Zi!b75DXSPm%avZb" "Q4#Q6y8G%%Px#npP4dudUobU#NBcGB7RMKV4ajYb" $ mc alias set myminio http://192.168.50.21:9000 "U4B6Zi!b75DXSPm%!a(MISSING)vZb" "Q4#Q6y8G%Px#npP4dudUobU#NBcGB7RMKV4ajYb" ``` After: ``` Status: 1 Online, 0 Offline. S3-API: http://192.168.50.21:9000 http://172.31.96.1:9000 http://127.0.0.1:9000 RootUser: "U4B6Zi!b75DXSPm%avZb" RootPass: "Q4#Q6y8G%%Px#npP4dudUobU#NBcGB7RMKV4ajYb" Console: http://192.168.50.21:52421 http://172.31.96.1:52421 http://127.0.0.1:52421 RootUser: "U4B6Zi!b75DXSPm%avZb" RootPass: "Q4#Q6y8G%%Px#npP4dudUobU#NBcGB7RMKV4ajYb" Command-line: https://min.io/docs/minio/linux/reference/minio-mc.html#quickstart $ mc alias set myminio http://192.168.50.21:9000 "U4B6Zi!b75DXSPm%avZb" "Q4#Q6y8G%%Px#npP4dudUobU#NBcGB7RMKV4ajYb" ``` No need for special Windows case. `mc` works just fine.	2023-10-02 07:39:47 -06:00
Shubhendu	10d5dd3a67	fix: a regression with audit log sending (#18112 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-09-26 12:23:02 -07:00
Harshavardhana	d9f1df01eb	return an error in CopyAligned upon premature EOF (#18110 ) add a unit-test to capture this corner case	2023-09-26 11:20:06 -07:00
Anis Eleuch	4eeb48f8e0	Return cached online/offline status for audit/http loggers (#18083 ) To avoid having delays in prometheus scrape and in 'mc admin info' command.	2023-09-21 16:58:24 -07:00
Harshavardhana	1472875670	fix: failed messages counting in audit_http metrics (#18075 ) all retries must not be counted as failed messages, a failed message is a single counter not for all retries, this PR fixes this. Also we do not need to retry 10-times, instead we should retry at max 3 times with some jitter to deliver the messages.	2023-09-21 11:24:56 -07:00
Harshavardhana	2add57cfed	apply healing per object at 1024 cycles (#18050 ) - we already have MRF for most recent failures - we trigger healing during HEAD/GET operation These are enough, also change the default max wait from 5sec to 1sec for default scanner speed.	2023-09-19 09:24:22 -07:00
jiuker	9947c01c8e	feat: SSE-KMS use uuid instead of read all data to md5. (#17958 )	2023-09-18 10:00:54 -07:00
Anis Eleuch	419e5baf16	fix: webhook notify endpoint with standard ports (#18016 )	2023-09-14 20:10:44 -07:00
Alex	dc48cd841a	Added MINIO_PROMETHEUS_AUTH_TOKEN env support (#18028 ) Signed-off-by: Benjamin Perez <benjamin@bexsoft.net>	2023-09-14 17:28:21 -07:00
Aditya Manthramurthy	cbc0ef459b	Fix policy package import name (#18031 ) We do not need to rename the import of minio/pkg/v2/policy as iampolicy any more.	2023-09-14 14:50:16 -07:00
Harshavardhana	32890342ce	introduce MINIO_BROWSER_REDIRECT env to enable/disable auto-redirect (#18025 )	2023-09-13 18:43:57 -07:00
Anis Eleuch	41de53996b	heal: calculate the number of workers based on NRRequests (#17945 )	2023-09-11 14:48:54 -07:00
Harshavardhana	9878031cfd	fix: change DISK_ to DRIVE_ for some drive related envs (#18005 )	2023-09-11 12:19:22 -07:00
Harshavardhana	e3fbcaeb72	allow scanner key cycle to be empty (#18001 ) configs from 2020 server throws an error due to deprecation of the keys however an attempt is made to parse them, we should have chosen existing defaults - this PR fixes that.	2023-09-09 08:53:32 -07:00
Anis Eleuch	b9269151a4	fix: drive rotational calculation status for partitions (#17986 ) Fix drive rotational calculation status If a MinIO drive path is mounted to a partition and not a real disk, getting the rotational status would fail because Linux does not expose that status to partition; In other words, /sys/block/drive-partition-name/queue/rotational does not exist; To fix the issue, the code will search for the rotational status of the disk that hosts the partition, and this can be calculated from the real path of /sys/class/block/<drive-partition-name>	2023-09-06 12:37:57 -07:00
Harshavardhana	5b114b43f7	refactor bandwidth throttling for replication target (#17980 ) This refactor is to allow using the bandwidth throttling for other purposes.	2023-09-05 20:21:59 -07:00
Aditya Manthramurthy	1c99fb106c	Update to minio/pkg/v2 (#17967 )	2023-09-04 12:57:37 -07:00
Anis Eleuch	6a8d8f34a5	kafka: Do not require key when sending a message (#17962 ) Keys are helpful to ensure the strict ordering of messages, however currently the code uses a random request id for every log, hence using the request-id as a Kafka key is not serve any purpose; This commit removes the usage of the key, to also fix the audit issue from internal subsystem that does not have a request ID.	2023-09-01 08:37:22 -07:00
Harshavardhana	0d1fbef751	fix: a possible crash in event target Close() (#17948 ) these are possible crashes when the configured target is still in init() state and never finished - however a delete config was initiated.	2023-08-30 07:27:45 -07:00
Harshavardhana	07b1281046	add queue_dir to help message for logger/audit targets	2023-08-29 16:07:35 -07:00
Harshavardhana	adb8be069e	tune-kafka targets to ensure timeout triggers on hung brokers (#17898 ) hung brokers can cause slowness to the entire system when many callers are hung, leading to large goroutine build-up.	2023-08-22 20:26:35 -07:00
Harshavardhana	3a0125fa1f	remove unexpected logging from peer calls (#17888 ) also make sure RequestID is set for system logs	2023-08-21 14:25:24 -07:00
Daniel Valdivia	328cb0a076	Pass environment variable to control session length to console (#17885 ) Signed-off-by: Daniel Valdivia <18384552+dvaldivia@users.noreply.github.com>	2023-08-21 11:55:43 -07:00
jiuker	fa2a8d7209	fix: drain the req.body into io.Discard correctly (#17881 )	2023-08-21 01:09:07 -07:00
Andreas Auernhammer	8f8f8854f0	update `minio/kes-go` dep to v0.2.0 (#17850 ) This commit updates the minio/kes-go dependency to v0.2.0 and updates the existing code to work with the new KES APIs. The `SetPolicy` handler got removed since it may not get implemented by KES at all and could not have been used in the past since stateless KES is read-only w.r.t. policies and identities. Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2023-08-19 07:37:53 -07:00
Harshavardhana	bc7c0d8624	if object is a delete marker it must skip tags filter in ILM (#17861 )	2023-08-18 09:36:23 -07:00
Harshavardhana	11dfc817f3	do not log client canceled events (#17838 )	2023-08-17 14:53:43 -07:00
Harshavardhana	8a9b886011	update grafana dashboard with disk -> drive rename (#17857 )	2023-08-15 16:04:20 -07:00
Klaus Post	406ea4f281	Fix distributed listing not able to resume (#17855 ) Two fields in lifecycles made GOB encoding consistently fail with `gob: type lifecycle.Prefix has no exported fields`. This meant that in distributed systems listings would never be able to continue and would restart on every call. Fix issues and be sure to log these errors at least once per bucket. We may see some connectivity errors here, but we shouldn't hide them.	2023-08-15 07:45:25 -07:00
Harshavardhana	c45bc32d98	skip disks under scanning when healing disks (#17822 ) Bonus: - avoid calling DiskInfo() calls when missing blocks instead heal the object using MRF operation. - change the max_sleep to 250ms beyond that we will not stop healing.	2023-08-09 12:51:47 -07:00
Praveen raj Mani	0285df5a02	fix: prioritize audit_webhook and logger_webhook ENVs over the config KVS (#17783 )	2023-08-03 02:47:07 -07:00

1 2 3 4 5 ...

631 Commits