minio

mirror of https://github.com/minio/minio.git synced 2025-11-29 13:28:17 -05:00

Author	SHA1	Message	Date
Klaus Post	51aa59a737	perf: websocket grid connectivity for all internode communication (#18461 ) This PR adds a WebSocket grid feature that allows servers to communicate via a single two-way connection. There are two request types: * Single requests, which are `[]byte => ([]byte, error)`. This is for efficient small roundtrips with small payloads. * Streaming requests which are `[]byte, chan []byte => chan []byte (and error)`, which allows for different combinations of full two-way streams with an initial payload. Only a single stream is created between two machines - and there is, as such, no server/client relation since both sides can initiate and handle requests. Which server initiates the request is decided deterministically on the server names. Requests are made through a mux client and server, which handles message passing, congestion, cancelation, timeouts, etc. If a connection is lost, all requests are canceled, and the calling server will try to reconnect. Registered handlers can operate directly on byte slices or use a higher-level generics abstraction. There is no versioning of handlers/clients, and incompatible changes should be handled by adding new handlers. The request path can be changed to a new one for any protocol changes. First, all servers create a "Manager." The manager must know its address as well as all remote addresses. This will manage all connections. To get a connection to any remote, ask the manager to provide it given the remote address using. ``` func (m Manager) Connection(host string) Connection ``` All serverside handlers must also be registered on the manager. This will make sure that all incoming requests are served. The number of in-flight requests and responses must also be given for streaming requests. The "Connection" returned manages the mux-clients. Requests issued to the connection will be sent to the remote. * `func (c Connection) Request(ctx context.Context, h HandlerID, req []byte) ([]byte, error)` performs a single request and returns the result. Any deadline provided on the request is forwarded to the server, and canceling the context will make the function return at once. `func (c Connection) NewStream(ctx context.Context, h HandlerID, payload []byte) (st Stream, err error)` will initiate a remote call and send the initial payload. ```Go // A Stream is a two-way stream. // All responses must be read by the caller. // If the call is canceled through the context, //The appropriate error will be returned. type Stream struct { // Responses from the remote server. // Channel will be closed after an error or when the remote closes. // All responses must be read by the caller until either an error is returned or the channel is closed. // Canceling the context will cause the context cancellation error to be returned. Responses <-chan Response // Requests sent to the server. // If the handler is defined with 0 incoming capacity this will be nil. // Channel must be closed to signal the end of the stream. // If the request context is canceled, the stream will no longer process requests. Requests chan<- []byte } type Response struct { Msg []byte Err error } ``` There are generic versions of the server/client handlers that allow the use of type safe implementations for data types that support msgpack marshal/unmarshal.	2023-11-20 17:09:35 -08:00
Harshavardhana	91d8bddbd1	use sendfile/splice implementation to perform DMA (#18411 ) sendfile implementation to perform DMA on all platforms Go stdlib already supports sendfile/splice implementations for - Linux - Windows - *BSD - Solaris Along with this change however O_DIRECT for reads() must be removed as well since we need to use sendfile() implementation The main reason to add O_DIRECT for reads was to reduce the chances of page-cache causing OOMs for MinIO, however it would seem that avoiding buffer copies from user-space to kernel space this issue is not a problem anymore. There is no Go based memory allocation required, and neither the page-cache is referenced back to MinIO. This page- cache reference is fully owned by kernel at this point, this essentially should solve the problem of page-cache build up. With this now we also support SG - when NIC supports Scatter/Gather https://en.wikipedia.org/wiki/Gather/scatter_(vector_addressing)	2023-11-10 10:10:14 -08:00
Harshavardhana	754f7a8a39	replace io.Discard usage to fix some NUMA copy() latencies (#18394 ) replace io.Discard usage to fix NUMA copy() latencies On NUMA systems copying from 8K buffer allocated via io.Discard leads to large latency build-up for every ``` copy(new8kbuf, largebuf) ``` can in-cur upto 1ms worth of latencies on NUMA systems due to memory sharding across NUMA nodes.	2023-11-06 14:26:08 -08:00
Aditya Manthramurthy	1c99fb106c	Update to minio/pkg/v2 (#17967 )	2023-09-04 12:57:37 -07:00
Harshavardhana	114fab4c70	export cluster health as prometheus metrics (#17741 )	2023-07-28 01:16:53 -07:00
Harshavardhana	2d1cda2061	fix: do not os.Exit(1) while writing goroutines during shutdown (#17640 ) Also shutdown poll add jitter, to verify if the shutdown sequence can finish before 500ms, this reduces the overall time taken during "restart" of the service. Provides speedup for `mc admin service restart` during active I/O, also ensures that systemd doesn't treat the returned 'error' as a failure, certain configurations in systemd can cause it to 'auto-restart' the process by-itself which can interfere with `mc admin service restart`. It can be observed how now restarting the service is much snappier.	2023-07-12 07:18:30 -07:00
Poorna	fb49aead9b	replication: add validation API (#17520 ) To check if replication is set up properly on a bucket.	2023-07-10 20:09:20 -07:00
Harshavardhana	28a01f0320	update missing license header in files (#17603 )	2023-07-08 10:42:05 -07:00
Harshavardhana	e37c4efc6e	fix: upon DNS refresh() failure use previous values (#17561 ) DNS refresh() in-case of MinIO can safely re-use the previous values on bare-metal setups, since bare-metal arrangements do not change DNS in any manner commonly. This PR simplifies that, we only ever need DNS caching on bare-metal setups. - On containerized setups do not enable DNS caching at all, as it may have adverse effects on the overall effectiveness of k8s DNS systems. k8s DNS systems are dynamic and expect applications to avoid managing DNS caching themselves, instead provide a cleaner container native caching implementations that must be used. - update IsDocker() detection, including podman runtime - move to minio/dnscache fork for a simpler package	2023-07-03 12:30:51 -07:00
Harshavardhana	7f782983ca	fix: for FTP server driver allow implicit trust of TLS (#17541 ) fixes #17535	2023-06-30 08:04:13 -07:00
Anis Eleuch	0f0dcf0c5e	tar: Avoid storing snowball extraction header in extract objects (#17389 )	2023-06-12 09:42:06 -07:00
Anis Eleuch	bb24346e04	listen: Only error out if not able to bind any interface (#17353 )	2023-06-12 09:09:28 -07:00
Klaus Post	6e38d0f3ab	Add more bootstrap info in debug mode (#17362 )	2023-06-08 08:39:47 -07:00
Anis Eleuch	eba378e4a1	vrf: Fix testing for loopback coming from the address (#17372 )	2023-06-07 09:53:05 -07:00
Krishnan Parthasarathi	62df731006	Add updatedAt for GetBucketLifecycleConfig (#17271 )	2023-05-24 22:52:39 -07:00
Harshavardhana	5569acd95c	disallow EC:0 if not set during server startup (#17141 )	2023-05-04 14:44:30 -07:00
Harshavardhana	9571b0825e	add configurable VRF interface and user-timeout (#17108 )	2023-05-03 14:12:25 -07:00
Anis Eleuch	31b5acc245	tcp: Increase user timeout to 10 minutes (#17087 )	2023-04-26 17:48:31 -07:00
Harshavardhana	a5835cecbf	fix: regression in counting total requests (#17024 )	2023-04-12 14:37:19 -07:00
Anis Eleuch	c259a8ea38	Set tcp user timeout to clean sockets with data in the buffer (#16887 )	2023-03-24 08:10:58 -07:00
Harshavardhana	901887e6bf	feat: add lambda transformation functions target (#16507 )	2023-03-07 08:12:41 -08:00
Harshavardhana	5c98223c89	add correct HostId instead of deploymentId for error responses (#16686 )	2023-02-22 15:41:09 +05:30
Harshavardhana	65c104a589	add x-amz-id-2 to indicate the node that received the request (#16474 )	2023-01-25 09:14:10 -08:00
Aditya Manthramurthy	698862ec5d	Fix transports/timeouts related regressions (#16427 )	2023-01-18 10:06:38 +05:30
Allan Roger Reid	9815dac48f	fix: allow bind on ipv6 loopback failures (#16388 )	2023-01-11 08:47:39 +05:30
Anis Elleuch	1c85652cff	lint: Fix in darwin environment (#16368 )	2023-01-05 10:12:01 -08:00
Aditya Manthramurthy	2d60bf8c50	Refactor HTTP transports (#16222 )	2022-12-12 20:31:21 -08:00
Harshavardhana	419f351df3	avoid logging gzipped body in trace output (#16172 )	2022-12-05 13:21:27 -08:00
Anis Elleuch	1f1dcdce65	move HTTP recorder to an internal library (#16128 )	2022-11-28 10:20:27 -08:00
Shireesh Anjal	5246e3be84	Send health diagnostics data as part of callhome (#16006 )	2022-11-15 13:53:05 -08:00
Harshavardhana	944c62daf4	skip flaky tests on windows OS (#16015 )	2022-11-07 00:11:21 -08:00
Poorna	e4e90b53c1	fix: delete-marker replication check properly (#15923 )	2022-10-21 14:45:06 -07:00
Klaus Post	bd3dfad8b9	Add concurrent Snowball extraction + options (#15836 )	2022-10-18 13:50:21 -07:00
Poorna	0e3c92c027	attempt delete marker replication after object is replicated (#15857 ) Ensure delete marker replication success, especially since the recent optimizations to heal on HEAD, LIST and GET can force replication attempts on delete marker before underlying object version could have synced.	2022-10-13 17:45:23 -07:00
Klaus Post	ff12080ff5	Remove deprecated io/ioutil (#15707 )	2022-09-19 11:05:16 -07:00
Klaus Post	a9f1ad7924	Add extended checksum support (#15433 )	2022-08-29 16:57:16 -07:00
Klaus Post	3795b2c8ba	Add compression scheme to header (#15395 ) For easier debugging. We still do not return compressed size for security reasons.	2022-07-24 07:15:49 -07:00
Harshavardhana	5792be71fa	fix: add timeouts to avoid goroutine leaks in net/http (#14995 ) Following code can reproduce an unending go-routine buildup, while keeping connections established due to lack of client not closing the connections. https://gist.github.com/harshavardhana/2d00e6f909054d2d2524c71485ad02e1 Without this PR all MinIO deployments can be put into denial of service attacks, causing entire service to be unavailable. We bring in two timeouts at this stage to control such go-routine build ups, new change - IdleTimeout (to kill off idle connections) - ReadHeaderTimeout (to kill off connections that are too slow) This new change also brings two hidden options to make any additional relevant changes if desired in some setups.	2022-05-30 06:24:51 -07:00
Harshavardhana	5a9a898ba2	allow forcibly creating metadata on buckets (#14820 ) introduce x-minio-force-create environment variable to force create a bucket and its metadata as required, it is useful in some situations when bucket metadata needs recovery.	2022-04-27 04:44:07 -07:00
Shireesh Anjal	25144fedd5	Send deployment id and minio version in http header (#14378 )	2022-02-23 13:36:01 -08:00
Harshavardhana	1a56ebea70	cleanup dsync tests and remove net/rpc references (#14118 )	2022-01-18 12:44:38 -08:00
Harshavardhana	f527c708f2	run gofumpt cleanup across code-base (#14015 )	2022-01-02 09:15:06 -08:00
Harshavardhana	e49c184595	add configurable 'shutdown-timeout' for HTTP server (#13771 ) fixes #12317	2021-11-29 09:06:56 -08:00
Harshavardhana	fb268add7a	do not flush if Write() failed (#13597 ) - Go might reset the internal http.ResponseWriter() to `nil` after Write() failure if the go-routine has returned, do not flush() such scenarios and avoid spurious flushes() as returning handlers always flush. - fix some racy tests with the console - avoid ticker leaks in certain situations	2021-11-18 17:19:58 -08:00
Harshavardhana	db84bb9bd3	avoid atomics for self contained reader/writers (#13531 ) read/writers are not concurrent in handlers and self contained - no need to use atomics on them. avoids unnecessary contentions where it's not required.	2021-10-28 17:03:00 -07:00
Harshavardhana	087dc13965	fix: server in shutdown should return 503 instead of 403 (#13496 ) various situations where the client is retrying the request server going through shutdown might incorrectly send 403 which is a non-retriable error, this PR allows for clients when they retry an attempt to go to another healthy pod or server in a distributed cluster - assuming it is a properly load-balanced setup.	2021-10-22 01:30:27 -07:00
Anis Elleuch	d7b7040408	tls: Avoid 3DES cipher (#13459 ) 3DES is enabled by default in Golang, this commit will use tls.CipherSuites() which returns all ciphers excluding those with security issues, such as 3DES.	2021-10-18 08:39:15 -07:00
Harshavardhana	9ea45399ce	fix: enable AssumeRoleWithCertificate API only when asked (#13410 ) This is a breaking change but we need to do this to avoid issues discussed in #13409 based on discussions from #13371 fixes #13371 fixes #13409	2021-10-11 14:23:51 -07:00
Harshavardhana	acc9645249	allow more socket listeners per instance for multi-core setups (#13385 )	2021-10-08 16:58:24 -07:00
Harshavardhana	3d5750f31c	update and use rs/dnscache implementation instead of custom (#13348 ) additionally optimize for IP only setups, avoid doing unnecessary lookups if the Dial addr is an IP. allow support for multiple listeners on same socket, this is mainly meant for future purposes.	2021-10-05 10:13:04 -07:00

1 2

60 Commits