minio

mirror of https://github.com/minio/minio.git synced 2025-07-18 13:12:46 -04:00

Author	SHA1	Message	Date
Klaus Post	51aa59a737	perf: websocket grid connectivity for all internode communication (#18461 ) This PR adds a WebSocket grid feature that allows servers to communicate via a single two-way connection. There are two request types: * Single requests, which are `[]byte => ([]byte, error)`. This is for efficient small roundtrips with small payloads. * Streaming requests which are `[]byte, chan []byte => chan []byte (and error)`, which allows for different combinations of full two-way streams with an initial payload. Only a single stream is created between two machines - and there is, as such, no server/client relation since both sides can initiate and handle requests. Which server initiates the request is decided deterministically on the server names. Requests are made through a mux client and server, which handles message passing, congestion, cancelation, timeouts, etc. If a connection is lost, all requests are canceled, and the calling server will try to reconnect. Registered handlers can operate directly on byte slices or use a higher-level generics abstraction. There is no versioning of handlers/clients, and incompatible changes should be handled by adding new handlers. The request path can be changed to a new one for any protocol changes. First, all servers create a "Manager." The manager must know its address as well as all remote addresses. This will manage all connections. To get a connection to any remote, ask the manager to provide it given the remote address using. ``` func (m Manager) Connection(host string) Connection ``` All serverside handlers must also be registered on the manager. This will make sure that all incoming requests are served. The number of in-flight requests and responses must also be given for streaming requests. The "Connection" returned manages the mux-clients. Requests issued to the connection will be sent to the remote. * `func (c Connection) Request(ctx context.Context, h HandlerID, req []byte) ([]byte, error)` performs a single request and returns the result. Any deadline provided on the request is forwarded to the server, and canceling the context will make the function return at once. `func (c Connection) NewStream(ctx context.Context, h HandlerID, payload []byte) (st Stream, err error)` will initiate a remote call and send the initial payload. ```Go // A Stream is a two-way stream. // All responses must be read by the caller. // If the call is canceled through the context, //The appropriate error will be returned. type Stream struct { // Responses from the remote server. // Channel will be closed after an error or when the remote closes. // All responses must be read by the caller until either an error is returned or the channel is closed. // Canceling the context will cause the context cancellation error to be returned. Responses <-chan Response // Requests sent to the server. // If the handler is defined with 0 incoming capacity this will be nil. // Channel must be closed to signal the end of the stream. // If the request context is canceled, the stream will no longer process requests. Requests chan<- []byte } type Response struct { Msg []byte Err error } ``` There are generic versions of the server/client handlers that allow the use of type safe implementations for data types that support msgpack marshal/unmarshal.	2023-11-20 17:09:35 -08:00
Minio Trusted	8bedb419a9	Update yaml files to latest version RELEASE.2023-11-20T22-40-07Z	2023-11-21 00:54:16 +00:00
jiuker	f56a182b71	fix: close http body when webhook send (#18487 ) RELEASE.2023-11-20T22-40-07Z	2023-11-20 14:40:07 -08:00
Shubhendu	317b40ef90	Fixed broken docs link (#18486 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-11-20 12:04:49 -08:00
Shubhendu	e938ece492	Added guidelines for setting prometheus alerts (#18479 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-11-19 10:16:08 -08:00
Anis Eleuch	02331a612c	batch-repl: Replicate missing metadata and standard headers (#18484 ) - Replicate Expires when the source is local or remote - Replicate metadata when the source is remote	2023-11-18 19:12:44 -08:00
Anis Eleuch	8317557f70	decom: Fix listing quorum to be equal to deletion quorum (#18476 ) With an odd number of drives per erasure set setup, the write/quorum is the half + 1; however the decommissioning listing will still list those objects and does not consider those as stale. Fix it by using (N+1)/2 formula. Co-authored-by: Anis Elleuch <anis@min.io>	2023-11-17 21:09:09 -08:00
Anis Eleuch	1bb7a2a295	Immediate transition ILM to avoid quick deferring to the scanner (#18475 ) Immediate transition use case and is mostly used to fill warm backend with a lot of data when a new deployment is created Currently, if the transition queue is complete, the transition will be deferred to the scanner; change this behavior by blocking the PUT request until the transition queue has a new place for a transition task.	2023-11-17 16:16:46 -08:00
jiuker	215ca58d6a	fix: close the http.Body when WebhookTarget isActive (#18467 )	2023-11-17 12:02:26 -08:00
Anis Eleuch	12f570a307	audit: Try to send audit even if the status is offline (#18458 ) Currently, once the audit becomes offline, there is no code that tries to reconnect to the audit, at the same time Send() quickly returns with an error without really trying to send a message the audit endpoint; so the audit endpoint will never be online again. Fixing this behavior; the current downside is that we miss printing some logs when the audit becomes offline; however this information is available in prometheus Later, we can refactor internal/logger so the http endpoint can send errors to console target.	2023-11-17 10:40:28 -08:00
Shubhendu	e4b619ce1a	Added graph for Erasure Set Tolerance value (#18472 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-11-17 10:38:15 -08:00
Harshavardhana	0a286153bb	remove checking for BucketInfo() peer call for every PUT() (#18464 ) we already validate if the bucket doesn't exist in RenameData() which can handle this cleanly, instead of making a network call and returning errors.	2023-11-17 05:29:50 -08:00
Anis Eleuch	22d59e757d	Remove stale data in HEAD/GET object (#18460 ) Currently if the object does not exist in quorum disks of an erasure set, the dangling code is never called because the returned error will be errFileNotFound or errFileVersionNotFound; With this commit, when errFileNotFound or errFileVersionNotFound is returning when trying to calculate the quorum of a given object, the code checks if a disk returned nil, which means a stale object exists in that disk, that will trigger deleteIfDangling() function	2023-11-16 08:39:53 -08:00
Andreas Auernhammer	0daa2dbf59	health: split liveness and readiness handler (#18457 ) This commit splits the liveness and readiness handler into two separate handlers. In K8S, a liveness probe is used to determine whether the pod is in "live" state and functioning at all. In contrast, the readiness probe is used to determine whether the pod is ready to serve requests. A failing liveness probe causes pod restarts while a failing readiness probe causes k8s to stop routing traffic to the pod. Hence, a liveness probe should be as robust as possible while a readiness probe should be used to load balancing. Ref: https://kubernetes.io/docs/tasks/configure-pod-container/configure-liveness-readiness-startup-probes/ Signed-off-by: Andreas Auernhammer <github@aead.dev>	2023-11-16 01:51:27 -08:00
Adrian Najera	96c2304ae8	allow MINIO_STS_DURATION to increase the IDP token expiration (#18396 ) Share link duration is based on the IDP token expiration, for the share link to last longer, you may now use MINIO_STS_DURATION environment variable.	2023-11-15 20:42:31 -08:00
Minio Trusted	343dd2f491	Update yaml files to latest version RELEASE.2023-11-15T20-43-25Z	2023-11-16 01:32:11 +00:00
Praveen raj Mani	38f35463b7	Load bucket configs during the metadata refresh (#18449 ) This patch takes care of loading the bucket configs of failed buckets during the periodic refresh. This makes sure the event notifiers and remote bucket targets are properly initialized. RELEASE.2023-11-15T20-43-25Z	2023-11-15 12:43:25 -08:00
Harshavardhana	5573986e8e	fix: relax free inode check for single drive deployments (#18437 ) users might use MinIO on NFS, GPFS that provide dynamic inodes and may not even have a concept of free inodes. to allow users to use MinIO on top of GPFS relax the free inode check.	2023-11-14 09:31:16 -08:00
Sveinn	f3367a1b20	Adding error handling for network errors in the SFTP layer (#18442 )	2023-11-14 09:31:00 -08:00
Sveinn	a3c2f7b0e8	small fix for the PR template (#18443 )	2023-11-14 09:29:11 -08:00
Sveinn	8fbec30998	Adding a missing return to fix SFTP Rmdir message (#18438 )	2023-11-14 09:26:46 -08:00
Harshavardhana	a7466eeb0e	fix: ignore dperf on unformatted/unavailable/unmounted drives (#18435 )	2023-11-13 22:32:08 -08:00
Harshavardhana	8b1e819bf3	fix: make sure to purge all the completed in resume() (#18429 ) currently previously completed jobs would re-run even if they are completed, causing incorrect behavior.	2023-11-13 08:15:00 -08:00
Anis Eleuch	fe63664164	prom: Add drive failure tolerance per erasure set (#18424 )	2023-11-13 00:59:48 -08:00
Minio Trusted	4598827dcb	Update yaml files to latest version RELEASE.2023-11-11T08-14-41Z	2023-11-11 18:01:27 +00:00
Sveinn	9afdb05bf4	fix: file consistency issue on SFTP upload (#18422 ) * creating a byte buffer for SFTP file segments * Adding an error condition for when there are remaining segments in the queue * Simplification of the queue using a map RELEASE.2023-11-11T08-14-41Z	2023-11-11 00:14:41 -08:00
Krishnan Parthasarathi	9569a85cee	Avoid allocs for MRF on-disk header (#18425 )	2023-11-10 19:54:46 -08:00
Harshavardhana	54721b7c7b	fix: batch replication from source allow out of band deletes (#18423 ) it is possible that ILM or Deletes got triggered on batch of objects that we are attempting to batch replicate, ignore this scenario as valid behavior.	2023-11-10 16:12:35 -08:00
Harshavardhana	91d8bddbd1	use sendfile/splice implementation to perform DMA (#18411 ) sendfile implementation to perform DMA on all platforms Go stdlib already supports sendfile/splice implementations for - Linux - Windows - *BSD - Solaris Along with this change however O_DIRECT for reads() must be removed as well since we need to use sendfile() implementation The main reason to add O_DIRECT for reads was to reduce the chances of page-cache causing OOMs for MinIO, however it would seem that avoiding buffer copies from user-space to kernel space this issue is not a problem anymore. There is no Go based memory allocation required, and neither the page-cache is referenced back to MinIO. This page- cache reference is fully owned by kernel at this point, this essentially should solve the problem of page-cache build up. With this now we also support SG - when NIC supports Scatter/Gather https://en.wikipedia.org/wiki/Gather/scatter_(vector_addressing)	2023-11-10 10:10:14 -08:00
Harshavardhana	80adc87a14	converge WARM tier object name to hash of deployment+bucket (#18410 ) this is to ensure that we can converge and save IOPs when hot-tier accesses MinIO.	2023-11-10 02:15:13 -08:00
Taran Pelkey	117ad1b65b	Loosen requirements to detach policies for LDAP (#18419 )	2023-11-09 14:44:43 -08:00
Klaus Post	2229509362	fix: leaking offline disks in MarkOffline() thread (#18414 ) `monitorAndConnectEndpoints` will continue to attempt to reconnect offline disks. Since disks were never closed, a `MarkOffline` would continue to try to check these disks forever. Close previous disks.	2023-11-09 09:33:32 -08:00
Anis Eleuch	6ef8e87492	Support case insensitive kafka SASL mechanism config values (#18398 )	2023-11-08 20:04:01 -08:00
Krishnan Parthasarathi	0a25083fdb	Tiered objects require ns locks unlike inlined (#18409 )	2023-11-08 20:00:02 -08:00
Sveinn	15137d0327	refactor SFTP to use the new minio/pkg implementation (#18406 )	2023-11-08 09:47:05 -08:00
Poorna	8c9974bc0f	site replication: avoid propagating bucket b/w settings (#18399 ) replication mode and bucket bandwidth are one-way and should not be propagated to peer cluster. Regression from #18062	2023-11-08 00:40:25 -08:00
jiuker	079b6c2b50	fix: add err when all bucket resync failed (#18401 )	2023-11-08 00:40:08 -08:00
Minio Trusted	0924b34a17	Update yaml files to latest version RELEASE.2023-11-06T22-26-08Z	2023-11-08 08:04:46 +00:00
Harshavardhana	754f7a8a39	replace io.Discard usage to fix some NUMA copy() latencies (#18394 ) replace io.Discard usage to fix NUMA copy() latencies On NUMA systems copying from 8K buffer allocated via io.Discard leads to large latency build-up for every ``` copy(new8kbuf, largebuf) ``` can in-cur upto 1ms worth of latencies on NUMA systems due to memory sharding across NUMA nodes. RELEASE.2023-11-06T22-26-08Z	2023-11-06 14:26:08 -08:00
Harshavardhana	64bafe1dfe	skip speedtest bucket from site-replication (#18393 )	2023-11-06 11:52:33 -08:00
jiuker	c3e456e7e6	fix: no resyncid when site-replication cancel (#18392 )	2023-11-06 01:53:31 -08:00
Harshavardhana	57aaeafd2f	update dperf to include NUMA fixes (#18391 )	2023-11-04 20:16:14 -07:00
Harshavardhana	3c2e1a87e2	fix: support dropping privileges with arbitrary users (#18386 ) fixes #18380	2023-11-03 14:18:18 -07:00
vicmunoz	da95a2d13f	fix: object versions metric help (#18388 )	2023-11-03 11:43:52 -07:00
Shireesh Anjal	cc5e05fdeb	Do not anonymize hostnames by default (#18387 ) Anonymize them only if the parameter `anonymize` is set to `strict	2023-11-03 10:09:33 -07:00
Harshavardhana	a79c390cca	update console v0.41.0 (#18385 ) Signed-off-by: Harshavardhana <harsha@minio.io>	2023-11-02 18:47:09 -07:00
jiuker	8a56af439c	fix: siteReplicationSys.startResync return no buckets return if error (#18374 )	2023-11-02 16:00:03 -07:00
Shireesh Anjal	f6e581ce54	Capture network device info in health report (#18381 )	2023-11-02 09:49:49 -07:00
Minio Trusted	8953f88780	Update yaml files to latest version RELEASE.2023-11-01T18-37-25Z	2023-11-01 21:57:51 +00:00
Harshavardhana	4b4a98d5e5	add support for older CPU via a new container image (#18370 ) fixes #18365 RELEASE.2023-11-01T18-37-25Z	2023-11-01 11:37:25 -07:00

1 2 3 4 5 ...

11262 Commits