minio

mirror of https://github.com/minio/minio.git synced 2025-12-02 06:07:51 -05:00

Author	SHA1	Message	Date
Klaus Post	d4b391de1b	Add PutObject Ring Buffer (#19605 ) Replace the `io.Pipe` from streamingBitrotWriter -> CreateFile with a fixed size ring buffer. This will add an output buffer for encoded shards to be written to disk - potentially via RPC. This will remove blocking when `(*streamingBitrotWriter).Write` is called, and it writes hashes and data. With current settings, the write looks like this: ``` Outbound ┌───────────────────┐ ┌────────────────┐ ┌───────────────┐ ┌────────────────┐ │ │ Parr. │ │ (http body) │ │ │ │ │ Bitrot Hash │ Write │ Pipe │ Read │ HTTP buffer │ Write (syscall) │ TCP Buffer │ │ Erasure Shard │ ──────────► │ (unbuffered) │ ────────────► │ (64K Max) │ ───────────────────► │ (4MB) │ │ │ │ │ │ (io.Copy) │ │ │ └───────────────────┘ └────────────────┘ └───────────────┘ └────────────────┘ ``` We write a Hash (32 bytes). Since the pipe is unbuffered, it will block until the 32 bytes have been delivered to the TCP buffer, and the next Read hits the Pipe. Then we write the shard data. This will typically be bigger than 64KB, so it will block until two blocks have been read from the pipe. When we insert a ring buffer: ``` Outbound ┌───────────────────┐ ┌────────────────┐ ┌───────────────┐ ┌────────────────┐ │ │ │ │ (http body) │ │ │ │ │ Bitrot Hash │ Write │ Ring Buffer │ Read │ HTTP buffer │ Write (syscall) │ TCP Buffer │ │ Erasure Shard │ ──────────► │ (2MB) │ ────────────► │ (64K Max) │ ───────────────────► │ (4MB) │ │ │ │ │ │ (io.Copy) │ │ │ └───────────────────┘ └────────────────┘ └───────────────┘ └────────────────┘ ``` The hash+shard will fit within the ring buffer, so writes will not block - but will complete after a memcopy. Reads can fill the 64KB buffer if there is data for it. If the network is congested, the ring buffer will become filled, and all syscalls will be on full buffers. Only when the ring buffer is filled will erasure coding start blocking. Since there is always "space" to write output data, we remove the parallel writing since we are always writing to memory now, and the goroutine synchronization overhead probably not worth taking. If the output were blocked in the existing, we would still wait for it to unblock in parallel write, so it would make no difference there - except now the ring buffer smoothes out the load. There are some micro-optimizations we could look at later. The biggest is that, in most cases, we could encode directly to the ring buffer - if we are not at a boundary. Also, "force filling" the Read requests (i.e., blocking until a full read can be completed) could be investigated and maybe allow concurrent memory on read and write.	2024-05-14 17:11:04 -07:00
Shubhendu	de4d3dac00	Added tests for IAM policies for bucket operations (#19734 ) * Added tests for bucket access policies Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io> * move to correct category of tests Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io> --------- Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-05-14 08:43:07 -07:00
Olli Janatuinen	534e7161df	SFTP: Correctly inform client about unsupported commands (#19735 )	2024-05-14 03:29:30 -07:00
Harshavardhana	9b219cd646	fix: return quorum based error, temporary failures must be ignored (#19732 )	2024-05-14 03:29:17 -07:00
Shireesh Anjal	3bab4822f3	Add logger webhook metrics in metrics-v3 (#19515 ) endpoint: /minio/metrics/v3/cluster/webhook metrics: - failed_messages (counter) - online (gauge) - queue_length (gauge) - total_messages (counter)	2024-05-14 00:27:33 -07:00
coderwander	3c5f2d8916	fix some typo in struct name comments (#19513 ) Signed-off-by: coderwander <770732124@qq.com>	2024-05-14 00:26:50 -07:00
Shireesh Anjal	5808190398	Add more metrics to v3/cluster/erasure-set (#19714 ) Metrics being added: - read_tolerance: No of drive failures that can be tolerated without disrupting read operations - write_tolerance: No of drive failures that can be tolerated without disrupting write operations - read_health: Health of the erasure set in a pool for read operations (1=healthy, 0=unhealthy) - write_health: Health of the erasure set in a pool for write operations (1=healthy, 0=unhealthy)	2024-05-14 00:25:56 -07:00
Shireesh Anjal	b2a82248b1	Move /system/go to /debug/go (#19707 )	2024-05-14 00:25:37 -07:00
dependabot[bot]	4e5fcca8b9	build(deps): bump golang.org/x/net (#23 ) Bumps the go_modules group with 1 update in the /docs/debugging/s3-verify directory: [golang.org/x/net](https://github.com/golang/net). Updates `golang.org/x/net` from 0.24.0 to 0.25.0 - [Commits](https://github.com/golang/net/compare/v0.24.0...v0.25.0) --- updated-dependencies: - dependency-name: golang.org/x/net dependency-type: indirect dependency-group: go_modules ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-13 10:59:52 -07:00
Klaus Post	c36eaedb93	Re-add "Fix incorrect merging of slash-suffixed objects (#19729 ) Adds regression test for #19699 Failures are a bit luck based, since it requires objects to be placed on different sets. However this generates a failure prior to #19699 * Revert "Revert "Fix incorrect merging of slash-suffixed objects (#19699)"" This reverts commit `f30417d9a8`. * Don't override when suffix doesn't match. Instead rely on quorum for each.	2024-05-13 09:30:24 -07:00
Poorna	7752b03add	optimize max-keys=2 listing for spark workloads (#19725 ) to return results appropriately for versioned buckets, especially when underlying prefixes have been deleted	2024-05-13 07:57:42 -07:00
jiuker	01bfc78535	Optimization: reuse hashedSecret when LookupConfig (#19724 )	2024-05-12 22:52:27 -07:00
Shireesh Anjal	074d70112d	Consolidate drive health related metrics into single metric (#19706 ) Instead of having "online" and "healing" as two metrics, replace with a single metric "health" which can have following values: 0 = offline 1 = healthy 2 = healing	2024-05-12 10:23:50 -07:00
Harshavardhana	e8d14c0d90	verify preconditions during CompleteMultipart (#19713 ) Bonus: hold the write lock properly to apply optimistic concurrency during NewMultipartUpload()	2024-05-10 17:31:22 -07:00
Shireesh Anjal	60d7e8143a	Move /cluster/audit to /audit (#19708 ) As the audit metrics are server level and not overall cluster level.	2024-05-10 07:50:39 -07:00
Klaus Post	9667a170de	Add usage cache cleanup and lower forced top compaction (#19719 ) Lower forced compaction to 250K entries. If there is more than 250K entries on the top level force compact it and log an error.	2024-05-10 07:49:50 -07:00
Shubhendu	abae30f9e1	Added decom test with KES using sse-s3 and sse-kms (#19695 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-05-10 01:24:14 -07:00
Minio Trusted	f9311bc9d1	Update yaml files to latest version RELEASE.2024-05-10T01-41-38Z	2024-05-10 02:00:49 +00:00
Harshavardhana	b598402738	fix: unexpected credentials missing while passing RELEASE.2024-05-10T01-41-38Z	2024-05-09 18:41:38 -07:00
Harshavardhana	bd026b913f	remove references for MINIO_SERVER_URL	2024-05-09 17:22:36 -07:00
Harshavardhana	72ff69d9bb	add log-prefix name for specifying custom log-name (#19712 )	2024-05-09 14:29:37 -07:00
Harshavardhana	f30417d9a8	Revert "Fix incorrect merging of slash-suffixed objects (#19699 )" This reverts commit `2f7a10ab31`.	2024-05-09 12:32:05 -07:00
jiuker	47a4ad3cd7	fix: truncate Expiration to second when Add ServiceAccount (#19674 ) Truncate Expiration at the second when Add ServiceAccount	2024-05-09 11:08:04 -07:00
Klaus Post	2f7a10ab31	Fix incorrect merging of slash-suffixed objects (#19699 ) If two objects share everything but one object has a slash prefix, those would be merged in listings, with secondary properties used for a tiebreak. Example: An object with the key `prefix/obj` would be merged with an object named `prefix/obj/`. While this violates the [no object can be a prefix of another](https://min.io/docs/minio/linux/operations/concepts/thresholds.html#conflicting-objects), let's resolve these. If we have an object with 'name' and a directory named 'name/' discard the directory only - but allow objects of 'name' and 'name/' (xldir) to be uniquely returned. Regression from #15772	2024-05-09 11:05:45 -07:00
Harshavardhana	b534dc69ab	deprecate unexpected healing failed counters (#19705 ) simplify this to avoid verbose metrics, and make room for valid metrics to be reported for alerting etc.	2024-05-09 11:04:41 -07:00
Harshavardhana	7b7d2ea7d4	pass around correct endpoint while registering remote storage (#19710 )	2024-05-09 11:03:54 -07:00
Aditya Manthramurthy	e00de1c302	ldap-import: Add additional logs (#19691 ) These logs are being added to provide better debugging of LDAP normalization on IAM import.	2024-05-09 10:52:53 -07:00
Harshavardhana	3549e583a6	results must be a single channel to avoid overwriting `healing.bin` (#19702 )	2024-05-09 10:15:03 -07:00
Andi	f5e3eedf34	chore: use errors.New to replace fmt.Errorf with no parameters (#19568 ) Signed-off-by: ChengenH <hce19970702@gmail.com>	2024-05-09 01:44:07 -07:00
Harshavardhana	519dbfebf6	upgrade to go1.22.x	2024-05-09 01:36:00 -07:00
Harshavardhana	9a267f9270	allow caller context during reloads() to cancel (#19687 ) canceled callers might linger around longer, can potentially overwhelm the system. Instead provider a caller context and canceled callers don't hold on to them. Bonus: we have no reason to cache errors, we should never cache errors otherwise we can potentially have quorum errors creeping in unexpectedly. We should let the cache when invalidating hit the actual resources instead.	2024-05-08 17:51:34 -07:00
Anis Eleuch	67bd71b7a5	grid: Fix a window of a disconnected node not marked as offline (#19703 ) LastPong is saved as nanoseconds after a connection or reconnection but saved as seconds when receiving a pong message. The code deciding if a pong is too old can be skewed since it assumes LastPong is only in seconds.	2024-05-08 17:50:13 -07:00
Klaus Post	ec49fff583	Accept multipart checksums with part count (#19680 ) Accept multipart uploads where the combined checksum provides the expected part count. It seems this was added by AWS to make the API more consistent, even if the data is entirely superfluous on multiple levels. Improves AWS S3 compatibility.	2024-05-08 09:18:34 -07:00
Andreas Auernhammer	8b660e18f2	kms: add support for MinKMS and remove some unused/broken code (#19368 ) This commit adds support for MinKMS. Now, there are three KMS implementations in `internal/kms`: Builtin, MinIO KES and MinIO KMS. Adding another KMS integration required some cleanup. In particular: - Various KMS APIs that haven't been and are not used have been removed. A lot of the code was broken anyway. - Metrics are now monitored by the `kms.KMS` itself. For basic metrics this is simpler than collecting metrics for external servers. In particular, each KES server returns its own metrics and no cluster-level view. - The builtin KMS now uses the same en/decryption implemented by MinKMS and KES. It still supports decryption of the previous ciphertext format. It's backwards compatible. - Data encryption keys now include a master key version since MinKMS supports multiple versions (~4 billion in total and 10000 concurrent) per key name. Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-05-07 16:55:37 -07:00
Harshavardhana	981497799a	return appropriate error upon reaching maxClients() (#19669 )	2024-05-07 13:41:56 -07:00
Minio Trusted	b9bdc17465	Update yaml files to latest version RELEASE.2024-05-07T06-41-25Z	2024-05-07 16:59:52 +00:00
Olli Janatuinen	b413ff9fdb	Support user certificate based authentication on SFTP (#19650 ) RELEASE.2024-05-07T06-41-25Z	2024-05-06 23:41:25 -07:00
Harshavardhana	6a15580817	fix: collect quorum errors for deletePrefix() (#19685 ) do not return error for single drive being offline.	2024-05-06 22:44:46 -07:00
Cesar N	39633a5581	Set Console Redirect URL env variable (#19683 )	2024-05-06 19:47:59 -07:00
Alex	1e83f15e2f	Update Console version to v1.4.0 (#19684 )	2024-05-06 19:47:37 -07:00
Harshavardhana	888d2bb1d8	support ETag value to be '' (#19682 ) This supports '' as per behavior to comply with AWS S3 behavior for - 'If-Match: ' - 'If-None-Match: '	2024-05-06 17:08:42 -07:00
Klaus Post	847ee5ac45	Make WalkDir return errors (#19677 ) If used, 'opts.Marker` will cause many missed entries since results are returned unsorted, and pools are serialized. Switch to fully concurrent listing and merging across pools to return sorted entries.	2024-05-06 13:27:52 -07:00
jiuker	9a9a49aa84	fix: Ignore AWSAccessKeyId check for SignV2 policy condition (#19673 )	2024-05-06 03:52:41 -07:00
Harshavardhana	a03ca80269	support 'mc support perf object' with root login disabled (#19672 ) It is expected that whoever is using the credentials which has the proper set of permissions must be able to run. `mc support perf object` While the root login is disabled.	2024-05-06 02:45:10 -07:00
Harshavardhana	523bd769f1	add support for specific error response for InvalidRange (#19668 ) fixes #19648 AWS S3 returns the actual object size as part of XML response for InvalidRange error, this is used apparently by SDKs to retry the request without the range.	2024-05-05 09:56:21 -07:00
Harshavardhana	8ff70ea5a9	turn-off coloring if we have std{err,out} dumb terminals (#19667 )	2024-05-03 17:17:57 -07:00
Harshavardhana	da3e7747ca	avoid using 10MiB EC buffers in maxAPI calculations (#19665 ) max requests per node is more conservative in its value causing premature serialization of the calls, avoid it for newer deployments.	2024-05-03 13:08:20 -07:00
Klaus Post	4afb59e63f	fix: walk missing entries with opts.Marker set (#19661 ) 'opts.Marker` is causing many missed entries if used since results are returned unsorted. Also since pools are serialized. Switch to do fully concurrent listing and merging across pools to return sorted entries. Returning errors on listings is impossible with the current API, so document that. Return an error at once if no drives are found instead of just returning an empty listing and no error.	2024-05-03 10:26:51 -07:00
Harshavardhana	1526e7ece3	extend server config.yaml to support per pool set drive count (#19663 ) This is to support deployments migrating from a multi-pooled wider stripe to lower stripe. MINIO_STORAGE_CLASS_STANDARD is still expected to be same for all pools. So you can satisfy adding custom drive count based pools by adjusting the storage class value. ``` version: v2 address: ':9000' rootUser: 'minioadmin' rootPassword: 'minioadmin' console-address: ':9001' pools: # Specify the nodes and drives with pools - args: - 'node{11...14}.example.net/data{1...4}' - args: - 'node{15...18}.example.net/data{1...4}' - args: - 'node{19...22}.example.net/data{1...4}' - args: - 'node{23...34}.example.net/data{1...10}' set-drive-count: 6 ```	2024-05-03 08:54:03 -07:00
Krishnan Parthasarathi	6c07bfee8a	With retention, skip actions expiring all versions (#19657 ) ILM actions due to ExpiredObjectDeleteAllVersions and DelMarkerExpiration are ignored when object locking is enabled on a bucket. Note: This applies to object versions which may not have retention configured on them. This applies to all object versions in this bucket, including those created before the retention config was applied.	2024-05-03 04:18:58 -07:00

... 3 4 5 6 7 ...

12105 Commits