minio

mirror of https://github.com/minio/minio.git synced 2024-12-25 22:55:54 -05:00

Author	SHA1	Message	Date
Shireesh Anjal	5808190398	Add more metrics to v3/cluster/erasure-set (#19714 ) Metrics being added: - read_tolerance: No of drive failures that can be tolerated without disrupting read operations - write_tolerance: No of drive failures that can be tolerated without disrupting write operations - read_health: Health of the erasure set in a pool for read operations (1=healthy, 0=unhealthy) - write_health: Health of the erasure set in a pool for write operations (1=healthy, 0=unhealthy)	2024-05-14 00:25:56 -07:00
Shireesh Anjal	b2a82248b1	Move /system/go to /debug/go (#19707 )	2024-05-14 00:25:37 -07:00
Klaus Post	c36eaedb93	Re-add "Fix incorrect merging of slash-suffixed objects (#19729 ) Adds regression test for #19699 Failures are a bit luck based, since it requires objects to be placed on different sets. However this generates a failure prior to #19699 * Revert "Revert "Fix incorrect merging of slash-suffixed objects (#19699)"" This reverts commit `f30417d9a8`. * Don't override when suffix doesn't match. Instead rely on quorum for each.	2024-05-13 09:30:24 -07:00
Poorna	7752b03add	optimize max-keys=2 listing for spark workloads (#19725 ) to return results appropriately for versioned buckets, especially when underlying prefixes have been deleted	2024-05-13 07:57:42 -07:00
Shireesh Anjal	074d70112d	Consolidate drive health related metrics into single metric (#19706 ) Instead of having "online" and "healing" as two metrics, replace with a single metric "health" which can have following values: 0 = offline 1 = healthy 2 = healing	2024-05-12 10:23:50 -07:00
Harshavardhana	e8d14c0d90	verify preconditions during CompleteMultipart (#19713 ) Bonus: hold the write lock properly to apply optimistic concurrency during NewMultipartUpload()	2024-05-10 17:31:22 -07:00
Shireesh Anjal	60d7e8143a	Move /cluster/audit to /audit (#19708 ) As the audit metrics are server level and not overall cluster level.	2024-05-10 07:50:39 -07:00
Klaus Post	9667a170de	Add usage cache cleanup and lower forced top compaction (#19719 ) Lower forced compaction to 250K entries. If there is more than 250K entries on the top level force compact it and log an error.	2024-05-10 07:49:50 -07:00
Harshavardhana	b598402738	fix: unexpected credentials missing while passing	2024-05-09 18:41:38 -07:00
Harshavardhana	72ff69d9bb	add log-prefix name for specifying custom log-name (#19712 )	2024-05-09 14:29:37 -07:00
Harshavardhana	f30417d9a8	Revert "Fix incorrect merging of slash-suffixed objects (#19699 )" This reverts commit `2f7a10ab31`.	2024-05-09 12:32:05 -07:00
jiuker	47a4ad3cd7	fix: truncate Expiration to second when Add ServiceAccount (#19674 ) Truncate Expiration at the second when Add ServiceAccount	2024-05-09 11:08:04 -07:00
Klaus Post	2f7a10ab31	Fix incorrect merging of slash-suffixed objects (#19699 ) If two objects share everything but one object has a slash prefix, those would be merged in listings, with secondary properties used for a tiebreak. Example: An object with the key `prefix/obj` would be merged with an object named `prefix/obj/`. While this violates the [no object can be a prefix of another](https://min.io/docs/minio/linux/operations/concepts/thresholds.html#conflicting-objects), let's resolve these. If we have an object with 'name' and a directory named 'name/' discard the directory only - but allow objects of 'name' and 'name/' (xldir) to be uniquely returned. Regression from #15772	2024-05-09 11:05:45 -07:00
Harshavardhana	b534dc69ab	deprecate unexpected healing failed counters (#19705 ) simplify this to avoid verbose metrics, and make room for valid metrics to be reported for alerting etc.	2024-05-09 11:04:41 -07:00
Harshavardhana	7b7d2ea7d4	pass around correct endpoint while registering remote storage (#19710 )	2024-05-09 11:03:54 -07:00
Aditya Manthramurthy	e00de1c302	ldap-import: Add additional logs (#19691 ) These logs are being added to provide better debugging of LDAP normalization on IAM import.	2024-05-09 10:52:53 -07:00
Harshavardhana	3549e583a6	results must be a single channel to avoid overwriting `healing.bin` (#19702 )	2024-05-09 10:15:03 -07:00
Andi	f5e3eedf34	chore: use errors.New to replace fmt.Errorf with no parameters (#19568 ) Signed-off-by: ChengenH <hce19970702@gmail.com>	2024-05-09 01:44:07 -07:00
Harshavardhana	9a267f9270	allow caller context during reloads() to cancel (#19687 ) canceled callers might linger around longer, can potentially overwhelm the system. Instead provider a caller context and canceled callers don't hold on to them. Bonus: we have no reason to cache errors, we should never cache errors otherwise we can potentially have quorum errors creeping in unexpectedly. We should let the cache when invalidating hit the actual resources instead.	2024-05-08 17:51:34 -07:00
Klaus Post	ec49fff583	Accept multipart checksums with part count (#19680 ) Accept multipart uploads where the combined checksum provides the expected part count. It seems this was added by AWS to make the API more consistent, even if the data is entirely superfluous on multiple levels. Improves AWS S3 compatibility.	2024-05-08 09:18:34 -07:00
Andreas Auernhammer	8b660e18f2	kms: add support for MinKMS and remove some unused/broken code (#19368 ) This commit adds support for MinKMS. Now, there are three KMS implementations in `internal/kms`: Builtin, MinIO KES and MinIO KMS. Adding another KMS integration required some cleanup. In particular: - Various KMS APIs that haven't been and are not used have been removed. A lot of the code was broken anyway. - Metrics are now monitored by the `kms.KMS` itself. For basic metrics this is simpler than collecting metrics for external servers. In particular, each KES server returns its own metrics and no cluster-level view. - The builtin KMS now uses the same en/decryption implemented by MinKMS and KES. It still supports decryption of the previous ciphertext format. It's backwards compatible. - Data encryption keys now include a master key version since MinKMS supports multiple versions (~4 billion in total and 10000 concurrent) per key name. Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-05-07 16:55:37 -07:00
Harshavardhana	981497799a	return appropriate error upon reaching maxClients() (#19669 )	2024-05-07 13:41:56 -07:00
Olli Janatuinen	b413ff9fdb	Support user certificate based authentication on SFTP (#19650 )	2024-05-06 23:41:25 -07:00
Harshavardhana	6a15580817	fix: collect quorum errors for deletePrefix() (#19685 ) do not return error for single drive being offline.	2024-05-06 22:44:46 -07:00
Cesar N	39633a5581	Set Console Redirect URL env variable (#19683 )	2024-05-06 19:47:59 -07:00
Harshavardhana	888d2bb1d8	support ETag value to be '' (#19682 ) This supports '' as per behavior to comply with AWS S3 behavior for - 'If-Match: ' - 'If-None-Match: '	2024-05-06 17:08:42 -07:00
Klaus Post	847ee5ac45	Make WalkDir return errors (#19677 ) If used, 'opts.Marker` will cause many missed entries since results are returned unsorted, and pools are serialized. Switch to fully concurrent listing and merging across pools to return sorted entries.	2024-05-06 13:27:52 -07:00
jiuker	9a9a49aa84	fix: Ignore AWSAccessKeyId check for SignV2 policy condition (#19673 )	2024-05-06 03:52:41 -07:00
Harshavardhana	a03ca80269	support 'mc support perf object' with root login disabled (#19672 ) It is expected that whoever is using the credentials which has the proper set of permissions must be able to run. `mc support perf object` While the root login is disabled.	2024-05-06 02:45:10 -07:00
Harshavardhana	523bd769f1	add support for specific error response for InvalidRange (#19668 ) fixes #19648 AWS S3 returns the actual object size as part of XML response for InvalidRange error, this is used apparently by SDKs to retry the request without the range.	2024-05-05 09:56:21 -07:00
Harshavardhana	8ff70ea5a9	turn-off coloring if we have std{err,out} dumb terminals (#19667 )	2024-05-03 17:17:57 -07:00
Harshavardhana	da3e7747ca	avoid using 10MiB EC buffers in maxAPI calculations (#19665 ) max requests per node is more conservative in its value causing premature serialization of the calls, avoid it for newer deployments.	2024-05-03 13:08:20 -07:00
Klaus Post	4afb59e63f	fix: walk missing entries with opts.Marker set (#19661 ) 'opts.Marker` is causing many missed entries if used since results are returned unsorted. Also since pools are serialized. Switch to do fully concurrent listing and merging across pools to return sorted entries. Returning errors on listings is impossible with the current API, so document that. Return an error at once if no drives are found instead of just returning an empty listing and no error.	2024-05-03 10:26:51 -07:00
Harshavardhana	1526e7ece3	extend server config.yaml to support per pool set drive count (#19663 ) This is to support deployments migrating from a multi-pooled wider stripe to lower stripe. MINIO_STORAGE_CLASS_STANDARD is still expected to be same for all pools. So you can satisfy adding custom drive count based pools by adjusting the storage class value. ``` version: v2 address: ':9000' rootUser: 'minioadmin' rootPassword: 'minioadmin' console-address: ':9001' pools: # Specify the nodes and drives with pools - args: - 'node{11...14}.example.net/data{1...4}' - args: - 'node{15...18}.example.net/data{1...4}' - args: - 'node{19...22}.example.net/data{1...4}' - args: - 'node{23...34}.example.net/data{1...10}' set-drive-count: 6 ```	2024-05-03 08:54:03 -07:00
Krishnan Parthasarathi	6c07bfee8a	With retention, skip actions expiring all versions (#19657 ) ILM actions due to ExpiredObjectDeleteAllVersions and DelMarkerExpiration are ignored when object locking is enabled on a bucket. Note: This applies to object versions which may not have retention configured on them. This applies to all object versions in this bucket, including those created before the retention config was applied.	2024-05-03 04:18:58 -07:00
Poorna	446c760820	replication: Avoid proxying if requested object is a deletemarker (#19656 ) Fixes: #19654	2024-05-02 13:15:54 -07:00
Shireesh Anjal	04f92f1291	Change endpoint format for per-bucket metrics (#19655 ) Per-bucket metrics endpoints always start with /bucket and the bucket name is appended to the path. e.g. if the collector path is /bucket/api, the endpoint for the bucket "mybucket" would be /minio/metrics/v3/bucket/api/mybucket Change the existing bucket api endpoint accordingly from /api/bucket to /bucket/api	2024-05-02 10:37:57 -07:00
Bala FA	e5b16adb1c	Add cluster IAM metrics in metrics-v3 (#19595 ) Signed-off-by: Bala.FA <bala@minio.io>	2024-05-02 01:20:42 -07:00
Harshavardhana	402a3ac719	support compression after rotation of logs (#19647 )	2024-05-01 15:38:07 -07:00
Aditya Manthramurthy	f3d61c51fc	fix: Filter out cust. AssumeRole `Token` for audit (#19646 ) The `Token` parameter is a sensitive value that should not be output in the Audit log for STS AssumeRoleWithCustomToken API. Bonus: Add a simple tool that echoes audit logs to the console.	2024-05-01 14:31:13 -07:00
Klaus Post	0cde17ae5d	Return listing when exceeding min disk errors (#19644 ) When listing, with drives returning `errFileNotFound,` `errVolumeNotFound`, or `errUnformattedDisk,`, we could get below `minDisks` drives being left. This would result in a quorum never being reachable for any object. Therefore, the listing would continue, but no results would ever be produced. Include `fnf` in the mindisk check since it is incremented on these errors. This will stop listing when minDisks are left. Allow `opts.minDisks` to not return errVolumeNotFound or errFileNotFound and return that. That will allow for good results even if disks return something else. We switch `errUnformattedDisk` to a regular error. If we have enough of those, we should just fail.	2024-05-01 10:59:08 -07:00
Harshavardhana	8c1bba681b	add logrotate support for MinIO logs (#19641 )	2024-05-01 10:57:52 -07:00
Klaus Post	dbfb5e797b	Wait one minute after startup to restart decommissioning (#19645 ) Typically not all drives are connected, so we delay 3 minutes before resuming. This greatly reduces risk of starting to list unconnected drives, or drives we risk being disconnected soon. This delay is not applied when starting with an admin call.	2024-05-01 08:18:21 -07:00
Harshavardhana	08ff702434	enhance ListSVCs() API to return more info to avoid InfoSvc() (#19642 ) ConsoleUI like applications rely on combination of ListServiceAccounts() and InfoServiceAccount() to populate UI elements, however individually these calls can be slow causing the entire UI to load sluggishly.	2024-05-01 05:41:13 -07:00
Klaus Post	0e2148264a	Fix --stfp "mac-algos=..." overwrites cipher algorithms (#19643 ) Setting MAC algorithms overwrites cipher algorithms. Followup to #19636	2024-05-01 04:07:40 -07:00
Krishnan Parthasarathi	7926401cbd	ilm: Handle DeleteAllVersions action differently for DEL markers (#19481 ) i.e., this rule element doesn't apply to DEL markers. This is a breaking change to how ExpiredObejctDeleteAllVersions functions today. This is necessary to avoid the following highly probable footgun scenario in the future. Scenario: The user uses tags-based filtering to select an object's time to live(TTL). The application sometimes deletes objects, too, making its latest version a DEL marker. The previous implementation skipped tag-based filters if the newest version was DEL marker, voiding the tag-based TTL. The user is surprised to find objects that have expired sooner than expected. * Add DelMarkerExpiration action This ILM action removes all versions of an object if its the latest version is a DEL marker. ```xml <DelMarkerObjectExpiration> <Days> 10 </Days> </DelMarkerObjectExpiration> ``` 1. Applies only to objects whose, • The latest version is a DEL marker. • satisfies the number of days criteria 2. Deletes all versions of this object 3. Associated rule can't have tag-based filtering Includes, - New bucket event type for deletion due to DelMarkerExpiration	2024-04-30 18:11:10 -07:00
Harshavardhana	8161411c5d	fix: a crash in RemoveReplication target (#19640 ) calling a remote target remove with a perfectly well constructed ARN can lead to a crash for a bucket with no replication configured. This PR fixes, and adds a crash check for ImportMetadata as well.	2024-04-30 18:09:56 -07:00
Klaus Post	f64dea2aac	Allow custom SFTP algorithm selection (#19636 ) Algorithms are comma separated. Note that valid values does not in all cases represent default values. `--sftp=pub-key-algos=...` specifies the supported client public key authentication algorithms. Note that this doesn't include certificate types since those use the underlying algorithm. This list is sent to the client if it supports the server-sig-algs extension. Order is irrelevant. Valid values ``` ssh-ed25519 sk-ssh-ed25519@openssh.com sk-ecdsa-sha2-nistp256@openssh.com ecdsa-sha2-nistp256 ecdsa-sha2-nistp384 ecdsa-sha2-nistp521 rsa-sha2-256 rsa-sha2-512 ssh-rsa ssh-dss ``` `--sftp=kex-algos=...` specifies the supported key-exchange algorithms in preference order. Valid values: ``` curve25519-sha256 curve25519-sha256@libssh.org ecdh-sha2-nistp256 ecdh-sha2-nistp384 ecdh-sha2-nistp521 diffie-hellman-group14-sha256 diffie-hellman-group16-sha512 diffie-hellman-group14-sha1 diffie-hellman-group1-sha1 ``` `--sftp=cipher-algos=...` specifies the allowed cipher algorithms. If unspecified then a sensible default is used. Valid values: ``` aes128-ctr aes192-ctr aes256-ctr aes128-gcm@openssh.com aes256-gcm@openssh.com chacha20-poly1305@openssh.com arcfour256 arcfour128 arcfour aes128-cbc 3des-cbc ``` `--sftp=mac-algos=...` specifies a default set of MAC algorithms in preference order. This is based on RFC 4253, section 6.4, but with hmac-md5 variants removed because they have reached the end of their useful life. Valid values: ``` hmac-sha2-256-etm@openssh.com hmac-sha2-512-etm@openssh.com hmac-sha2-256 hmac-sha2-512 hmac-sha1 hmac-sha1-96 ```	2024-04-30 08:15:45 -07:00
Shubhendu	6579304d8c	Suppress metrics with zero values (#19638 ) This would reduce the size of data in response of metrics listing. While graphing we can default these metrics with a zero value if not found. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-04-30 08:05:22 -07:00
Klaus Post	3cf8a7c888	Always unfreeze when connection dies (#19634 ) Unfreeze as soon as the incoming connection is terminated and don't wait for everything to complete. We don't want to keep the services frozen if something becomes stuck.	2024-04-29 10:39:04 -07:00
Harshavardhana	a372c6a377	a bunch of fixes for error handling (#19627 ) - handle errFileCorrupt properly - micro-optimization of sending done() response quicker to close the goroutine. - fix logger.Event() usage in a couple of places - handle the rest of the client to return a different error other than lastErr() when the client is closed.	2024-04-28 10:53:50 -07:00
Poorna	9e95703efc	iam reload policy mapping of STS users properly (#19626 )	2024-04-27 03:04:10 -07:00
Anis Eleuch	d8e05aca81	heal/list: Fix rare incomplete listing with flaky internode connections (#19625 ) listPathRaw() counts errDiskNotFound as a valid error to indicate a listing stream end. However, storage.WalkDir() is allowed to return errDiskNotFound anytime since grid.ErrDisconnected is converted to errDiskNotFound. This affects fresh disk healing and should affect S3 listing as well.	2024-04-26 12:52:52 -07:00
Praveen raj Mani	410a1ac040	Handle failures in pool rebalancing (#19623 )	2024-04-26 12:29:28 -07:00
Shireesh Anjal	4caa3422bd	Add process metrics in `metrics-v3` (#19612 ) endpoint: /minio/metrics/v3/system/process metrics: - locks_read_total - locks_write_total - cpu_total_seconds - go_routine_total - io_rchar_bytes - io_read_bytes - io_wchar_bytes - io_write_bytes - start_time_seconds - uptime_seconds - file_descriptor_limit_total - file_descriptor_open_total - syscall_read_total - syscall_write_total - resident_memory_bytes - virtual_memory_bytes - virtual_memory_max_bytes Since the standard process collector implements only a subset of these metrics, remove it and implement our own custom process collector that captures all the process metrics we need.	2024-04-26 09:07:23 -07:00
Anis Eleuch	135874ebdc	heal: Avoid marking a bucket as done when remote drives are offline (#19587 )	2024-04-25 23:32:14 -07:00
Poorna	e7aa26dc29	fix: allow DeleteObject unversioned objects with insufficient read quorum (#19581 ) Since the object is being permanently deleted, the lack of read quorum should not matter as long as sufficient disks are online to complete the deletion with parity requirements. If several pools have the same object with insufficient read quorum, attempt to delete object from all the pools where it exists	2024-04-25 17:31:12 -07:00
Harshavardhana	c54ffde568	add metrics ioerror counter for alerts on I/O errors (#19618 )	2024-04-25 15:01:31 -07:00
Anis Eleuch	9a3c992d7a	heal: Fix regression in healing a new fresh drive (#19615 )	2024-04-25 14:55:41 -07:00
Aditya Manthramurthy	0c855638de	fix: LDAP init. issue when LDAP server is down (#19619 ) At server startup, LDAP configuration is validated against the LDAP server. If the LDAP server is down at that point, we need to cleanly disable LDAP configuration. Previously, LDAP would remain configured but error out in strange ways because initialization did not complete without errors.	2024-04-25 14:28:16 -07:00
Ramon de Klein	4c0acba62d	Fixes an internal error while force-deleting a bucket (#19614 )	2024-04-25 09:27:27 -07:00
Aditya Manthramurthy	62c3cdee75	fix: IAM LDAP access key import bug (#19608 ) When importing access keys (i.e. service accounts) for LDAP accounts, we are requiring groups to exist under one of the configured group base DNs. This is not correct. This change fixes this by only checking for existence and storing the normalized form of the group DN - we do not return an error if the group is not under a base DN. Test is updated to illustrate an import failure that would happen without this change.	2024-04-25 08:50:16 -07:00
Aditya Manthramurthy	3212d0c8cd	fix: IAM import for LDAP should replace mappings (#19607 ) Existing IAM import logic for LDAP creates new mappings when the normalized form of the mapping key differs from the existing mapping key in storage. This change effectively replaces the existing mapping key by first deleting it and then recreating with the normalized form of the mapping key. For e.g. if an older deployment had a policy mapped to a user DN - `UID=alice1,OU=people,OU=hwengg,DC=min,DC=io` instead of adding a mapping for the normalized form - `uid=alice1,ou=people,ou=hwengg,dc=min,dc=io` we should replace the existing mapping. This ensures that duplicates mappings won't remain after the import. Some additional cleanup cases are also covered. If there are multiple mappings for the name normalized key such as: `UID=alice1,OU=people,OU=hwengg,DC=min,DC=io` `uid=alice1,ou=people,ou=hwengg,DC=min,DC=io` `uid=alice1,ou=people,ou=hwengg,dc=min,dc=io` we check if the list of policies mapped to all these keys are exactly the same, and if so remove all of them and create a single mapping with the normalized key. However, if the policies mapped to such keys differ, the import operation returns an error as the server cannot automatically pick the "right" list of policies to map.	2024-04-25 08:49:53 -07:00
Harshavardhana	1d03bea965	support preserving renameData() on inlined content during overwrites (#19609 ) extending #19548 to inlined-data as well.	2024-04-24 18:14:08 -07:00
jiuker	df93ff92ba	fix: site-replication will reset group status when add user (#19594 )	2024-04-24 08:54:24 -07:00
Shireesh Anjal	77d5331e85	Fix few wrongly defined metric types (#19586 ) `minio_cluster_webhook_queue_length` was wrongly defined as `counter` where-as it should be `gauge` Following were wrongly defined as `gauge` when they should actually be `counter`: - minio_bucket_replication_sent_bytes - minio_bucket_replication_received_bytes - minio_bucket_replication_total_failed_bytes - minio_bucket_replication_total_failed_count	2024-04-23 23:19:40 -07:00
Bala FA	14cdadfb56	Add cluster notification metrics in metrics-v3 (#19533 ) Signed-off-by: Bala.FA <bala@minio.io>	2024-04-23 21:10:35 -07:00
Harshavardhana	f3a52cc195	simplify listener implementation setup customizations in right place (#19589 )	2024-04-23 21:08:47 -07:00
Aditya Manthramurthy	7640cd24c9	fix: avoid some IAM import errors if LDAP enabled (#19591 ) When LDAP is enabled, previously we were: - rejecting creation of users and groups via the IAM import functionality - throwing a `not a valid DN` error when non-LDAP group mappings are present This change allows for these cases as we need to support situations where the MinIO server contains users, groups and policy mappings created before LDAP was enabled.	2024-04-23 18:23:08 -07:00
Shireesh Anjal	f7b665347e	Add system CPU metrics to metrics-v3 (#19560 ) endpoint: /minio/metrics/v3/system/cpu metrics: - minio_system_cpu_avg_idle - minio_system_cpu_avg_iowait - minio_system_cpu_load - minio_system_cpu_load_perc - minio_system_cpu_nice - minio_system_cpu_steal - minio_system_cpu_system - minio_system_cpu_user	2024-04-23 16:56:12 -07:00
Harshavardhana	9693c382a8	make renameData() more defensive during overwrites (#19548 ) instead upon any error in renameData(), we still preserve the existing dataDir in some form for recoverability in strange situations such as out of disk space type errors. Bonus: avoid running list and heal() instead allow versions disparity to return the actual versions, uuid to heal. Currently limit this to 100 versions and lesser disparate objects. an undo now reverts back the xl.meta from xl.meta.bkp during overwrites on such flaky setups. Bonus: Save N depth syscalls via skipping the parents upon overwrites and versioned updates. Flaky setup examples are stretch clusters with regular packet drops etc, we need to add some defensive code around to avoid dangling objects.	2024-04-23 10:15:52 -07:00
jiuker	ee1047bd52	fix: can't get total disksize for `decom status` (#19585 )	2024-04-23 04:33:28 -07:00
Seiya	5ea5ab162b	Remove leading zero strings in return value of (*xlMetaV2)getDataDirs() (#19567 ) remove leading zero strings in return value of getDataDirs()	2024-04-22 22:07:37 -07:00
Klaus Post	b5a09ff96b	Fix RenameData data race (#19579 ) RenameData could start operating on inline data after timing out and the call returned due to WithDeadline. This could cause a buffer to write to the inline data being written. Since no writes are in `RenameData` and the call is canceled, this doesn't present a corruption issue. But a race is a race and should be fixed. Copy inline data to a fresh buffer.	2024-04-22 22:07:19 -07:00
Harshavardhana	95c65f4e8f	do not panic on rebalance during server restarts (#19563 ) This PR makes a feasible approach to handle all the scenarios that we must face to avoid returning "panic." Instead, we must return "errServerNotInitialized" when a bucketMetadataSys.Get() is called, allowing the caller to retry their operation and wait. Bonus fix the way data-usage-cache stores the object. Instead of storing usage-cache.bin with the bucket as `.minio.sys/buckets`, the `buckets` must be relative to the bucket `.minio.sys` as part of the object name. Otherwise, there is no way to decommission entries at `.minio.sys/buckets` and their final erasure set positions. A bucket must never have a `/` in it. Adds code to read() from existing data-usage.bin upon upgrade.	2024-04-22 10:49:30 -07:00
Harshavardhana	6bfff7532e	re-use transport and set stronger backwards compatible Ciphers (#19565 ) This PR fixes a few things - FIPS support for missing for remote transports, causing MinIO could end up using non-FIPS Ciphers in FIPS mode - Avoids too many transports, they all do the same thing to make connection pooling work properly re-use them. - globalTCPOptions must be set before setting transport to make sure the client conn deadlines are honored properly. - GCS warm tier must re-use our transport - Re-enable trailing headers support.	2024-04-21 04:43:18 -07:00
Harshavardhana	1aa8896ad6	Revert "cleanup: Simplify usage of MinIOSourceProxyRequest (#19553 )" This reverts commit `928c0181bf`. This change was not correct, reverting. We track 3 states with the ProxyRequest header - if replication process wants to know if object is already replicated with a HEAD, it shouldn't proxy back - Poorna	2024-04-20 02:05:54 -07:00
Krishnan Parthasarathi	3e32ceb39f	Disable trailing header support for MinIO tiers (#19561 ) AWS S3 trailing header support was recently enabled on the warm tier client connection to MinIO type remote tiers. With this enabled, we are seeing the following error message at http transport layer. > Unsolicited response received on idle HTTP channel starting with "HTTP/1.1 400 Bad Request\r\nContent-Type: text/plain; charset=utf-8\r\nConnection: close\r\n\r\n400 Bad Request"; err=<nil> This is an interim fix until we identify the root cause for this behaviour in the minio-go client package.	2024-04-19 19:32:25 -07:00
jiuker	9205434ed3	fix: ignore signaturev2 for policy header check (#19551 )	2024-04-19 09:45:54 -07:00
Harshavardhana	cd50e9b4bc	make LRU cache global for internode tokens (#19555 )	2024-04-19 09:45:14 -07:00
Klaus Post	ec816f3840	Reduce parallelReader allocs (#19558 )	2024-04-19 09:44:59 -07:00
Klaus Post	5f774951b1	Store object EC in metadata header (#19534 ) Keep the EC in header, so it can be retrieved easily for dynamic quorum calculations. To not force a full metadata decode on every read the value will be 0/0 for data written in previous versions. Size is expected to increase by 2 bytes per version, since all valid values can be represented with 1 byte each. Example: ``` λ xl-meta xl.meta { "Versions": [ { "Header": { "EcM": 4, "EcN": 8, "Flags": 6, "ModTime": "2024-04-17T11:46:25.325613+02:00", "Signature": "0a409875", "Type": 1, "VersionID": "8e03504e11234957b2727bc53eda0d55" }, ... ``` Not used for operations yet.	2024-04-19 09:43:43 -07:00
Harshavardhana	72f5cb577e	optimize ftp/sftp upload() implementations to avoid CPU load (#19552 )	2024-04-19 05:23:42 -07:00
Robert Lützner	928c0181bf	cleanup: Simplify usage of MinIOSourceProxyRequest (#19553 ) This replaces a convoluted condition that ultimately evaluated to "is this HTTP header present in the request or not?"	2024-04-19 05:23:31 -07:00
Harshavardhana	03767d26da	fix: get rid of large buffers (#19549 ) these lead to run-away usage of memory beyond which the Go's GC can handle, we have to re-visit this differently, remove this for now.	2024-04-19 04:26:59 -07:00
Sveinn	108e6f92d4	updating tests to use new mc --enc flags (#19508 )	2024-04-19 01:43:09 -07:00
Harshavardhana	d653a59fc0	fix: flaky getHostIP test	2024-04-18 19:09:56 -07:00
Aditya Manthramurthy	98f7821eb3	fix: ldap: avoid unnecessary import errors (#19547 ) Follow up for #19528 If there are multiple existing DN mappings for the same normalized DN, if they all have the same policy mapping value, we pick one of them of them instead of returning an import error.	2024-04-18 12:09:19 -07:00
Aditya Manthramurthy	ae46ce9937	ldap: Normalize DNs when importing (#19528 ) This is a change to IAM export/import functionality. For LDAP enabled setups, it performs additional validations: - for policy mappings on LDAP users and groups, it ensures that the corresponding user or group DN exists and if so uses a normalized form of these DNs for storage - for access keys (service accounts), it updates (i.e. validates existence and normalizes) the internally stored parent user DN and group DNs. This allows for a migration path for setups in which LDAP mappings have been stored in previous versions of the server, where the name of the mapping file stored on drives is not in a normalized form. An administrator needs to execute: `mc admin iam export ALIAS` followed by `mc admin iam import ALIAS /path/to/export/file` The validations are more strict and returns errors when multiple mappings are found for the same user/group DN. This is to ensure the mappings stored by the server are unambiguous and to reduce the potential for confusion. Bonus bug fix: IAM export of access keys (service accounts) did not export key name, description and expiration. This is fixed in this change too.	2024-04-18 08:15:02 -07:00
Anis Eleuch	dfc112c06b	list: Fix rare listing continuation freeze (#19524 ) Reading the list metacache is not protected by a lock; the code retries when it fails to read the metacache object, however, it forgot to re-read the metacache object from the drives, which is necessary, especially if the metacache object is inlined. This commit will ensure that we always re-read the metacache object from the drives when it is retrying.	2024-04-17 21:42:11 -07:00
Shireesh Anjal	ca5fab8656	Add cluster audit metrics in metrics-v3 (#19514 ) endpoint: /minio/metrics/v3/cluster/audit metrics: - failed_messages (counter) - total_messages (counter) - target_queue_length (gauge)	2024-04-17 02:18:02 -07:00
Shireesh Anjal	6df76ca73c	Add system memory metrics in v3 (#19486 ) Following memory metrics will be added under /system/memory - available - buffers - cache - free - shared - total - used - used_perc	2024-04-16 22:10:25 -07:00
Harshavardhana	f65dd3e5a2	reload from drive tier-config when in-memory cache is not found (#19527 ) avoid probing tier target while reloading() tier config	2024-04-16 22:09:58 -07:00
Harshavardhana	a8d601b64a	allow detaching any non-normalized DN (#19525 )	2024-04-16 17:36:43 -07:00
Klaus Post	e2709ea129	ftp: Return current time for prefixes/directories (#19519 )	2024-04-16 17:35:55 -07:00
Allan Roger Reid	740ec80819	At server init, use the correct context when creating the KMS Master Key (#19526 )	2024-04-16 17:34:45 -07:00
Allan Roger Reid	7c1f9667d1	Use GetDuration() helper for MINIO_KMS_KEY_CACHE_INTERVAL as time.Duration (#19512 ) Bonus: Use default duration of 10 seconds if invalid input < time.Second is specified	2024-04-16 08:43:39 -07:00
Klaus Post	9246990496	fix: ListObjectVersions returning duplicates when resuming with null version id (#19518 ) When resuming a versioned listing where `version-id-marker=null`, the `null` object would always be returned, causing duplicate entries to be returned. Add check against empty version	2024-04-16 08:41:27 -07:00
Harshavardhana	cb06aee5ac	convert multipart-cleanup from a blocking unlink() to a rename to trash (#19495 ) unlinking() at two different locations on a disk when there are lots to purge, this can lead to huge IOwaits, instead rely on rename() to .trash to avoid running multiple unlinks() in parallel.	2024-04-15 03:02:39 -07:00
Shubhendu	1c70e9ed1b	ILM expiry replication status only if enabled (#19503 ) Report ILM expiry replication status only if atleast one site has the feature enabled. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-04-15 02:40:39 -07:00
jiuker	f3d6a2dd37	code clean for dynamicSleeper (#19499 )	2024-04-15 02:40:19 -07:00
Harshavardhana	d1c58fc2eb	remove older deploymentID fix behavior to speed up startup (#19497 ) since mid 2018 we do not have any deployments without deployment-id, it is time to put this code to rest, this PR removes this old code as its no longer valuable. on setups with 1000's of drives these are all quite expensive operations.	2024-04-15 01:25:46 -07:00
Allan Roger Reid	b8f05b1471	Keep an up-to-date copy of the KMS master key (#19492 )	2024-04-15 00:42:50 -07:00
Klaus Post	e7baf78ee8	fix: list operations resuming when hitting different node (#19494 ) The rest of the peer clients were not consistent across nodes. So, meta cache requests would not go to the same server if a continuation happens on a different node.	2024-04-12 11:13:36 -07:00
Harshavardhana	7e3166475d	simplify common functions in replication (#19480 )	2024-04-11 17:27:32 -07:00
Klaus Post	5206c0e883	Inspect: Add error if no results (#19476 ) When no results match or another error occurs, add an error to the stream. Keep the "inspect-input.txt" as the only thing in the zip for reference. Example: ``` λ mc support inspect --airgap myminio/testbucket/fjghfjh/** mc: Using public key from C:\Users\klaus\mc\support_public.pem File data successfully downloaded as inspect-data.enc λ inspect inspect-data.enc Using private key from support_private.pem output written to inspect-data.zip 2024/04/11 14:10:51 next stream: GetRawData: No files matched the given pattern λ unzip -l inspect-data.zip Archive: inspect-data.zip Length Date Time Name --------- ---------- ----- ---- 222 2024-04-11 14:10 inspect-input.txt --------- ------- 222 1 file λ ``` Modifies inspect to read until end of stream to report the error. Bonus: Add legacy commandline params	2024-04-11 14:22:47 -07:00
Harshavardhana	41ec038523	remove permission denied error for being drive error (#19478 )	2024-04-11 14:22:15 -07:00
Shireesh Anjal	08d3d06a06	Add drive metrics in metrics-v3 (#19452 ) Add following metrics: - used_inodes - total_inodes - healing - online - reads_per_sec - reads_kb_per_sec - reads_await - writes_per_sec - writes_kb_per_sec - writes_await - perc_util To be able to calculate the `per_sec` values, we capture the IOStats-related data in the beginning (along with the time at which they were captured), and compare them against the current values subsequently. This is because dividing by "time since server uptime." doesn't work in k8s environments.	2024-04-11 10:46:34 -07:00
Harshavardhana	074febd9e1	remove SetDiskLoc() rely on the endpoint values instead (#19475 ) the disk location never changes in the lifetime of a MinIO cluster, even if it did validate this close to the disk instead at the higher layer. Return appropriate errors indicating an invalid drive, so that the drive is not recognized as part of a valid drive.	2024-04-11 10:45:28 -07:00
Poorna	ffa91f9794	fix CopyObject with replace overwriting inline status (#19468 ) Fixes #19450 - internal inline-data header can get overwritten during copy with replace before this fix.	2024-04-10 23:42:51 -07:00
Harshavardhana	0c31e61343	allow protection from invalid config values (#19460 ) we have had numerous reports on some config values not having default values, causing features misbehaving and not having default values set properly. This PR tries to address all these concerns once and for all. Each new sub-system that gets added - must check for invalid keys - must have default values set - must not "return err" when being saved into a global state() instead collate as part of other subsystem errors allow other sub-systems to independently initialize.	2024-04-10 18:10:30 -07:00
Harshavardhana	9b926f7dbe	avoid busy loops in bad path component (#19466 ) use it in places where we are looking for such bad path components.	2024-04-10 18:08:52 -07:00
Harshavardhana	35d8728990	handle missing LDAP normalization in SetPolicy() API (#19465 )	2024-04-10 15:37:42 -07:00
Allan Roger Reid	f7ed9a75ba	Allow specifying the local server with env variable _MINIO_SERVER_LOCAL (#19453 ) * Allow specifying the local server, with env variable _MINIO_SERVER_LOCAL, in systems where the hostname cannot be resolved to local IP * Limit scope of the _MINIO_SERVER_LOCAL solution to only containerized implementations	2024-04-10 09:34:59 -07:00
jiuker	ed64e91f06	fix: noHost for collectLocalMetric (#19457 )	2024-04-10 09:28:08 -07:00
jiuker	a481825ae1	fix: unknow contentType for ArchiveFileHandler (#19451 )	2024-04-09 03:41:25 -07:00
Harshavardhana	7bb0f32332	make if-none-match PUT/POST RFC compliant (#19448 ) fixes #19442	2024-04-09 01:17:49 -07:00
Anis Eleuch	c6f8dc431e	Add a warning when the total size of an object versions exceeds 1 TiB (#19435 )	2024-04-08 10:45:03 -07:00
Anis Eleuch	787c44c39d	batch-repl: Do not allow both source/target to be remote (#19434 ) Return an error when the user specifies endpoints for both source and target. This can generate many type of errors as the code considers a deployment remote if its endpoint is specified.	2024-04-08 07:11:38 -07:00
Anis Eleuch	f06fee0364	heal: Add more per disk healing result in the audit (#19427 ) HealObject() does not return an error in some cases, for example, when an object is successfully reconstructed in one disk but fails with other disks, another case is when a disk does not have the object is temporarily disconnected Add the After heal drives result in the audit output for better analysis.	2024-04-08 02:26:14 -07:00
Harshavardhana	c957e0d426	fix: increase the tiering part size to 128MiB (#19424 ) also introduce 8MiB buffer to read from for bigger parts	2024-04-08 02:22:27 -07:00
Harshavardhana	04101d472f	fix: add fallbackDisks for disk healing (#19425 )	2024-04-08 02:22:13 -07:00
Minio Trusted	51fc145161	Update yaml files to latest version RELEASE.2024-04-06T05-26-02Z	2024-04-06 06:44:30 +00:00
Taran Pelkey	9d63bb1b41	Added new API errors for LDAP (#19415 ) * change internal errors to named errors * Change names	2024-04-05 22:26:02 -07:00
Aditya Manthramurthy	8ff2a7a2b9	fix: IAM import/export: remove sts group handling (#19422 ) There are no separate STS group mappings to be handled. Also add tests for basic import/export sanity.	2024-04-05 20:13:35 -07:00
Harshavardhana	91f91d8f47	fix: a regression in IAM policy reload routine() (#19421 ) all policy reloading is broken since last release since `48deccdc40` fixes #19417	2024-04-05 14:26:41 -07:00
Harshavardhana	a207bd6790	turn-off Nlink readdir() optimization for NFS/CIFS (#19420 ) fixes #19418 fixes #19416	2024-04-05 08:17:08 -07:00
Harshavardhana	96d226c0b1	remove frivolous log about abort-multipart failure in replication (#19413 )	2024-04-05 04:39:55 -07:00
Krishnan Parthasarathi	a86d98826d	Set object's original modTime when being restored (#19414 ) Set object's modTime when being restored restored here refers to making a temporary local copy in the hot tier for a tiered object using the RestoreObject API	2024-04-05 04:39:31 -07:00
Harshavardhana	1bb670ecba	use new generics based LRU from hashicorp (#19409 ) we have been using an LRU caching for internode auth tokens, migrate to using a typed implementation and also do not cache auth tokens when its an error.	2024-04-04 11:58:48 -07:00
Aditya Manthramurthy	c9e9a8e2b9	fix: ldap: use validated base DNs (#19406 ) This fixes a regression from #19358 which prevents policy mappings created in the latest release from being displayed in policy entity listing APIs. This is due to the possibility that the base DNs in the LDAP config are not in a normalized form and #19358 introduced normalized of mapping keys (user DNs and group DNs). When listing, we check if the policy mappings are on entities that parse as valid DNs that are descendants of the base DNs in the config. Test added that demonstrates a failure without this fix.	2024-04-04 11:36:18 -07:00
jiuker	272367ccd2	feat: add memlimit flags for setMaxResources (#19400 )	2024-04-04 05:06:57 -07:00
Anis Eleuch	95bf4a57b6	logging: Add subsystem to log API (#19002 ) Create new code paths for multiple subsystems in the code. This will make maintaing this easier later. Also introduce bugLogIf() for errors that should not happen in the first place.	2024-04-04 05:04:40 -07:00
Andreas Auernhammer	faeb2b7e79	use `GenerateKey` as more reliable KMS health-check (#19404 ) This commit replaces the `KMS.Stat` API call with a `KMS.GenerateKey` call. This approach is more reliable since data key generation also works when the KMS backend is unavailable (temp. offline), but KES has cached the key. Ref: KES offline caching. With this change, it is less likely that MinIO readiness checks fail in cases where the KMS backend is offline. Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-04-03 14:13:20 -07:00
Anis Eleuch	97ce11cb6b	Avoid using a nil transport when the config is not initialized (#19405 ) Make sure to pass a nil pointer as a Transport to minio-go when the API config is not initialized, this will make sure that we do not pass an interface with a known type but a nil value. This will also fix the update of the API remote_transport_deadline configuration without requiring the cluster restart.	2024-04-03 11:27:05 -07:00
Harshavardhana	4f660a8eb7	fix: missing metrics for healed objects (#19392 ) all healed successful objects via queueHealTask in a non-blocking heal weren't being reported correctly, this PR fixes this comprehensively.	2024-04-01 23:48:36 -07:00
Praveen raj Mani	ae4fb1b72e	Prioritize the bucket configs first during the decommissioning (#19393 )	2024-04-01 23:48:26 -07:00
Klaus Post	b435806d91	Reduce big message RPC allocations (#19390 ) Use `ODirectPoolSmall` buffers for inline data in PutObject. Add a separate call for inline data that will fetch a buffer for the inline data before unmarshal.	2024-04-01 16:42:09 -07:00
Klaus Post	3d6194e93c	Remove empty replication stats (#19385 ) When sending final stats upstream also trim empty ReplicationStats.	2024-03-29 11:57:52 -07:00
Harshavardhana	feb9d8480b	add auditing for healing objects (#19379 )	2024-03-28 16:46:19 -07:00
Aditya Manthramurthy	48deccdc40	fix: sts accounts map refresh and fewer list calls (#19376 ) This fixes a bug where STS Accounts map accumulates accounts in memory and never removes expired accounts and the STS Policy mappings were not being refreshed. The STS purge routine now runs with every IAM credentials load instead of every 4th time. The listing of IAM files is now cached on every IAM load operation to prevent re-listing for STS accounts purging/reload. Additionally this change makes each server pick a time for IAM loading that is randomly distributed from a 10 minute interval - this is to prevent server from thundering while performing the IAM load. On average, IAM loading will happen between every 5-15min after the previous IAM load operation completes.	2024-03-28 16:43:50 -07:00
Kaan Kabalak	3f72439b8a	Suppress error log for force-deleting object in locked bucket (#19378 )	2024-03-28 14:37:42 -07:00
Shubhendu	468a9fae83	Enable replication of SSE-C objects (#19107 ) If site replication enabled across sites, replicate the SSE-C objects as well. These objects could be read from target sites using the same client encryption keys. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-03-28 10:44:56 -07:00
Klaus Post	aa0eec16ab	Remove empty replication stats when sending update (#19375 ) When sending update and there is no replication stats - remove the struct. Will remove an unneeded alloc on the receiver.	2024-03-28 10:13:07 -07:00
jiuker	8222a640ac	fix: slice append lose the data for NSScanner (#19373 )	2024-03-28 08:13:36 -07:00
Aditya Manthramurthy	7e45d84ace	ldap: improve normalization of DN values (#19358 ) Instead of relying on user input values, we use the DN value returned by the LDAP server. This handles cases like when a mapping is set on a DN value `uid=svc.algorithm,OU=swengg,DC=min,DC=io` with a user input value (with unicode variation) of `uid=svc﹒algorithm,OU=swengg,DC=min,DC=io`. The LDAP server on lookup of this DN returns the normalized value where the unicode dot character `SMALL FULL STOP` (in the user input), gets replaced with regular full stop.	2024-03-27 23:45:26 -07:00
Harshavardhana	139a606f0a	use bigger partSize per part for tiering to MinIO (#19361 ) Bonus: remove persistent md5sum calculation, turn-off sha256 as well. Instead we always enable crc32c which is enough for payload verification also support for trailing headers checksum.	2024-03-27 23:45:08 -07:00
Harshavardhana	289223b6de	expire ILM all versions verify quorum on action (#19359 )	2024-03-27 23:44:52 -07:00
Harshavardhana	c61dd16a1e	fix: avoid fan-out DeletePrefix calls for batch-expire and ILM (#19365 )	2024-03-27 20:18:15 -07:00
Harshavardhana	3e38fa54a5	set max versions to be IntMax to avoid premature failures (#19360 ) let users/customers set relevant values make default value to be non-applicable.	2024-03-27 18:08:07 -07:00
jiuker	4a02189ba0	feat: add env to choose which node to start decom (#19310 ) add a temporary env _MINIO_DECOM_ENDPOINT to choose the node to start decom from, in situations when first node first pool is not available.	2024-03-27 16:18:40 -07:00
jiuker	ec3a3bb10d	fix: Remove unnecessary loops for searchParent (#19353 )	2024-03-27 08:12:14 -07:00
Harshavardhana	364d3a0ac9	fix: new staticheck and linter issues reported (#19340 )	2024-03-27 08:10:40 -07:00
Poorna	8bce123bba	fix: precondition check for multipart with existing object replication (#19349 )	2024-03-26 15:10:45 -07:00
Harshavardhana	0a56dbde2f	allow configuring inline shard size value (#19336 )	2024-03-26 15:06:19 -07:00
Klaus Post	7ff4164d65	Fix races in IAM cache lazy loading (#19346 ) Fix races in IAM cache Fixes #19344 On the top level we only grab a read lock, but we write to the cache if we manage to fetch it. `a03dac41eb/cmd/iam-store.go (L446)` is also flipped to what it should be AFAICT. Change the internal cache structure to a concurrency safe implementation. Bonus: Also switch grid implementation.	2024-03-26 11:12:57 -07:00
Harshavardhana	dc45a5010d	bring back minor DNS cache for k8s setups (#19341 ) k8s as it stands is flaky in DNS lookups, bring this change back such that we can cache DNS atleast for 30secs TTL.	2024-03-26 08:00:38 -07:00
jiuker	4b9192034c	fix: should return when error happend (#19342 )	2024-03-26 07:51:56 -07:00
Harshavardhana	deeadd1a37	fix: convert multiple callers to use toStorageErr(err) correctly (#19339 ) we must attempt to convert all errors at storage-rest-client into StorageErr() regardless of what functionality is being called in, this PR fixes this for multiple callers including some internally used functions.	2024-03-25 23:24:59 -07:00
Sveinn	1fc4203c19	Webhook targets refactor and bug fixes (#19275 ) - old version was unable to retain messages during config reload - old version could not go from memory to disk during reload - new version can batch disk queue entries to single for to reduce I/O load - error logging has been improved, previous version would miss certain errors. - logic for spawning/despawning additional workers has been adjusted to trigger when half capacity is reached, instead of when the log queue becomes full. - old version would json marshall x2 and unmarshal 1x for every log item. Now we only do marshal x1 and then we GetRaw from the store and send it without having to re-marshal.	2024-03-25 09:44:20 -07:00
Poorna	7fd76dbbb7	fix batch snowball to close channel after listing finishes (#19316 ) panic seen due to premature closing of slow channel while listing is still sending or list has already closed on the sender's side: ``` panic: close of closed channel goroutine 13666 [running]: github.com/minio/minio/internal/ioutil.SafeClose[...](0x101ff51e4?) /Users/kp/code/src/github.com/minio/minio/internal/ioutil/ioutil.go:425 +0x24 github.com/minio/minio/cmd.(erasureServerPools).Walk.func1() /Users/kp/code/src/github.com/minio/minio/cmd/erasure-server-pool.go:2142 +0x170 created by github.com/minio/minio/cmd.(erasureServerPools).Walk in goroutine 1189 /Users/kp/code/src/github.com/minio/minio/cmd/erasure-server-pool.go:1985 +0x228 ```	2024-03-21 16:13:43 -07:00
Krishnan Parthasarathi	da81c6cc27	Encode dir obj names before expiration (#19305 ) Object names of directory objects qualified for ExpiredObjectAllVersions must be encoded appropriately before calling on deletePrefix on their erasure set. e.g., a directory object and regular objects with overlapping prefixes could lead to the expiration of regular objects, which is not the intention of ILM. ``` bucket/dir/ ---> directory object bucket/dir/obj-1 ``` When `bucket/dir/` qualifies for expiration, the current implementation would remove regular objects under the prefix `bucket/dir/`, in this case, `bucket/dir/obj-1`.	2024-03-21 10:21:35 -07:00
Harshavardhana	a03dac41eb	use retry during policy reload from drives (#19307 )	2024-03-21 10:19:50 -07:00
Shireesh Anjal	55778ae278	fix: peer addr returned as empty string (#19308 ) In handlers related to health diagnostics e.g. CPU, Network, Partitions, etc, globalMinioHost was being passed as the addr, resulting in empty value for the same in the health report. Using globalLocalNodeName instead fixes the issue.	2024-03-21 10:19:14 -07:00
Poorna	d990661d1f	replication: enforce precondition for multipart (#19306 )	2024-03-20 18:12:37 -07:00
Harshavardhana	280526caf7	add IAM policyDB lookup fallbacks to drives (#19302 ) IAM loading is a lazy operation, allow these fallbacks to be in place when we cannot find in-memory state(). this allows us to honor the request even if pay a small price for lookup and populating the data.	2024-03-20 09:24:04 -07:00
Harshavardhana	1173b26fc8	avoid triggering heals on metacache files if any (#19299 )	2024-03-19 20:21:15 -07:00
Krishnan Parthasarathi	383489d5d9	Handle zero versions qualified for expiration (#19301 ) When objects have more versions than their ILM policy expects to retain via NewerNoncurrentVersions, but they don't qualify for expiry due to NoncurrentDays are configured in that rule. In this case, applyNewerNoncurrentVersionsLimit method was enqueuing empty tasks, which lead to a panic (panic: runtime error: index out of range [0] with length 0) in newerNoncurrentTask.OpHash method, which assumes the task to contain at least one version to expire.	2024-03-19 20:10:58 -07:00
Anis Eleuch	9370b11684	decom: Fix failed status after a failed decommission (#19300 ) When returning the status of a decommissioned pool, a pool with zero time StartedTime will be considered an active pool, which is unexpected. This commit will always ensure that a pool's canceled/failed/completed status is returned.	2024-03-19 20:09:59 -07:00
Anis Eleuch	235edd88aa	xl: Purge instead of moving to trash with near filled disks (#19294 ) Immediately remove objects from the trash when the disk is 95% full	2024-03-19 13:26:24 -07:00
Anis Eleuch	b5e074e54c	list: Fix IsTruncated and NextMarker when encountering expired objects (#19290 )	2024-03-19 13:23:12 -07:00
Harshavardhana	7213bd7131	add additional logs for the decom during metadata save (#19288 )	2024-03-18 15:25:45 -07:00
Harshavardhana	741de4cf94	fix: add a default requests deadline when deadline is 0 (#19287 )	2024-03-18 12:30:41 -07:00
Harshavardhana	f168ef9989	implement a flag to specify custom crossdomain.xml (#19262 ) fixes #16909	2024-03-17 23:42:40 -07:00
alingse	a0de56abb6	fix: wrong time.Parse params order for replication timestamp (#19279 )	2024-03-17 21:19:43 -07:00
Harshavardhana	c201d8bda9	write anything beyond 4k to be written in 4k pages (#19269 ) we were prematurely not writing 4k pages while we could have due to the fact that most buffers would be multiples of 4k upto some number and there shall be some remainder. We only need to write the remainder without O_DIRECT.	2024-03-15 12:27:59 -07:00
Harshavardhana	93fb7d62d8	allow dynamically changing max_object_versions per object (#19265 )	2024-03-14 18:07:19 -07:00
Harshavardhana	ce1c640ce0	feat: allow retaining parity SLA to be configurable (#19260 ) at scale customers might start with failed drives, causing skew in the overall usage ratio per EC set. make this configurable such that customers can turn this off as needed depending on how comfortable they are.	2024-03-14 03:38:33 -07:00
Anis Eleuch	24b4f9d748	Fix quorum calculation with zero parity objects (#19250 ) Currently, the code relies on object parity to decide whether it is a delete marker or a regular object. In the case of a delete marker, the return quorum is half of the disks in the erasure set. However, this calculation must be corrected with objects with EC = 0, mainly because EC is not a one-time fixed configuration. Though all data are correct, the manifested symptom is a 503 with an EC=0 object. This bug was manifested after we introduced the fast Get Object feature that does not read all data from all disks in case of inlined objects	2024-03-12 12:59:11 -07:00
Harshavardhana	81d7531f1f	only look for valid buckets (#19244 ) fixes #19239	2024-03-12 04:33:30 -07:00
Poorna	b4a23f720e	update build constants (#19243 )	2024-03-11 17:54:37 -07:00
Dennis Marttinen	6c964fede5	Improve handling of compression inclusion for objects (#19234 )	2024-03-11 04:55:34 -07:00
huajin tong	a25a8312d8	fix: some flyby typos in the code (#19212 ) Signed-off-by: thirdkeyword <fliterdashen@gmail.com>	2024-03-10 14:09:36 -07:00
Aditya Manthramurthy	b2c5b75efa	feat: Add Metrics V3 API (#19068 ) Metrics v3 is mainly a reorganization of metrics into smaller groups of metrics and the removal of internal aggregation of metrics received from peer nodes in a MinIO cluster. This change adds the endpoint `/minio/metrics/v3` as the top-level metrics endpoint and under this, various sub-endpoints are implemented. These are currently documented in `docs/metrics/v3.md` The handler will serve metrics at any path `/minio/metrics/v3/PATH`, as follows: when PATH is a sub-endpoint listed above => serves the group of metrics under that path; or when PATH is a (non-empty) parent directory of the sub-endpoints listed above => serves metrics from each child sub-endpoint of PATH. otherwise, returns a no resource found error All available metrics are listed in the `docs/metrics/v3.md`. More will be added subsequently.	2024-03-10 01:15:15 -08:00
Harshavardhana	88a89213ff	make immediate purge non-blocking up to 100,000 entries per drive (#19231 ) make immediate purge non-blocking upto 100000 entries per drive Bonus: turn-off O_DIRECT verification when FSType is 'XFS'	2024-03-09 18:53:48 -08:00
Poorna	8e2238ea09	some more cleanup for startup message (#19229 )	2024-03-08 22:42:32 -08:00
Poorna	31e8f7c525	Small reformatting of startup message (#19228 ) Also changing User-Agent format	2024-03-08 19:07:08 -08:00
Klaus Post	51f62a8da3	Port ListBuckets to websockets layer & some cleanup (#19199 )	2024-03-08 11:08:18 -08:00
Klaus Post	650efc2e96	Fix listing in objects split across pools (#19227 ) Merging same-object - multiple versions from different pools would not always result in correct ordering. When merging keep inputs separate. ``` λ mc ls --versions local/testbucket ------ before ------ [2024-03-05 20:17:19 CET] 228B STANDARD 1f163718-9bc5-4b01-bff7-5d8cf09caf10 v3 PUT hosts [2024-03-05 20:19:56 CET] 19KiB STANDARD null v2 PUT hosts [2024-03-05 20:17:15 CET] 228B STANDARD 73c9f651-f023-4566-b012-cc537fdb7ce2 v1 PUT hosts ------ after ------ λ mc ls --versions local/testbucket [2024-03-05 20:19:56 CET] 19KiB STANDARD null v3 PUT hosts [2024-03-05 20:17:19 CET] 228B STANDARD 1f163718-9bc5-4b01-bff7-5d8cf09caf10 v2 PUT hosts [2024-03-05 20:17:15 CET] 228B STANDARD 73c9f651-f023-4566-b012-cc537fdb7ce2 v1 PUT hosts ```	2024-03-08 09:50:48 -08:00
Harshavardhana	2cc4997d24	fix: crash on 32bit systems during pre-allocation (#19225 )	2024-03-08 05:55:28 -08:00
Poorna	934f6cabf6	sr: use site replicator creds to verify temp user claims (#19224 ) This PR continues #19209 which did not handle claims verification of temporary users created by root in site replication scenario. Fixes: #19217	2024-03-07 14:30:00 -08:00
Anis Eleuch	68dd74c5ab	batch: Separate batch job request and batch job stats (#19205 ) Currently, the progress of the batch job is saved in inside the job request object, which is normally not supported by MinIO. Though there is no apparent bug, it is better to fix this now. Batch progress is saved in .minio.sys/batch-jobs/reports/ Co-authored-by: Anis Eleuch <anis@min.io>	2024-03-07 10:58:22 -08:00
Harshavardhana	48b590e14b	fix: same server to be part of multiple pools (#19216 ) our PoolNumber calculation was costly, while we already had this information per endpoint, we needed to deduce it appropriately. This PR addresses this by assigning PoolNumbers field that carries all the pool numbers that belong to a server. properties.PoolNumber still carries a valid value only when len(properties.PoolNumbers) == 1, otherwise properties.PoolNumber is set to math.MaxInt (indicating that this value is undefined) and then one must rely on properties.PoolNumbers for server participation in multiple pools. addresses the issue originating from #11327	2024-03-07 10:24:07 -08:00
Poorna	837a2a3d4b	sr: use service account cred for claims check (#19209 ) PR #19111 overlaid service account secret with site replicator secret during token claims check. Fixes : #19206	2024-03-06 16:19:24 -08:00
Harshavardhana	74ccee6619	avoid too much auditing during decom/rebalance make it more robust (#19174 ) there can be a sudden spike in tiny allocations, due to too much auditing being done, also don't hang on the ``` h.logCh <- entry ``` after initializing workers if you do not have a way to dequeue for some reason.	2024-03-06 03:43:16 -08:00
Poorna	89f759566c	bucket import: avoid overwriting bucket creation date (#19207 )	2024-03-05 16:05:28 -08:00
Harshavardhana	cd7551031b	fix: a regression in loading replication creds (#19204 ) fixes #19200 generating STS credentials fail with site-replicated setup, with this error on a fresh environment.	2024-03-05 11:06:17 -08:00
Praveen raj Mani	df57bfcd6c	fix: cluster read health check to return proper values (#19203 ) Fixes #19202	2024-03-05 10:25:49 -08:00
Justin Griffin	dfb1f39b57	Support custom endpoint for Azure remote storage tier (#19188 ) This commits adds support for using the `--endpoint` arg when creating a tier of type `azure`. This is needed to connect to Azure's Gov Cloud instance. For example, ``` mc ilm tier add azure TARGET TIER_NAME \ --account-name ACCOUNT \ --account-key KEY \ --bucket CONTAINER \ --endpoint https://ACCOUNT.blob.core.usgovcloudapi.net --prefix PREFIX \ --storage-class STORAGE_CLASS ``` Prior to this, the endpoint was hardcoded to `https://ACCOUNT.blob.core.windows.net`. The docs were even explicit about this, stating that `--endpoint` is: "Required for `s3` or `minio` tier types. This option has no effect for any other value of `TIER_TYPE`." Now, if the endpoint arg is present it will be used. If not, it will fall back to the same default behavior of `ACCOUNT.blob.core.windows.net`.	2024-03-05 08:44:08 -08:00
Harshavardhana	1b5f28e99b	fix: skip local disks properly in cluster health maintenance check (#19184 )	2024-03-04 20:48:44 -08:00

... 2 3 4 5 6 ...

6226 Commits