minio

mirror of https://github.com/minio/minio.git synced 2025-11-27 04:46:53 -05:00

Author	SHA1	Message	Date
Harshavardhana	1526e7ece3	extend server config.yaml to support per pool set drive count (#19663 ) This is to support deployments migrating from a multi-pooled wider stripe to lower stripe. MINIO_STORAGE_CLASS_STANDARD is still expected to be same for all pools. So you can satisfy adding custom drive count based pools by adjusting the storage class value. ``` version: v2 address: ':9000' rootUser: 'minioadmin' rootPassword: 'minioadmin' console-address: ':9001' pools: # Specify the nodes and drives with pools - args: - 'node{11...14}.example.net/data{1...4}' - args: - 'node{15...18}.example.net/data{1...4}' - args: - 'node{19...22}.example.net/data{1...4}' - args: - 'node{23...34}.example.net/data{1...10}' set-drive-count: 6 ```	2024-05-03 08:54:03 -07:00
Krishnan Parthasarathi	6c07bfee8a	With retention, skip actions expiring all versions (#19657 ) ILM actions due to ExpiredObjectDeleteAllVersions and DelMarkerExpiration are ignored when object locking is enabled on a bucket. Note: This applies to object versions which may not have retention configured on them. This applies to all object versions in this bucket, including those created before the retention config was applied.	2024-05-03 04:18:58 -07:00
Poorna	446c760820	replication: Avoid proxying if requested object is a deletemarker (#19656 ) Fixes: #19654	2024-05-02 13:15:54 -07:00
Shireesh Anjal	04f92f1291	Change endpoint format for per-bucket metrics (#19655 ) Per-bucket metrics endpoints always start with /bucket and the bucket name is appended to the path. e.g. if the collector path is /bucket/api, the endpoint for the bucket "mybucket" would be /minio/metrics/v3/bucket/api/mybucket Change the existing bucket api endpoint accordingly from /api/bucket to /bucket/api	2024-05-02 10:37:57 -07:00
Bala FA	e5b16adb1c	Add cluster IAM metrics in metrics-v3 (#19595 ) Signed-off-by: Bala.FA <bala@minio.io>	2024-05-02 01:20:42 -07:00
Harshavardhana	402a3ac719	support compression after rotation of logs (#19647 )	2024-05-01 15:38:07 -07:00
Aditya Manthramurthy	f3d61c51fc	fix: Filter out cust. AssumeRole `Token` for audit (#19646 ) The `Token` parameter is a sensitive value that should not be output in the Audit log for STS AssumeRoleWithCustomToken API. Bonus: Add a simple tool that echoes audit logs to the console.	2024-05-01 14:31:13 -07:00
Klaus Post	0cde17ae5d	Return listing when exceeding min disk errors (#19644 ) When listing, with drives returning `errFileNotFound,` `errVolumeNotFound`, or `errUnformattedDisk,`, we could get below `minDisks` drives being left. This would result in a quorum never being reachable for any object. Therefore, the listing would continue, but no results would ever be produced. Include `fnf` in the mindisk check since it is incremented on these errors. This will stop listing when minDisks are left. Allow `opts.minDisks` to not return errVolumeNotFound or errFileNotFound and return that. That will allow for good results even if disks return something else. We switch `errUnformattedDisk` to a regular error. If we have enough of those, we should just fail.	2024-05-01 10:59:08 -07:00
Harshavardhana	8c1bba681b	add logrotate support for MinIO logs (#19641 )	2024-05-01 10:57:52 -07:00
Klaus Post	dbfb5e797b	Wait one minute after startup to restart decommissioning (#19645 ) Typically not all drives are connected, so we delay 3 minutes before resuming. This greatly reduces risk of starting to list unconnected drives, or drives we risk being disconnected soon. This delay is not applied when starting with an admin call.	2024-05-01 08:18:21 -07:00
Harshavardhana	08ff702434	enhance ListSVCs() API to return more info to avoid InfoSvc() (#19642 ) ConsoleUI like applications rely on combination of ListServiceAccounts() and InfoServiceAccount() to populate UI elements, however individually these calls can be slow causing the entire UI to load sluggishly.	2024-05-01 05:41:13 -07:00
Klaus Post	0e2148264a	Fix --stfp "mac-algos=..." overwrites cipher algorithms (#19643 ) Setting MAC algorithms overwrites cipher algorithms. Followup to #19636	2024-05-01 04:07:40 -07:00
Krishnan Parthasarathi	7926401cbd	ilm: Handle DeleteAllVersions action differently for DEL markers (#19481 ) i.e., this rule element doesn't apply to DEL markers. This is a breaking change to how ExpiredObejctDeleteAllVersions functions today. This is necessary to avoid the following highly probable footgun scenario in the future. Scenario: The user uses tags-based filtering to select an object's time to live(TTL). The application sometimes deletes objects, too, making its latest version a DEL marker. The previous implementation skipped tag-based filters if the newest version was DEL marker, voiding the tag-based TTL. The user is surprised to find objects that have expired sooner than expected. * Add DelMarkerExpiration action This ILM action removes all versions of an object if its the latest version is a DEL marker. ```xml <DelMarkerObjectExpiration> <Days> 10 </Days> </DelMarkerObjectExpiration> ``` 1. Applies only to objects whose, • The latest version is a DEL marker. • satisfies the number of days criteria 2. Deletes all versions of this object 3. Associated rule can't have tag-based filtering Includes, - New bucket event type for deletion due to DelMarkerExpiration	2024-04-30 18:11:10 -07:00
Harshavardhana	8161411c5d	fix: a crash in RemoveReplication target (#19640 ) calling a remote target remove with a perfectly well constructed ARN can lead to a crash for a bucket with no replication configured. This PR fixes, and adds a crash check for ImportMetadata as well.	2024-04-30 18:09:56 -07:00
Klaus Post	f64dea2aac	Allow custom SFTP algorithm selection (#19636 ) Algorithms are comma separated. Note that valid values does not in all cases represent default values. `--sftp=pub-key-algos=...` specifies the supported client public key authentication algorithms. Note that this doesn't include certificate types since those use the underlying algorithm. This list is sent to the client if it supports the server-sig-algs extension. Order is irrelevant. Valid values ``` ssh-ed25519 sk-ssh-ed25519@openssh.com sk-ecdsa-sha2-nistp256@openssh.com ecdsa-sha2-nistp256 ecdsa-sha2-nistp384 ecdsa-sha2-nistp521 rsa-sha2-256 rsa-sha2-512 ssh-rsa ssh-dss ``` `--sftp=kex-algos=...` specifies the supported key-exchange algorithms in preference order. Valid values: ``` curve25519-sha256 curve25519-sha256@libssh.org ecdh-sha2-nistp256 ecdh-sha2-nistp384 ecdh-sha2-nistp521 diffie-hellman-group14-sha256 diffie-hellman-group16-sha512 diffie-hellman-group14-sha1 diffie-hellman-group1-sha1 ``` `--sftp=cipher-algos=...` specifies the allowed cipher algorithms. If unspecified then a sensible default is used. Valid values: ``` aes128-ctr aes192-ctr aes256-ctr aes128-gcm@openssh.com aes256-gcm@openssh.com chacha20-poly1305@openssh.com arcfour256 arcfour128 arcfour aes128-cbc 3des-cbc ``` `--sftp=mac-algos=...` specifies a default set of MAC algorithms in preference order. This is based on RFC 4253, section 6.4, but with hmac-md5 variants removed because they have reached the end of their useful life. Valid values: ``` hmac-sha2-256-etm@openssh.com hmac-sha2-512-etm@openssh.com hmac-sha2-256 hmac-sha2-512 hmac-sha1 hmac-sha1-96 ```	2024-04-30 08:15:45 -07:00
Shubhendu	6579304d8c	Suppress metrics with zero values (#19638 ) This would reduce the size of data in response of metrics listing. While graphing we can default these metrics with a zero value if not found. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-04-30 08:05:22 -07:00
Klaus Post	3cf8a7c888	Always unfreeze when connection dies (#19634 ) Unfreeze as soon as the incoming connection is terminated and don't wait for everything to complete. We don't want to keep the services frozen if something becomes stuck.	2024-04-29 10:39:04 -07:00
Harshavardhana	a372c6a377	a bunch of fixes for error handling (#19627 ) - handle errFileCorrupt properly - micro-optimization of sending done() response quicker to close the goroutine. - fix logger.Event() usage in a couple of places - handle the rest of the client to return a different error other than lastErr() when the client is closed.	2024-04-28 10:53:50 -07:00
Poorna	9e95703efc	iam reload policy mapping of STS users properly (#19626 )	2024-04-27 03:04:10 -07:00
Anis Eleuch	d8e05aca81	heal/list: Fix rare incomplete listing with flaky internode connections (#19625 ) listPathRaw() counts errDiskNotFound as a valid error to indicate a listing stream end. However, storage.WalkDir() is allowed to return errDiskNotFound anytime since grid.ErrDisconnected is converted to errDiskNotFound. This affects fresh disk healing and should affect S3 listing as well.	2024-04-26 12:52:52 -07:00
Praveen raj Mani	410a1ac040	Handle failures in pool rebalancing (#19623 )	2024-04-26 12:29:28 -07:00
Shireesh Anjal	4caa3422bd	Add process metrics in `metrics-v3` (#19612 ) endpoint: /minio/metrics/v3/system/process metrics: - locks_read_total - locks_write_total - cpu_total_seconds - go_routine_total - io_rchar_bytes - io_read_bytes - io_wchar_bytes - io_write_bytes - start_time_seconds - uptime_seconds - file_descriptor_limit_total - file_descriptor_open_total - syscall_read_total - syscall_write_total - resident_memory_bytes - virtual_memory_bytes - virtual_memory_max_bytes Since the standard process collector implements only a subset of these metrics, remove it and implement our own custom process collector that captures all the process metrics we need.	2024-04-26 09:07:23 -07:00
Anis Eleuch	135874ebdc	heal: Avoid marking a bucket as done when remote drives are offline (#19587 )	2024-04-25 23:32:14 -07:00
Poorna	e7aa26dc29	fix: allow DeleteObject unversioned objects with insufficient read quorum (#19581 ) Since the object is being permanently deleted, the lack of read quorum should not matter as long as sufficient disks are online to complete the deletion with parity requirements. If several pools have the same object with insufficient read quorum, attempt to delete object from all the pools where it exists	2024-04-25 17:31:12 -07:00
Harshavardhana	c54ffde568	add metrics ioerror counter for alerts on I/O errors (#19618 )	2024-04-25 15:01:31 -07:00
Anis Eleuch	9a3c992d7a	heal: Fix regression in healing a new fresh drive (#19615 )	2024-04-25 14:55:41 -07:00
Aditya Manthramurthy	0c855638de	fix: LDAP init. issue when LDAP server is down (#19619 ) At server startup, LDAP configuration is validated against the LDAP server. If the LDAP server is down at that point, we need to cleanly disable LDAP configuration. Previously, LDAP would remain configured but error out in strange ways because initialization did not complete without errors.	2024-04-25 14:28:16 -07:00
Ramon de Klein	4c0acba62d	Fixes an internal error while force-deleting a bucket (#19614 )	2024-04-25 09:27:27 -07:00
Aditya Manthramurthy	62c3cdee75	fix: IAM LDAP access key import bug (#19608 ) When importing access keys (i.e. service accounts) for LDAP accounts, we are requiring groups to exist under one of the configured group base DNs. This is not correct. This change fixes this by only checking for existence and storing the normalized form of the group DN - we do not return an error if the group is not under a base DN. Test is updated to illustrate an import failure that would happen without this change.	2024-04-25 08:50:16 -07:00
Aditya Manthramurthy	3212d0c8cd	fix: IAM import for LDAP should replace mappings (#19607 ) Existing IAM import logic for LDAP creates new mappings when the normalized form of the mapping key differs from the existing mapping key in storage. This change effectively replaces the existing mapping key by first deleting it and then recreating with the normalized form of the mapping key. For e.g. if an older deployment had a policy mapped to a user DN - `UID=alice1,OU=people,OU=hwengg,DC=min,DC=io` instead of adding a mapping for the normalized form - `uid=alice1,ou=people,ou=hwengg,dc=min,dc=io` we should replace the existing mapping. This ensures that duplicates mappings won't remain after the import. Some additional cleanup cases are also covered. If there are multiple mappings for the name normalized key such as: `UID=alice1,OU=people,OU=hwengg,DC=min,DC=io` `uid=alice1,ou=people,ou=hwengg,DC=min,DC=io` `uid=alice1,ou=people,ou=hwengg,dc=min,dc=io` we check if the list of policies mapped to all these keys are exactly the same, and if so remove all of them and create a single mapping with the normalized key. However, if the policies mapped to such keys differ, the import operation returns an error as the server cannot automatically pick the "right" list of policies to map.	2024-04-25 08:49:53 -07:00
Harshavardhana	1d03bea965	support preserving renameData() on inlined content during overwrites (#19609 ) extending #19548 to inlined-data as well.	2024-04-24 18:14:08 -07:00
jiuker	df93ff92ba	fix: site-replication will reset group status when add user (#19594 )	2024-04-24 08:54:24 -07:00
Shireesh Anjal	77d5331e85	Fix few wrongly defined metric types (#19586 ) `minio_cluster_webhook_queue_length` was wrongly defined as `counter` where-as it should be `gauge` Following were wrongly defined as `gauge` when they should actually be `counter`: - minio_bucket_replication_sent_bytes - minio_bucket_replication_received_bytes - minio_bucket_replication_total_failed_bytes - minio_bucket_replication_total_failed_count	2024-04-23 23:19:40 -07:00
Bala FA	14cdadfb56	Add cluster notification metrics in metrics-v3 (#19533 ) Signed-off-by: Bala.FA <bala@minio.io>	2024-04-23 21:10:35 -07:00
Harshavardhana	f3a52cc195	simplify listener implementation setup customizations in right place (#19589 )	2024-04-23 21:08:47 -07:00
Aditya Manthramurthy	7640cd24c9	fix: avoid some IAM import errors if LDAP enabled (#19591 ) When LDAP is enabled, previously we were: - rejecting creation of users and groups via the IAM import functionality - throwing a `not a valid DN` error when non-LDAP group mappings are present This change allows for these cases as we need to support situations where the MinIO server contains users, groups and policy mappings created before LDAP was enabled.	2024-04-23 18:23:08 -07:00
Shireesh Anjal	f7b665347e	Add system CPU metrics to metrics-v3 (#19560 ) endpoint: /minio/metrics/v3/system/cpu metrics: - minio_system_cpu_avg_idle - minio_system_cpu_avg_iowait - minio_system_cpu_load - minio_system_cpu_load_perc - minio_system_cpu_nice - minio_system_cpu_steal - minio_system_cpu_system - minio_system_cpu_user	2024-04-23 16:56:12 -07:00
Harshavardhana	9693c382a8	make renameData() more defensive during overwrites (#19548 ) instead upon any error in renameData(), we still preserve the existing dataDir in some form for recoverability in strange situations such as out of disk space type errors. Bonus: avoid running list and heal() instead allow versions disparity to return the actual versions, uuid to heal. Currently limit this to 100 versions and lesser disparate objects. an undo now reverts back the xl.meta from xl.meta.bkp during overwrites on such flaky setups. Bonus: Save N depth syscalls via skipping the parents upon overwrites and versioned updates. Flaky setup examples are stretch clusters with regular packet drops etc, we need to add some defensive code around to avoid dangling objects.	2024-04-23 10:15:52 -07:00
jiuker	ee1047bd52	fix: can't get total disksize for `decom status` (#19585 )	2024-04-23 04:33:28 -07:00
Seiya	5ea5ab162b	Remove leading zero strings in return value of (*xlMetaV2)getDataDirs() (#19567 ) remove leading zero strings in return value of getDataDirs()	2024-04-22 22:07:37 -07:00
Klaus Post	b5a09ff96b	Fix RenameData data race (#19579 ) RenameData could start operating on inline data after timing out and the call returned due to WithDeadline. This could cause a buffer to write to the inline data being written. Since no writes are in `RenameData` and the call is canceled, this doesn't present a corruption issue. But a race is a race and should be fixed. Copy inline data to a fresh buffer.	2024-04-22 22:07:19 -07:00
Harshavardhana	95c65f4e8f	do not panic on rebalance during server restarts (#19563 ) This PR makes a feasible approach to handle all the scenarios that we must face to avoid returning "panic." Instead, we must return "errServerNotInitialized" when a bucketMetadataSys.Get() is called, allowing the caller to retry their operation and wait. Bonus fix the way data-usage-cache stores the object. Instead of storing usage-cache.bin with the bucket as `.minio.sys/buckets`, the `buckets` must be relative to the bucket `.minio.sys` as part of the object name. Otherwise, there is no way to decommission entries at `.minio.sys/buckets` and their final erasure set positions. A bucket must never have a `/` in it. Adds code to read() from existing data-usage.bin upon upgrade.	2024-04-22 10:49:30 -07:00
Harshavardhana	6bfff7532e	re-use transport and set stronger backwards compatible Ciphers (#19565 ) This PR fixes a few things - FIPS support for missing for remote transports, causing MinIO could end up using non-FIPS Ciphers in FIPS mode - Avoids too many transports, they all do the same thing to make connection pooling work properly re-use them. - globalTCPOptions must be set before setting transport to make sure the client conn deadlines are honored properly. - GCS warm tier must re-use our transport - Re-enable trailing headers support.	2024-04-21 04:43:18 -07:00
Harshavardhana	1aa8896ad6	Revert "cleanup: Simplify usage of MinIOSourceProxyRequest (#19553 )" This reverts commit `928c0181bf`. This change was not correct, reverting. We track 3 states with the ProxyRequest header - if replication process wants to know if object is already replicated with a HEAD, it shouldn't proxy back - Poorna	2024-04-20 02:05:54 -07:00
Krishnan Parthasarathi	3e32ceb39f	Disable trailing header support for MinIO tiers (#19561 ) AWS S3 trailing header support was recently enabled on the warm tier client connection to MinIO type remote tiers. With this enabled, we are seeing the following error message at http transport layer. > Unsolicited response received on idle HTTP channel starting with "HTTP/1.1 400 Bad Request\r\nContent-Type: text/plain; charset=utf-8\r\nConnection: close\r\n\r\n400 Bad Request"; err=<nil> This is an interim fix until we identify the root cause for this behaviour in the minio-go client package.	2024-04-19 19:32:25 -07:00
jiuker	9205434ed3	fix: ignore signaturev2 for policy header check (#19551 )	2024-04-19 09:45:54 -07:00
Harshavardhana	cd50e9b4bc	make LRU cache global for internode tokens (#19555 )	2024-04-19 09:45:14 -07:00
Klaus Post	ec816f3840	Reduce parallelReader allocs (#19558 )	2024-04-19 09:44:59 -07:00
Klaus Post	5f774951b1	Store object EC in metadata header (#19534 ) Keep the EC in header, so it can be retrieved easily for dynamic quorum calculations. To not force a full metadata decode on every read the value will be 0/0 for data written in previous versions. Size is expected to increase by 2 bytes per version, since all valid values can be represented with 1 byte each. Example: ``` λ xl-meta xl.meta { "Versions": [ { "Header": { "EcM": 4, "EcN": 8, "Flags": 6, "ModTime": "2024-04-17T11:46:25.325613+02:00", "Signature": "0a409875", "Type": 1, "VersionID": "8e03504e11234957b2727bc53eda0d55" }, ... ``` Not used for operations yet.	2024-04-19 09:43:43 -07:00
Harshavardhana	72f5cb577e	optimize ftp/sftp upload() implementations to avoid CPU load (#19552 )	2024-04-19 05:23:42 -07:00

... 2 3 4 5 6 ...

6193 Commits