minio

mirror of https://github.com/minio/minio.git synced 2025-04-30 06:31:32 -04:00

Author	SHA1	Message	Date
Harshavardhana	ce1c640ce0	feat: allow retaining parity SLA to be configurable (#19260 ) at scale customers might start with failed drives, causing skew in the overall usage ratio per EC set. make this configurable such that customers can turn this off as needed depending on how comfortable they are.	2024-03-14 03:38:33 -07:00
Anis Eleuch	24b4f9d748	Fix quorum calculation with zero parity objects (#19250 ) Currently, the code relies on object parity to decide whether it is a delete marker or a regular object. In the case of a delete marker, the return quorum is half of the disks in the erasure set. However, this calculation must be corrected with objects with EC = 0, mainly because EC is not a one-time fixed configuration. Though all data are correct, the manifested symptom is a 503 with an EC=0 object. This bug was manifested after we introduced the fast Get Object feature that does not read all data from all disks in case of inlined objects	2024-03-12 12:59:11 -07:00
Harshavardhana	81d7531f1f	only look for valid buckets (#19244 ) fixes #19239	2024-03-12 04:33:30 -07:00
Poorna	b4a23f720e	update build constants (#19243 )	2024-03-11 17:54:37 -07:00
Dennis Marttinen	6c964fede5	Improve handling of compression inclusion for objects (#19234 )	2024-03-11 04:55:34 -07:00
huajin tong	a25a8312d8	fix: some flyby typos in the code (#19212 ) Signed-off-by: thirdkeyword <fliterdashen@gmail.com>	2024-03-10 14:09:36 -07:00
Aditya Manthramurthy	b2c5b75efa	feat: Add Metrics V3 API (#19068 ) Metrics v3 is mainly a reorganization of metrics into smaller groups of metrics and the removal of internal aggregation of metrics received from peer nodes in a MinIO cluster. This change adds the endpoint `/minio/metrics/v3` as the top-level metrics endpoint and under this, various sub-endpoints are implemented. These are currently documented in `docs/metrics/v3.md` The handler will serve metrics at any path `/minio/metrics/v3/PATH`, as follows: when PATH is a sub-endpoint listed above => serves the group of metrics under that path; or when PATH is a (non-empty) parent directory of the sub-endpoints listed above => serves metrics from each child sub-endpoint of PATH. otherwise, returns a no resource found error All available metrics are listed in the `docs/metrics/v3.md`. More will be added subsequently.	2024-03-10 01:15:15 -08:00
Harshavardhana	88a89213ff	make immediate purge non-blocking up to 100,000 entries per drive (#19231 ) make immediate purge non-blocking upto 100000 entries per drive Bonus: turn-off O_DIRECT verification when FSType is 'XFS'	2024-03-09 18:53:48 -08:00
Poorna	8e2238ea09	some more cleanup for startup message (#19229 )	2024-03-08 22:42:32 -08:00
Poorna	31e8f7c525	Small reformatting of startup message (#19228 ) Also changing User-Agent format	2024-03-08 19:07:08 -08:00
Klaus Post	51f62a8da3	Port ListBuckets to websockets layer & some cleanup (#19199 )	2024-03-08 11:08:18 -08:00
Klaus Post	650efc2e96	Fix listing in objects split across pools (#19227 ) Merging same-object - multiple versions from different pools would not always result in correct ordering. When merging keep inputs separate. ``` λ mc ls --versions local/testbucket ------ before ------ [2024-03-05 20:17:19 CET] 228B STANDARD 1f163718-9bc5-4b01-bff7-5d8cf09caf10 v3 PUT hosts [2024-03-05 20:19:56 CET] 19KiB STANDARD null v2 PUT hosts [2024-03-05 20:17:15 CET] 228B STANDARD 73c9f651-f023-4566-b012-cc537fdb7ce2 v1 PUT hosts ------ after ------ λ mc ls --versions local/testbucket [2024-03-05 20:19:56 CET] 19KiB STANDARD null v3 PUT hosts [2024-03-05 20:17:19 CET] 228B STANDARD 1f163718-9bc5-4b01-bff7-5d8cf09caf10 v2 PUT hosts [2024-03-05 20:17:15 CET] 228B STANDARD 73c9f651-f023-4566-b012-cc537fdb7ce2 v1 PUT hosts ```	2024-03-08 09:50:48 -08:00
Harshavardhana	2cc4997d24	fix: crash on 32bit systems during pre-allocation (#19225 )	2024-03-08 05:55:28 -08:00
Poorna	934f6cabf6	sr: use site replicator creds to verify temp user claims (#19224 ) This PR continues #19209 which did not handle claims verification of temporary users created by root in site replication scenario. Fixes: #19217	2024-03-07 14:30:00 -08:00
Anis Eleuch	68dd74c5ab	batch: Separate batch job request and batch job stats (#19205 ) Currently, the progress of the batch job is saved in inside the job request object, which is normally not supported by MinIO. Though there is no apparent bug, it is better to fix this now. Batch progress is saved in .minio.sys/batch-jobs/reports/ Co-authored-by: Anis Eleuch <anis@min.io>	2024-03-07 10:58:22 -08:00
Harshavardhana	48b590e14b	fix: same server to be part of multiple pools (#19216 ) our PoolNumber calculation was costly, while we already had this information per endpoint, we needed to deduce it appropriately. This PR addresses this by assigning PoolNumbers field that carries all the pool numbers that belong to a server. properties.PoolNumber still carries a valid value only when len(properties.PoolNumbers) == 1, otherwise properties.PoolNumber is set to math.MaxInt (indicating that this value is undefined) and then one must rely on properties.PoolNumbers for server participation in multiple pools. addresses the issue originating from #11327	2024-03-07 10:24:07 -08:00
Poorna	837a2a3d4b	sr: use service account cred for claims check (#19209 ) PR #19111 overlaid service account secret with site replicator secret during token claims check. Fixes : #19206	2024-03-06 16:19:24 -08:00
Harshavardhana	74ccee6619	avoid too much auditing during decom/rebalance make it more robust (#19174 ) there can be a sudden spike in tiny allocations, due to too much auditing being done, also don't hang on the ``` h.logCh <- entry ``` after initializing workers if you do not have a way to dequeue for some reason.	2024-03-06 03:43:16 -08:00
Poorna	89f759566c	bucket import: avoid overwriting bucket creation date (#19207 )	2024-03-05 16:05:28 -08:00
Harshavardhana	cd7551031b	fix: a regression in loading replication creds (#19204 ) fixes #19200 generating STS credentials fail with site-replicated setup, with this error on a fresh environment.	2024-03-05 11:06:17 -08:00
Praveen raj Mani	df57bfcd6c	fix: cluster read health check to return proper values (#19203 ) Fixes #19202	2024-03-05 10:25:49 -08:00
Justin Griffin	dfb1f39b57	Support custom endpoint for Azure remote storage tier (#19188 ) This commits adds support for using the `--endpoint` arg when creating a tier of type `azure`. This is needed to connect to Azure's Gov Cloud instance. For example, ``` mc ilm tier add azure TARGET TIER_NAME \ --account-name ACCOUNT \ --account-key KEY \ --bucket CONTAINER \ --endpoint https://ACCOUNT.blob.core.usgovcloudapi.net --prefix PREFIX \ --storage-class STORAGE_CLASS ``` Prior to this, the endpoint was hardcoded to `https://ACCOUNT.blob.core.windows.net`. The docs were even explicit about this, stating that `--endpoint` is: "Required for `s3` or `minio` tier types. This option has no effect for any other value of `TIER_TYPE`." Now, if the endpoint arg is present it will be used. If not, it will fall back to the same default behavior of `ACCOUNT.blob.core.windows.net`.	2024-03-05 08:44:08 -08:00
Harshavardhana	1b5f28e99b	fix: skip local disks properly in cluster health maintenance check (#19184 )	2024-03-04 20:48:44 -08:00
Krishnan Parthasarathi	b69bcdcdc4	Fix ilm config at startup (#19189 ) Remove api.expiration_workers config setting which was inadvertently left behind. Per review comment https://github.com/minio/minio/pull/18926, expiration_workers can be configured via ilm.expiration_workers.	2024-03-04 18:50:24 -08:00
Harshavardhana	e385f54185	fix: nLink is unreliable on all filesystems (#19187 ) ext4, xfs support this behavior however btrfs, nfs may not support it properly. in-case when we see Nlink < 2 then we know that we need to fallback on readdir() fixes a regression from #19100 fixes #19181	2024-03-04 15:58:35 -08:00
Aditya Manthramurthy	9a4d003ac7	Add common middleware to S3 API handlers (#19171 ) The middleware sets up tracing, throttling, gzipped responses and collecting API stats. Additionally, this change updates the names of handler functions in metric labels to be the same as the name derived from Go lang reflection on the handler name. The metric api labels are now stored in memory the same as the handler name - they will be camelcased, e.g. `GetObject` instead of `getobject`. For compatibility, we lowercase the metric api label values when emitting the metrics.	2024-03-04 10:05:56 -08:00
Praveen raj Mani	d5656eeb65	fix: healthcheck to fail even if one erasure set doesn't have quorum (#19180 ) fix: healthcheck to return false even if one erasure set doesn't have quorum	2024-03-04 08:34:14 -08:00
Harshavardhana	6d08af61a0	for root disks add additional information in the error log (#19177 )	2024-03-02 23:45:39 -08:00
Krishnan Parthasarathi	a7577da768	Improve expiration of tiered objects (#18926 ) - Use a shared worker pool for all ILM expiry tasks - Free version cleanup executes in a separate goroutine - Add a free version only if removing the remote object fails - Add ILM expiry metrics to the node namespace - Move tier journal tasks to expiryState - Remove unused on-disk journal for tiered objects pending deletion - Distribute expiry tasks across workers such that the expiry of versions of the same object serialized - Ability to resize worker pool without server restart - Make scaling down of expiryState workers' concurrency safe; Thanks @klauspost - Add error logs when expiryState and transition state are not initialized (yet) * metrics: Add missed tier journal entry tasks * Initialize the ILM worker pool after the object layer	2024-03-01 21:11:03 -08:00
Harshavardhana	325fd80687	add retry logic upto 3 times for policy map and policy (#19173 )	2024-03-01 16:21:34 -08:00
Andreas Auernhammer	09626d78ff	automatically generate root credentials with KMS (#19025 ) With this commit, MinIO generates root credentials automatically and deterministically if: - No root credentials have been set. - A KMS (KES) is configured. - API access for the root credentials is disabled (lockdown mode). Before, MinIO defaults to `minioadmin` for both the access and secret keys. Now, MinIO generates unique root credentials automatically on startup using the KMS. Therefore, it uses the KMS HMAC function to generate pseudo-random values. These values never change as long as the KMS key remains the same, and the KMS key must continue to exist since all IAM data is encrypted with it. Backward compatibility: This commit should not cause existing deployments to break. It only changes the root credentials of deployments that have a KMS configured (KES, not a static key) but have not set any admin credentials. Such implementations should be rare or not exist at all. Even if the worst case would be updating root credentials in mc or other clients used to administer the cluster. Root credentials are anyway not intended for regular S3 operations. Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-03-01 13:09:42 -08:00
Anis Eleuch	8f03c6e0db	xl: Avoid called getdents for folders in listing (#19100 )	2024-03-01 08:01:28 -08:00
Harshavardhana	2c2f5d871c	debug: introduce support for configuring client connect WRITE deadline (#19170 ) just like client-conn-read-deadline, added a new flag that does client-conn-write-deadline as well. Both are not configured by default, since we do not yet know what is the right value. Allow this to be configurable if needed.	2024-03-01 08:00:42 -08:00
Harshavardhana	c599c11e70	fix: relax metadata checks for healing (#19165 ) we should do this to ensure that we focus on data healing as primary focus, fixing metadata as part of healing must be done but making data available is the main focus. the main reason is metadata inconsistencies can cause data availability issues, which must be avoided at all cost. will be bringing in an additional healing mechanism that involves "metadata-only" heal, for now we do not expect to have these checks. continuation of #19154 Bonus: add a pro-active healthcheck to perform a connection	2024-02-29 22:49:01 -08:00
Aditya Manthramurthy	6769d4dd54	Update API label names for metrics (#19162 ) This change makes the label names consistent with the handler names. This is in preparation to use reflection based API handler function names for the api labels so they will be the same as tracing, auditing and logging names for these API calls.	2024-02-29 16:14:27 -08:00
Harshavardhana	d7520f0ae6	fix: make sure maintenance=true is honored properly (#19156 ) fixes a regression from #18700	2024-02-29 08:37:57 -08:00
Harshavardhana	44b70eb646	allow creating missing parent folders during moveToTrash() (#19155 )	2024-02-29 08:28:33 -08:00
Harshavardhana	467714f33b	ignore x-amz-storage-class when its set to STANDARD (#19154 ) fixes #19135	2024-02-28 17:44:30 -08:00
Harshavardhana	f8696cc8f6	fallback to globalLocalDrives for non-distributed setups	2024-02-28 14:56:08 -08:00
Anis Eleuch	9a7c7ab2d0	fix: parsing v2 and v1 cgroup memory limit (#19153 ) Trim the newline at the end of the sysfs memory limit.	2024-02-28 14:52:20 -08:00
Harshavardhana	51874a5776	fix: allow DNS disconnection events to happen in k8s (#19145 ) in k8s things really do come online very asynchronously, we need to use implementation that allows this randomness. To facilitate this move WriteAll() as part of the websocket layer instead. Bonus: avoid instances of dnscache usage on k8s	2024-02-28 09:54:52 -08:00
Aditya Manthramurthy	62ce52c8fd	cachevalue: simplify exported interface (#19137 ) - Also add cache options type	2024-02-28 09:09:09 -08:00
Anis Eleuch	2bdb9511bd	heal: Add skipped objects to the heal summary (#19142 ) New disk healing code skips/expires objects that ILM supposed to expire. Add more visibility to the user about this activity by calculating those objects and print it at the end of healing activity.	2024-02-28 09:05:40 -08:00
Harshavardhana	9a012a53ef	initialize the disk healer early on (#19143 ) This PR fixes a bug that perhaps has been long introduced, with no visible workarounds. In any deployment, if an entire erasure set is deleted, there is no way the cluster recovers.	2024-02-27 23:02:14 -08:00
Harshavardhana	1dd8ef09a6	remove unnecessary 'recreate' code (#19136 )	2024-02-27 01:47:58 -08:00
Poorna	b1351e2dee	sr: use site replicator svcacct to sign STS session tokens (#19111 ) This change is to decouple need for root credentials to match between site replication deployments. Also ensuring site replication config initialization is re-tried until it succeeds, this deoendency is critical to STS flow in site replication scenario.	2024-02-26 13:30:28 -08:00
Praveen raj Mani	30c2596512	Read drive IO stats from sysfs instead of procfs (#19131 ) Currently, we read from `/proc/diskstats` which is found to be un-reliable in k8s environments. We can read from `sysfs` instead. Also, cache the latest drive io stats to find the diff and update the metrics.	2024-02-26 11:34:50 -08:00
Klaus Post	2b5e4b853c	Improve caching (#19130 ) * Remove lock for cached operations. * Rename "Relax" to `ReturnLastGood`. * Add `CacheError` to allow caching values even on errors. * Add NoWait that will return current value with async fetching if within 2xTTL. * Make benchmark somewhat representative. ``` Before: BenchmarkCache-12 16408370 63.12 ns/op 0 B/op After: BenchmarkCache-12 428282187 2.789 ns/op 0 B/op ``` * Remove `storageRESTClient.scanning`. Nonsensical - RPC clients will not have any idea about scanning. * Always fetch remote diskinfo metrics and cache them. Seems most calls are requesting metrics. * Do async fetching of usage caches.	2024-02-26 10:49:19 -08:00
Harshavardhana	92788e4cf4	fix: re-arrange console-sys to log properly in k8s/docker (#19129 ) fixes #19125	2024-02-26 01:33:48 -08:00
Harshavardhana	8a698fef71	fix: crash in ResourceMetrics RPC handling concurrent writers (#19123 ) Continuation of #19103 that had fixed the crash in peer metrics for cluster endpoint.	2024-02-25 00:51:38 -08:00

1 2 3 4 5 ...

5999 Commits