minio

mirror of https://github.com/minio/minio.git synced 2025-05-02 23:43:55 -04:00

Author	SHA1	Message	Date
Harshavardhana	2cc4997d24	fix: crash on 32bit systems during pre-allocation (#19225 )	2024-03-08 05:55:28 -08:00
Poorna	934f6cabf6	sr: use site replicator creds to verify temp user claims (#19224 ) This PR continues #19209 which did not handle claims verification of temporary users created by root in site replication scenario. Fixes: #19217	2024-03-07 14:30:00 -08:00
Anis Eleuch	68dd74c5ab	batch: Separate batch job request and batch job stats (#19205 ) Currently, the progress of the batch job is saved in inside the job request object, which is normally not supported by MinIO. Though there is no apparent bug, it is better to fix this now. Batch progress is saved in .minio.sys/batch-jobs/reports/ Co-authored-by: Anis Eleuch <anis@min.io>	2024-03-07 10:58:22 -08:00
Harshavardhana	48b590e14b	fix: same server to be part of multiple pools (#19216 ) our PoolNumber calculation was costly, while we already had this information per endpoint, we needed to deduce it appropriately. This PR addresses this by assigning PoolNumbers field that carries all the pool numbers that belong to a server. properties.PoolNumber still carries a valid value only when len(properties.PoolNumbers) == 1, otherwise properties.PoolNumber is set to math.MaxInt (indicating that this value is undefined) and then one must rely on properties.PoolNumbers for server participation in multiple pools. addresses the issue originating from #11327	2024-03-07 10:24:07 -08:00
Poorna	837a2a3d4b	sr: use service account cred for claims check (#19209 ) PR #19111 overlaid service account secret with site replicator secret during token claims check. Fixes : #19206	2024-03-06 16:19:24 -08:00
Harshavardhana	74ccee6619	avoid too much auditing during decom/rebalance make it more robust (#19174 ) there can be a sudden spike in tiny allocations, due to too much auditing being done, also don't hang on the ``` h.logCh <- entry ``` after initializing workers if you do not have a way to dequeue for some reason.	2024-03-06 03:43:16 -08:00
Poorna	89f759566c	bucket import: avoid overwriting bucket creation date (#19207 )	2024-03-05 16:05:28 -08:00
Harshavardhana	cd7551031b	fix: a regression in loading replication creds (#19204 ) fixes #19200 generating STS credentials fail with site-replicated setup, with this error on a fresh environment.	2024-03-05 11:06:17 -08:00
Praveen raj Mani	df57bfcd6c	fix: cluster read health check to return proper values (#19203 ) Fixes #19202	2024-03-05 10:25:49 -08:00
Justin Griffin	dfb1f39b57	Support custom endpoint for Azure remote storage tier (#19188 ) This commits adds support for using the `--endpoint` arg when creating a tier of type `azure`. This is needed to connect to Azure's Gov Cloud instance. For example, ``` mc ilm tier add azure TARGET TIER_NAME \ --account-name ACCOUNT \ --account-key KEY \ --bucket CONTAINER \ --endpoint https://ACCOUNT.blob.core.usgovcloudapi.net --prefix PREFIX \ --storage-class STORAGE_CLASS ``` Prior to this, the endpoint was hardcoded to `https://ACCOUNT.blob.core.windows.net`. The docs were even explicit about this, stating that `--endpoint` is: "Required for `s3` or `minio` tier types. This option has no effect for any other value of `TIER_TYPE`." Now, if the endpoint arg is present it will be used. If not, it will fall back to the same default behavior of `ACCOUNT.blob.core.windows.net`.	2024-03-05 08:44:08 -08:00
Harshavardhana	1b5f28e99b	fix: skip local disks properly in cluster health maintenance check (#19184 )	2024-03-04 20:48:44 -08:00
Krishnan Parthasarathi	b69bcdcdc4	Fix ilm config at startup (#19189 ) Remove api.expiration_workers config setting which was inadvertently left behind. Per review comment https://github.com/minio/minio/pull/18926, expiration_workers can be configured via ilm.expiration_workers.	2024-03-04 18:50:24 -08:00
Harshavardhana	e385f54185	fix: nLink is unreliable on all filesystems (#19187 ) ext4, xfs support this behavior however btrfs, nfs may not support it properly. in-case when we see Nlink < 2 then we know that we need to fallback on readdir() fixes a regression from #19100 fixes #19181	2024-03-04 15:58:35 -08:00
Aditya Manthramurthy	9a4d003ac7	Add common middleware to S3 API handlers (#19171 ) The middleware sets up tracing, throttling, gzipped responses and collecting API stats. Additionally, this change updates the names of handler functions in metric labels to be the same as the name derived from Go lang reflection on the handler name. The metric api labels are now stored in memory the same as the handler name - they will be camelcased, e.g. `GetObject` instead of `getobject`. For compatibility, we lowercase the metric api label values when emitting the metrics.	2024-03-04 10:05:56 -08:00
Praveen raj Mani	d5656eeb65	fix: healthcheck to fail even if one erasure set doesn't have quorum (#19180 ) fix: healthcheck to return false even if one erasure set doesn't have quorum	2024-03-04 08:34:14 -08:00
Harshavardhana	6d08af61a0	for root disks add additional information in the error log (#19177 )	2024-03-02 23:45:39 -08:00
Krishnan Parthasarathi	a7577da768	Improve expiration of tiered objects (#18926 ) - Use a shared worker pool for all ILM expiry tasks - Free version cleanup executes in a separate goroutine - Add a free version only if removing the remote object fails - Add ILM expiry metrics to the node namespace - Move tier journal tasks to expiryState - Remove unused on-disk journal for tiered objects pending deletion - Distribute expiry tasks across workers such that the expiry of versions of the same object serialized - Ability to resize worker pool without server restart - Make scaling down of expiryState workers' concurrency safe; Thanks @klauspost - Add error logs when expiryState and transition state are not initialized (yet) * metrics: Add missed tier journal entry tasks * Initialize the ILM worker pool after the object layer	2024-03-01 21:11:03 -08:00
Harshavardhana	325fd80687	add retry logic upto 3 times for policy map and policy (#19173 )	2024-03-01 16:21:34 -08:00
Andreas Auernhammer	09626d78ff	automatically generate root credentials with KMS (#19025 ) With this commit, MinIO generates root credentials automatically and deterministically if: - No root credentials have been set. - A KMS (KES) is configured. - API access for the root credentials is disabled (lockdown mode). Before, MinIO defaults to `minioadmin` for both the access and secret keys. Now, MinIO generates unique root credentials automatically on startup using the KMS. Therefore, it uses the KMS HMAC function to generate pseudo-random values. These values never change as long as the KMS key remains the same, and the KMS key must continue to exist since all IAM data is encrypted with it. Backward compatibility: This commit should not cause existing deployments to break. It only changes the root credentials of deployments that have a KMS configured (KES, not a static key) but have not set any admin credentials. Such implementations should be rare or not exist at all. Even if the worst case would be updating root credentials in mc or other clients used to administer the cluster. Root credentials are anyway not intended for regular S3 operations. Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-03-01 13:09:42 -08:00
Anis Eleuch	8f03c6e0db	xl: Avoid called getdents for folders in listing (#19100 )	2024-03-01 08:01:28 -08:00
Harshavardhana	2c2f5d871c	debug: introduce support for configuring client connect WRITE deadline (#19170 ) just like client-conn-read-deadline, added a new flag that does client-conn-write-deadline as well. Both are not configured by default, since we do not yet know what is the right value. Allow this to be configurable if needed.	2024-03-01 08:00:42 -08:00
Harshavardhana	c599c11e70	fix: relax metadata checks for healing (#19165 ) we should do this to ensure that we focus on data healing as primary focus, fixing metadata as part of healing must be done but making data available is the main focus. the main reason is metadata inconsistencies can cause data availability issues, which must be avoided at all cost. will be bringing in an additional healing mechanism that involves "metadata-only" heal, for now we do not expect to have these checks. continuation of #19154 Bonus: add a pro-active healthcheck to perform a connection	2024-02-29 22:49:01 -08:00
Aditya Manthramurthy	6769d4dd54	Update API label names for metrics (#19162 ) This change makes the label names consistent with the handler names. This is in preparation to use reflection based API handler function names for the api labels so they will be the same as tracing, auditing and logging names for these API calls.	2024-02-29 16:14:27 -08:00
Harshavardhana	d7520f0ae6	fix: make sure maintenance=true is honored properly (#19156 ) fixes a regression from #18700	2024-02-29 08:37:57 -08:00
Harshavardhana	44b70eb646	allow creating missing parent folders during moveToTrash() (#19155 )	2024-02-29 08:28:33 -08:00
Harshavardhana	467714f33b	ignore x-amz-storage-class when its set to STANDARD (#19154 ) fixes #19135	2024-02-28 17:44:30 -08:00
Harshavardhana	f8696cc8f6	fallback to globalLocalDrives for non-distributed setups	2024-02-28 14:56:08 -08:00
Anis Eleuch	9a7c7ab2d0	fix: parsing v2 and v1 cgroup memory limit (#19153 ) Trim the newline at the end of the sysfs memory limit.	2024-02-28 14:52:20 -08:00
Harshavardhana	51874a5776	fix: allow DNS disconnection events to happen in k8s (#19145 ) in k8s things really do come online very asynchronously, we need to use implementation that allows this randomness. To facilitate this move WriteAll() as part of the websocket layer instead. Bonus: avoid instances of dnscache usage on k8s	2024-02-28 09:54:52 -08:00
Aditya Manthramurthy	62ce52c8fd	cachevalue: simplify exported interface (#19137 ) - Also add cache options type	2024-02-28 09:09:09 -08:00
Anis Eleuch	2bdb9511bd	heal: Add skipped objects to the heal summary (#19142 ) New disk healing code skips/expires objects that ILM supposed to expire. Add more visibility to the user about this activity by calculating those objects and print it at the end of healing activity.	2024-02-28 09:05:40 -08:00
Harshavardhana	9a012a53ef	initialize the disk healer early on (#19143 ) This PR fixes a bug that perhaps has been long introduced, with no visible workarounds. In any deployment, if an entire erasure set is deleted, there is no way the cluster recovers.	2024-02-27 23:02:14 -08:00
Harshavardhana	1dd8ef09a6	remove unnecessary 'recreate' code (#19136 )	2024-02-27 01:47:58 -08:00
Poorna	b1351e2dee	sr: use site replicator svcacct to sign STS session tokens (#19111 ) This change is to decouple need for root credentials to match between site replication deployments. Also ensuring site replication config initialization is re-tried until it succeeds, this deoendency is critical to STS flow in site replication scenario.	2024-02-26 13:30:28 -08:00
Praveen raj Mani	30c2596512	Read drive IO stats from sysfs instead of procfs (#19131 ) Currently, we read from `/proc/diskstats` which is found to be un-reliable in k8s environments. We can read from `sysfs` instead. Also, cache the latest drive io stats to find the diff and update the metrics.	2024-02-26 11:34:50 -08:00
Klaus Post	2b5e4b853c	Improve caching (#19130 ) * Remove lock for cached operations. * Rename "Relax" to `ReturnLastGood`. * Add `CacheError` to allow caching values even on errors. * Add NoWait that will return current value with async fetching if within 2xTTL. * Make benchmark somewhat representative. ``` Before: BenchmarkCache-12 16408370 63.12 ns/op 0 B/op After: BenchmarkCache-12 428282187 2.789 ns/op 0 B/op ``` * Remove `storageRESTClient.scanning`. Nonsensical - RPC clients will not have any idea about scanning. * Always fetch remote diskinfo metrics and cache them. Seems most calls are requesting metrics. * Do async fetching of usage caches.	2024-02-26 10:49:19 -08:00
Harshavardhana	92788e4cf4	fix: re-arrange console-sys to log properly in k8s/docker (#19129 ) fixes #19125	2024-02-26 01:33:48 -08:00
Harshavardhana	8a698fef71	fix: crash in ResourceMetrics RPC handling concurrent writers (#19123 ) Continuation of #19103 that had fixed the crash in peer metrics for cluster endpoint.	2024-02-25 00:51:38 -08:00
Harshavardhana	c2b54d92f6	allow all disk full errors to be handled (#19117 )	2024-02-24 09:11:14 -08:00
Harshavardhana	f965434022	fix: re-use endpoint strings to avoid allocation during audit (#19116 )	2024-02-23 16:19:13 -08:00
Harshavardhana	a3ac62596c	move timedValue -> cachevalue package (#19114 )	2024-02-23 13:28:14 -08:00
Harshavardhana	2faba02d6b	fix: allow diskInfo at storageRPC to be cached (#19112 ) Bonus: convert timedValue into a typed implementation	2024-02-23 09:21:38 -08:00
Krishnan Parthasarathi	ee158e1610	ilm: Update action count only on success (#19093 ) It also fixes a long-standing bug in expiring transitioned objects. The expiration action was deleting the current version in the case' of tiered objects instead of adding a delete marker.	2024-02-22 15:00:32 -08:00
Anis Eleuch	fa68efb1e7	s3: CopyObject to disallow invalid dest object names (#19110 ) By not doing so, objects can risk being in a wrong erasure set if the destination object name contains e.g. '//'	2024-02-22 10:05:17 -08:00
Anis Eleuch	8c53a4405a	Add audit for folder excess (#19109 ) Also replace ilm:expiry with scanner to avoid user confusion	2024-02-22 08:18:13 -08:00
Harshavardhana	c32f699105	turn-off md5sum for SSE-KMS/SSE-C as optimization for multipart (#19106 ) only enable md5sum if explicitly asked by the client, otherwise its not necessary to compute md5sum when SSE-KMS/SSE-C is enabled. this is continuation of #17958	2024-02-22 04:24:11 -08:00
Harshavardhana	53aa8f5650	use typos instead of codespell (#19088 )	2024-02-21 22:26:06 -08:00
Klaus Post	92180bc793	Add array recycling safety (#19103 ) Nil entries when recycling arrays.	2024-02-21 12:27:35 -08:00
Poorna	526b829a09	site replication: Disallow removal of site-replicator account (#19092 )	2024-02-21 02:09:33 -08:00
Anis Eleuch	9ea5d08ecd	site-repl: Fix endpoint in the error with unexpected deployment-id (#19086 )	2024-02-20 15:02:35 -08:00

1 2 3 4 5 ...

5937 Commits