minio

Commit Graph

Author	SHA1	Message	Date
Krishnan Parthasarathi	383489d5d9	Handle zero versions qualified for expiration (#19301 ) When objects have more versions than their ILM policy expects to retain via NewerNoncurrentVersions, but they don't qualify for expiry due to NoncurrentDays are configured in that rule. In this case, applyNewerNoncurrentVersionsLimit method was enqueuing empty tasks, which lead to a panic (panic: runtime error: index out of range [0] with length 0) in newerNoncurrentTask.OpHash method, which assumes the task to contain at least one version to expire.	2024-03-19 20:10:58 -07:00
Krishnan Parthasarathi	b69bcdcdc4	Fix ilm config at startup (#19189 ) Remove api.expiration_workers config setting which was inadvertently left behind. Per review comment https://github.com/minio/minio/pull/18926, expiration_workers can be configured via ilm.expiration_workers.	2024-03-04 18:50:24 -08:00
Krishnan Parthasarathi	a7577da768	Improve expiration of tiered objects (#18926 ) - Use a shared worker pool for all ILM expiry tasks - Free version cleanup executes in a separate goroutine - Add a free version only if removing the remote object fails - Add ILM expiry metrics to the node namespace - Move tier journal tasks to expiryState - Remove unused on-disk journal for tiered objects pending deletion - Distribute expiry tasks across workers such that the expiry of versions of the same object serialized - Ability to resize worker pool without server restart - Make scaling down of expiryState workers' concurrency safe; Thanks @klauspost - Add error logs when expiryState and transition state are not initialized (yet) * metrics: Add missed tier journal entry tasks * Initialize the ILM worker pool after the object layer	2024-03-01 21:11:03 -08:00
Harshavardhana	9a012a53ef	initialize the disk healer early on (#19143 ) This PR fixes a bug that perhaps has been long introduced, with no visible workarounds. In any deployment, if an entire erasure set is deleted, there is no way the cluster recovers.	2024-02-27 23:02:14 -08:00
Krishnan Parthasarathi	ee158e1610	ilm: Update action count only on success (#19093 ) It also fixes a long-standing bug in expiring transitioned objects. The expiration action was deleting the current version in the case' of tiered objects instead of adding a delete marker.	2024-02-22 15:00:32 -08:00
Harshavardhana	1d3bd02089	avoid close 'nil' panics if any (#18890 ) brings a generic implementation that prints a stack trace for 'nil' channel closes(), if not safely closes it.	2024-01-28 10:04:17 -08:00
Krishnan Parthasarathi	56b7045c20	Export tier metrics (#18678 ) minio_node_tier_ttlb_seconds - Distribution of time to last byte for streaming objects from warm tier minio_node_tier_requests_success - Number of requests to download object from warm tier that were successful minio_node_tier_requests_failure - Number of requests to download object from warm tier that failed	2023-12-20 20:13:40 -08:00
Harshavardhana	e98172d72d	avoid hot-tier SLA to be tied to warm-tier SLA (#18581 ) it is okay if the warm-tier cannot keep up, we should continue to take I/O at hot-tier, only fail hot-tier or block it when we are disk full. Bonus: add metrics counter for these missed tasks, we will know for sure if one of the node is lagging behind or is losing too many tasks during transitioning.	2023-12-02 13:02:12 -08:00
Harshavardhana	506f121576	remove frivolous logging in transition object (#18526 ) AWS S3 closes keep-alive connections frequently leading to frivolous logs filling up the MinIO logs when the transition tier is an AWS S3 bucket. Ignore such transient errors, let MinIO retry it when it can.	2023-11-26 22:18:09 -08:00
Krishnan Parthasarathi	a93214ea63	ilm: ObjectSizeLessThan and ObjectSizeGreaterThan (#18500 )	2023-11-22 13:42:39 -08:00
Anis Eleuch	1bb7a2a295	Immediate transition ILM to avoid quick deferring to the scanner (#18475 ) Immediate transition use case and is mostly used to fill warm backend with a lot of data when a new deployment is created Currently, if the transition queue is complete, the transition will be deferred to the scanner; change this behavior by blocking the PUT request until the transition queue has a new place for a transition task.	2023-11-17 16:16:46 -08:00
Harshavardhana	80adc87a14	converge WARM tier object name to hash of deployment+bucket (#18410 ) this is to ensure that we can converge and save IOPs when hot-tier accesses MinIO.	2023-11-10 02:15:13 -08:00
Klaus Post	7926df0b80	Fix globalDeploymentID race (#18275 ) globalDeploymentID was being read while it was being set. Fixes race: ``` WARNING: DATA RACE Write at 0x0000079605a0 by main goroutine: github.com/minio/minio/cmd.connectLoadInitFormats() github.com/minio/minio/cmd/prepare-storage.go:269 +0x14f0 github.com/minio/minio/cmd.waitForFormatErasure() github.com/minio/minio/cmd/prepare-storage.go:294 +0x21d ... Previous read at 0x0000079605a0 by goroutine 105: github.com/minio/minio/cmd.newContext() github.com/minio/minio/cmd/utils.go:817 +0x31e github.com/minio/minio/cmd.adminMiddleware.func1() github.com/minio/minio/cmd/admin-router.go:110 +0x96 net/http.HandlerFunc.ServeHTTP() net/http/server.go:2136 +0x47 github.com/minio/minio/cmd.setBucketForwardingMiddleware.func1() github.com/minio/minio/cmd/generic-handlers.go:460 +0xb1a net/http.HandlerFunc.ServeHTTP() net/http/server.go:2136 +0x47 ... ```	2023-10-18 08:06:57 -07:00
Aditya Manthramurthy	1c99fb106c	Update to minio/pkg/v2 (#17967 )	2023-09-04 12:57:37 -07:00
Krishnan Parthasarathi	53abd25116	Don't log when object to be tiered is not found (#17924 )	2023-08-25 23:34:16 -07:00
Anis Eleuch	1664fd8bb1	Avoid logging errors twice during transitioned objects expiration (#17782 )	2023-08-02 09:06:03 -07:00
Klaus Post	ff5988f4e0	Reduce allocations (#17584 ) * Reduce allocations * Add stringsHasPrefixFold which can compare string prefixes, while ignoring case and not allocating. * Reuse all msgp.Readers * Reuse metadata buffers when not reading data. * Make type safe. Make buffer 4K instead of 8. * Unslice	2023-07-06 16:02:08 -07:00
Aditya Manthramurthy	5a1612fe32	Bump up madmin-go and pkg deps (#17469 )	2023-06-19 17:53:08 -07:00
Krishnan Parthasarathi	62df731006	Add updatedAt for GetBucketLifecycleConfig (#17271 )	2023-05-24 22:52:39 -07:00
Krishnan Parthasarathi	3e128c116e	Add lifecycle event source to audit log tags (#17248 )	2023-05-22 15:28:56 -07:00
jiuker	fd2959fa3a	fix: workers.New err must be returned (#17208 )	2023-05-16 08:08:00 -07:00
Krishnan Parthasarathi	0ec722bc54	Add tags to NewerNoncurrentVersions audit event (#17110 )	2023-05-02 12:56:33 -07:00
Krishnan Parthasarathi	e7cac8acef	Add tags to auditLogLifecycle (#17081 )	2023-04-26 17:49:00 -07:00
Praveen raj Mani	72802a5972	Use 'minio/pkg/sync/errgroup' and 'minio/pkg/workers' (#17069 )	2023-04-25 22:57:40 -07:00
Poorna	cd6dec49c0	Add trace support for ilm activity (#16993 )	2023-04-11 19:22:32 -07:00
Harshavardhana	c06e0bfef9	set correct `Host:` value for replication event notification (#16984 )	2023-04-06 10:20:53 -07:00
Harshavardhana	7a6c4e438e	allow more workers for ILM expiration (#16924 )	2023-03-30 10:47:15 -07:00
Harshavardhana	b66d7dc708	add missing x-amz-id-2 to event notification date (#16646 )	2023-02-20 15:41:47 +05:30
Krishnan Parthasarathi	d136ac0596	Don't close transition task channel on server exit (#16627 )	2023-02-15 22:09:25 -08:00
Krishnan Parthasarathi	9de26531e4	tiering: UpdateWorkers may be called before Init (#16573 )	2023-02-08 19:13:34 -08:00
Krishnan Parthasarathi	990fc415f7	Ensure safety of transitionState at startup (#16563 )	2023-02-07 23:11:42 -08:00
Krishnan Parthasarathi	cea2ca8c8e	Add restore-status header for multipart objects (#16508 )	2023-01-31 07:53:45 +05:30
Klaus Post	a713aee3d5	Run staticcheck on CI (#16170 )	2022-12-05 11:18:50 -08:00
Krishnan Parthasarathi	6eef9b4a23	lifecycle: simplify Eval and HasActiveRules (#16036 )	2022-11-10 07:17:45 -08:00
Harshavardhana	23b329b9df	remove gateway completely (#15929 )	2022-10-24 17:44:15 -07:00
Anis Elleuch	ac85c2af76	lifecycle: refactor rules filtering and tagging support (#15914 )	2022-10-21 10:46:53 -07:00
Harshavardhana	228c6686f8	allow non-standards fallback for all http.TimeFormats (#15662 ) fixes #15645	2022-09-07 07:24:54 -07:00
Krishnan Parthasarathi	3a1d3a7952	audit-log: Add time to get/restore object from remote-tier (#15602 )	2022-08-29 21:33:59 -07:00
Poorna	426c902b87	site replication: fix healing of bucket deletes. (#15377 ) This PR changes the handling of bucket deletes for site replicated setups to hold on to deleted bucket state until it syncs to all the clusters participating in site replication.	2022-07-25 17:51:32 -07:00
Krishnan Parthasarathi	ad8e611098	feat: implement prefix-level versioning exclusion (#14828 ) Spark/Hadoop workloads which use Hadoop MR Committer v1/v2 algorithm upload objects to a temporary prefix in a bucket. These objects are 'renamed' to a different prefix on Job commit. Object storage admins are forced to configure separate ILM policies to expire these objects and their versions to reclaim space. Our solution: This can be avoided by simply marking objects under these prefixes to be excluded from versioning, as shown below. Consequently, these objects are excluded from replication, and don't require ILM policies to prune unnecessary versions. - MinIO Extension to Bucket Version Configuration ```xml <VersioningConfiguration xmlns="http://s3.amazonaws.com/doc/2006-03-01/"> <Status>Enabled</Status> <ExcludeFolders>true</ExcludeFolders> <ExcludedPrefixes> <Prefix>app1-jobs//_temporary/</Prefix> </ExcludedPrefixes> <ExcludedPrefixes> <Prefix>app2-jobs//__magic/</Prefix> </ExcludedPrefixes> <!-- .. up to 10 prefixes in all --> </VersioningConfiguration> ``` Note: `ExcludeFolders` excludes all folders in a bucket from versioning. This is required to prevent the parent folders from accumulating delete markers, especially those which are shared across spark workloads spanning projects/teams. - To enable version exclusion on a list of prefixes ``` mc version enable --excluded-prefixes "app1-jobs//_temporary/,app2-jobs//_magic," --exclude-prefix-marker myminio/test ```	2022-05-06 19:05:28 -07:00
Harshavardhana	2a6a40e93b	enable go1.18.x builds (#14746 )	2022-04-13 14:21:55 -07:00
Krishnan Parthasarathi	cdab4a3b85	Update hourly tier-stats only on succesful tiering (#14330 )	2022-02-16 17:29:12 -08:00
Anis Elleuch	661ea57907	restore: Add quotes some fields in x-amz-restore header (#14281 ) S3 spec returns x-amz-restore header in HEAD/GET object with the following format: ``` x-amz-restore: ongoing-request="false", expiry-date="Fri, 21 Dec 2012 00:00:00 GMT" ``` This commit adds quotes as the current code does not support it. It will also supports the old format saved in the disk (in xl.meta) for backward compatibility.	2022-02-09 13:17:41 -08:00
Krishnan Parthasarathi	d2e5f01542	feat: maintain in-memory tier stats for the last 24hrs (#13782 )	2022-01-26 14:33:10 -08:00
Harshavardhana	f527c708f2	run gofumpt cleanup across code-base (#14015 )	2022-01-02 09:15:06 -08:00
Krishnan Parthasarathi	44a9339c0a	Newer noncurrent versions (#13815 ) - Rename MaxNoncurrentVersions tag to NewerNoncurrentVersions Note: We apply overlapping NewerNoncurrentVersions rules such that we honor the highest among applicable limits. e.g if 2 overlapping rules are configured with 2 and 3 noncurrent versions to be retained, we will retain 3. - Expire newer noncurrent versions after noncurrent days - MinIO extension: allow noncurrent days to be zero, allowing expiry of noncurrent version as soon as more than configured NewerNoncurrentVersions are present. - Allow NewerNoncurrentVersions rules on object-locked buckets - No x-amz-expiration when NewerNoncurrentVersions configured - ComputeAction should skip rules with NewerNoncurrentVersions > 0 - Add unit tests for lifecycle.ComputeAction - Support lifecycle rules with MaxNoncurrentVersions - Extend ExpectedExpiryTime to work with zero days - Fix all-time comparisons to be relative to UTC	2021-12-14 09:41:44 -08:00
Krishnan Parthasarathi	3da9ee15d3	Add MaxNoncurrentVersions to NoncurrentExpiration action (#13580 ) This unit allows users to limit the maximum number of noncurrent versions of an object. To enable this rule you need the following ilm.json ``` cat >> ilm.json <<EOF { "Rules": [ { "ID": "test-max-noncurrent", "Status": "Enabled", "Filter": { "Prefix": "user-uploads/" }, "NoncurrentVersionExpiration": { "MaxNoncurrentVersions": 5 } } ] } EOF mc ilm import myminio/mybucket < ilm.json ```	2021-11-19 17:54:10 -08:00
Harshavardhana	7752cdbfaf	fix: restored object to preserve x-amz-meta properly (#13664 ) with SelectRestoreRequest OutputLocation provides additional metadata for the object, this is not preserved due to argument order change.	2021-11-15 13:25:55 -08:00
Krishnan Parthasarathi	f3aeed77e5	Add immediate inline tiering support (#13298 )	2021-10-01 11:58:17 -07:00
Harshavardhana	50a68a1791	allow S3 gateway to support object locked buckets (#13257 ) - Supports object locked buckets that require PutObject() to set content-md5 always. - Use SSE-S3 when S3 gateway is being used instead of SSE-KMS for auto-encryption.	2021-09-21 09:02:15 -07:00

1 2

97 Commits