minio

mirror of https://github.com/minio/minio.git synced 2024-12-26 15:15:55 -05:00

Author	SHA1	Message	Date
Poorna	26c23b30f4	replication: set context timeout for NewMultipartUpload calls (#17807 )	2023-08-05 12:27:07 -07:00
Poorna	311380f8cb	replication resync: fix queueing (#17775 ) Assign resync of all versions of object to the same worker to avoid locking contention. Fixes parallel resync implementation in #16707	2023-08-01 11:51:15 -07:00
Poorna	1a42693d68	replication: limit larger uploads to a subset of workers (#17687 ) Limit large uploads (> 128MiB) to a max of 10 workers, intent is to avoid larger uploads from using all replication bandwidth, giving room for smaller uploads to sync faster.	2023-07-25 20:02:02 -07:00
Harshavardhana	005a4a275a	add more bootstrap messages to provide latency (#17650 ) - simplify refreshing bucket metadata, wait() to depend on how fast the bucket metadata can load. - simplify resync to start resync in single pass.	2023-07-14 04:00:29 -07:00
Poorna	5e2f8d7a42	replication: Simplify mrf requeueing and add backlog handler (#17171 ) Simplify MRF queueing and add backlog handler - Limit re-tries to 3 to avoid repeated re-queueing. Fall offs to be re-tried when the scanner revisits this object or upon access. - Change MRF to have each node process only its MRF entries. - Collect MRF backlog by the node to allow for current backlog visibility	2023-07-12 23:51:33 -07:00
Kaan Kabalak	f64d62b01d	Fix style of logOnceIf calls w/unique identifiers (#17631 )	2023-07-11 13:17:45 -07:00
Poorna	e8c98c3246	Avoid extra GetObjectInfo call in DeleteObject API (#17599 ) Optimize DeleteObject API to avoid extra GetObjectInfo call on the replicating side. For receiving side, it is just a regular DeleteObject call. Bonus: Fix a corner case where version purged is absent on target (either due to replication not yet complete or target version already deleted in a one-way replication or when replication was disabled). In such cases, mark version purge complete.	2023-07-10 07:57:56 -07:00
Klaus Post	ff5988f4e0	Reduce allocations (#17584 ) * Reduce allocations * Add stringsHasPrefixFold which can compare string prefixes, while ignoring case and not allocating. * Reuse all msgp.Readers * Reuse metadata buffers when not reading data. * Make type safe. Make buffer 4K instead of 8. * Unslice	2023-07-06 16:02:08 -07:00
Kaan Kabalak	21fbe88e1f	Print certain log messages once per error (#17484 )	2023-06-24 20:29:13 -07:00
jiuker	b6b68be052	fix: replication check for duplicate endpoints detection with wrong route (#17474 )	2023-06-20 09:27:54 -07:00
Aditya Manthramurthy	5a1612fe32	Bump up madmin-go and pkg deps (#17469 )	2023-06-19 17:53:08 -07:00
Harshavardhana	1443b5927a	allow quorum fileInfo to pick same parityBlocks (#17454 ) Bonus: allow replication to proceed for 503 errors such as with error code SlowDownRead	2023-06-18 18:20:15 -07:00
Poorna	c4d0c49a5f	ensure metadata updates go to same pool where version exists (#17451 ) This PR also returns the replication status in proxy calls and defers replication attempt if HEAD on object version returned a error different from NoSuchKey	2023-06-17 07:30:53 -07:00
Poorna Krishnamoorthy	f986b0c493	replication: perform bucket resync in parallel (#16707 ) Default number of parallel resync operations for a bucket to 10 to speed up resync.	2023-06-11 16:09:55 -07:00
Harshavardhana	b210ea79bc	do not save MTime in newMultipartUpload() to avoid side-affects (#17340 )	2023-06-02 14:38:09 -07:00
Poorna	e95825a42e	replication: use latest object info for metrics update (#17333 )	2023-06-01 18:52:55 -07:00
Poorna	2131046427	replication: fix audit log reporting (#17222 )	2023-05-16 15:35:08 -07:00
Klaus Post	aaf1abc993	simplify HardLimitReader by using LimitReader for internal usage (#17218 )	2023-05-16 13:14:37 -07:00
jiuker	413549bcf5	fix: loadStatsFromDisk() should return nil for configNotFound (#17217 )	2023-05-16 12:23:38 -07:00
Poorna	e07c2ab868	Use hash.NewLimitReader for internal multipart calls (#17191 )	2023-05-12 11:19:08 -07:00
Poorna	c5c1426262	Validate if replication config being added is self referential (#17142 )	2023-05-06 13:35:43 -07:00
Harshavardhana	6825bd7e75	fix: inlined objects don't need to honor long locks (#17039 )	2023-04-17 12:16:37 -07:00
Harshavardhana	c06e0bfef9	set correct `Host:` value for replication event notification (#16984 )	2023-04-06 10:20:53 -07:00
Anis Eleuch	d90d0c8931	Use one http response recorder per external http call (#16938 )	2023-03-31 09:37:29 -07:00
Allan Roger Reid	483b226cc1	fix: avoid logging when object/version not found in replication (#16919 )	2023-03-29 15:02:45 -07:00
Harshavardhana	8e02660a0d	update all our deps (#16899 )	2023-03-28 03:45:24 -07:00
Poorna	fb6ab1cca2	fix: allow replication of 'null' delete markers (#16773 )	2023-03-08 07:03:29 -08:00
Poorna	ee54643004	Avoid unnecessary replication heal attempts (#16769 )	2023-03-07 07:43:38 -08:00
Poorna	c33a237067	fix: under site replication disallow remote target modification (#16628 )	2023-02-15 20:22:13 -08:00
jiuker	a15b6f21b8	remove incorrect use of WaitGroup (#16596 )	2023-02-12 20:59:45 -08:00
Poorna	876e1a91b2	replication: Fix typo checking PreconditionFailed status code (#16517 )	2023-02-02 19:22:02 +05:30
Poorna	820d94447c	replication: fix target bucket passed on GET proxy (#16495 )	2023-01-27 10:24:51 -08:00
Poorna	ed20134a7b	replication: detect proxy header presence correctly (#16489 )	2023-01-27 01:29:32 -08:00
Harshavardhana	e64b9f6751	fix: disallow SSE-C encrypted objects on replicated buckets (#16467 )	2023-01-24 15:46:33 -08:00
Poorna	ddad231921	replication: Avoid logging PreConditionFailed error (#16450 )	2023-01-21 07:33:04 +05:30
Poorna	1b02e046c2	Fix bandwidth monitoring to be per remote target (#16360 )	2023-01-19 18:52:16 +05:30
Harshavardhana	2937711390	fix: DeleteObject() API with versionId under replication (#16325 )	2022-12-28 22:48:33 -08:00
Anis Elleuch	acc9c033ed	debug: Add X-Amz-Request-ID to lock/unlock calls (#16309 )	2022-12-23 19:49:07 -08:00
Poorna	de0b43de32	persist replication stats with leader lock (#16282 )	2022-12-22 14:25:13 -08:00
Poorna	6423e4c767	Remove site replication config if it succeeded locally (#16279 )	2022-12-22 01:31:20 -08:00
Harshavardhana	2fc182d8e6	fix: iso8601TimeFormat padding issue for certain nanoseconds (#16207 )	2022-12-12 10:28:30 -08:00
Aditya Manthramurthy	a30cfdd88f	Bump up madmin-go to v2 (#16162 )	2022-12-06 13:46:50 -08:00
Klaus Post	a713aee3d5	Run staticcheck on CI (#16170 )	2022-12-05 11:18:50 -08:00
Klaus Post	1cd875de1e	Persist updated metadata (#16160 )	2022-12-02 08:35:04 -08:00
Anis Elleuch	641ab24aec	repl: resync orchestrator to use global shared lock (#16154 )	2022-12-01 12:10:09 -08:00
Klaus Post	a22b4adf4c	distribute replication ops based on names (#16083 )	2022-11-17 15:20:09 -08:00
Klaus Post	b7bb122be8	fix: replication auto-scaling deadlock (#16084 )	2022-11-17 07:35:02 -08:00
Klaus Post	8a07000e58	fix: refactor getReplicationDiff for safe use (#16051 )	2022-11-15 07:59:21 -08:00
Poorna	d6bc141bd1	feat: Add support for site level resync (#15753 )	2022-11-14 07:16:40 -08:00
Poorna	34d28dd79f	replication: Avoid blocking on mrf save (#16045 )	2022-11-10 10:20:02 -08:00
Klaus Post	2894dd4d1a	fix: hold lock while serializing replication stats (#16007 )	2022-11-04 09:59:14 -07:00
Klaus Post	0f0e154315	fix: inconsistent replication delete marker timestamps (#15956 )	2022-10-27 09:46:52 -07:00
Harshavardhana	23b329b9df	remove gateway completely (#15929 )	2022-10-24 17:44:15 -07:00
Anis Elleuch	fc6c794972	Audit dangling object removal (#15933 )	2022-10-24 11:35:07 -07:00
Poorna	e4e90b53c1	fix: delete-marker replication check properly (#15923 )	2022-10-21 14:45:06 -07:00
Harshavardhana	59e33b3b21	validate setBucketTarget properly as per BucketExists() call (#15860 )	2022-10-13 17:46:49 -07:00
Poorna	0e3c92c027	attempt delete marker replication after object is replicated (#15857 ) Ensure delete marker replication success, especially since the recent optimizations to heal on HEAD, LIST and GET can force replication attempts on delete marker before underlying object version could have synced.	2022-10-13 17:45:23 -07:00
Harshavardhana	97112c69be	fix: replication stats() to not crash under any situation (#15851 ) Co-authored-by: Daniel Valdivia <18384552+dvaldivia@users.noreply.github.com>	2022-10-12 15:47:41 -07:00
Anis Elleuch	e856e10ac2	ignore VersionNotFound in addition to ObjectNotFound while replicating (#15814 )	2022-10-07 16:11:41 -07:00
Poorna	8ea6fb368d	Add auto configuration of replication workers (#15636 )	2022-09-24 16:20:28 -07:00
Harshavardhana	50a8ba6a6f	fix: parse and save retainUntilDate in correct time format (#15741 )	2022-09-23 08:49:27 -07:00
Harshavardhana	124544d834	add pre-conditions support for PUT calls during replication (#15674 ) PUT shall only proceed if pre-conditions are met, the new code uses - x-minio-source-mtime - x-minio-source-etag to verify if the object indeed needs to be replicated or not, allowing us to avoid StatObject() call.	2022-09-14 18:44:04 -07:00
Poorna	a0fb0c1835	panic if replication config could not be read from disk (#15685 ) If replication config could not be read from bucket metadata for some reason, issue a panic so that unexpected replication outcomes can be avoided for replicated buckets. For similar reasons, adding a panic while fetching object-lock config if it failed for reason other than non-existence of config.	2022-09-13 21:23:33 -07:00
Poorna	6b9fd256e1	Persist in-memory replication stats to disk (#15594 ) to avoid relying on scanner-calculated replication metrics. This will improve the accuracy of the replication stats reported. This PR also adds on to #15556 by handing replication traffic that could not be queued by available workers to the MRF queue so that entries in `PENDING` status are healed faster.	2022-09-12 12:40:02 -07:00
Anis Elleuch	cf52691959	Save resync status in the backend using a last update timestamp (#15638 ) Currently, there is a short time window where the code is allowed to save the status of a replication resync. Currently, the window is `now.Sub(st.EndTime) <= resyncTimeInterval`. Also, any failure to write in the backend disks is not retried. Refactor the code a little bit to rely on the last timestamp of a successful write of the resync status of any given bucket in the backend disks.	2022-09-01 16:53:36 -07:00
Anis Elleuch	10e75116ef	Avoid replicating dirs in listing with replication enabled (#15641 ) When replication is enabled in a particular bucket, the listing will send objects to bucket replication, but it is also sending prefixes for non recursive listing which is useless and shows a lot of error logs. This commit will ignore prefixes.	2022-09-01 15:22:11 -07:00
Harshavardhana	433b6fa8fe	upgrade golang-lint to the latest (#15600 )	2022-08-26 12:52:29 -07:00
Harshavardhana	edba7c987b	fix: objects matching prefixes should not leave delete markers (#15586 ) This is needed to ensure that we do not leave prefixes where version is suspended, instead we never leave versions on these paths.	2022-08-24 13:46:29 -07:00
Poorna	4155c5b695	replication: improve MRF healing. (#15556 ) This PR improves the replication failure healing by persisting most recent failures to disk and re-queuing them until the replication is successful. While this does not eliminate the need for healing during a full scan, queuing MRF vastly improves the ETA to keeping replicated buckets in sync as it does not wait for the scanner visit to detect unreplicated object versions.	2022-08-22 16:53:06 -07:00
Harshavardhana	ae4ee95d25	change default lock retry interval to 50ms (#15560 ) competing calls on the same object on versioned bucket mutating calls on the same object may unexpected have higher delays. This can be reproduced with a replicated bucket overwriting the same object writes, deletes repeatedly. For longer locks like scanner keep the 1sec interval	2022-08-19 16:21:05 -07:00
Harshavardhana	e9055e9ef7	fix: walk() should cancel itself upon context cancellation (#15553 ) This PR fixes possible leaks that may emanate from not listening on context cancelation or timeouts. ``` goroutine 60957610 [chan send, 16 minutes]: github.com/minio/minio/cmd.(erasureServerPools).Walk.func1.1.1(...) github.com/minio/minio/cmd/erasure-server-pool.go:1724 +0x368 github.com/minio/minio/cmd.listPathRaw({0x4a9a740, 0xc0666dffc0},... github.com/minio/minio/cmd/metacache-set.go:1022 +0xfc4 github.com/minio/minio/cmd.(erasureServerPools).Walk.func1.1() github.com/minio/minio/cmd/erasure-server-pool.go:1764 +0x528 created by github.com/minio/minio/cmd.(*erasureServerPools).Walk.func1 github.com/minio/minio/cmd/erasure-server-pool.go:1697 +0x1b7 ```	2022-08-18 17:49:08 -07:00
Poorna	21fe14201f	replication: centralize healthcheck for remote targets (#15516 ) This PR moves health check from minio-go client to being managed on the server. Additionally integrating health check into site replication	2022-08-16 17:46:22 -07:00
Poorna	21bf5b4db7	replication: heal proactively upon access (#15501 ) Queue failed/pending replication for healing during listing and GET/HEAD API calls. This includes healing of existing objects that were never replicated or those in the middle of a resync operation. This PR also fixes a bug in ListObjectVersions where lifecycle filtering should be done.	2022-08-09 15:00:24 -07:00
ebozduman	b57e7321e7	Replaces 'disk'=>'drive' visible to end user (#15464 )	2022-08-04 16:10:08 -07:00
Poorna	5e0776e96a	replication: Include replica object versions for resync (#15427 )	2022-07-28 13:43:02 -07:00
Harshavardhana	5e763b71dc	use logger.LogOnce to reduce printing disconnection logs (#15408 ) fixes #15334 - re-use net/url parsed value for http.Request{} - remove gosimple, structcheck and unusued due to https://github.com/golangci/golangci-lint/issues/2649 - unwrapErrs upto leafErr to ensure that we store exactly the correct errors	2022-07-27 09:44:59 -07:00
Poorna	cab8d3d568	feat: add API to return list of objects waiting to be replicated (#15091 )	2022-07-21 11:05:44 -07:00
Poorna	b4f6901903	resync: Avoid concurrent access/write on map (#15286 ) fixes a crash ``` fatal error: concurrent map iteration and map write minio[19309]: goroutine 18640 [running]: minio[19309]: runtime.throw({0x27a3399?, 0x1785?}) minio[19309]: runtime/panic.go:992 +0x71 fp=0xc0062f1c80 sp=0xc0062f1c50 pc=0x438671 minio[19309]: runtime.mapiternext(0xc0062f1e90?) minio[19309]: runtime/map.go:871 +0x4eb fp=0xc0062f1cf0 sp=0xc0062f1c80 pc=0x41002b minio[19309]: github.com/minio/minio/cmd.(*ReplicationPool).periodicResyncMetaSave(0xc0056c00c0, {0x4d06a48, 0xc0005b2480}, {0x4d22fc0, 0xc0015ea0 ```	2022-07-13 16:29:10 -07:00
Harshavardhana	0a8b78cb84	fix: simplify passing auditLog eventType (#15278 ) Rename Trigger -> Event to be a more appropriate name for the audit event. Bonus: fixes a bug in AddMRFWorker() it did not cancel the waitgroup, leading to waitgroup leaks.	2022-07-12 10:43:32 -07:00
Harshavardhana	31c4fdbf79	fix: resyncing 'null' version on pre-existing content (#15043 ) PR #15041 fixed replicating 'null' version however due to a regression from #14994 caused the target versions for these 'null' versioned objects to have different 'versions', this may cause confusion with bi-directional replication and cause double replication. This PR fixes this properly by making sure we replicate the correct versions on the objects.	2022-06-06 15:14:56 -07:00
Harshavardhana	48e367ff7d	reject resync start on misconfigured replication rules (#15041 ) we expect resync to start on buckets with replication rule ExistingObjects enabled, if not we reject such calls.	2022-06-06 02:54:39 -07:00
Harshavardhana	52221db7ef	fix: for unexpected errors in reading versioning config panic (#14994 ) We need to make sure if we cannot read bucket metadata for some reason, and bucket metadata is not missing and returning corrupted information we should panic such handlers to disallow I/O to protect the overall state on the system. In-case of such corruption we have a mechanism now to force recreate the metadata on the bucket, using `x-minio-force-create` header with `PUT /bucket` API call. Additionally fix the versioning config updated state to be set properly for the site replication healing to trigger correctly.	2022-05-31 02:57:57 -07:00
Harshavardhana	f1abb92f0c	feat: Single drive XL implementation (#14970 ) Main motivation is move towards a common backend format for all different types of modes in MinIO, allowing for a simpler code and predictable behavior across all features. This PR also brings features such as versioning, replication, transitioning to single drive setups.	2022-05-30 10:58:37 -07:00
Poorna	5c81d0d89a	site replication: heal missing/invalid replication config (#14979 ) Validate remote target ARNs and heal any stale rules in the replication config	2022-05-26 17:57:23 -07:00
Harshavardhana	f8650a3493	fetch bucket replication stats across peers in single call (#14956 ) current implementation relied on recursively calling one bucket at a time across all peers, this would be very slow and chatty when there are 100's of buckets which would mean 100*peerCount amount of network operations. This PR attempts to reduce this entire call into `peerCount` amount of network calls only. This functionality addresses also a concern where the Prometheus metrics would significantly slow down when one of the peers is offline.	2022-05-23 09:15:30 -07:00
Harshavardhana	6cfb1cb6fd	fix: timer usage across codebase (#14935 ) it seems in some places we have been wrongly using the timer.Reset() function, nicely exposed by an example shared by @donatello https://go.dev/play/p/qoF71_D1oXD this PR fixes all the usage comprehensively	2022-05-17 22:42:59 -07:00
Harshavardhana	62aa42cccf	avoid replication proxy on version excluded paths (#14878 ) no need to attempt proxying objects that were never replicated, but do have local `null` versions on them.	2022-05-08 16:50:31 -07:00
Harshavardhana	5cffd3780a	fix: multiple fixes in prefix exclude implementation (#14877 ) - do not need to restrict prefix exclusions that do not have `/` as suffix, relax this requirement as spark may have staging folders with other autogenerated characters , so we are better off doing full prefix March and skip. - multiple delete objects was incorrectly creating a null delete marker on a versioned bucket instead of creating a proper versioned delete marker. - do not suspend paths on the excluded prefixes during delete operations to avoid creating `null` delete markers, honor suspension of versioning only at bucket level for delete markers.	2022-05-07 22:06:44 -07:00
Krishnan Parthasarathi	ad8e611098	feat: implement prefix-level versioning exclusion (#14828 ) Spark/Hadoop workloads which use Hadoop MR Committer v1/v2 algorithm upload objects to a temporary prefix in a bucket. These objects are 'renamed' to a different prefix on Job commit. Object storage admins are forced to configure separate ILM policies to expire these objects and their versions to reclaim space. Our solution: This can be avoided by simply marking objects under these prefixes to be excluded from versioning, as shown below. Consequently, these objects are excluded from replication, and don't require ILM policies to prune unnecessary versions. - MinIO Extension to Bucket Version Configuration ```xml <VersioningConfiguration xmlns="http://s3.amazonaws.com/doc/2006-03-01/"> <Status>Enabled</Status> <ExcludeFolders>true</ExcludeFolders> <ExcludedPrefixes> <Prefix>app1-jobs//_temporary/</Prefix> </ExcludedPrefixes> <ExcludedPrefixes> <Prefix>app2-jobs//__magic/</Prefix> </ExcludedPrefixes> <!-- .. up to 10 prefixes in all --> </VersioningConfiguration> ``` Note: `ExcludeFolders` excludes all folders in a bucket from versioning. This is required to prevent the parent folders from accumulating delete markers, especially those which are shared across spark workloads spanning projects/teams. - To enable version exclusion on a list of prefixes ``` mc version enable --excluded-prefixes "app1-jobs//_temporary/,app2-jobs//_magic," --exclude-prefix-marker myminio/test ```	2022-05-06 19:05:28 -07:00
Poorna	3a64580663	Add support for site replication healing (#14572 ) heal bucket metadata and IAM entries for sites participating in site replication from the site with the most updated entry. Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Aditya Manthramurthy <aditya@minio.io>	2022-04-24 02:36:31 -07:00
Krishnan Parthasarathi	7b81967a3c	Fix handling of object versions pending purge (#14555 ) - GetObject() with vid should return 405 - GetObject() without vid should return 404 - ListObjects() should ignore this object if this is the "latest" version of the object - ListObjectVersions() should list this object as "DELETE marker" - Remove data parts before sync'ing the version pending purge	2022-03-16 16:59:43 -07:00
Poorna	1e39ca39c3	fix: consistent replies for incorrect range requests on replicated buckets (#14345 ) Propagate error from replication proxy target correctly to the client if range GET is unsatisfiable.	2022-03-08 13:58:55 -08:00
Poorna	ed3418c046	Refactor replication resync to be an active process (#14266 ) When resync is triggered, walk the bucket namespace and resync objects that are unreplicated. This PR also adds an API to report resync progress.	2022-02-10 10:16:52 -08:00
Poorna	288e276abe	Specify tags in options while selecting replication targets (#14126 ) When the replication rule is based on tag matches, the replication process should pick up targets matching the tags specified in the replication rule. Fixing regression due to #12880	2022-01-19 10:45:42 -08:00
Harshavardhana	cc3f139d1f	replication: attempt abort multipart-upload at max 3 times on remote (#14087 ) this is mainly an attempt to relinquish space on the remote site, if this still doesn't do it we give and let the admin know with a log message.	2022-01-11 22:32:29 -08:00
Poorna	54a98773f8	fix: replication of tag removal (#14056 ) Currently tag removal leaves replication state as `PENDING` because the `HEAD` api returns just a tag count but not the actual tags, and this is treated as a no-op	2022-01-10 19:06:10 -08:00
Harshavardhana	f527c708f2	run gofumpt cleanup across code-base (#14015 )	2022-01-02 09:15:06 -08:00
Aditya Manthramurthy	997e808088	fix; race in bucket replication stats (#13942 ) - r.ulock was not locked when r.UsageCache was being modified Bonus: - simplify code by removing some unnecessary clone methods - we can do this because go arrays are values (not pointers/references) that are automatically copied on assignment. - remove some unnecessary map allocation calls	2021-12-17 15:33:13 -08:00
Poorna K	e270ab65b3	fix: healing of replication delete markers (#13933 ) A corner case can occur where the delete-marker was propagated but the metadata could not be updated on the primary. Sending a RemoveObject call with the Delete marker version would end up permanently deleting the version on target. Instead, perform a Stat on the delete-marker version on target and redo replication only if the delete-marker is missing on target.	2021-12-16 15:34:55 -08:00
Poorna K	d422d24278	replication: warn if insufficient workers (#13899 ) This should give an early warning if configured replication workers are insufficient to meet application workload.	2021-12-13 18:22:56 -08:00

1 2 3 4 5 ...

255 Commits