minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	909b169593	avoid source index to be same as destination index (#20238 ) during rebalance stop, it can possibly happen that Put() would race by overwriting the same object again. This may very well if done "successfully" it can potentially proceed to delete the object from the pool, causing data loss. This PR enhances #20233 to handle more scenarios such as these.	2024-08-09 19:30:44 -07:00
jiuker	f7ff19cb18	fix: warning for decommissioned pool while start (#20019 )	2024-07-01 07:38:46 -07:00
Anis Eleuch	b94dd835c9	decom: Fix CurrentSize output when generating the status (#19883 ) StartSize starts with the raw free space of all disks in the given pool, however during the status, CurrentSize is not showing the current free raw space, as expected at least by `mc admin decom status` since it was written.	2024-06-06 07:30:43 -07:00
Klaus Post	0a63dc199c	Add trace sizes to more trace types (#19864 ) Add trace sizes to * ILM traces * Replication traces * Healing traces * Decommission traces * Rebalance traces * (s)ftp traces * http traces.	2024-06-03 08:45:54 -07:00
Aditya Manthramurthy	5f78691fcf	ldap: Add user DN attributes list config param (#19758 ) This change uses the updated ldap library in minio/pkg (bumped up to v3). A new config parameter is added for LDAP configuration to specify extra user attributes to load from the LDAP server and to store them as additional claims for the user. A test is added in sts_handlers.go that shows how to access the LDAP attributes as a claim. This is in preparation for adding SSH pubkey authentication to MinIO's SFTP integration.	2024-05-24 16:05:23 -07:00
Klaus Post	dbfb5e797b	Wait one minute after startup to restart decommissioning (#19645 ) Typically not all drives are connected, so we delay 3 minutes before resuming. This greatly reduces risk of starting to list unconnected drives, or drives we risk being disconnected soon. This delay is not applied when starting with an admin call.	2024-05-01 08:18:21 -07:00
Harshavardhana	95c65f4e8f	do not panic on rebalance during server restarts (#19563 ) This PR makes a feasible approach to handle all the scenarios that we must face to avoid returning "panic." Instead, we must return "errServerNotInitialized" when a bucketMetadataSys.Get() is called, allowing the caller to retry their operation and wait. Bonus fix the way data-usage-cache stores the object. Instead of storing usage-cache.bin with the bucket as `.minio.sys/buckets`, the `buckets` must be relative to the bucket `.minio.sys` as part of the object name. Otherwise, there is no way to decommission entries at `.minio.sys/buckets` and their final erasure set positions. A bucket must never have a `/` in it. Adds code to read() from existing data-usage.bin upon upgrade.	2024-04-22 10:49:30 -07:00
Anis Eleuch	95bf4a57b6	logging: Add subsystem to log API (#19002 ) Create new code paths for multiple subsystems in the code. This will make maintaing this easier later. Also introduce bugLogIf() for errors that should not happen in the first place.	2024-04-04 05:04:40 -07:00
Praveen raj Mani	ae4fb1b72e	Prioritize the bucket configs first during the decommissioning (#19393 )	2024-04-01 23:48:26 -07:00
Anis Eleuch	9370b11684	decom: Fix failed status after a failed decommission (#19300 ) When returning the status of a decommissioned pool, a pool with zero time StartedTime will be considered an active pool, which is unexpected. This commit will always ensure that a pool's canceled/failed/completed status is returned.	2024-03-19 20:09:59 -07:00
Harshavardhana	7213bd7131	add additional logs for the decom during metadata save (#19288 )	2024-03-18 15:25:45 -07:00
Harshavardhana	74ccee6619	avoid too much auditing during decom/rebalance make it more robust (#19174 ) there can be a sudden spike in tiny allocations, due to too much auditing being done, also don't hang on the ``` h.logCh <- entry ``` after initializing workers if you do not have a way to dequeue for some reason.	2024-03-06 03:43:16 -08:00
Anis Eleuch	68dde2359f	log: Add logger.Event to send to console and other logger targets (#19060 ) Add a new function logger.Event() to send the log to Console and http/kafka log webhooks. This will include some internal events such as disk healing and rebalance/decommissioning	2024-02-15 15:13:30 -08:00
Anis Eleuch	6ae97aedc9	xl: Disable rename2 in decommissioning/rebalance (#18964 ) Always disable rename2 optimization in decom/rebalance	2024-02-03 14:03:30 -08:00
Harshavardhana	32e668eb94	update() stale rebalance stats() object during pool expansion (#18882 ) it is entirely possible that a rebalance process which was running when it was asked to "stop" it failed to write its last statistics to the disk. After this a pool expansion can cause disruption and all S3 API calls would fail at IsPoolRebalancing() function. This PRs makes sure that we update rebalance.bin under such conditions to avoid any runtime crashes.	2024-01-27 10:14:03 -08:00
Harshavardhana	da55499db0	fix: reject clients that do not send proper payload (#18701 )	2023-12-22 01:26:17 -08:00
Harshavardhana	109a9e3f35	skip ILM expired objects from healing (#18569 )	2023-12-01 07:56:24 -08:00
Anis Eleuch	8317557f70	decom: Fix listing quorum to be equal to deletion quorum (#18476 ) With an odd number of drives per erasure set setup, the write/quorum is the half + 1; however the decommissioning listing will still list those objects and does not consider those as stale. Fix it by using (N+1)/2 formula. Co-authored-by: Anis Elleuch <anis@min.io>	2023-11-17 21:09:09 -08:00
Klaus Post	9a877734b2	Fix various poolmeta races (#18230 ) There is a fundamental race condition in `newErasureServerPools`, where setObjectLayer is called before the poolMeta has been loaded/populated. We add a placeholder value to this field but disable all saving of the value, so we don't risk overwriting the value on disk. Once the value has been loaded or created, it is replaced with the proper value, which will also be saved. Also fixes various accesses of `poolMeta` that were done without locks. We make the `poolMeta.IsSuspended` return false, even if we shouldn't risk out-of-bounds reads anymore.	2023-10-12 15:30:42 -07:00
Poorna	9dc29d7687	Avoid ILM expiry on deleted versions that are yet to replicate (#18175 ) Fixes #18167	2023-10-06 06:55:15 -06:00
Anis Eleuch	22d2dbc4e6	decom: Fix infinite retry when the decom is canceled (#18143 ) Also, use rand.Float64() since it is thread-safe; otherwise go race will complain.	2023-09-30 00:02:29 -07:00
jiuker	9947c01c8e	feat: SSE-KMS use uuid instead of read all data to md5. (#17958 )	2023-09-18 10:00:54 -07:00
Harshavardhana	fa6d082bfd	reduce all major allocations in replication path (#18032 ) - remove targetClient for passing around via replicationObjectInfo{} - remove cloing to object info unnecessarily - remove objectInfo from replicationObjectInfo{} (only require necessary fields)	2023-09-16 02:28:06 -07:00
Aditya Manthramurthy	1c99fb106c	Update to minio/pkg/v2 (#17967 )	2023-09-04 12:57:37 -07:00
Harshavardhana	bddd53d6d2	fix: retry listing in decommissioning if it fails perpetually (#17682 )	2023-07-19 13:09:37 -07:00
Harshavardhana	dfd7cca0d2	fix: allow cancel of decom only when its in progress (#17607 )	2023-07-10 07:55:38 -07:00
Harshavardhana	d2f5c3621f	fix: add additional decommission traces for ILM expired content (#17522 ) current decommission traces were missing for - Skipped ILM expired versions - Skipped single DELETE marked version - A success or failure in decommissioning DELETE marker - allow additional info to be shared in DecomStatus() API	2023-06-27 11:59:40 -07:00
Harshavardhana	eefa047974	fix: keep decommission in a go-routine (#17496 ) This was removed by mistake in #17491	2023-06-23 12:29:32 -07:00
Harshavardhana	d315d012a4	decom: during multiple pool decom preserve current pool status (#17491 ) removal of completed pools must retain pool status of other pools in draining, to resume any remaining draining operations.	2023-06-23 07:44:18 -07:00
Harshavardhana	9af6c6ceef	under rebalance look for expired versions v/s remaining versions (#17482 ) A continuation of PR #17479 for rebalance behavior must also match the decommission behavior. Fixes bug where rebalance would ignore rebalancing object versions after one of the version returned "ObjectNotFound"	2023-06-21 13:23:20 -07:00
Harshavardhana	ccc5801112	always look for expired versions v/s remaining versions (#17479 ) while decommissioning it can so happen that the non-current versions are all expired but there is a DEL marker as the latest version. For such objects, we should not decommission them instead calculate the remaining versions and if the remaining versions is one and that version is a DEL marker consider such an object not to be scheduled for decommissioning.	2023-06-21 08:49:28 -07:00
Harshavardhana	15911c85f6	safely ignore out of band deletions while decommissioning (#17473 )	2023-06-20 08:31:42 -07:00
Aditya Manthramurthy	5a1612fe32	Bump up madmin-go and pkg deps (#17469 )	2023-06-19 17:53:08 -07:00
Anis Eleuch	38342b1df5	decom: Parallelize decommissining (#17364 )	2023-06-07 14:27:51 -07:00
Anis Eleuch	1436858347	log: Add a log when saving pool.bin fails (#17338 ) Co-authored-by: Anis Elleuch <anis@min.io>	2023-06-04 14:20:21 -07:00
Harshavardhana	b210ea79bc	do not save MTime in newMultipartUpload() to avoid side-affects (#17340 )	2023-06-02 14:38:09 -07:00
Harshavardhana	9b5829c16e	avoid decommissioning DEL markers with single versions (#17274 )	2023-05-25 09:18:49 -07:00
Krishnan Parthasarathi	3e128c116e	Add lifecycle event source to audit log tags (#17248 )	2023-05-22 15:28:56 -07:00
Harshavardhana	06557fe8be	allow decommissioned pools to be removed while others are finishing (#17221 )	2023-05-16 16:00:57 -07:00
Klaus Post	aaf1abc993	simplify HardLimitReader by using LimitReader for internal usage (#17218 )	2023-05-16 13:14:37 -07:00
Poorna	e07c2ab868	Use hash.NewLimitReader for internal multipart calls (#17191 )	2023-05-12 11:19:08 -07:00
Anis Eleuch	883c98e26f	fix: remove objects when there are skipped versions due to ILM in decom (#17198 )	2023-05-12 10:37:38 -07:00
Harshavardhana	3637aad36e	do not count ILM expired objects and other skipped objects (#17184 )	2023-05-11 13:35:16 -07:00
Harshavardhana	b92cdea578	fix: start using pkg/workers to spawn parallel workers (#17170 )	2023-05-09 16:37:31 -07:00
Krishnan Parthasarathi	e7cac8acef	Add tags to auditLogLifecycle (#17081 )	2023-04-26 17:49:00 -07:00
Harshavardhana	6825bd7e75	fix: inlined objects don't need to honor long locks (#17039 )	2023-04-17 12:16:37 -07:00
Poorna	d1e775313d	support decommissioning of tiered objects (#16751 )	2023-03-16 07:48:05 -07:00
Harshavardhana	b984bf8d1a	allow expiration of all versions during Listing() (#16757 )	2023-03-09 15:15:30 -08:00
Harshavardhana	31188e9327	add parallel workers in batch replication (#16609 )	2023-02-13 12:07:58 -08:00
Florian Schwab	d67a846ec4	allow restarting of decommissioning if completed, failed or canceld (#16464 )	2023-01-24 07:07:59 -08:00

1 2 3

101 Commits