minio

mirror of https://github.com/minio/minio.git synced 2024-12-27 23:55:56 -05:00

Author	SHA1	Message	Date
Harshavardhana	dd2542e96c	add codespell action (#18818 ) Original work here, #18474, refixed and updated.	2024-01-17 23:03:17 -08:00
Harshavardhana	38637897ba	fix: listing SSE encrypted multipart objects (#18786 ) GetActualSize() was heavily relying on o.Parts() to be non-empty to figure out if the object is multipart or not, However, we have many indicators of whether an object is multipart or not. Blindly assuming that o.Parts == nil is not a multipart, is an incorrect expectation instead, multipart must be obtained via - Stored metadata value indicating this is a multipart encrypted object. - Rely on <meta>-actual-size metadata to get the object's actual size. This value is preserved for additional reasons such as these. - ETag != 32 length	2024-01-15 00:57:49 -08:00
Anis Eleuch	04135fa6cd	audit: Add the drives where the dangling object is removed (#18737 )	2024-01-05 14:17:24 -08:00
Harshavardhana	a50ea92c64	feat: introduce list_quorum="auto" to prefer quorum drives (#18084 ) NOTE: This feature is not retro-active; it will not cater to previous transactions on existing setups. To enable this feature, please set ` _MINIO_DRIVE_QUORUM=on` environment variable as part of systemd service or k8s configmap. Once this has been enabled, you need to also set `list_quorum`. ``` ~ mc admin config set alias/ api list_quorum=auto` ``` A new debugging tool is available to check for any missing counters.	2023-12-29 15:52:41 -08:00
Anis Eleuch	8a0ba093dd	audit: Fix merrs and derrs object dangling message (#18714 ) merrs and derrs are empty when a dangling object is deleted. Fix the bug and adds invalid-meta data for data blocks	2023-12-27 22:27:04 -08:00
Harshavardhana	7c948adf88	allow pre-allocating buffers to reduce frequent GCs during growth (#18686 ) This PR also increases per node bpool memory from 1024 entries to 2048 entries; along with that, it also moves the byte pool centrally instead of being per pool.	2023-12-21 08:59:38 -08:00
Harshavardhana	196e7e072b	allow bitrot files to be healed in MRF (#18618 ) bitrot scanMode was ignored in MRF, allow it to heal relevant content if needed when seen as an error.	2023-12-08 12:26:01 -08:00
Harshavardhana	45b7253f39	parallelize renameData() cleanup upon error (#18591 )	2023-12-04 14:54:34 -08:00
Harshavardhana	8fdfcfb562	upon RenameData() quorum error delete any partial success (#18586 ) there is potential for danglingWrites when quorum failed, where only some drives took a successful write, generally this is left to the healing routine to pick it up. However it is better that we delete it right away to avoid potential for quorum issues on version signature when there are many versions of an object.	2023-12-04 11:33:39 -08:00
Harshavardhana	e7c144eeac	avoid double MRF heal when there is versions disparity (#18585 )	2023-12-04 11:13:50 -08:00
Anis Eleuch	b7d11141e1	rename Force to Immediate for clarity (#18540 )	2023-11-28 22:35:16 -08:00
Klaus Post	dc88865908	fix: shadowed error in getObjectFileInfo() (#18548 ) This will result in `done <- err == nil` always returning true for this path, which seems unintentional.	2023-11-28 09:47:41 -08:00
Harshavardhana	506f121576	remove frivolous logging in transition object (#18526 ) AWS S3 closes keep-alive connections frequently leading to frivolous logs filling up the MinIO logs when the transition tier is an AWS S3 bucket. Ignore such transient errors, let MinIO retry it when it can.	2023-11-26 22:18:09 -08:00
Harshavardhana	fba883839d	feat: bring new HDD related performance enhancements (#18239 ) Optionally allows customers to enable - Enable an external cache to catch GET/HEAD responses - Enable skipping disks that are slow to respond in GET/HEAD when we have already achieved a quorum	2023-11-22 13:46:17 -08:00
Harshavardhana	a4cfb5e1ed	return errors if dataDir is missing during HeadObject() (#18477 ) Bonus: allow replication to attempt Deletes/Puts when the remote returns quorum errors of some kind, this is to ensure that MinIO can rewrite the namespace with the latest version that exists on the source.	2023-11-20 21:33:47 -08:00
Anis Eleuch	22d59e757d	Remove stale data in HEAD/GET object (#18460 ) Currently if the object does not exist in quorum disks of an erasure set, the dangling code is never called because the returned error will be errFileNotFound or errFileVersionNotFound; With this commit, when errFileNotFound or errFileVersionNotFound is returning when trying to calculate the quorum of a given object, the code checks if a disk returned nil, which means a stale object exists in that disk, that will trigger deleteIfDangling() function	2023-11-16 08:39:53 -08:00
Harshavardhana	0663eb69ed	fix: do not preserve mtime during CopyObject() metadata updates (#18316 ) mtime must be preserved only if destination mtime is set. fixes #18314	2023-10-25 14:30:56 -07:00
Harshavardhana	5c8339e1e8	fix: veeam SOS API to higher layers (#18287 ) - support populating usage info from scanner info - support populating quota for the bucket via quota settings for the bucket	2023-10-23 13:55:45 -07:00
Aditya Manthramurthy	b3e7de010d	Remove usage of errors.Join for go1.19 compat (#18243 )	2023-10-13 15:14:16 -07:00
Harshavardhana	77e94087cf	fix: calling statfs() call moves the disk head (#18203 ) if erasure upgrade is needed rely on the in-memory values, instead of performing a "DiskInfo()" call. https://brendangregg.com/blog/2016-09-03/sudden-disk-busy.html for HDDs these are problematic, lets avoid this because there is no value in "being" absolutely strict here in terms of parity. We are okay to increase parity as we see based on the in-memory online/offline ratio.	2023-10-10 13:47:35 -07:00
Poorna	9dc29d7687	Avoid ILM expiry on deleted versions that are yet to replicate (#18175 ) Fixes #18167	2023-10-06 06:55:15 -06:00
Harshavardhana	c34bdc33fb	make sure to set Versioned field to ensure rename2 is not called (#18141 ) without this the rename2() can rename the previous dataDir causing issues for different versions of the object, only latest version is preserved due to this bug. Added healing code to ensure recovery of such content.	2023-09-29 09:08:24 -07:00
Harshavardhana	3c470a6b8b	fix: the inspect script to use scheme per deployment (#18118 )	2023-09-27 08:22:50 -07:00
jiuker	9947c01c8e	feat: SSE-KMS use uuid instead of read all data to md5. (#17958 )	2023-09-18 10:00:54 -07:00
Harshavardhana	9f7044aed0	fix: ignore transient errors in read path (#18006 ) Errors such as ``` returned an error (context deadline exceeded) (fmt.wrapError) ``` ``` (msgp: too few bytes left to read object) (fmt.wrapError) ```	2023-09-11 15:29:59 -07:00
Aditya Manthramurthy	1c99fb106c	Update to minio/pkg/v2 (#17967 )	2023-09-04 12:57:37 -07:00
Harshavardhana	3995355150	avoid repeated large allocations for large parts (#17968 ) objects with 10,000 parts and many of them can cause a large memory spike which can potentially lead to OOM due to lack of GC. with previous PR reducing the memory usage significantly in #17963, this PR reduces this further by 80% under repeated calls. Scanner sub-system has no use for the slice of Parts(), it is better left empty. ``` benchmark old ns/op new ns/op delta BenchmarkToFileInfo/ToFileInfo-8 295658 188143 -36.36% benchmark old allocs new allocs delta BenchmarkToFileInfo/ToFileInfo-8 61 60 -1.64% benchmark old bytes new bytes delta BenchmarkToFileInfo/ToFileInfo-8 1097210 227255 -79.29% ```	2023-09-02 07:49:24 -07:00
Harshavardhana	18b3655c99	with xlv2 format we never had to fill in checksumInfo() (#17963 ) - this PR avoids sending a large ChecksumInfo slice when its not needed - also for a file with XLV2 format there is no reason to allocate Checksum slice while reading	2023-09-01 13:45:58 -07:00
Harshavardhana	8a57b6bced	use renameat2 Linux extension syscall (#17757 ) this is a faster and safer alternative on newer kernel versions.	2023-08-27 09:57:11 -07:00
Harshavardhana	124e28578c	remove strict persistence requirements for List() .metacache objects (#17917 ) .metacache objects are transient in nature, and are better left to use page-cache effectively to avoid using more IOPs on the disks. this allows for incoming calls to be not taxed heavily due to multiple large batch listings.	2023-08-25 07:58:11 -07:00
Harshavardhana	c45bc32d98	skip disks under scanning when healing disks (#17822 ) Bonus: - avoid calling DiskInfo() calls when missing blocks instead heal the object using MRF operation. - change the max_sleep to 250ms beyond that we will not stop healing.	2023-08-09 12:51:47 -07:00
Harshavardhana	81be718674	fix: optimize DiskInfo() call avoid metrics when not needed (#17763 )	2023-07-31 15:20:48 -07:00
Harshavardhana	e7b60c4d65	Add slow drive timeouts to match with active disk monitoring (#17701 ) allow active disk-monitoring to be configurable, and use these add deadlines in various call layers for various syscalls.	2023-07-25 16:58:31 -07:00
Harshavardhana	a566bcf613	treat 0-byte objects to honor same quorum as delete marker (#17633 ) on unversioned buckets its possible that 0-byte objects might lose quorum on flaky systems, allow them to be same as DELETE markers. Since practically speak they have no content.	2023-07-11 21:53:49 -07:00
Kaan Kabalak	f64d62b01d	Fix style of logOnceIf calls w/unique identifiers (#17631 )	2023-07-11 13:17:45 -07:00
Poorna	e8c98c3246	Avoid extra GetObjectInfo call in DeleteObject API (#17599 ) Optimize DeleteObject API to avoid extra GetObjectInfo call on the replicating side. For receiving side, it is just a regular DeleteObject call. Bonus: Fix a corner case where version purged is absent on target (either due to replication not yet complete or target version already deleted in a one-way replication or when replication was disabled). In such cases, mark version purge complete.	2023-07-10 07:57:56 -07:00
Klaus Post	ff5988f4e0	Reduce allocations (#17584 ) * Reduce allocations * Add stringsHasPrefixFold which can compare string prefixes, while ignoring case and not allocating. * Reuse all msgp.Readers * Reuse metadata buffers when not reading data. * Make type safe. Make buffer 4K instead of 8. * Unslice	2023-07-06 16:02:08 -07:00
Kaan Kabalak	21fbe88e1f	Print certain log messages once per error (#17484 )	2023-06-24 20:29:13 -07:00
Harshavardhana	1f8b9b4bd5	fix: do not listAndHeal() inline with PutObject() (#17499 ) there is a possibility that slow drives can actually add latency to the overall call, leading to a large spike in latency. this can happen if there are other parallel listObjects() calls to the same drive, in-turn causing each other to sort of serialize. this potentially improves performance and makes PutObject() also non-blocking.	2023-06-24 19:31:04 -07:00
Harshavardhana	02c2ec3027	skip onlineDisks with parity mismatch (#17478 )	2023-06-20 13:18:24 -07:00
Aditya Manthramurthy	5a1612fe32	Bump up madmin-go and pkg deps (#17469 )	2023-06-19 17:53:08 -07:00
Harshavardhana	1443b5927a	allow quorum fileInfo to pick same parityBlocks (#17454 ) Bonus: allow replication to proceed for 503 errors such as with error code SlowDownRead	2023-06-18 18:20:15 -07:00
Harshavardhana	64de61d15d	fallback on etags if they match when mtime is not same (#17424 ) on "unversioned" buckets there are situations when successive concurrent I/O can lead to an inconsistent state() with mtime while the etag might be the same for the object on disk. in such a scenario it is possible for us to allow reading of the object since etag matches and if etag matches we are guaranteed that we have enough copies the object will be readable and same. This PR allows fallback in such scenarios.	2023-06-17 19:18:20 -07:00
Harshavardhana	ad4e511026	do not save plain-text ETag when encryption is requested (#17427 ) fixes an issue under bucket replication could cause ETags for replicated SSE-S3 single part PUT objects, to fail as we would attempt a decryption while listing, or stat() operation.	2023-06-15 12:43:26 -07:00
Harshavardhana	54e544e03e	allow lookup()/head() operations on Veeam SOS objects (#17331 )	2023-06-01 15:26:26 -07:00
Harshavardhana	ef54200db7	offline drives more than 50% of total drives return error (#17252 )	2023-05-23 07:57:57 -07:00
Krishnan Parthasarathi	3e128c116e	Add lifecycle event source to audit log tags (#17248 )	2023-05-22 15:28:56 -07:00
Klaus Post	aaf1abc993	simplify HardLimitReader by using LimitReader for internal usage (#17218 )	2023-05-16 13:14:37 -07:00
Poorna	e07c2ab868	Use hash.NewLimitReader for internal multipart calls (#17191 )	2023-05-12 11:19:08 -07:00
Klaus Post	7fad0c8b41	Remove checksums from HTTP range request, add part checksums (#17105 )	2023-04-28 08:26:32 -07:00
Krishnan Parthasarathi	e7cac8acef	Add tags to auditLogLifecycle (#17081 )	2023-04-26 17:49:00 -07:00
Praveen raj Mani	72802a5972	Use 'minio/pkg/sync/errgroup' and 'minio/pkg/workers' (#17069 )	2023-04-25 22:57:40 -07:00
Harshavardhana	b1f3935c5b	allow ListObjects() when a prefix is an object (#17074 )	2023-04-25 22:41:54 -07:00
Krishnan Parthasarathi	fae9000304	heal: Pick maximally occuring modTime in quorum (#17071 )	2023-04-25 10:13:57 -07:00
Harshavardhana	84f31ed45d	simplify MRF, converge it to regular healing (#17026 )	2023-04-19 07:47:42 -07:00
Harshavardhana	6825bd7e75	fix: inlined objects don't need to honor long locks (#17039 )	2023-04-17 12:16:37 -07:00
Klaus Post	c133979b8e	Add part count to checksum (#17035 )	2023-04-14 09:44:45 -07:00
Poorna	cd6dec49c0	Add trace support for ilm activity (#16993 )	2023-04-11 19:22:32 -07:00
Harshavardhana	c06e0bfef9	set correct `Host:` value for replication event notification (#16984 )	2023-04-06 10:20:53 -07:00
Poorna	699a24f7e5	batch: validate versioning on src/tgt buckets (#16955 )	2023-04-04 10:50:11 -07:00
Harshavardhana	216a471bbb	on quorum DeleteObject() errors attempt an MRF (#16932 )	2023-03-31 08:15:41 -07:00
Harshavardhana	6c11dbffd5	add crash protection from backend modifications (#16846 )	2023-03-20 09:08:42 -07:00
Poorna	d1e775313d	support decommissioning of tiered objects (#16751 )	2023-03-16 07:48:05 -07:00
Harshavardhana	e0f4dd6027	remove unncessary logs from WalkDir(), PutObject() (#16818 )	2023-03-15 11:52:23 -07:00
ferhat elmas	714283fae2	cleanup ignored static analysis (#16767 )	2023-03-06 08:56:10 -08:00
Klaus Post	9acf1024e4	Remove bloom filter (#16682 ) Removes the bloom filter since it has so limited usability, often gets saturated anyway and adds a bunch of complexity to the scanner. Also removes a tiny bit of CPU by each write operation.	2023-02-24 09:03:31 +05:30
Harshavardhana	a0f06eac2a	add Veeam SOS API first implementation (#16688 )	2023-02-22 19:54:57 +05:30
Krishnan Parthasarathi	d136ac0596	Don't close transition task channel on server exit (#16627 )	2023-02-15 22:09:25 -08:00
Krishnan Parthasarathi	cea2ca8c8e	Add restore-status header for multipart objects (#16508 )	2023-01-31 07:53:45 +05:30
Harshavardhana	67fce4a5b3	fix: dangling delete() upon success should return 404 (#16494 )	2023-01-27 12:43:45 -08:00
Anis Elleuch	d98116559b	Use async healing in PutObject call (#16431 )	2023-01-19 00:54:22 -08:00
Harshavardhana	2937711390	fix: DeleteObject() API with versionId under replication (#16325 )	2022-12-28 22:48:33 -08:00
Anis Elleuch	acc9c033ed	debug: Add X-Amz-Request-ID to lock/unlock calls (#16309 )	2022-12-23 19:49:07 -08:00
Krishnan Parthasarathi	2fa35def2c	Fix DeleteObject when only free versions remain (#16289 )	2022-12-21 16:24:07 -08:00
Anis Elleuch	89db3fdb5d	Do not return an error when version disparity is detected (#16269 )	2022-12-16 08:52:12 -08:00
Harshavardhana	dfe73629a3	fix: delete marker discrepancies via DeleteObject() API (#16195 )	2022-12-08 18:15:16 -08:00
Aditya Manthramurthy	a30cfdd88f	Bump up madmin-go to v2 (#16162 )	2022-12-06 13:46:50 -08:00
Klaus Post	a713aee3d5	Run staticcheck on CI (#16170 )	2022-12-05 11:18:50 -08:00
Klaus Post	1cd875de1e	Persist updated metadata (#16160 )	2022-12-02 08:35:04 -08:00
Poorna	63fc6ba2cd	preserve replicated ETag properly on target (#16129 )	2022-11-26 14:43:32 -08:00
Harshavardhana	91f45c4aa6	avoid inconsistent versions healing when versions are large (#16066 )	2022-11-14 18:35:26 -08:00
Anis Elleuch	7260241511	Remove some logs caused by external apps (#16027 )	2022-11-08 13:29:05 -08:00
Harshavardhana	fd6f6fc8df	cleanup stale parent multipart directories (#15980 )	2022-11-01 08:00:02 -07:00
Harshavardhana	136d41775f	remove numAvailableDisks check as it doesn't serve any purpose (#15954 )	2022-10-27 09:05:24 -07:00
Poorna	7dd8b6c8ed	ensure ILM expiry creates non null deleteMarker for versioned bucket (#15947 )	2022-10-26 16:09:27 -07:00
Anis Elleuch	fc6c794972	Audit dangling object removal (#15933 )	2022-10-24 11:35:07 -07:00
Anis Elleuch	ac85c2af76	lifecycle: refactor rules filtering and tagging support (#15914 )	2022-10-21 10:46:53 -07:00
Harshavardhana	928feb0889	remove unused debug param from evalActionFromLifecycle (#15813 )	2022-10-07 10:24:12 -07:00
Harshavardhana	9e5853ecc0	optimize double reads by reusing results from checkUploadIDExists() (#15692 ) Move to using `xl.meta` data structure to keep temporary partInfo, this allows for a future change where we move to different parts to different drives.	2022-09-15 12:43:49 -07:00
Harshavardhana	124544d834	add pre-conditions support for PUT calls during replication (#15674 ) PUT shall only proceed if pre-conditions are met, the new code uses - x-minio-source-mtime - x-minio-source-etag to verify if the object indeed needs to be replicated or not, allowing us to avoid StatObject() call.	2022-09-14 18:44:04 -07:00
Harshavardhana	8e997eba4a	fix: trigger Heal when xl.meta needs healing during PUT (#15661 ) This PR is a continuation of the previous change instead of returning an error, instead trigger a spot heal on the 'xl.meta' and return only after the healing is complete. This allows for future GETs on the same resource to be consistent for any version of the object.	2022-09-07 07:25:39 -07:00
Harshavardhana	2d9b5a65f1	verify RenameData() versions to be consistent (#15649 ) xl.meta gets written and never rolled back, however we definitely need to validate the state that is persisted on the disk, if there are inconsistencies - more than write quorum we should return an error to the client - if write quorum was achieved however there are inconsistent xl.meta's we should simply trigger an MRF on them	2022-09-05 16:51:37 -07:00
Harshavardhana	5ea629beb2	avoid printing io.ErrUnexpectedEOF for .metacache objects (#15642 )	2022-09-02 12:47:17 -07:00
Klaus Post	8e4a45ec41	fix: encrypt checksums in metadata (#15620 )	2022-08-31 08:13:23 -07:00
Klaus Post	a9f1ad7924	Add extended checksum support (#15433 )	2022-08-29 16:57:16 -07:00
Poorna	471467d310	fix: ensure metadata update happens after deletemarker replication (#15564 ) Fixes regression caused by #15521	2022-08-22 15:59:06 -07:00
Harshavardhana	d350b666ff	feat: add idempotent delete marker support (#15521 ) The bottom line is delete markers are a nuisance, most applications are not version aware and this has simply complicated the version management. AWS S3 gave an unnecessary complication overhead for customers, they need to now manage these markers by applying ILM settings and clean them up on a regular basis. To make matters worse all these delete markers get replicated as well in a replicated setup, requiring two ILM settings on each site. This PR is an attempt to address this inferior implementation by deviating MinIO towards an idempotent delete marker implementation i.e MinIO will never create any more than single consecutive delete markers. This significantly reduces operational overhead by making versioning more useful for real data. This is an S3 spec deviation for pragmatic reasons.	2022-08-18 16:41:59 -07:00
Anis Elleuch	b3edb25377	bloom: healObject to mark a path dirty only for dangling objects (#15458 ) The path is marked dirty automatically when healObject() is called, which is wrong. HealObject() is called during self-healing and this will lead to an increase in the false positive result of the bloom filter. Also move NSUpdated() from renameData() and call it directly in CompleteMultipart and PutObject, this is not a functional change but it will make it less prone to errors in the future.	2022-08-02 16:57:39 -07:00
Harshavardhana	aa874010e2	fix: regression in resolving the right versions (#15430 ) fix: regression in resolving right versions commit `d480022711` caused a regression in real resolver, by picking up incorrect versionID.	2022-07-29 10:03:53 -07:00
Harshavardhana	ce8397f7d9	use partInfo only for intermediate part.x.meta (#15353 )	2022-07-19 18:56:24 -07:00

1 2 3 4 5 ...

328 Commits