minio

mirror of https://github.com/minio/minio.git synced 2025-11-28 13:09:09 -05:00

Author	SHA1	Message	Date
Harshavardhana	186c477f3c	init console server after server config is initialized fixes #14259	2022-02-07 00:17:33 -08:00
Harshavardhana	6123377e66	speedup getFormatErasureInQuorum use driveCount (#14239 ) startup speed-up, currently getFormatErasureInQuorum() would spend up to 2-3secs when there are 3000+ drives for example in a setup, simplify this implementation to use drive counts.	2022-02-04 12:21:21 -08:00
Harshavardhana	0256dae657	fix: quorum requirement for DeleteMarkers and parity upgraded objects (#14248 ) DeleteMarkers do not have a default quorum, i.e it is possible that DeleteMarkers were created with n/2+1 quorum as well to make sure that we satisfy situations such as those we need to make sure delete markers only expect n/2 read quorum. Additionally we should also look at additional metadata on the actual objects that might have been "erasure" upgraded with new parity when disks are down. In such a scenario do not default to the standard storage class parity, instead use the parityBlocks present on the FileInfo to ensure that we are dealing with the correct quorum for READs and DELETEs.	2022-02-04 02:47:36 -08:00
Harshavardhana	84b121bbe1	return error with empty x-amz-copy-source-range headers (#14249 ) fixes #14246	2022-02-03 16:58:27 -08:00
Harshavardhana	01e550a9be	ignore unreadable metrics on certain closed systems (#14234 ) fixes #14233	2022-02-03 09:45:12 -08:00
Poorna	63a2e0bab6	Remove notification from NotificationSys on bucket deletion (#14236 )	2022-02-02 17:11:56 -08:00
Harshavardhana	24657859a8	when o_direct is disabled do not attempt fadvise call (#14230 )	2022-02-02 08:54:52 -08:00
Sidhartha Mani	d7df6bc738	add support for speedtest drive (#14182 )	2022-02-01 22:38:05 -08:00
Poorna	a4e1de93a7	Add API for removing site(s) from site replication (#14104 )	2022-02-01 17:26:09 -08:00
Klaus Post	067d21d0f2	fs: Retry listing if no marker (#14221 ) Retry listings, when no next marker is returned and the result isn't truncated. This can happen when an object is queued, but no info can be fetched. Fixes #14190	2022-02-01 10:00:14 -08:00
Shireesh Anjal	3882da6ac5	Add subnet proxy config (#14225 ) Will store the HTTP(S) proxy URL to use for connecting to SUBNET.	2022-02-01 09:52:38 -08:00
Anis Elleuch	127e8bf3b6	heal: Avoid printing repetitive error to heal a root disk (#14220 ) The healing code repeatedly tries to heal a root disk when it is empty the reason is that connectEndpoint() returns errUnformattedDisk even if the disk is a root disk. Changing that to returning another error will avoid queueing the disk to the healing code in each connect disks iteration.	2022-01-31 17:28:20 -08:00
Harshavardhana	74faed166a	Add quota usage as part of prometheus metrics (#14222 ) Bonus: pass caller context when needed to all bucket metadata handling calls.	2022-01-31 17:27:43 -08:00
Harshavardhana	dbd05d6e82	remove FIFO bucket quota, use ILM expiration instead (#14206 )	2022-01-31 11:07:04 -08:00
Harshavardhana	b5d35c7e09	ignore disk metrics for single drive mode (#14212 ) fixes #14211	2022-01-31 00:44:26 -08:00
Poorna	0f88cdc80e	Return all stats in SiteReplicationStatus API if options unset (#14207 )	2022-01-28 21:19:38 -08:00
Poorna	38e3c7a8f7	Added filters for SiteReplicationStatus API to support new UI changes (#14177 )	2022-01-28 15:37:55 -08:00
Poorna	a4be47d7ad	Validate config before saving changes after config reset (#14203 )	2022-01-27 18:28:16 -08:00
Harshavardhana	aaea94a48d	update quorum requirement to list all objects (#14201 ) some upgraded objects might not get listed due to different quorum ratios across objects. make sure to list all objects that satisfy the maximum possible quorum.	2022-01-27 17:00:15 -08:00
Aditya Manthramurthy	c3d9c45f58	Ensure that AssumeRole calls are sent to Audit log (#14202 ) When authentication fails MinIO was not sending out an Audit log event for this STS call	2022-01-27 16:17:11 -08:00
Klaus Post	a2a48cc065	Optimize read locker cleanup (#14200 ) When objects hold a lot of read locks cleanup time grows exponentially. ``` BEFORE: Unable to complete tests. AFTER: === RUN Test_localLocker_expireOldLocksExpire/100-locks/1-read local-locker_test.go:298: Scan Took: 0s. Left: 100/100 local-locker_test.go:317: Expire 50% took: 0s. Left: 44/44 local-locker_test.go:331: Expire rest took: 0s. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/100-locks/100-read local-locker_test.go:298: Scan Took: 0s. Left: 10000/100 local-locker_test.go:317: Expire 50% took: 1ms. Left: 5000/100 local-locker_test.go:331: Expire rest took: 1ms. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/100-locks/1000-read local-locker_test.go:298: Scan Took: 2ms. Left: 100000/100 local-locker_test.go:317: Expire 50% took: 55ms. Left: 50038/100 local-locker_test.go:331: Expire rest took: 29ms. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/10000-locks/1-read local-locker_test.go:298: Scan Took: 1ms. Left: 10000/10000 local-locker_test.go:317: Expire 50% took: 2ms. Left: 5019/5019 local-locker_test.go:331: Expire rest took: 2ms. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/10000-locks/100-read local-locker_test.go:298: Scan Took: 23ms. Left: 1000000/10000 local-locker_test.go:317: Expire 50% took: 160ms. Left: 499798/10000 local-locker_test.go:331: Expire rest took: 138ms. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/10000-locks/1000-read local-locker_test.go:298: Scan Took: 200ms. Left: 10000000/10000 local-locker_test.go:317: Expire 50% took: 5.888s. Left: 5000196/10000 local-locker_test.go:331: Expire rest took: 3.417s. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/1000000-locks/1-read local-locker_test.go:298: Scan Took: 133ms. Left: 1000000/1000000 local-locker_test.go:317: Expire 50% took: 348ms. Left: 500255/500255 local-locker_test.go:331: Expire rest took: 307ms. Left: 0/0 ```	2022-01-27 14:10:57 -08:00
Harshavardhana	cf407f7176	do not expect 'speedtest' to be a bucket (#14199 ) fixes #14196	2022-01-27 08:13:03 -08:00
Harshavardhana	d6dd17a483	make sure to pass groups for all credentials while verifying policies (#14193 ) fixes #14180	2022-01-26 21:53:36 -08:00
Aditya Manthramurthy	7dfa565d00	Identity LDAP: Allow multiple search base DNs (#14191 ) This change allows the MinIO server to lookup users in different directory sub-trees by allowing specification of multiple search bases separated by semicolons.	2022-01-26 15:05:59 -08:00
Krishnan Parthasarathi	d2e5f01542	feat: maintain in-memory tier stats for the last 24hrs (#13782 )	2022-01-26 14:33:10 -08:00
yfanswer	f4e373e0d2	de-couple cache completeMultipartUpload with caller context (#14181 )	2022-01-26 11:55:58 -08:00
Harshavardhana	57118919d2	cached diskIDs are not needed for scanner healing (#14170 ) This PR removes an unnecessary state that gets passed around for DiskIDs, which is not necessary since each disk exactly knows which pool and which set it belongs to on a running system. Currently cached DiskId's won't work properly because it always ends up skipping offline disks and never runs healing when disks are offline, as it expects all the cached diskIDs to be present always. This also sort of made things in-flexible in terms perhaps a new diskID for `format.json`. (however this is not a big issue) This is an unnecessary requirement that healing via scanner needs all drives to be online, instead healing should trigger even when partial nodes and drives are available this ensures that we keep the SLA in-tact on the objects when disks are offline for a prolonged period of time.	2022-01-26 08:34:56 -08:00
Klaus Post	7db05a80dd	locking: Fix wrong map id (#14184 ) Wrong resource is being fetched, since idx is incremented, but mapID is reused. Regression caused by #13454 - that part didn't optimize anything anyway.	2022-01-26 08:34:09 -08:00
Anis Elleuch	45a99c3fd3	publish storage API latency through node metrics (#14117 ) Publish storage functions latency to help compare the performance of different disks in a single deployment. e.g.: ``` minio_node_disk_latency_us{api="storage.WalkDir",disk="/tmp/xl/1",server="localhost:9001"} 226 minio_node_disk_latency_us{api="storage.WalkDir",disk="/tmp/xl/2",server="localhost:9002"} 1180 minio_node_disk_latency_us{api="storage.WalkDir",disk="/tmp/xl/3",server="localhost:9003"} 1183 minio_node_disk_latency_us{api="storage.WalkDir",disk="/tmp/xl/4",server="localhost:9004"} 1625 ```	2022-01-25 16:31:44 -08:00
Harshavardhana	b68f0cbde4	ignore remote disks with diskID empty as offline (#14168 ) concurrent loading of erasure sets can now expose a situation in a distributed setup that might return diskID as empty, treat such disks as offline.	2022-01-24 19:40:02 -08:00
Krishnan Parthasarathi	ebc3627c73	further improvements to newXLStorage (#14166 ) - create internal erasure volumes only if the disk is unformatted - return a copy of format data in xlStorage.ReadAll - parse env vars only once, to be re-used by xl-storage	2022-01-24 17:09:12 -08:00
Harshavardhana	5a9f133491	speed up startup sequence for all operations (#14148 ) This speed-up is intended for faster startup times for almost all MinIO operations. Changes here are - Drives are not re-read for 'format.json' on a regular basis once read during init is remembered and refreshed at 5 second intervals. - Do not do O_DIRECT tests on drives with existing 'format.json' only fresh setups need this check. - Parallelize initializing erasureSets for multiple sets. - Avoid re-reading format.json when migrating 'format.json' from really old V1->V2->V3 - Keep a copy of local drives for any given server in memory for a quick lookup.	2022-01-24 11:28:45 -08:00
Harshavardhana	f6d13f57bb	fix: correct parentUser lookup for OIDC auto expiration (#14154 ) fixes #14026 This is a regression from #13884	2022-01-22 16:36:11 -08:00
Poorna	48da4aeee0	Add API for removing site(s) from site replication (#14022 )	2022-01-21 08:48:21 -08:00
Harshavardhana	7f214a0e46	use dnscache resolver for resolving command line endpoints (#14135 ) this helps in caching the resolved values early on, avoids causing further resolution for individual nodes when object layer comes online. this can speed up our startup time during, upgrades etc by an order of magnitude. additional changes in connectLoadInitFormats() and parallelize all calls that might be potentially blocking.	2022-01-20 13:03:15 -08:00
Klaus Post	e1a0a1e73c	fs: Return prefix as listing marker if no objects (#14143 ) Fixes #14132	2022-01-20 10:55:18 -08:00
Harshavardhana	9d588319dd	support site replication to replicate IAM users,groups (#14128 ) - Site replication was missing replicating users, groups when an empty site was added. - Add site replication for groups and users when they are disabled and enabled. - Add support for replicating bucket quota config.	2022-01-19 20:02:24 -08:00
Klaus Post	0012ca8ca5	Fix inconsistent metadata after healing (#14125 ) When calculating signatures empty part ETags were not discarded, leading to a different signature compared to freshly created ones. This would mean that after a heal signature of the healed metadata would be different. Fixing the calculation of signature will make these consistent. Furthermore when inconsistent entries, with zero version ID, with the same mod times but different signatures, the one with the lowest signature would be picked for quorum check. Since this is 50/50, we fall back to a simple quorum count on all signatures. Each of these fixes by themselves will lead to quorum. Tests were added for regressions and expected outcomes.	2022-01-19 10:48:00 -08:00
Poorna	288e276abe	Specify tags in options while selecting replication targets (#14126 ) When the replication rule is based on tag matches, the replication process should pick up targets matching the tags specified in the replication rule. Fixing regression due to #12880	2022-01-19 10:45:42 -08:00
Jarbitz	f22e745514	fix: ListBucketUsers comment doc (#14129 )	2022-01-19 10:45:13 -08:00
Krishnan Parthasarathi	070c31eac5	Wait for updates collector when disk.NSScanner returns error (#14127 )	2022-01-19 00:46:43 -08:00
Harshavardhana	70e1cbda21	allow disabling O_DIRECT in certain environments for reads (#14115 ) repeated reads on single large objects in HPC like workloads, need the following option to disable O_DIRECT for a more effective usage of the kernel page-cache. However this optional should be used in very specific situations only, and shouldn't be enabled on all servers. NVMe servers benefit always from keeping O_DIRECT on.	2022-01-17 08:34:14 -08:00
Harshavardhana	60f2df54e0	Add envVars for CLI arguments (#14114 ) fixes #14107	2022-01-15 16:20:02 -08:00
Harshavardhana	ba708f51f2	fix: copyMetrics to avoid map references elsewhere (#14113 ) map labels might have been referenced else, this can lead to concurrent access at lower layers. avoid this by copying the information while concurrently serving the metrics.	2022-01-14 16:48:19 -08:00
Harshavardhana	0df31f63ab	reject changing pools when there are pending decommissions in-progress (#14102 ) do not allow mutation to pool command line when there are unfinished decommissions in place, disallow such scenarios to avoid user mistakes. also add testcases to cover all relevant scenarios.	2022-01-14 10:32:35 -08:00
Klaus Post	64d4da5a37	Add Put input readahead (#14084 ) When reading input for PutObject or PutObjectPart add a readahead buffer for big inputs. This will make network reads+hashing separate run async with erasure coding and writes. This will reduce overall latency in distributed setups where the input is from upstream and writes go to other servers. We will read at 2 buffers ahead, meaning one will always be ready/waiting and one is currently being read from. This improves PutObject and PutObjectParts for these cases.	2022-01-14 10:01:25 -08:00
Harshavardhana	7aec38a73e	Simplify the messaging for internode versions (#14103 ) provide a cleaner message instead of cryptic logs, also provide the relevant link on how to do recommended way to upgrade.	2022-01-13 17:25:08 -08:00
Klaus Post	a2fd8caa69	Ignore version not found in deleteVersions (#14093 ) When deleting multiple versions it "gives" up with an errFileVersionNotFound if a version cannot be found. This effectively skips deleting other versions sent in the same request. This can happen on inconsistent objects. We should ignore errFileVersionNotFound and continue with others. We already ignore these at the caller level, this PR is continuation of `54a9877`	2022-01-13 14:28:07 -08:00
Harshavardhana	f546636c52	fix: use renameAll instead of deleteObject() for purging temporary files (#14096 ) This PR simplifies few things - Multipart parts are renamed, upon failure are unrenamed() keep this multipart specific behavior it is needed and works fine. - AbortMultipart should blindly delete once lock is acquired instead of re-reading metadata and calculating quorum, abort is a delete() operation and client has no business looking for errors on this. - Skip Access() calls to folders that are operating on `.minio.sys/multipart` folder as well.	2022-01-13 11:07:41 -08:00
Harshavardhana	38ccc4f672	fix: make sure to avoid calling RenameData() on disconnected disks. (#14094 ) Large clusters with multiple sets, or multi-pool setups at times might fail and report unexpected "file not found" errors. This can become a problem during startup sequence when some files need to be created at multiple locations. - This PR ensures that we nil the erasure writers such that they are skipped in RenameData() call. - RenameData() doesn't need to "Access()" calls for `.minio.sys` folders they always exist. - Make sure PutObject() never returns ObjectNotFound{} for any errors, make sure it always returns "WriteQuorum" when renameData() fails with ObjectNotFound{}. Return appropriate errors for all other cases.	2022-01-12 18:49:01 -08:00

... 4 5 6 7 8 ...

4491 Commits