minio

mirror of https://github.com/minio/minio.git synced 2025-11-24 03:27:44 -05:00

Author	SHA1	Message	Date
Krishnan Parthasarathi	cdab4a3b85	Update hourly tier-stats only on succesful tiering (#14330 )	2022-02-16 17:29:12 -08:00
Klaus Post	b88c57ba93	Add fgprof profiles (#14321 ) https://github.com/felixge/fgprof#rocket-fgprof---the-full-go-profiler	2022-02-16 12:00:10 -08:00
Klaus Post	60cd513a33	Fix leaked healing goroutines (#14322 ) Only the first `listAndHeal` would ever be able to write on errCh, blocking all others infinitely. Instead read all errors but return the first non-nil, if any. The intention appears to be that this should cancel on any error, so that part is kept. Regression from #13990	2022-02-16 08:40:18 -08:00
Harshavardhana	03a6e8aee2	fix: creating steep directory structure on trash folder (#14314 ) weird directory structures get created on the '.trash' folder upon server restarts, this PR fixes this.	2022-02-15 16:34:03 -08:00
Anis Elleuch	4afbb89774	nas: Clean stale background appended files (#14295 ) When more than one gateway reads and writes from the same mount point and there is a load balancer pointing to those gateways. Each gateway will try to create its own temporary append file but fails to clear it later when not needed. This commit creates a routine that checks all upload IDs saved in multipart directory and remove any stale entry with the same upload id in the memory and in the temporary background append folder as well.	2022-02-15 09:25:47 -08:00
Klaus Post	5ec57a9533	Add GetObject gzip option (#14226 ) Enabled with `mc admin config set alias/ api gzip_objects=on` Standard filtering applies (1K response minimum, not compressed content type, not range request, gzip accepted by client).	2022-02-14 09:19:01 -08:00
Anis Elleuch	1f92fc3fc0	Always check for root disks unless MINIO_CI_CD is set (#14232 ) The current code considers a pool with all root disks to be as part of a testing environment even if there are other pools with mounted disks. This will result to illegitimate writing in root disks. Fix this by simplifing the logic: require MINIO_CI_CD in order to skip root disk check.	2022-02-13 15:42:07 -08:00
Harshavardhana	fad3d66093	parallelize background cleanup on local disks across sets (#14290 )	2022-02-11 14:22:48 -08:00
Poorna	ed3418c046	Refactor replication resync to be an active process (#14266 ) When resync is triggered, walk the bucket namespace and resync objects that are unreplicated. This PR also adds an API to report resync progress.	2022-02-10 10:16:52 -08:00
Anis Elleuch	71bab74148	Fix adding bucket forwarder handler in server mode (#14288 ) MinIO configuration is loaded after the initialization of the server handlers, which will miss the initialization of the bucket forwarder handler. Though the federation is deprecated, let's fix this for the time being.	2022-02-10 08:49:36 -08:00
Anis Elleuch	661ea57907	restore: Add quotes some fields in x-amz-restore header (#14281 ) S3 spec returns x-amz-restore header in HEAD/GET object with the following format: ``` x-amz-restore: ongoing-request="false", expiry-date="Fri, 21 Dec 2012 00:00:00 GMT" ``` This commit adds quotes as the current code does not support it. It will also supports the old format saved in the disk (in xl.meta) for backward compatibility.	2022-02-09 13:17:41 -08:00
Anis Elleuch	1f18efb0ba	gateway: Active bucket forwarding handler (#14277 ) A regression removed support of federation in the gateway mode. Enable it again. Federation is deprecated for a while but let's fix this for the time being.	2022-02-09 09:31:47 -08:00
Daniel	8ae46bce93	fix the error logs have been omitted because of retryCount never exceed 10 (#14268 )	2022-02-09 03:14:22 -08:00
Harshavardhana	f19a414e09	fix: allow danging objects to be purged properly deleteMultipleObjects() (#14273 ) Deleting bulk objects had an issue since the relevant versionID is not passed through the layers to ensure that the dangling object purge actually works cleanly. This is a continuation of quorum related error returned by multi-object delete API from #14248 This PR ensures that we pass down correct information as well as extend the scope of dangling object detection.	2022-02-08 20:08:23 -08:00
Krishnan Parthasarathi	0ee2933234	Export tier metrics via Prometheus (#13413 ) e.g ``` minio_cluster_ilm_transitioned_bytes{server="minio3:9000",tier="S3TIER-1"} 1.36317772e+08 minio_cluster_ilm_transitioned_bytes{server="minio3:9000",tier="S3TIER-2"} 2892 minio_cluster_ilm_transitioned_bytes{server="minio3:9000",tier="STANDARD"} 1.3631488e+08 minio_cluster_ilm_transitioned_objects{server="minio3:9000",tier="S3TIER-1"} 1 minio_cluster_ilm_transitioned_objects{server="minio3:9000",tier="S3TIER-2"} 0 minio_cluster_ilm_transitioned_objects{server="minio3:9000",tier="STANDARD"} 1 minio_cluster_ilm_transitioned_versions{server="minio3:9000",tier="S3TIER-1"} 3 minio_cluster_ilm_transitioned_versions{server="minio3:9000",tier="S3TIER-2"} 2 minio_cluster_ilm_transitioned_versions{server="minio3:9000",tier="STANDARD"} 1 ```	2022-02-08 12:45:28 -08:00
Shireesh Anjal	9890f579f8	Add subsystem level validation on `config set` (#14269 ) When setting a config of a particular sub-system, validate the existing config and notification targets of only that sub-system, so that existing errors related to one sub-system (e.g. notification target offline) do not result in errors for other sub-systems.	2022-02-08 10:36:41 -08:00
Anis Elleuch	2ee337ead5	prometheus: Add incoming requests metrics since last scrape (#14261 ) Some users running MinIO claim that their system became slow. One way to investigate is to look at this Prometheus history of the number of the requests reaching the server. The existing current S3 requests metric is not enough because it can increase of the system really becomes slow, due to disk issues for example.	2022-02-07 16:30:14 -08:00
Harshavardhana	3c87e1e60d	fix: rename some function names to avoid confusion (#14262 )	2022-02-07 11:49:07 -08:00
Harshavardhana	0cac868a36	speed-up startup time, do not block on ListBuckets() (#14240 ) Bonus fixes #13816	2022-02-07 10:39:57 -08:00
Harshavardhana	186c477f3c	init console server after server config is initialized fixes #14259	2022-02-07 00:17:33 -08:00
Harshavardhana	6123377e66	speedup getFormatErasureInQuorum use driveCount (#14239 ) startup speed-up, currently getFormatErasureInQuorum() would spend up to 2-3secs when there are 3000+ drives for example in a setup, simplify this implementation to use drive counts.	2022-02-04 12:21:21 -08:00
Harshavardhana	0256dae657	fix: quorum requirement for DeleteMarkers and parity upgraded objects (#14248 ) DeleteMarkers do not have a default quorum, i.e it is possible that DeleteMarkers were created with n/2+1 quorum as well to make sure that we satisfy situations such as those we need to make sure delete markers only expect n/2 read quorum. Additionally we should also look at additional metadata on the actual objects that might have been "erasure" upgraded with new parity when disks are down. In such a scenario do not default to the standard storage class parity, instead use the parityBlocks present on the FileInfo to ensure that we are dealing with the correct quorum for READs and DELETEs.	2022-02-04 02:47:36 -08:00
Harshavardhana	84b121bbe1	return error with empty x-amz-copy-source-range headers (#14249 ) fixes #14246	2022-02-03 16:58:27 -08:00
Harshavardhana	01e550a9be	ignore unreadable metrics on certain closed systems (#14234 ) fixes #14233	2022-02-03 09:45:12 -08:00
Poorna	63a2e0bab6	Remove notification from NotificationSys on bucket deletion (#14236 )	2022-02-02 17:11:56 -08:00
Harshavardhana	24657859a8	when o_direct is disabled do not attempt fadvise call (#14230 )	2022-02-02 08:54:52 -08:00
Sidhartha Mani	d7df6bc738	add support for speedtest drive (#14182 )	2022-02-01 22:38:05 -08:00
Poorna	a4e1de93a7	Add API for removing site(s) from site replication (#14104 )	2022-02-01 17:26:09 -08:00
Klaus Post	067d21d0f2	fs: Retry listing if no marker (#14221 ) Retry listings, when no next marker is returned and the result isn't truncated. This can happen when an object is queued, but no info can be fetched. Fixes #14190	2022-02-01 10:00:14 -08:00
Shireesh Anjal	3882da6ac5	Add subnet proxy config (#14225 ) Will store the HTTP(S) proxy URL to use for connecting to SUBNET.	2022-02-01 09:52:38 -08:00
Anis Elleuch	127e8bf3b6	heal: Avoid printing repetitive error to heal a root disk (#14220 ) The healing code repeatedly tries to heal a root disk when it is empty the reason is that connectEndpoint() returns errUnformattedDisk even if the disk is a root disk. Changing that to returning another error will avoid queueing the disk to the healing code in each connect disks iteration.	2022-01-31 17:28:20 -08:00
Harshavardhana	74faed166a	Add quota usage as part of prometheus metrics (#14222 ) Bonus: pass caller context when needed to all bucket metadata handling calls.	2022-01-31 17:27:43 -08:00
Harshavardhana	dbd05d6e82	remove FIFO bucket quota, use ILM expiration instead (#14206 )	2022-01-31 11:07:04 -08:00
Harshavardhana	b5d35c7e09	ignore disk metrics for single drive mode (#14212 ) fixes #14211	2022-01-31 00:44:26 -08:00
Poorna	0f88cdc80e	Return all stats in SiteReplicationStatus API if options unset (#14207 )	2022-01-28 21:19:38 -08:00
Poorna	38e3c7a8f7	Added filters for SiteReplicationStatus API to support new UI changes (#14177 )	2022-01-28 15:37:55 -08:00
Poorna	a4be47d7ad	Validate config before saving changes after config reset (#14203 )	2022-01-27 18:28:16 -08:00
Harshavardhana	aaea94a48d	update quorum requirement to list all objects (#14201 ) some upgraded objects might not get listed due to different quorum ratios across objects. make sure to list all objects that satisfy the maximum possible quorum.	2022-01-27 17:00:15 -08:00
Aditya Manthramurthy	c3d9c45f58	Ensure that AssumeRole calls are sent to Audit log (#14202 ) When authentication fails MinIO was not sending out an Audit log event for this STS call	2022-01-27 16:17:11 -08:00
Klaus Post	a2a48cc065	Optimize read locker cleanup (#14200 ) When objects hold a lot of read locks cleanup time grows exponentially. ``` BEFORE: Unable to complete tests. AFTER: === RUN Test_localLocker_expireOldLocksExpire/100-locks/1-read local-locker_test.go:298: Scan Took: 0s. Left: 100/100 local-locker_test.go:317: Expire 50% took: 0s. Left: 44/44 local-locker_test.go:331: Expire rest took: 0s. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/100-locks/100-read local-locker_test.go:298: Scan Took: 0s. Left: 10000/100 local-locker_test.go:317: Expire 50% took: 1ms. Left: 5000/100 local-locker_test.go:331: Expire rest took: 1ms. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/100-locks/1000-read local-locker_test.go:298: Scan Took: 2ms. Left: 100000/100 local-locker_test.go:317: Expire 50% took: 55ms. Left: 50038/100 local-locker_test.go:331: Expire rest took: 29ms. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/10000-locks/1-read local-locker_test.go:298: Scan Took: 1ms. Left: 10000/10000 local-locker_test.go:317: Expire 50% took: 2ms. Left: 5019/5019 local-locker_test.go:331: Expire rest took: 2ms. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/10000-locks/100-read local-locker_test.go:298: Scan Took: 23ms. Left: 1000000/10000 local-locker_test.go:317: Expire 50% took: 160ms. Left: 499798/10000 local-locker_test.go:331: Expire rest took: 138ms. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/10000-locks/1000-read local-locker_test.go:298: Scan Took: 200ms. Left: 10000000/10000 local-locker_test.go:317: Expire 50% took: 5.888s. Left: 5000196/10000 local-locker_test.go:331: Expire rest took: 3.417s. Left: 0/0 === RUN Test_localLocker_expireOldLocksExpire/1000000-locks/1-read local-locker_test.go:298: Scan Took: 133ms. Left: 1000000/1000000 local-locker_test.go:317: Expire 50% took: 348ms. Left: 500255/500255 local-locker_test.go:331: Expire rest took: 307ms. Left: 0/0 ```	2022-01-27 14:10:57 -08:00
Harshavardhana	cf407f7176	do not expect 'speedtest' to be a bucket (#14199 ) fixes #14196	2022-01-27 08:13:03 -08:00
Harshavardhana	d6dd17a483	make sure to pass groups for all credentials while verifying policies (#14193 ) fixes #14180	2022-01-26 21:53:36 -08:00
Aditya Manthramurthy	7dfa565d00	Identity LDAP: Allow multiple search base DNs (#14191 ) This change allows the MinIO server to lookup users in different directory sub-trees by allowing specification of multiple search bases separated by semicolons.	2022-01-26 15:05:59 -08:00
Krishnan Parthasarathi	d2e5f01542	feat: maintain in-memory tier stats for the last 24hrs (#13782 )	2022-01-26 14:33:10 -08:00
yfanswer	f4e373e0d2	de-couple cache completeMultipartUpload with caller context (#14181 )	2022-01-26 11:55:58 -08:00
Harshavardhana	57118919d2	cached diskIDs are not needed for scanner healing (#14170 ) This PR removes an unnecessary state that gets passed around for DiskIDs, which is not necessary since each disk exactly knows which pool and which set it belongs to on a running system. Currently cached DiskId's won't work properly because it always ends up skipping offline disks and never runs healing when disks are offline, as it expects all the cached diskIDs to be present always. This also sort of made things in-flexible in terms perhaps a new diskID for `format.json`. (however this is not a big issue) This is an unnecessary requirement that healing via scanner needs all drives to be online, instead healing should trigger even when partial nodes and drives are available this ensures that we keep the SLA in-tact on the objects when disks are offline for a prolonged period of time.	2022-01-26 08:34:56 -08:00
Klaus Post	7db05a80dd	locking: Fix wrong map id (#14184 ) Wrong resource is being fetched, since idx is incremented, but mapID is reused. Regression caused by #13454 - that part didn't optimize anything anyway.	2022-01-26 08:34:09 -08:00
Anis Elleuch	45a99c3fd3	publish storage API latency through node metrics (#14117 ) Publish storage functions latency to help compare the performance of different disks in a single deployment. e.g.: ``` minio_node_disk_latency_us{api="storage.WalkDir",disk="/tmp/xl/1",server="localhost:9001"} 226 minio_node_disk_latency_us{api="storage.WalkDir",disk="/tmp/xl/2",server="localhost:9002"} 1180 minio_node_disk_latency_us{api="storage.WalkDir",disk="/tmp/xl/3",server="localhost:9003"} 1183 minio_node_disk_latency_us{api="storage.WalkDir",disk="/tmp/xl/4",server="localhost:9004"} 1625 ```	2022-01-25 16:31:44 -08:00
Harshavardhana	b68f0cbde4	ignore remote disks with diskID empty as offline (#14168 ) concurrent loading of erasure sets can now expose a situation in a distributed setup that might return diskID as empty, treat such disks as offline.	2022-01-24 19:40:02 -08:00
Krishnan Parthasarathi	ebc3627c73	further improvements to newXLStorage (#14166 ) - create internal erasure volumes only if the disk is unformatted - return a copy of format data in xlStorage.ReadAll - parse env vars only once, to be re-used by xl-storage	2022-01-24 17:09:12 -08:00

1 2 3 4 5 ...

4260 Commits