minio

mirror of https://github.com/minio/minio.git synced 2025-04-27 21:35:05 -04:00

Author	SHA1	Message	Date
Aditya Manthramurthy	ed2c2a285f	Load STS accounts into IAM cache lazily (#17994 ) In situations with large number of STS credentials on disk, IAM load time is high. To mitigate this, STS accounts will now be loaded into memory only on demand - i.e. when the credential is used. In each IAM cache (re)load we skip loading STS credentials and STS policy mappings into memory. Since STS accounts only expire and cannot be deleted, there is no risk of invalid credentials being reused, because credential validity is checked when it is used.	2023-09-13 12:43:46 -07:00
Poorna	18e23bafd9	replication resync: report only the on-disk status (#18017 ) Avoid reporting in-memory status since results can vary if different nodes are queried, resync always runs at a single node.	2023-09-13 10:58:38 -07:00
Harshavardhana	8b8be2695f	optimize mkdir calls to avoid base-dir `Mkdir` attempts (#18021 ) Currently we have IOPs of these patterns ``` [OS] os.Mkdir play.min.io:9000 /disk1 2.718µs [OS] os.Mkdir play.min.io:9000 /disk1/data 2.406µs [OS] os.Mkdir play.min.io:9000 /disk1/data/.minio.sys 4.068µs [OS] os.Mkdir play.min.io:9000 /disk1/data/.minio.sys/tmp 2.843µs [OS] os.Mkdir play.min.io:9000 /disk1/data/.minio.sys/tmp/d89c8ceb-f8d1-4cc6-b483-280f87c4719f 20.152µs ``` It can be seen that we can save quite Nx levels such as if your drive is mounted at `/disk1/minio` you can simply skip sending an `Mkdir /disk1/` and `Mkdir /disk1/minio`. Since they are expected to exist already, this PR adds a way for us to ignore all paths upto the mount or a directory which ever has been provided to MinIO setup.	2023-09-13 08:14:36 -07:00
Poorna	96fbf18201	replication: queue existing objects to same workers as incoming (#18020 ) Previously existing objects were queued to single worker and MRF re-queues are also handled by same worker - this does not fully use the available bandwidth in case there is no incoming workload.	2023-09-12 21:59:15 -07:00
Harshavardhana	c8a57a8fa2	fix: send content-md5 for AWS S3 proactively (#18018 ) fixes #17977	2023-09-12 19:11:13 -07:00
Harshavardhana	b1c2dacab3	fix: allow dynamic ports for API only in non-distributed setups (#18019 ) fixes #17998	2023-09-12 19:10:49 -07:00
Harshavardhana	08b3a466e8	fix: allow concurrent SFTP connections (#18013 ) current implementation did not fully implement the concurrent SFTP connection implementation, this PR properly handles this. fixes #17914	2023-09-12 12:41:52 -07:00
Harshavardhana	1df5e31706	optimize MRF replication queue to avoid memory leaks (#18007 )	2023-09-11 20:59:11 -07:00
Harshavardhana	9f7044aed0	fix: ignore transient errors in read path (#18006 ) Errors such as ``` returned an error (context deadline exceeded) (fmt.wrapError) ``` ``` (msgp: too few bytes left to read object) (fmt.wrapError) ```	2023-09-11 15:29:59 -07:00
Anis Eleuch	41de53996b	heal: calculate the number of workers based on NRRequests (#17945 )	2023-09-11 14:48:54 -07:00
Harshavardhana	9878031cfd	fix: change DISK_ to DRIVE_ for some drive related envs (#18005 )	2023-09-11 12:19:22 -07:00
Poorna	703ed46d79	fix: replication of tags while removing (#17989 ) A tag removal was not being replicated prior to this change	2023-09-06 19:05:02 -07:00
Harshavardhana	f7ca6c63c2	fix: bucket quota clear and honor existing quota config (#17988 )	2023-09-06 19:03:58 -07:00
Harshavardhana	ad69b9907f	fix: report bucket metrics for only existing buckets (#17987 )	2023-09-06 12:50:46 -07:00
Shubhendu	bfddbb8b40	Embed file in ZIP with custom permissions (#17954 ) This change enables embedding files in ZIP with custom permissions. Also uses default creds for starting MinIO based on inspect data. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-09-06 09:24:01 -07:00
Poorna	13a2dc8485	replication resync: avoid blocking on results channel. (#17981 ) continues fix in #17775	2023-09-05 20:22:39 -07:00
Harshavardhana	1e51424e8a	use syscall.Rename() directly instead of os.Rename() (#17982 )	2023-09-05 20:22:23 -07:00
Harshavardhana	5b114b43f7	refactor bandwidth throttling for replication target (#17980 ) This refactor is to allow using the bandwidth throttling for other purposes.	2023-09-05 20:21:59 -07:00
Poorna	812f5a02d7	metrics: fix panic in replication stats reporting (#17979 )	2023-09-05 10:26:18 -07:00
Aditya Manthramurthy	1c99fb106c	Update to minio/pkg/v2 (#17967 )	2023-09-04 12:57:37 -07:00
Krishnan Parthasarathi	71c32e9b48	Return successorModTime in quorum when available (#17925 )	2023-09-04 08:24:17 -07:00
Harshavardhana	380a59520b	add missing testdata for benchmarking	2023-09-02 14:40:38 -07:00
Harshavardhana	3995355150	avoid repeated large allocations for large parts (#17968 ) objects with 10,000 parts and many of them can cause a large memory spike which can potentially lead to OOM due to lack of GC. with previous PR reducing the memory usage significantly in #17963, this PR reduces this further by 80% under repeated calls. Scanner sub-system has no use for the slice of Parts(), it is better left empty. ``` benchmark old ns/op new ns/op delta BenchmarkToFileInfo/ToFileInfo-8 295658 188143 -36.36% benchmark old allocs new allocs delta BenchmarkToFileInfo/ToFileInfo-8 61 60 -1.64% benchmark old bytes new bytes delta BenchmarkToFileInfo/ToFileInfo-8 1097210 227255 -79.29% ```	2023-09-02 07:49:24 -07:00
Harshavardhana	8208bcb896	remove all unnecessary logging, logOnce when absolutely needed (#17965 )	2023-09-01 16:19:18 -07:00
Poorna	d665e855de	replication: remove check for empty version id (#17964 )	2023-09-01 13:46:10 -07:00
Harshavardhana	18b3655c99	with xlv2 format we never had to fill in checksumInfo() (#17963 ) - this PR avoids sending a large ChecksumInfo slice when its not needed - also for a file with XLV2 format there is no reason to allocate Checksum slice while reading	2023-09-01 13:45:58 -07:00
Anis Eleuch	6a8d8f34a5	kafka: Do not require key when sending a message (#17962 ) Keys are helpful to ensure the strict ordering of messages, however currently the code uses a random request id for every log, hence using the request-id as a Kafka key is not serve any purpose; This commit removes the usage of the key, to also fix the audit issue from internal subsystem that does not have a request ID.	2023-09-01 08:37:22 -07:00
Harshavardhana	b1c1f02132	use buffers for pathJoin, to re-use buffers. (#17960 ) ``` benchmark old ns/op new ns/op delta BenchmarkPathJoin/PathJoin-8 79.6 55.3 -30.53% benchmark old allocs new allocs delta BenchmarkPathJoin/PathJoin-8 2 1 -50.00% benchmark old bytes new bytes delta BenchmarkPathJoin/PathJoin-8 48 24 -50.00% ```	2023-08-31 17:58:48 -07:00
yangw	b13fcaf666	fix: read atomic variable in clientDevNull round trip time (#17955 )	2023-08-31 08:31:01 -07:00
Harshavardhana	9458485e43	avoid double logging from healing (#17950 )	2023-08-30 18:46:04 -07:00
Poorna	b48bbe08b2	Add additional info for replication metrics API (#17293 ) to track the replication transfer rate across different nodes, number of active workers in use and in-queue stats to get an idea of the current workload. This PR also adds replication metrics to the site replication status API. For site replication, prometheus metrics are no longer at the bucket level - but at the cluster level. Add prometheus metric to track credential errors since uptime	2023-08-30 01:00:59 -07:00
Krishnan Parthasarathi	6a67c277eb	Reuse types for key-value, notification and retry (#17936 )	2023-08-29 11:27:23 -07:00
Harshavardhana	7cafdc0512	fix: skip access checks further for known buckets (#17934 )	2023-08-28 15:16:41 -07:00
Harshavardhana	8a57b6bced	use renameat2 Linux extension syscall (#17757 ) this is a faster and safer alternative on newer kernel versions.	2023-08-27 09:57:11 -07:00
Krishnan Parthasarathi	53abd25116	Don't log when object to be tiered is not found (#17924 )	2023-08-25 23:34:16 -07:00
Harshavardhana	1ea7826c0e	do not have to consider replicationTimestamp for healing and quorum (#17922 ) replicationTimestamp might differ if there were retries in replication and the retried attempt overwrote in quorum but enough shards with newer timestamp causing the existing timestamps on xl.meta to be invalid, we do not rely on this value for anything external. this is purely a hint for debugging purposes, but there is no real value in it considering the object itself is in-tact we do not have to spend time healing this situation. we may consider healing this situation in future but that needs to be decoupled to make sure that we do not over calculate how much we have to heal.	2023-08-25 15:31:15 -07:00
Anis Eleuch	0cde37be50	Reduce the number of calls to import bucket metadata (#17899 ) For each bucket, save the bucket metadata once, call the site replication hook once	2023-08-25 07:59:16 -07:00
jiuker	6aeca54ece	fix: replace context by timeout-context from parent-context when `selfSpeedTest` (#17906 )	2023-08-25 07:58:38 -07:00
Harshavardhana	124e28578c	remove strict persistence requirements for List() .metacache objects (#17917 ) .metacache objects are transient in nature, and are better left to use page-cache effectively to avoid using more IOPs on the disks. this allows for incoming calls to be not taxed heavily due to multiple large batch listings.	2023-08-25 07:58:11 -07:00
Harshavardhana	62c9e500de	remove mTime requirement from pre-condition checks (#17916 ) given a versionId the mtime is always the same, it can never be different than its original value. versionIds also do not conflict, since they are uuid's and unique practically forever.	2023-08-24 14:33:58 -07:00
jiuker	02cc18ff29	refactor the perf client for TTFB and TotalResponseTime (#17901 )	2023-08-24 10:21:08 -07:00
Harshavardhana	ba4566e86d	add missing IAM node metrics to cluster and node endpoint (#17908 )	2023-08-24 09:26:37 -07:00
Krishnan Parthasarathi	87cb0081ec	Retain current and upto NewerNoncurrentVersions versions (#17909 ) applyNewerNoncurrentVersionLimit method should pass along versions unaffected by NewerNoncurrentVersions rule for further ILM evaluation.	2023-08-24 09:26:29 -07:00
Poorna	4a6af93c83	mark replication target offline if network timeouts seen (#17907 ) regular target liveness check every 5 secs will toggle state back as target returns online.	2023-08-24 09:24:26 -07:00
Harshavardhana	af564b8ba0	allow bootstrap to capture time-spent for each initializers (#17900 )	2023-08-23 03:07:06 -07:00
Klaus Post	7c8746732b	Return cancelled storage calls as 499 (#17895 ) Make upstream cancels more visible - right now they are just reported as "forbidden".	2023-08-22 11:10:41 -07:00
Klaus Post	f506117edb	Reduce memory profiling rate (#17894 ) Change profiling from every 4KB to every 128K, reducing the lock contention by a factor of 32.	2023-08-22 07:21:49 -07:00
Harshavardhana	1c5af7c31a	serialize queueMRFHeal(), add timeouts and avoid normal build-ups (#17886 ) we expect a certain level of IOPs and latency so this is okay. fixes other miscellaneous bugs - such as hanging on mrfCh <- when the context is canceled - queuing MRF heal when the context is canceled - remove unused saveStateCh channel	2023-08-21 16:44:50 -07:00
Harshavardhana	3a0125fa1f	remove unexpected logging from peer calls (#17888 ) also make sure RequestID is set for system logs	2023-08-21 14:25:24 -07:00
Daniel Valdivia	328cb0a076	Pass environment variable to control session length to console (#17885 ) Signed-off-by: Daniel Valdivia <18384552+dvaldivia@users.noreply.github.com>	2023-08-21 11:55:43 -07:00

... 5 6 7 8 9 ...

5777 Commits