minio

mirror of https://github.com/minio/minio.git synced 2025-11-24 19:46:16 -05:00

Author	SHA1	Message	Date
Harshavardhana	4bf90ca67f	fix: handle a crash when AskDisks is set to -1 (#10777 )	2020-10-29 09:25:43 -07:00
Harshavardhana	e0655e24f2	fix: A possible crash when fi.Erasure.Distribution is empty (#10779 )	2020-10-28 19:24:01 -07:00
Klaus Post	bfc36aed89	Add update retry limit and compare error by string instead (#10776 )	2020-10-28 13:19:53 -07:00
Kaloyan Raev	be7f67268d	fix: Do not cleanup range files in cache SaveMetadata when total hits are false (#10728 )	2020-10-28 09:23:17 -07:00
Klaus Post	a982baff27	ListObjects Metadata Caching (#10648 ) Design: https://gist.github.com/klauspost/025c09b48ed4a1293c917cecfabdf21c Gist of improvements: * Cross-server caching and listing will use the same data across servers and requests. * Lists can be arbitrarily resumed at a constant speed. * Metadata for all files scanned is stored for streaming retrieval. * The existing bloom filters controlled by the crawler is used for validating caches. * Concurrent requests for the same data (or parts of it) will not spawn additional walkers. * Listing a subdirectory of an existing recursive cache will use the cache. * All listing operations are fully streamable so the number of objects in a bucket no longer dictates the amount of memory. * Listings can be handled by any server within the cluster. * Caches are cleaned up when out of date or superseded by a more recent one.	2020-10-28 09:18:35 -07:00
Krishna Srinivas	f53c5a020e	fix: heal object shards with ec.index and ec.distribution mismatches (#10773 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-10-28 00:10:20 -07:00
Harshavardhana	5b30bbda92	fix: add more protection distribution to match EcIndex (#10772 ) allows for more stricter validation in picking up the right set of disks for reconstruction.	2020-10-28 00:09:15 -07:00
Shireesh Anjal	858e2a43df	Remove logging info from OBDInfoHandler (#10727 ) A lot of logging data is counterproductive. A better implementation with precise useful log data can be introduced later.	2020-10-27 17:41:48 -07:00
Kaloyan Raev	df9894e275	avoid caching http ranges in background goroutine (#10724 )	2020-10-26 23:04:48 -07:00
Krishna Srinivas	592f2f23a3	fix: heal rejects objects with disk re-ordering issue (#10766 )	2020-10-26 18:48:47 -07:00
Krishna Srinivas	c49a80db41	fix: use meta.Erasure.Index for GetObject() to reconstruct object (#10764 )	2020-10-26 16:19:42 -07:00
Poorna Krishnamoorthy	46275c6547	cache: rename function declarations (#10763 )	2020-10-26 15:41:24 -07:00
Poorna Krishnamoorthy	0994ed9783	cache: fix call in GetObjectNInfo (#10762 ) Fixes: #10751	2020-10-26 12:30:40 -07:00
Anis Elleuch	eb95353cb1	fix: Get/HeadObject return 404 on non quorum objects (#10753 )	2020-10-26 10:30:46 -07:00
Harshavardhana	029758cb20	fix: retain the previous UUID for newly replaced drives (#10759 ) only newly replaced drives get the new `format.json`, this avoids disks reloading their in-memory reference format, ensures that drives are online without reloading the in-memory reference format. keeping reference format in-tact means UUIDs never change once they are formatted.	2020-10-26 10:29:29 -07:00
Harshavardhana	646d6917ed	turn-off checking for updates completely if MINIO_UPDATE=off (#10752 )	2020-10-24 22:39:44 -07:00
Harshavardhana	d9db7f3308	expire lockers if lockers are offline (#10749 ) lockers currently might leave stale lockers, in unknown ways waiting for downed lockers. locker check interval is high enough to safely cleanup stale locks.	2020-10-24 13:23:16 -07:00
Harshavardhana	6a8c62f9fd	make sure to preserve UUID from reference format (#10748 ) reference format should be source of truth for inconsistent drives which reconnect, add them back to their original position remove automatic fix for existing offline disk uuids	2020-10-24 13:23:08 -07:00
Anis Elleuch	00124c56d9	erasure: Commit data before xl.meta in RenameData() (#10734 ) This will reduce the chance to have updated xl.meta without data.	2020-10-23 21:54:58 -07:00
Anis Elleuch	2c32c2149e	tests: Avoid running TestNSRace in short test mode (#10735 )	2020-10-23 21:23:12 -07:00
Harshavardhana	734f258878	fix: slow down auto healing more aggressively (#10730 ) Bonus fixes - logging improvements to ensure that we don't use `go logger.LogIf` to avoid runtime.Caller missing the function name. log where necessary. - remove unused code at erasure sets	2020-10-22 13:36:24 -07:00
Anis Elleuch	0e0c53bba4	tests: Lower expectation in addr selection in rand cache dialer (#10739 ) Test TestDialContextWithDNSCacheRand was failing sometimes because it depends on a random selection of addresses when testing random DNS resolution from cache. Lower addr selection exception to 10%	2020-10-22 09:35:32 -07:00
Poorna Krishnamoorthy	5cc23ae052	validate if iam store is initialized (#10719 ) Fixes panic - regression from `d6d770c1b1`	2020-10-20 21:28:24 -07:00
Harshavardhana	d6d770c1b1	initialize object layer right after config has loaded	2020-10-19 22:04:59 -07:00
Harshavardhana	b07df5cae1	initialize IAM as soon as object layer is initialized (#10700 ) Allow requests to come in for users as soon as object layer and config are initialized, this allows users to be authenticated sooner and would succeed automatically on servers which are yet to fully initialize.	2020-10-19 09:54:40 -07:00
Harshavardhana	c107728676	fix: s3 gateway DNS cache initialization (#10706 ) fixes #10705	2020-10-19 01:34:23 -07:00
Anis Elleuch	284a2b9021	ilm: Send delete marker creation event when appropriate (#10696 ) Before this commit, the crawler ILM will always send object delete event notification though this is wrong.	2020-10-16 21:22:12 -07:00
Ritesh H Shukla	0b53e30ecb	Clean up monitor on delete bucket (#10698 )	2020-10-16 17:59:31 -07:00
Harshavardhana	bd2131ba34	add DNS cache support to avoid DNS flooding (#10693 ) Go stdlib resolver doesn't support caching DNS resolutions, since we compile with CGO disabled we are more probe to DNS flooding for all network calls to resolve for DNS from the DNS server. Under various containerized environments such as VMWare this becomes a problem because there are no DNS caches available and we may end up overloading the kube-dns resolver under concurrent I/O. To circumvent this issue implement a DNSCache resolver which resolves DNS and caches them for around 10secs with every 3sec invalidation attempted.	2020-10-16 14:49:05 -07:00
ebozduman	1aec168c84	fix: azure gateway should reject bucket names with "." (#10635 )	2020-10-16 09:30:18 -07:00
Klaus Post	21a549a83b	fix: keep MRF channel open to avoid random CI crash (#10686 ) There doesn't seem to be any benefit to closing the channel, so just keep it open and let it die with the server.	2020-10-16 09:08:51 -07:00
Ritesh H Shukla	8a16a1a1a9	fix: misc fixes for bandwidth reporting amd monitoring (#10683 ) * Set peer for fetch bandwidth * Fix the limit for bandwidth that is reported. * Reduce CPU burn from bandwidth management.	2020-10-16 09:07:50 -07:00
Harshavardhana	ad726b49b4	rename zones to serverSets to avoid terminology conflict (#10679 ) we are bringing in availability zones, we should avoid zones as per server expansion concept.	2020-10-15 14:28:50 -07:00
Anis Elleuch	db2241066b	heal: Enable removing dangling delete markers (#10688 )	2020-10-15 13:06:40 -07:00
Harshavardhana	f1cc16e788	fix: background heal rely on getOnlineDisks() (#10687 )	2020-10-15 13:06:23 -07:00
Klaus Post	3820a905e0	in getOnlineDisks wait for disks to be populated (#10685 )	2020-10-15 06:37:10 -07:00
Harshavardhana	2042d4873c	rename crawler config option to heal (#10678 )	2020-10-14 13:51:51 -07:00
Harshavardhana	f9be783f3e	fix: allow crawler to crawl on disks without usage constraints (#10677 ) additionally also change the resolution usage wise return of disks, allows to small byte level differences to be masked.	2020-10-14 12:12:10 -07:00
Harshavardhana	71b97fd3ac	fix: connect disks pre-emptively during startup (#10669 ) connect disks pre-emptively upon startup, to ensure we have enough disks are connected at startup rather than wait for them. we need to do this to avoid long wait times for server to be online when we have servers come up in rolling upgrade fashion	2020-10-13 18:28:42 -07:00
Klaus Post	03991c5d41	crawler: Remove waitForLowActiveIO (#10667 ) Only use dynamic delays for the crawler. Even though the max wait was 1 second the number of waits could severely impact crawler speed. Instead of relying on a global metric, we use the stateless local delays to keep the crawler running at a speed more adjusted to current conditions. The only case we keep it is before bitrot checks when enabled.	2020-10-13 13:45:08 -07:00
飞雪无情	614060764d	fix: use the correct Action type for policy.Args and iampolicy.Args (#10650 )	2020-10-12 15:18:22 -07:00
Harshavardhana	a3ba8188d7	fix: allow locker to be niladic	2020-10-12 14:23:44 -07:00
Harshavardhana	2760fc86af	Bump default idleConnsPerHost to control conns in time_wait (#10653 ) This PR fixes a hang which occurs quite commonly at higher concurrency by allowing following changes - allowing lower connections in time_wait allows faster socket open's - lower idle connection timeout to ensure that we let kernel reclaim the time_wait connections quickly - increase somaxconn to 4096 instead of 2048 to allow larger tcp syn backlogs. fixes #10413	2020-10-12 14:19:46 -07:00
Ritesh H Shukla	8ceb2a93fd	fix: peer replication bandwidth monitoring in distributed setup (#10652 )	2020-10-12 09:04:55 -07:00
Ritesh H Shukla	c2f16ee846	Add basic bandwidth monitoring for replication. (#10501 ) This change tracks bandwidth for a bucket and object - [x] Add Admin API - [x] Add Peer API - [x] Add BW throttling - [x] Admin APIs to set replication limit - [x] Admin APIs for fetch bandwidth	2020-10-09 20:36:00 -07:00
Harshavardhana	6484453fc6	optionally allow strict quorum listing (#10649 ) ``` export MINIO_API_LIST_STRICT_QUORUM=on ``` would enable listing in quorum if necessary	2020-10-09 15:40:46 -07:00
Harshavardhana	a0d0645128	remove safeMode behavior in startup (#10645 ) In almost all scenarios MinIO now is mostly ready for all sub-systems independently, safe-mode is not useful anymore and do not serve its original intended purpose. allow server to be fully functional even with config partially configured, this is to cater for availability of actual I/O v/s manually fixing the server. In k8s like environments it will never make sense to take pod into safe-mode state, because there is no real access to perform any remote operation on them.	2020-10-09 09:59:52 -07:00
Harshavardhana	253194e491	do not hold write locks - if objects don't exist (#10644 )	2020-10-08 17:47:21 -07:00
Harshavardhana	736e58dd68	fix: handle concurrent lockers with multiple optimizations (#10640 ) - select lockers which are non-local and online to have affinity towards remote servers for lock contention - optimize lock retry interval to avoid sending too many messages during lock contention, reduces average CPU usage as well - if bucket is not set, when deleteObject fails make sure setPutObjHeaders() honors lifecycle only if bucket name is set. - fix top locks to list out always the oldest lockers always, avoid getting bogged down into map's unordered nature.	2020-10-08 12:32:32 -07:00
Poorna Krishnamoorthy	907a171edd	Generalize error messages for remote targets (#10638 ) This is to allow remote targets to be generalized for replication/ILM transition Also adding a field in BucketTarget to identify a remote target with a label.	2020-10-08 10:54:11 -07:00

1 2 3 4 5 ...

3001 Commits