minio

Commit Graph

Author	SHA1	Message	Date
Poorna Krishnamoorthy	50c10a5087	Add delete marker replication support (#10396 ) Delete marker replication is implemented for V2 configuration specified in AWS spec (though AWS allows it only in the V1 configuration). This PR also brings in a MinIO only extension of replicating permanent deletes, i.e. deletes specifying version id are replicated to target cluster.	2020-11-10 15:24:14 -08:00
Harshavardhana	b72cac4cf3	fix: dangling objects on actual namespace (#10822 )	2020-11-05 11:48:55 -08:00
Klaus Post	2294e53a0b	Don't retain context in locker (#10515 ) Use the context for internal timeouts, but disconnect it from outgoing calls so we always receive the results and cancel it remotely.	2020-11-04 08:25:42 -08:00
Harshavardhana	8527f22df1	optimize request URL encoding for internode (#10811 ) this reduces allocations in order of magnitude Also, revert "erasure: delete dangling objects automatically (#10765)" affects list caching should be investigated.	2020-11-02 15:15:12 -08:00
Anis Elleuch	b456292295	erasure: delete dangling objects automatically (#10765 )	2020-11-02 10:49:30 -08:00
Klaus Post	a982baff27	ListObjects Metadata Caching (#10648 ) Design: https://gist.github.com/klauspost/025c09b48ed4a1293c917cecfabdf21c Gist of improvements: * Cross-server caching and listing will use the same data across servers and requests. * Lists can be arbitrarily resumed at a constant speed. * Metadata for all files scanned is stored for streaming retrieval. * The existing bloom filters controlled by the crawler is used for validating caches. * Concurrent requests for the same data (or parts of it) will not spawn additional walkers. * Listing a subdirectory of an existing recursive cache will use the cache. * All listing operations are fully streamable so the number of objects in a bucket no longer dictates the amount of memory. * Listings can be handled by any server within the cluster. * Caches are cleaned up when out of date or superseded by a more recent one.	2020-10-28 09:18:35 -07:00
Harshavardhana	5b30bbda92	fix: add more protection distribution to match EcIndex (#10772 ) allows for more stricter validation in picking up the right set of disks for reconstruction.	2020-10-28 00:09:15 -07:00
Krishna Srinivas	c49a80db41	fix: use meta.Erasure.Index for GetObject() to reconstruct object (#10764 )	2020-10-26 16:19:42 -07:00
Anis Elleuch	eb95353cb1	fix: Get/HeadObject return 404 on non quorum objects (#10753 )	2020-10-26 10:30:46 -07:00
Harshavardhana	253194e491	do not hold write locks - if objects don't exist (#10644 )	2020-10-08 17:47:21 -07:00
Harshavardhana	736e58dd68	fix: handle concurrent lockers with multiple optimizations (#10640 ) - select lockers which are non-local and online to have affinity towards remote servers for lock contention - optimize lock retry interval to avoid sending too many messages during lock contention, reduces average CPU usage as well - if bucket is not set, when deleteObject fails make sure setPutObjHeaders() honors lifecycle only if bucket name is set. - fix top locks to list out always the oldest lockers always, avoid getting bogged down into map's unordered nature.	2020-10-08 12:32:32 -07:00
Harshavardhana	18063bf25c	fix: cleanup old directory handling code (#10633 ) we don't need them anymore, remove legacy code.	2020-10-06 12:03:57 -07:00
Harshavardhana	6fcbdd5607	remove unused putObjectDir code (#10528 )	2020-09-21 09:41:39 -07:00
Harshavardhana	02c1a08a5b	fix: make sure to lock CopyObject for in-place updates (#10492 )	2020-09-15 20:44:48 -07:00
Harshavardhana	0104af6bcc	delayed locks until we have started reading the body (#10474 ) This is to ensure that Go contexts work properly, after some interesting experiments I found that Go net/http doesn't cancel the context when Body is non-zero and hasn't been read till EOF. The following gist explains this, this can lead to pile up of go-routines on the server which will never be canceled and will die at a really later point in time, which can simply overwhelm the server. https://gist.github.com/harshavardhana/c51dcfd055780eaeb71db54f9c589150 To avoid this refactor the locking such that we take locks after we have started reading from the body and only take locks when needed. Also, remove contextReader as it's not useful, doesn't work as expected context is not canceled until the body reaches EOF so there is no point in wrapping it with context and putting a `select {` on it which can unnecessarily increase the CPU overhead. We will still use the context to cancel the lockers etc. Additional simplification in the locker code to avoid timers as re-using them is a complicated ordeal avoid them in the hot path, since locking is very common this may avoid lots of allocations.	2020-09-14 15:57:13 -07:00
Harshavardhana	48919de301	fix: for defer'ed deleteObject use internal context (#10463 )	2020-09-11 06:39:19 -07:00
Harshavardhana	c13afd56e8	Remove MaxConnsPerHost settings to avoid potential hangs (#10438 ) MaxConnsPerHost can potentially hang a call without any way to timeout, we do not need this setting for our proxy and gateway implementations instead IdleConn settings are good enough. Also ensure to use NewRequestWithContext and make sure to take the disks offline only for network errors. Fixes #10304	2020-09-08 14:22:04 -07:00
Klaus Post	2d58a8d861	Add storage layer contexts (#10321 ) Add context to all (non-trivial) calls to the storage layer. Contexts are propagated through the REST client. - `context.TODO()` is left in place for the places where it needs to be added to the caller. - `endWalkCh` could probably be removed from the walkers, but no changes so far. The "dangerous" part is that now a caller disconnecting will propagate down, so a "delete" operation will now be interrupted. In some cases we might want to disconnect this functionality so the operation completes if it has started, leaving the system in a cleaner state.	2020-09-04 09:45:06 -07:00
Harshavardhana	37da0c647e	fix: delete marker compatibility behavior for suspended bucket (#10395 ) - delete-marker should be created on a suspended bucket as `null` - delete-marker should delete any pre-existing `null` versioned object and create an entry `null`	2020-09-02 00:19:03 -07:00
poornas	79e21601b0	fix: web handlers to enforce replication (#10249 ) This PR also preserves source ETag for replication	2020-08-12 17:32:24 -07:00
Harshavardhana	5ce82b45da	add CopyObject optimization when source and destination are same (#10170 ) when source and destination are same and versioning is enabled on the destination bucket - we do not need to re-create the entire object once again to optimize on space utilization. Cases this PR is not supporting - any pre-existing legacy object will not be preserved in this manner, meaning a new dataDir will be created. - key-rotation and storage class changes of course will never re-use the dataDir	2020-08-03 16:21:10 -07:00
Harshavardhana	b68bc75dad	fix: quorum calculation mistake with reduced parity (#10186 ) With reduced parity our write quorum should be same as read quorum, but code was still assuming ``` readQuorum+1 ``` In all situations which is not necessary.	2020-08-03 12:15:08 -07:00
Harshavardhana	35212b673e	add unformatted disk as part of the error list (#10128 ) these errors should be ignored for quorum error calculation to ensure that we don't prematurely return unformatted disk error as part of API calls	2020-07-24 13:16:11 -07:00
poornas	c43da3005a	Add support for server side bucket replication (#9882 )	2020-07-21 17:49:56 -07:00
Harshavardhana	14b1c9f8e4	fix: return Range errors after If-Matches (#10045 ) closes #7292	2020-07-17 13:01:22 -07:00
Anis Elleuch	778e9c864f	Move dependency from minio-go v6 to v7 (#10042 )	2020-07-14 09:38:05 -07:00
Harshavardhana	2743d4ca87	fix: Add support for preserving mtime for replication (#9995 ) This PR is needed for bucket replication support	2020-07-08 17:36:56 -07:00
Harshavardhana	810a4f0723	fix: return proper errors Get/HeadObject for deleteMarkers (#9957 )	2020-07-02 16:17:27 -07:00
Harshavardhana	a38ce29137	fix: simplify background heal and trigger heal items early (#9928 ) Bonus fix during versioning merge one of the PR was missing the offline/online disk count fix from #9801 port it correctly over to the master branch from release. Additionally, add versionID support for MRF Fixes #9910 Fixes #9931	2020-06-29 13:07:26 -07:00
Harshavardhana	e79874f58e	[feat] Preserve version supplied by client (#9854 ) Just like GET/DELETE APIs it is possible to preserve client supplied versionId's, of course the versionIds have to be uuid, if an existing versionId is found it is overwritten if no object locking policies are found. - PUT /bucketname/objectname?versionId=<id> - POST /bucketname/objectname?uploads=&versionId=<id> - PUT /bucketname/objectname?verisonId=<id> (with x-amz-copy-source)	2020-06-17 11:13:41 -07:00
Harshavardhana	4ac31ea82b	fix: find current location of object multi-zones (#9840 ) PutObject on multiple-zone with versioning would not overwrite the correct location of the object if the object has delete marker, leading to duplicate objects on two zones. This PR fixes by adding affinity towards delete marker when GetObjectInfo() returns error, use the zone index which has the delete marker.	2020-06-17 08:33:14 -07:00
Anis Elleuch	2073b79633	fix: Remove unnecessary debug log line (#9834 )	2020-06-15 08:55:33 -07:00
Anis Elleuch	63e9005f01	fix: Avoid updating object tags on failed disks (#9819 )	2020-06-14 10:53:07 -07:00
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	2020-06-12 20:04:01 -07:00

1 2 3 4 5

234 Commits