minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	e377bb949a	migrate bootstrap logic directly to websockets (#18855 ) improve performance for startup sequences by 2x for 300+ nodes.	2024-01-24 13:36:44 -08:00
Anis Eleuch	a47fc75c26	xl: Remove wrong wording for errCorruptedFormat (#18775 ) Also add errCorruptedBackend to make it easier to differentiate between corrupted content or something else wrong in the backend drive	2024-01-12 14:48:44 -08:00
Krishnan Parthasarathi	9fbd931058	Skip versions expired by DeleteAllVersionsAction (#18537 ) Object versions expired by DeleteAllVersionsAction must not be included toward data-usage accounting.	2023-11-28 08:39:21 -08:00
Harshavardhana	fba883839d	feat: bring new HDD related performance enhancements (#18239 ) Optionally allows customers to enable - Enable an external cache to catch GET/HEAD responses - Enable skipping disks that are slow to respond in GET/HEAD when we have already achieved a quorum	2023-11-22 13:46:17 -08:00
Harshavardhana	cb089dcb52	error out by default beyond 10000 versions per object (#17803 ) ``` You've exceeded the limit on the number of versions you can create on this object ```	2023-08-04 10:40:21 -07:00
Anis Elleuch	89db3fdb5d	Do not return an error when version disparity is detected (#16269 )	2022-12-16 08:52:12 -08:00
Harshavardhana	23b329b9df	remove gateway completely (#15929 )	2022-10-24 17:44:15 -07:00
Harshavardhana	8e997eba4a	fix: trigger Heal when xl.meta needs healing during PUT (#15661 ) This PR is a continuation of the previous change instead of returning an error, instead trigger a spot heal on the 'xl.meta' and return only after the healing is complete. This allows for future GETs on the same resource to be consistent for any version of the object.	2022-09-07 07:25:39 -07:00
ebozduman	b57e7321e7	Replaces 'disk'=>'drive' visible to end user (#15464 )	2022-08-04 16:10:08 -07:00
Harshavardhana	f1abb92f0c	feat: Single drive XL implementation (#14970 ) Main motivation is move towards a common backend format for all different types of modes in MinIO, allowing for a simpler code and predictable behavior across all features. This PR also brings features such as versioning, replication, transitioning to single drive setups.	2022-05-30 10:58:37 -07:00
Anis Elleuch	f5be8ba11f	Print log when EINVALID is encountered in storage layer (#13341 ) EINVALID from the OS is not a common case and should be logger.	2021-10-04 09:01:52 -07:00
Harshavardhana	ffd497673f	internode lockArgs should use messagepack (#13329 ) it would seem like using `bufio.Scan()` is very slow for heavy concurrent I/O, ie. when r.Body is slow , instead use a proper binary exchange format, to marshal and unmarshal the LockArgs datastructure in a cleaner way. this PR increases performance of the locking sub-system for tiny repeated read lock requests on same object. ``` BenchmarkLockArgs BenchmarkLockArgs-4 6417609 185.7 ns/op 56 B/op 2 allocs/op BenchmarkLockArgsOld BenchmarkLockArgsOld-4 1187368 1015 ns/op 4096 B/op 1 allocs/op ```	2021-09-30 11:53:01 -07:00
Harshavardhana	035882d292	fix: remove parentIsObject() check (#12851 ) we will allow situations such as ``` a/b/1.txt a/b ``` and ``` a/b a/b/1.txt ``` we are going to document that this usecase is not supported and we will never support it, if any application does this users have to delete the top level parent to make sure namespace is accessible at lower level. rest of the situations where the prefixes get created across sets are supported as is.	2021-08-03 13:26:57 -07:00
Harshavardhana	069432566f	update license change for MinIO Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Harshavardhana	8778828a03	fix: read metadata in O_DIRECT if configured and supported (#11594 ) reduce the page-cache pressure completely by moving the entire read-phase of our operations to O_DIRECT, primarily this is going to be very useful for chatty metadata operations such as listing, scanner, ilm, healing like operations to avoid filling up the page-cache upon repeated runs.	2021-02-22 01:36:17 -08:00
Harshavardhana	289e1d8b2a	fix: reduce crawler memory usage by orders of magnitude (#11556 ) currently crawler waits for an entire readdir call to return until it processes usage, lifecycle, replication and healing - instead we should pass the applicator all the way down to avoid building any special stack for all the contents in a single directory. This allows for - no need to remember the entire list of entries per directory before applying the required functions - no need to wait for entire readdir() call to finish before applying the required functions	2021-02-17 15:34:42 -08:00
Anis Elleuch	c9d502e6fa	parentDirIsObject() to return quickly with inexistant parent (#11204 ) Rewrite parentIsObject() function. Currently if a client uploads a/b/c/d, we always check if c, b, a are actual objects or not. The new code will check with the reverse order and quickly quit if the segment doesn't exist. So if a, b, c in 'a/b/c' does not exist in the first place, then returns false quickly.	2021-01-02 12:01:29 -08:00
Harshavardhana	df93102235	fix: unwrapping issues with os.Is* functions (#10949 ) reduces 3 stat calls, reducing the overall startup time significantly.	2020-11-23 08:36:49 -08:00
Harshavardhana	6a8c62f9fd	make sure to preserve UUID from reference format (#10748 ) reference format should be source of truth for inconsistent drives which reconnect, add them back to their original position remove automatic fix for existing offline disk uuids	2020-10-24 13:23:08 -07:00
Harshavardhana	66174692a2	add '.healing.bin' for tracking currently healing disk (#10573 ) add a hint on the disk to allow for tracking fresh disk being healed, to allow for restartable heals, and also use this as a way to track and remove disks. There are more pending changes where we should move all the disk formatting logic to backend drives, this PR doesn't deal with this refactor instead makes it easier to track healing in the future.	2020-09-28 19:39:32 -07:00
Harshavardhana	019fe69a57	fix: reduce an extra system call for writes instead fail later (#10187 )	2020-08-04 12:09:41 -07:00
Harshavardhana	b16781846e	allow server to start even with corrupted/faulty disks (#10175 )	2020-08-03 18:17:48 -07:00
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	2020-06-12 20:04:01 -07:00
Harshavardhana	2dc46cb153	Report correct error when O_DIRECT is not supported (#9545 ) fixes #9537	2020-05-07 16:12:16 -07:00
Harshavardhana	0879a4f743	rest/storage: Remove racy LastError usage (#8817 ) instead perform a liveness check call to verify if server is online and print relevant errors. Also introduce a StorageErr string error type instead of errors.New() deprecate usage of VerifyFileError, DeleteFileError for gob, change in datastructure also requires bump in storage REST version to v13. Fixes #8811	2020-01-14 18:45:17 -08:00
Harshavardhana	ff5bf51952	admin/heal: Fix deep healing to heal objects under more conditions (#8321 ) - Heal if the part.1 is truncated from its original size - Heal if the part.1 fails while being verified in between - Heal if the part.1 fails while being at a certain offset Other cleanups include make sure to flush the HTTP responses properly from storage-rest-server, avoid using 'defer' to improve call latency. 'defer' incurs latency avoid them in our hot-paths such as storage-rest handlers. Fixes #8319	2019-10-02 01:42:15 +05:30
Anis Elleuch	000a60f238	xl: Heal empty parts (#7860 ) posix.VerifyFile() doesn't know how to check if a file is corrupted if that file is empty. We do have the part size in xl.json so we pass it to VerifyFile to return an error so healing empty parts can work properly.	2019-07-13 00:29:44 +01:00
Krishna Srinivas	58d90ed73c	Avoid network transfer for bitrot verification during healing (#7375 )	2019-07-08 13:51:18 -07:00
poornas	cf2a436bc8	Show SlowDown error message if backend is busy (#7521 ) or if there are too many open file descriptors.	2019-05-02 07:09:57 -07:00
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	2019-04-09 11:39:42 -07:00
Harshavardhana	c184038b6a	Add proper custom errors object creations (#7387 ) In scenario 1 ``` - bucket/object-prefix - bucket/object-prefix/object ``` Server responds with `XMinioParentIsObject` In scenario 2 ``` - bucket/object-prefix/object - bucket/object-prefix ``` Server responds with `XMinioObjectExistsAsDirectory` Fixes #6566	2019-03-20 13:06:53 -07:00
Krishna Srinivas	98c950aacd	Streaming bitrot verification support (#7004 )	2019-01-17 18:28:18 +05:30
Krishna Srinivas	ce02ab613d	Simplify erasure code by separating bitrot from erasure code (#5959 )	2018-08-06 15:14:08 -07:00
Praveen raj Mani	ea76e72054	Incorrect error message for insufficient volume fix (#6099 ) Reply back with appropriate error message when the server is spawn with volume of insufficient size (< 1GiB). Fixes #5993.	2018-06-28 12:01:05 -07:00
Harshavardhana	fb96779a8a	Add large bucket support for erasure coded backend (#5160 ) This PR implements an object layer which combines input erasure sets of XL layers into a unified namespace. This object layer extends the existing erasure coded implementation, it is assumed in this design that providing > 16 disks is a static configuration as well i.e if you started the setup with 32 disks with 4 sets 8 disks per pack then you would need to provide 4 sets always. Some design details and restrictions: - Objects are distributed using consistent ordering to a unique erasure coded layer. - Each pack has its own dsync so locks are synchronized properly at pack (erasure layer). - Each pack still has a maximum of 16 disks requirement, you can start with multiple such sets statically. - Static sets set of disks and cannot be changed, there is no elastic expansion allowed. - Static sets set of disks and cannot be changed, there is no elastic removal allowed. - ListObjects() across sets can be noticeably slower since List happens on all servers, and is merged at this sets layer. Fixes #5465 Fixes #5464 Fixes #5461 Fixes #5460 Fixes #5459 Fixes #5458 Fixes #5460 Fixes #5488 Fixes #5489 Fixes #5497 Fixes #5496	2018-02-15 17:45:57 -08:00
Harshavardhana	eb2894233c	Convert gateways into respective packages (#5200 ) - Make azure gateway a package - Make b2 gateway a package - Make gcs gateway a package - Make s3 gateway a package - Make sia gateway a package	2017-12-05 17:58:09 -08:00
Harshavardhana	8efa82126b	Convert errors tracer into a separate package (#5221 )	2017-11-25 11:58:29 -08:00
Harshavardhana	879cef37a1	Fail to start server if detected cross-device mounts. (#4807 ) Fixes #4764	2017-08-15 15:10:50 -07:00
Frank Wessels	98b62cbec8	Implement an offline mode for a distributed node (#4646 ) Implement an offline mode for remote storage to cache the offline status of a node in order to prevent network calls that are bound to fail. After a time interval an attempt will be made to restore the connection and mark the node as online if successful. Fixes #4183	2017-08-11 11:49:35 -07:00
Harshavardhana	f5ce685aa1	Remove dead unused errs and constants. (#4627 )	2017-07-07 14:31:42 -07:00
Krishna Srinivas	3928c1e14c	gateway/gcs: Change in multipart backend format (#4455 )	2017-06-17 16:00:41 -07:00
Aditya Manthramurthy	8975da4e84	Add new ReadFileWithVerify storage-layer API (#4349 ) This is an enhancement to the XL/distributed-XL mode. FS mode is unaffected. The ReadFileWithVerify storage-layer call is similar to ReadFile with the additional functionality of performing bit-rot checking. It accepts additional parameters for a hashing algorithm to use and the expected hex-encoded hash string. This patch provides significant performance improvement because: 1. combines the step of reading the file (during erasure-decoding/reconstruction) with bit-rot verification; 2. limits the number of file-reads; and 3. avoids transferring the file over the network for bit-rot verification. ReadFile API is implemented as ReadFileWithVerify with empty hashing arguments. Credits to AB and Harsha for the algorithmic improvement. Fixes #4236.	2017-05-16 14:21:52 -07:00
Anis Elleuch	f2ed149714	Add slack channel link to corrupted disk err msg (#4270 )	2017-05-11 14:27:32 -07:00
Harshavardhana	48aa2ac392	server: Validate path for bad components in a handler. (#4170 )	2017-04-24 18:13:46 -07:00
Harshavardhana	0b9f0d14a1	auth/rpc: Take remote disk offline after maximum allowed attempts. (#3288 ) Disks when are offline for a long period of time, we should ignore the disk after trying Login upto 5 times. This is to reduce the network chattiness, this also reduces the overall time spent on `net.Dial`. Fixes #3286	2016-11-20 16:57:12 -08:00
Harshavardhana	6494b77d41	server: Add more elaborate startup messages. (#2731 ) These messages based on our prep stage during XL and prints more informative message regarding drive information. This change also does a much needed refactoring.	2016-10-05 12:48:07 -07:00
Harshavardhana	6aa2fc95c0	Revert "bucket: refactor policies and fix bugs related to enforcing policies. (#2766 )" This reverts commit `ca5ca8332b`.	2016-09-26 19:32:33 -07:00
Harshavardhana	ca5ca8332b	bucket: refactor policies and fix bugs related to enforcing policies. (#2766 ) This patch also addresses the problem of double caching at object layer once at XL and another at handler layer.	2016-09-22 23:47:48 -07:00
Krishnan Parthasarathi	e55926e8cf	distribute: Make server work with multiple remote disks This change initializes rpc servers associated with disks that are local. It makes object layer initialization on demand, namely on the first request to the object layer. Also adds lock RPC service vendorized minio/dsync	2016-09-13 21:18:30 -07:00
Harshavardhana	bccf549463	server: Move all the top level files into cmd folder. (#2490 ) This change brings a change which was done for the 'mc' package to allow for clean repo and have a cleaner github drop in experience.	2016-08-18 16:23:42 -07:00

50 Commits