minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	6ac48a65cb	fix: use unused cacheMetrics code in prometheus (#9588 ) remove all other unusued/deadcode	2020-05-13 08:15:26 -07:00
Harshavardhana	bc61417284	calculate automatic node based symmetry (#9446 ) it is possible in many screnarios that even if the divisible value is optimal, we may end up with uneven distribution due to number of nodes present in the configuration. added code allow for affinity towards various ellipses to figure out optimal value across ellipses such that we can always reach a symmetric value automatically. Fixes #9416	2020-04-27 14:39:57 -07:00
Harshavardhana	f44cfb2863	use GlobalContext whenever possible (#9280 ) This change is throughout the codebase to ensure that all codepaths honor GlobalContext	2020-04-09 09:30:02 -07:00
Harshavardhana	91f21ddc47	fix: ignore lost+found properly while reading disks (#9278 ) Fixes #9277	2020-04-06 16:51:18 -07:00
Harshavardhana	30707659b5	[feature] allow for an odd number of erasure packs (#9221 ) Too many deployments come up with an odd number of hosts or drives, to facilitate even distribution among those setups allow for odd and prime numbers based packs.	2020-03-31 09:32:16 -07:00
Harshavardhana	6f992134a2	fix: startup load time by reusing storageDisks (#9210 )	2020-03-27 14:48:30 -07:00
Klaus Post	8d98662633	re-implement data usage crawler to be more efficient (#9075 ) Implementation overview: https://gist.github.com/klauspost/1801c858d5e0df391114436fdad6987b	2020-03-18 16:19:29 -07:00
Harshavardhana	6a00eb10bf	fix: allow set drive count of proper divisible values (#9101 ) Currently the code assumed some orthogonal requirements which led situations where when we have a setup where we have let's say for example 168 drives, the final set_drive_count chosen was 14. Indeed 168 drives are divisible by 12 but this wasn't allowed due to an unexpected requirement to have 12 to be a perfect modulo of 14 which is not possible. This assumption was incorrect. This PR fixes this old assumption properly, also adds few tests and some negative tests as well. Improvements are seen in error messages as well.	2020-03-08 13:30:25 -07:00
Anis Elleuch	6d5d77f62c	usage typo: Fix creating .minio.sys/background-ops bucket (#8957 ) Due to a typo in the code, a cluster was not correctly creating `background-ops` in all disks and nodes print the following error: minio3_1 \| API: SYSTEM() minio3_1 \| Time: 19:32:45 UTC 02/06/2020 minio3_1 \| DeploymentID: d67c20fa-4a1e-41f5-b319-7e3e90f425d8 minio3_1 \| Error: Bucket not found: .minio.sys/background-ops minio3_1 \| 2: cmd/data-usage.go:109:cmd.runDataUsageInfo() minio3_1 \| 1: cmd/data-usage.go:56:cmd.runDataUsageInfoUpdateRoutine() This commit fixes the typo.	2020-02-06 13:12:36 -08:00
Harshavardhana	64fde1ab95	xl/zones: return errNoHealRequired when no heal is required (#8821 ) Zone abstraction of object layer was returning `nil` incorrectly under situations where disk healing is not required. Returning `nil` is considered as healing successful, which leads to unexpected ReloadFormat() peer notification calls during startup. This PR fixes this behavior properly for zones.	2020-01-15 17:19:13 -08:00
Anis Elleuch	069876e262	xl: All nodes create meta volumes in its local disks (#8786 ) Meta volumes directories, tmp/, background-ops/, etc.. undr .minio.sys are created when disks are formatted but also when the cluster is started. However using MakeVolBulk() is not appropriate in the case of a user migrating from a version which does not have .minio.sys/background-ops/. The reason is that MakeVolBulk() exits early when an error is occured: errVolumeExists in this case, which is expected since some directories such as tmp/ already exist. This commit will avoid use MakeVolBulk and use MakeVol instead. Also the PR will make each node creates meta volumes in its local disks and stop relying on the first disk since the first node could be offline.	2020-01-15 12:36:52 -08:00
Klaus Post	37b32199e3	Validate XL sets on format (#8779 ) When formatting a set validate if a host failure will likely lead to data loss. While we don't know what config will be set in the future evaluate to our best knowledge, assuming default settings.	2020-01-13 13:09:10 -08:00
Harshavardhana	f68a7005c0	Improve disk formatting stage for large disk sets (#8690 )	2019-12-23 16:31:03 -08:00
Anis Elleuch	555969ee42	Add data usage collect with its new admin API (#8553 ) Admin data usage info API returns the following (Only FS & XL, for now) - Number of buckets - Number of objects - The total size of objects - Objects histogram - Bucket sizes	2019-12-12 06:02:37 -08:00
Harshavardhana	5d3d57c12a	Start using error wrapping with fmt.Errorf (#8588 ) Use fatih/errwrap to fix all the code to use error wrapping with fmt.Errorf()	2019-12-02 09:28:01 -08:00
Harshavardhana	4e9de58675	Avoid pointer based copy, instead use Clone() (#8547 ) This PR adds functional test to test expanded cluster syntax.	2019-11-21 17:54:51 +05:30
Harshavardhana	8392d2f510	Preserve same deploymentID on all zones (#8542 )	2019-11-20 15:39:30 +05:30
Harshavardhana	347b29d059	Implement bucket expansion (#8509 )	2019-11-19 17:42:27 -08:00
Harshavardhana	e9b2bf00ad	Support MinIO to be deployed on more than 32 nodes (#8492 ) This PR implements locking from a global entity into a more localized set level entity, allowing for locks to be held only on the resources which are writing to a collection of disks rather than a global level. In this process this PR also removes the top-level limit of 32 nodes to an unlimited number of nodes. This is a precursor change before bring in bucket expansion.	2019-11-13 12:17:45 -08:00
Harshavardhana	68a519a468	Use errgroups instead of sync.WaitGroup as needed (#8354 )	2019-10-14 09:44:51 -07:00
Harshavardhana	127641731a	Parallelize initialization of storageDisks (#8288 )	2019-09-27 16:47:12 -07:00
Harshavardhana	c8fbc94329	Fix writing 'format.json' and make it atomic (#8296 ) - Choose a unique uuid such that under situations of duplicate mounts we do not append to an existing json entry. - Avoid AppendFile instead use WriteAll() to write the entire byte array atomically.	2019-09-24 18:47:26 -07:00
Harshavardhana	53e4887e02	Simplify and cleanup metadata r/w functions (#8146 )	2019-09-11 22:52:12 +05:30
Praveen raj Mani	b976521c83	Ignore faulty disks in xl-sets Storage info (#7878 )	2019-08-02 12:17:26 -07:00
Krishna Srinivas	338e9a9be9	Put object client disconnect (#7824 ) Fail putObject and postpolicy in case client prematurely disconnects Use request's context to cancel lock requests on client disconnects	2019-06-28 22:09:17 -07:00
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	2019-04-09 11:39:42 -07:00
Krishnan Parthasarathi	93a9078b23	Assign deploymentID for first minio server in distributed setup (#7427 ) - Pass local endpoints to functions fixing formatXL during startup	2019-04-02 10:50:13 -07:00
Anis Elleuch	dc2348daa5	heal: Preserve deployment ID from reference format.json (#7126 ) Deployment ID is not copied into new formats after healing format. Although, this is not critical since a new deployment ID will be generated and set in the next cluster restart, it is still much better if we don't change the deployment id of a cluster for a better tracking.	2019-01-22 18:32:06 -08:00
Harshavardhana	8608a84c23	Ignore hidden directory .snapshot for NetApp volumes (#6889 )	2018-11-29 11:39:21 +05:30
Harshavardhana	d2f240c791	Ignore windows hidden folders (#6735 ) On Windows erasure coding setup if ``` ~ minio server V:\ W:\ X:\ Z:\ ``` is not possible due to NTFS creating couple of hidden folders, this PR allows minio to use the entire drive.	2018-11-02 11:31:55 -07:00
Harshavardhana	2dede2fdc2	Add reliable RemoveAll to handle racy situations (#6227 )	2018-08-06 09:45:28 +05:30
kannappanr	43cc0096fa	Add support for deployment ID (#6144 ) deployment ID helps in identifying a minio deployment in the case of remote logging targets.	2018-07-18 20:17:35 -07:00
Krishna Srinivas	0c9f4c9092	formatMetaV1 should be "inherited" by disk format structs (#6134 )	2018-07-16 20:26:42 -07:00
Harshavardhana	5f9041571f	Heal only when atleast one of the disk is unformatted (#5866 ) Current healing has an issue when disks are healed even when they are offline without knowing if disk is unformatted. This can lead to issues of pre-maturely removing the disk from the set just because it was temporarily offline. There is an increasing number of `mc admin heal` usage on a cron or regular basis. It is possible that if healing code saw disk is offline it might prematurely take it down, this causes availability issues. Fixes #5826	2018-05-01 09:07:39 +05:30
Harshavardhana	1f07545e2a	Improve init messages for distributed setup (#5786 ) Fixes #5531	2018-04-12 15:43:38 -07:00
kannappanr	cef992a395	Remove error package and cause functions (#5784 )	2018-04-10 09:36:37 -07:00
kannappanr	f8a3fd0c2a	Create logger package and rename errorIf to LogIf (#5678 ) Removing message from error logging Replace errors.Trace with LogIf	2018-04-05 15:04:40 -07:00
Harshavardhana	85a57d2021	Make sure to close the disk connections (#5752 ) Since we do not re-use storageDisks after moving the connections to object layer we should close them appropriately otherwise we have a lot of connection leaks and these can compound as the time goes by. This PR also refactors the initialization code to re-use storageDisks for given set of endpoints until we have confirmed a valid reference format.	2018-04-04 10:28:48 +05:30
Harshavardhana	2c5f2e9669	Stop deleting 'format.json' upon unsuccessful save (#5747 ) An issue was reproduced when there a no more inodes available on an existing setup of 4 disks, now we took one of the disks and reformatted it to relinquish inodes. Now we attempt to bring the fresh disk back into setup and perform a heal - at this point creating new `format.json` fails on existing disks since they do not have more inodes available. At this point due to quorum failure, we end up deleting existing `format.json` as well, this PR removes the code which deletes existing `format.json` as there is no need to delete them.	2018-04-03 10:48:06 +05:30
Harshavardhana	2938e332ba	Fix format migration regression (#5668 ) Migration regression got introduced in `9083bc152e` adding more unit tests to catch this scenario, we need to fix this by re-writing the formats after the migration to 'V3'. This bug only happens when a user is migrating directly from V1 to V3, not from V1 to V2 and V2 to V3. Added additional unit tests to cover these situations as well. Fixes #5667	2018-03-19 21:43:00 +05:30
Krishna Srinivas	9083bc152e	Flat multipart backend implementation for Erasure backend (#5447 )	2018-03-15 13:55:23 -07:00
Krishna Srinivas	a00e052606	Provide more descriptive error during erasure init (#5282 ) fixes #5239	2018-02-20 18:42:09 -08:00
Harshavardhana	fb96779a8a	Add large bucket support for erasure coded backend (#5160 ) This PR implements an object layer which combines input erasure sets of XL layers into a unified namespace. This object layer extends the existing erasure coded implementation, it is assumed in this design that providing > 16 disks is a static configuration as well i.e if you started the setup with 32 disks with 4 sets 8 disks per pack then you would need to provide 4 sets always. Some design details and restrictions: - Objects are distributed using consistent ordering to a unique erasure coded layer. - Each pack has its own dsync so locks are synchronized properly at pack (erasure layer). - Each pack still has a maximum of 16 disks requirement, you can start with multiple such sets statically. - Static sets set of disks and cannot be changed, there is no elastic expansion allowed. - Static sets set of disks and cannot be changed, there is no elastic removal allowed. - ListObjects() across sets can be noticeably slower since List happens on all servers, and is merged at this sets layer. Fixes #5465 Fixes #5464 Fixes #5461 Fixes #5460 Fixes #5459 Fixes #5458 Fixes #5460 Fixes #5488 Fixes #5489 Fixes #5497 Fixes #5496	2018-02-15 17:45:57 -08:00
Aditya Manthramurthy	a337ea4d11	Move admin APIs to new path and add redesigned heal APIs (#5351 ) - Changes related to moving admin APIs - admin APIs now have an endpoint under /minio/admin - admin APIs are now versioned - a new API to server the version is added at "GET /minio/admin/version" and all API operations have the path prefix /minio/admin/v1/<operation> - new service stop API added - credentials change API is moved to /minio/admin/v1/config/credential - credentials change API and configuration get/set API now require TLS so that credentials are protected - all API requests now receive JSON - heal APIs are disabled as they will be changed substantially - Heal API changes Heal API is now provided at a single endpoint with the ability for a client to start a heal sequence on all the data in the server, a single bucket, or under a prefix within a bucket. When a heal sequence is started, the server returns a unique token that needs to be used for subsequent 'status' requests to fetch heal results. On each status request from the client, the server returns heal result records that it has accumulated since the previous status request. The server accumulates upto 1000 records and pauses healing further objects until the client requests for status. If the client does not request any further records for a long time, the server aborts the heal sequence automatically. A heal result record is returned for each entity healed on the server, such as system metadata, object metadata, buckets and objects, and has information about the before and after states on each disk. A client may request to force restart a heal sequence - this causes the running heal sequence to be aborted at the next safe spot and starts a new heal sequence.	2018-01-22 14:54:55 -08:00
Krishna Srinivas	7c72d14027	Separate the codebase for XL and FS format.json related code (#5317 )	2018-01-08 14:30:55 -08:00

45 Commits