minio

mirror of https://github.com/minio/minio.git synced 2025-12-08 08:42:43 -05:00

Author	SHA1	Message	Date
Harshavardhana	62b1da3e2c	fix offline disk calculation (#9801 ) Current code was relying on globalEndpoints as the source of secondary truth to obtain the missing endpoints list when the disk is offline, this is problematic - there is no way to know if the getDisks() returned endpoints total is same as the ones list of globalEndpoints and it belongs to a particular set. - there is no order guarantee as getDisks() is ordered as per format.json, globalEndpoints may not be, so potentially end up including incorrect endpoints. To fix this bring getEndpoints() just like getDisks() to ensure that consistently ordered endpoints are always available for us to ensure that returned values are consistent with what each erasure set would observe.	2020-06-10 17:10:31 -07:00
poornas	d26b24f670	avoid storing X-Amz-Tagging-Directive in metadata (#9800 )	2020-06-10 14:29:24 -07:00
kannappanr	2c372a9894	Send Partscount only when partnumber is specified (#9793 ) Fixes #9789	2020-06-10 09:22:15 -07:00
poornas	3d3b75fb8d	Avoid overwriting object tags when changing lock (#9794 )	2020-06-10 08:16:30 -07:00
Klaus Post	142b057be8	Check object names on windows (#9798 ) Uploading files with names that could not be written to disk would result in "reduce your request" errors returned. Instead check explicitly for disallowed characters and reject files with `Object name contains unsupported characters.`	2020-06-10 08:14:22 -07:00
Harshavardhana	4790868878	allow background IAM load to speed up startup (#9796 ) Also fix healthcheck handler to run success only if object layer has initialized fully for S3 API access call.	2020-06-09 19:19:03 -07:00
Harshavardhana	342ade03f6	deprecate listDir usage for healing (#9792 ) listDir was incorrectly used for healing which is slower, instead use Walk() to heal the entire set.	2020-06-09 17:09:19 -07:00
P R	9407dbf387	display proper used space based on disk usage (#9551 ) Fixes #9346	2020-06-09 15:05:39 -07:00
Harshavardhana	423aeb0d81	allow large buffer to list more entries per directory (#9785 )	2020-06-09 09:44:50 -07:00
Anis Elleuch	790323ac37	lifecycle: Fix object expiration date (#9791 ) re-use PredictExpiryTime() in ComputeAction()	2020-06-09 09:40:53 -07:00
Harshavardhana	febe9cc26a	fix: avoid timer leaks in dsync/lsync (#9781 ) At a customer setup with lots of concurrent calls it can be observed that in newRetryTimer there were lots of tiny alloations which are not relinquished upon retries, in this codepath we were only interested in re-using the timer and use it wisely for each locker. ``` (pprof) top Showing nodes accounting for 8.68TB, 97.02% of 8.95TB total Dropped 1198 nodes (cum <= 0.04TB) Showing top 10 nodes out of 79 flat flat% sum% cum cum% 5.95TB 66.50% 66.50% 5.95TB 66.50% time.NewTimer 1.16TB 13.02% 79.51% 1.16TB 13.02% github.com/ncw/directio.AlignedBlock 0.67TB 7.53% 87.04% 0.70TB 7.78% github.com/minio/minio/cmd.xlObjects.putObject 0.21TB 2.36% 89.40% 0.21TB 2.36% github.com/minio/minio/cmd.(posix).Walk 0.19TB 2.08% 91.49% 0.27TB 2.99% os.statNolog 0.14TB 1.59% 93.08% 0.14TB 1.60% os.(File).readdirnames 0.10TB 1.09% 94.17% 0.11TB 1.25% github.com/minio/minio/cmd.readDirN 0.10TB 1.07% 95.23% 0.10TB 1.07% syscall.ByteSliceFromString 0.09TB 1.03% 96.27% 0.09TB 1.03% strings.(Builder).grow 0.07TB 0.75% 97.02% 0.07TB 0.75% path.(lazybuf).append ```	2020-06-08 11:28:40 -07:00
Praveen raj Mani	2ce2e88adf	Support mTLS Authentication in Webhooks (#9777 )	2020-06-08 05:55:44 -07:00
Harshavardhana	c7599d323b	fix: throw error if symmetry cannot be obtained (#9780 ) For example `{1...17}/{1...52}` symmetrical distribution of drives cannot be obtained - Because 17 is a prime number - Is not divisible by any pre-defined setCounts i.e from 1 to 16	2020-06-06 22:13:48 -07:00
Harshavardhana	d93bdea433	fix remove LDAPPassword from audit logs (#9773 ) the previous fix for #9707 was not correct, fix this properly passing the right filter keys to be filtered from the audit log output. Fixes #9767	2020-06-04 22:07:55 -07:00
Harshavardhana	5e529a1c96	simplify context timeout for readiness (#9772 ) additionally also add CORS support to restrict for specific origin, adds a new config and updated the documentation as well	2020-06-04 14:58:34 -07:00
Harshavardhana	5686a7e273	fix NAS gateway support for policy/notification (#9765 ) Fixes #9764	2020-06-03 13:18:54 -07:00
Harshavardhana	566e0e2048	allow deleting of dropped multiparts (#9753 ) bonus change trigger MRF heal when single offline disk is found, break out early.	2020-06-02 15:27:03 -07:00
Anis Elleuch	3aad09be28	heal: Fix passing healing opts (#9756 ) Manual healing (as background healing) creates a heal task with a possiblity to override healing options, such as deep or normal mode. Use a pointer type in heal opts so nil would mean use the default healing options.	2020-06-02 09:07:16 -07:00
Harshavardhana	f0358acb32	concurrently load bucket metadata (#9749 )	2020-06-01 22:32:53 -07:00
Anis Elleuch	fd0de4ab32	azure: Show better message when credentials are wrong (#9748 )	2020-06-01 18:23:48 -07:00
Anis Elleuch	73a308502f	Relax content-md5 requirement in set encryption handler (#9750 ) aws cli fails to set a bucket encryption configuration to MinIO server. The reason is that aws cli does not send MD5-Content header. It seems that MD5-Content is not required anymore. This commit also returns Not Implemented header early to help mint tests to ignore testing this API in gateway modes.	2020-06-01 18:08:19 -07:00
Anis Elleuch	bd59f150b8	azure: Implement CopyPart API (#9747 )	2020-06-01 11:12:18 -07:00
Harshavardhana	f90422a890	fix prometheus calculation of offline disks per instance (#9744 ) This was a regression introduced in `9baeda7` for prometheus calculation of offline disks which should be local to an instance. fixes #9742	2020-06-01 07:35:40 -07:00
Harshavardhana	8befedef14	simplify FS multipart cleanup (#9740 ) fixes #9671	2020-05-30 13:56:31 -07:00
Nathan Brown	2af3004409	Use registry to check Atime support on Windows (#9741 )	2020-05-30 09:47:42 -07:00
Harshavardhana	38ee40d59c	move to upstream code colinmarc/hdfs (#9738 ) - supports SASL based authentication now - upgrades to new changes in gokrb library - implement force delete feature Fixes #8206	2020-05-29 18:38:50 -07:00
kannappanr	d583f1ac0e	check if container is empty before invoking DeleteContainer (#9733 )	2020-05-29 13:24:39 -07:00
Harshavardhana	2bcb02f628	Avoid '\n' from constant strings (#9737 ) Fixes #9736	2020-05-29 11:40:57 -07:00
Klaus Post	167ddf9c9c	Workaround for Windows Docker Engine 19.03.8 (#9735 ) Add workaround for issue preventing servers from starting on Windows Docker Engine 19.03.8 Fixes #9726	2020-05-29 07:05:19 -07:00
Anton Huck	f833e41e69	IAM: Fix nil panic due to uninit. iamGroupPolicyMap. Fixes #9730 (#9734 )	2020-05-29 06:13:54 -07:00
Harshavardhana	41688a936b	fix: CopyObject behavior on expanded zones (#9729 ) CopyObject was not correctly figuring out the correct destination object location and would end up creating duplicate objects on two different zones, reproduced by doing encryption based key rotation.	2020-05-28 14:36:38 -07:00
Harshavardhana	b2db8123ec	Preserve errors returned by diskInfo to detect disk errors (#9727 ) This PR basically reverts #9720 and re-implements it differently	2020-05-28 13:03:04 -07:00
Harshavardhana	b330c2c57e	Introduce simpler GetMultipartInfo call for performance (#9722 ) Advantages avoids 100's of stats which are needed for each upload operation in FS/NAS gateway mode when uploading a large multipart object, dramatically increases performance for multipart uploads by avoiding recursive calls. For other gateway's simplifies the approach since azure, gcs, hdfs gateway's don't capture any specific metadata during upload which needs handler validation for encryption/compression. Erasure coding was already optimized, additionally just avoids small allocations of large data structure. Fixes #7206	2020-05-28 12:36:20 -07:00
kannappanr	7214a0160a	allow bucket policy to set/removed in NAS gateway (#9706 )	2020-05-28 08:31:16 -07:00
Anis Elleuch	375b79f11b	storage: Implement GetDiskID request in REST server side (#9720 ) GetDiskID() in storage rest client does not really issue a REST request to the remote disk, but returns an in-memory value instead. However, GetDiskID() should return an error when format.json is not found or for other similar issues (unmounted disks, etc..) GetDiskID() is only called when formatting disks and getting storage informatio, hence this commit should not have a performance degradation.	2020-05-28 08:17:42 -07:00
Harshavardhana	3da1869d5e	Avoid double reads on metadata during GetObject() (#9719 ) Overall TTFB can see a dramatic improvement with this change - did not do any benchmark as such but the change itself is self-explanatory	2020-05-27 16:14:26 -07:00
Harshavardhana	7cedc5369d	fix: send valid claims in AuditLogs for browser requests (#9713 ) Additionally also fix STS logs to filter out LDAP password to be sent out in audit logs. Bonus fix handle the reload of users properly by making sure to preserve the newer users during the reload to be not invalidated. Fixes #9707 Fixes #9644 Fixes #9651	2020-05-27 12:38:44 -07:00
Harshavardhana	53aaa5d2a5	Export bucket usage counts as part of bucket metrics (#9710 ) Bonus fixes in quota enforcement to use the new datastructure and use timedValue to cache a value/reload automatically avoids one less global variable.	2020-05-27 06:45:43 -07:00
P R	9d39fb3604	add copyobject tagging replace directive for gateway (#9711 )	2020-05-26 17:32:53 -07:00
Klaus Post	4a007e3767	Prefer local disks when fetching data blocks (#9563 ) If the requested server is part of the set this will always read from the local disk, even if the disk contains a parity shard. In default setup there is a 50% chance that at least one shard that otherwise would have been fetched remotely will be read locally instead. It basically trades RPC call overhead for reed-solomon. On distributed localhost this seems to be fairly break-even, with a very small gain in throughput and latency. However on networked servers this should be a bigger 1MB objects, before: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 76257: * Avg: 25ms 50%: 24ms 90%: 32ms 99%: 42ms Fastest: 7ms Slowest: 67ms * First Byte: Average: 23ms, Median: 22ms, Best: 5ms, Worst: 65ms Throughput: * Average: 1213.68 MiB/s, 1272.63 obj/s (59.948s, starting 14:45:44 CEST) ``` After: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 78845: * Avg: 24ms 50%: 24ms 90%: 31ms 99%: 39ms Fastest: 8ms Slowest: 62ms * First Byte: Average: 22ms, Median: 21ms, Best: 6ms, Worst: 57ms Throughput: * Average: 1255.11 MiB/s, 1316.08 obj/s (59.938s, starting 14:43:58 CEST) ``` Bonus fix: Only ask for heal once on an object.	2020-05-26 16:47:23 -07:00
Klaus Post	95814359bd	cache disk info to avoid repeated calls (#9682 ) This value is requested on every upload when there are multiple zones. Since this will result in an RPC call to every remote disk this scales quite badly in a distributed setup. Load every 1second interval. 2 servers, localhost only. In large distributed setups much bigger gains can be expected. ``` Operations: 21743 -> 22454 * Average: +3.28% (+0.0 MiB/s) throughput, +3.28% (+11.9) obj/s * Fastest: +3.37% (+0.0 MiB/s) throughput, +3.37% (+13.0) obj/s * 50% Median: +3.03% (+0.0 MiB/s) throughput, +3.03% (+11.2) obj/s * Slowest: +8.03% (+0.0 MiB/s) throughput, +8.03% (+22.8) obj/s ``` For easy management of this a generic helper has been added.	2020-05-26 12:52:24 -07:00
Harshavardhana	d0ae69087c	fix: add proper errors for disks with preexisting content (#9703 )	2020-05-26 09:32:33 -07:00
Harshavardhana	7ea026ff1d	fix: reply back user-metadata in lower case form (#9697 ) some clients such as veeam expect the x-amz-meta to be sent in lower cased form, while this does indeed defeats the HTTP protocol contract it is harder to change these applications, while these applications get fixed appropriately in future. x-amz-meta is usually sent in lowercased form by AWS S3 and some applications like veeam incorrectly end up relying on the case sensitivity of the HTTP headers. Bonus fixes - Fix the iso8601 time format to keep it same as AWS S3 response - Increase maxObjectList to 50,000 and use maxDeleteList as 10,000 whenever multi-object deletes are needed.	2020-05-25 16:51:32 -07:00
Harshavardhana	6e0575a53d	Revert "Disable crawler in FS/NAS gateway mode (#9695 )" (#9702 ) This reverts commit `eba423bb9d`. Additionally also address the FS crawler to properly calculate the sizes for encrypted/compressed content.	2020-05-25 11:32:53 -07:00
Harshavardhana	eba423bb9d	Disable crawler in FS/NAS gateway mode (#9695 ) No one really uses FS for large scale accounting usage, neither we crawl in NAS gateway mode. It is worthwhile to simply disable this feature as its not useful for anyone. Bonus disable bucket quota ops as well in, FS and gateway mode	2020-05-25 00:17:52 -07:00
Erkki Eilonen	301de169e9	in cache build ranges metadata as needed (#9698 )	2020-05-25 00:17:03 -07:00
Harshavardhana	0c71ce3398	fix size accounting for encrypted/compressed objects (#9690 ) size calculation in crawler was using the real size of the object instead of its actual size i.e either a decrypted or uncompressed size. this is needed to make sure all other accounting such as bucket quota and mcs UI to display the correct values.	2020-05-24 11:19:17 -07:00
Krishna Srinivas	7d19ab9f62	readiness returns error quickly if any of the set is down (#9662 ) This PR adds a new configuration parameter which allows readiness check to respond within 10secs, this can be reduced to a lower value if necessary using ``` mc admin config set api ready_deadline=5s ``` or ``` export MINIO_API_READY_DEADLINE=5s ```	2020-05-23 17:38:39 -07:00
P R	3f6d624c7b	add gateway object tagging support (#9124 )	2020-05-23 11:09:35 -07:00
Harshavardhana	c138272d63	reject object lock requests on existing buckets (#9684 ) a regression was introduced fix it to ensure that we do not allow object locking settings on existing buckets without object locking	2020-05-23 10:01:01 -07:00

1 2 3 4 5 ...

2724 Commits