minio

mirror of https://github.com/minio/minio.git synced 2024-12-26 07:05:55 -05:00

Author	SHA1	Message	Date
Andreas Auernhammer	18725679c4	crypto: allow multiple KES endpoints (#10383 ) This commit addresses a maintenance / automation problem when MinIO-KES is deployed on bare-metal. In orchestrated env. the orchestrator (K8S) will make sure that `n` KES servers (IPs) are available via the same DNS name. There it is sufficient to provide just one endpoint.	2020-08-31 18:10:52 -07:00
Anis Elleuch	ba8a8ad818	ListObjectsV1 requests unnecessarily fail with offline nodes (#10386 ) ListObjectsV1 requests are actually redirected to a specific node, depending on the bucket name. The purpose of this behavior was to optimize listing. However, the current code sends a Bad Gateway error if the target node is offline, which is a bad behavior because it means that the list request will fail, although this is unnecessary since we can still use the current node to list as well (the default behavior without using proxying optimization) Currently, you can see mint fails when there is one offline node, after this PR, mint will always succeed.	2020-08-31 12:37:31 -07:00
Harshavardhana	102ad60dee	simplify removing temporary files (#10389 )	2020-08-31 12:35:40 -07:00
Gaige B Paulsen	859ef52886	update for smartos build (solaris too) (#10378 )	2020-08-31 10:19:25 -07:00
Harshavardhana	e730da1438	fix: referesh JWKS public keys upon failure (#10368 ) fixes #10359	2020-08-28 08:15:12 -07:00
Anis Elleuch	46ee8659b4	fix write quorum calculation for bucket operations (#10364 ) When the number of disks is odd, the calculation of quorum for bucket operations were not correct, fix it.	2020-08-27 12:55:32 -07:00
Harshavardhana	a359e36e35	tolerate listing with only readQuorum disks (#10357 ) We can reduce this further in the future, but this is a good value to keep around. With the advent of continuous healing, we can be assured that namespace will eventually be consistent so we are okay to avoid the necessity to a list across all drives on all sets. Bonus Pop()'s in parallel seem to have the potential to wait too on large drive setups and cause more slowness instead of gaining any performance remove it for now. Also, implement load balanced reply for local disks, ensuring that local disks have an affinity for - cleanupStaleMultipartUploads()	2020-08-26 19:29:35 -07:00
Jorge Israel Peña	0a2e6d58a5	hdfs gateway handle listing single files (#10362 )	2020-08-26 16:03:53 -07:00
Klaus Post	1b119557c2	getDisksInfo: Attribute failed disks to correct endpoint (#10360 ) If DiskInfo calls failed the information returned was used anyway resulting in no endpoint being set. This would make the drive be attributed to the local system since `disk.Endpoint == disk.DrivePath` in that case. Instead, if the call fails record the endpoint and the error only.	2020-08-26 10:11:26 -07:00
Harshavardhana	7778fef6bb	update continous heal metrics appropriately for scanned items (#10352 ) bonus make sure to ignore objectNotFound, and versionNotFound errors properly at all layers, since HealObjects() returns objectNotFound error if the bucket or prefix is empty.	2020-08-26 08:53:33 -07:00
飞雪无情	ea1803417f	Use constants for gateway names to avoid bugs caused by spelling. (#10355 )	2020-08-26 08:52:46 -07:00
Harshavardhana	d19b434ffc	fix: bring back delayed leaf detection in listing (#10346 )	2020-08-25 12:26:48 -07:00
Klaus Post	17a1eda702	Disregard healing disks in crawling (#10349 ) When crawling never use a disk we know is healing. Most of the change involves keeping track of the original endpoint on xlStorage and this also fixes DiskInfo.Endpoint never being populated. Heal master will print `data-crawl: Disk "http://localhost:9001/data/mindev/data2/xl1" is Healing, skipping` once on a cycle (no more often than every 5m).	2020-08-25 10:55:15 -07:00
Daniel Valdivia	7d1734d033	indicate through HTTP header cluster healing in progress (#10342 )	2020-08-24 15:20:50 -07:00
Harshavardhana	03ec6adfd0	fix: KES http2.0 communication support (#10341 )	2020-08-24 14:37:53 -07:00
Harshavardhana	309b10f201	keep crawler cycle at 5 minutes	2020-08-24 14:05:16 -07:00
Klaus Post	c097ce9c32	continous healing based on crawler (#10103 ) Design: https://gist.github.com/klauspost/792fe25c315caf1dd15c8e79df124914	2020-08-24 13:47:01 -07:00
Harshavardhana	caad314faa	add ruleguard support, fix all the reported issues (#10335 )	2020-08-24 12:11:20 -07:00
Klaus Post	bc2ebe0021	Only enforce quota on success (#10339 ) We should only enforce quotas if no error has been returned. firstErr is safe to access since all goroutines have exited at this point. If `firstErr` hasn't been set by something else return the context error if cancelled.	2020-08-24 10:15:46 -07:00
Harshavardhana	11aa393ba7	Allow region errors to be dynamic (#10323 ) remove other FIXMEs as we are not planning to fix these, instead we will add dynamism case by case basis. fixes #10250	2020-08-23 22:06:22 -07:00
Praveen raj Mani	d0c910a6f3	Support https and basic-auth for elasticsearch notification target (#10332 )	2020-08-23 09:43:48 -07:00
kannappanr	d15a5ad4cc	S3 Gateway: Check for encryption headers properly (#10309 )	2020-08-22 11:41:49 -07:00
Harshavardhana	95411228db	add missing cleanupStaleMultipartUploads (#10325 ) fixes #10319	2020-08-21 21:39:54 -07:00
ebozduman	23774353b7	get_object() returns NoSuchKey error when object is a prefix (#10315 )	2020-08-21 13:08:01 -07:00
poornas	a2a5ec93d3	fix: use global context for filling cache in the background (#10308 )	2020-08-20 14:23:24 -07:00
Harshavardhana	27a774cbe9	fix: FS mode should reject putBucketVersioning (#10307 )	2020-08-20 13:18:06 -07:00
Klaus Post	8e6787a302	Fix TestDataUpdateTracker hanging (#10302 ) Keep dataUpdateTracker while goroutine is starting. This will ensure the object is updated one `start` returns Tested with ``` λ go test -cpu=1,2,4,8 -test.run TestDataUpdateTracker -count=1000 PASS ok github.com/minio/minio/cmd 8.913s ``` Fixes #10295	2020-08-20 13:17:42 -07:00
Harshavardhana	59352d0ac2	load all blocking metadata in background (#10298 ) most of this metadata already has fallbacks and there is no good reason to load them in blocking fashion	2020-08-20 10:38:53 -07:00
Harshavardhana	75d44b3bae	add disk for more context in bitrot errors (#10296 )	2020-08-20 09:41:15 -07:00
Klaus Post	95ae6c4b49	Fix missing unlock in *healSequence.hasEnded() (#10305 ) The background healing sequence would always hang when this function is called.	2020-08-20 08:48:09 -07:00
KevinSmile	0ebb73ee2e	use const instead of literals (#10292 )	2020-08-19 16:43:52 -07:00
Harshavardhana	c8b84a0e9e	Add nancy vulnerability scanner (#10289 )	2020-08-19 14:25:21 -07:00
Ritesh H Shukla	3acb5cff45	Update code comment (#10287 )	2020-08-19 14:24:58 -07:00
Harshavardhana	74116204ce	handle fresh setup with mixed drives (#10273 ) fresh drive setups when one of the drive is a root drive, we should ignore such a root drive and not proceed to format. This PR handles this properly by marking the disks which are root disk and they are taken offline.	2020-08-18 14:37:26 -07:00
Harshavardhana	e4a44f6224	fix: commonPrefixes behavior in ListObjectVersions (#10286 ) ``` $ aws s3api --profile minio --endpoint-url http://localhost:9003 \ list-object-versions --bucket testbucket \ --delimiter / --prefix Veeam/Archive/ { "CommonPrefixes": [ { "Prefix": "Veeam/Archive/003/" } ] } ``` Also add coverage tests similar to ListObjects to catch errors in future, skip these tests in FS mode	2020-08-18 12:19:44 -07:00
poornas	0272973175	Fix regression in web ui for retention (#10285 ) Fixes: #10283 regression from PR #9259	2020-08-18 12:09:42 -07:00
Harshavardhana	d2a3f92452	fix: health handler for lockers (#10280 )	2020-08-18 07:27:41 -07:00
Harshavardhana	ede86845e5	docs: Add policy variables for resource and conditions (#10278 ) Bonus fix adds LDAP policy variable and clarifies the usage of policy variables for temporary credentials. fixes #10197	2020-08-17 17:39:55 -07:00
Harshavardhana	e57c742674	use single dynamic timeout for most locked API/heal ops (#10275 ) newDynamicTimeout should be allocated once, in-case of temporary locks in config and IAM we should have allocated timeout once before the `for loop` This PR doesn't fix any issue as such, but provides enough dynamism for the timeout as per expectation.	2020-08-17 11:29:58 -07:00
Klaus Post	bb5976d727	healbucket: Send object version ID (#10263 ) Based on our previous conversations I assume we should send the version id when healing an object. Maybe we should even list object versions and heal all?	2020-08-17 08:25:44 -07:00
Harshavardhana	f7c1a59de1	add validation logs for configured Logger/Audit HTTP targets (#10274 ) extra logs in-case of misconfiguration of audit/logger targets	2020-08-16 10:25:00 -07:00
Anis Elleuch	51ba1dac49	listing: Fix result when prefix is an object with a slash (#10267 ) In a non recursive mode, issuing a list request where prefix is an existing object with a slash and delimiter is a slash will return entries in the object directory (data dir IDs) ``` $ aws s3api --profile minioadmin --endpoint-url http://localhost:9000 \ list-objects-v2 --bucket testbucket --prefix code_of_conduct.md/ --delimiter '/' { "CommonPrefixes": [ { "Prefix": "code_of_conduct.md/ec750fe0-ea7e-4b87-bbec-1e32407e5e47/" } ] } ``` This commit adds a fast exit track in Walk() in this specific case.	2020-08-14 20:13:24 -07:00
Harshavardhana	a4463dd40f	fix: storageClass shouldn't set the value upon failure (#10271 )	2020-08-14 19:48:04 -07:00
Harshavardhana	83a82d818e	allow lock tolerance to match storage-class drive tolerance (#10270 )	2020-08-14 18:17:14 -07:00
Harshavardhana	1d1c4430b2	decrypt ETags in parallel around 500 at a time (#10261 ) Listing speed-up gained from 10secs for just 400 entries to 2secs for 400 entries	2020-08-14 11:56:35 -07:00
Harshavardhana	43e6d1ce2d	fix: missing proxy request by bucket for ListVersions (#10260 )	2020-08-13 16:31:58 -07:00
Harshavardhana	30da442a85	rootDisk on containers can have different device Id (#10259 ) use `/etc/hosts` instead of `/` to check for common device id, if the device is same for `/etc/hosts` and the --bind mount to detect root disks. Bonus enhance healthcheck logging by adding maintenance tags, for all messages.	2020-08-13 15:21:20 -07:00
Harshavardhana	038d91feaa	fix: add public certs automatically as part of global CAs (#10256 )	2020-08-13 09:46:50 -07:00
Harshavardhana	e7ba78beee	use GlobalContext instead of context.Background when possible (#10254 )	2020-08-13 09:16:01 -07:00
Harshavardhana	b32d0a5b60	use the correct endpoints for offline drives	2020-08-12 19:17:49 -07:00
poornas	79e21601b0	fix: web handlers to enforce replication (#10249 ) This PR also preserves source ETag for replication	2020-08-12 17:32:24 -07:00
Harshavardhana	34253aa595	feat: cache env value in-case network is not reachable (#10251 )	2020-08-12 16:53:15 -07:00
Harshavardhana	79ed7ce451	fs: listObjects shouldn't take FS locks while listing (#10248 )	2020-08-12 15:23:14 +05:30
Harshavardhana	0dd3a08169	move the certPool loader function into pkg/certs (#10239 )	2020-08-11 08:29:50 -07:00
Klaus Post	f8f290e848	security: Remove insecure custom headers (#10244 ) Background: https://github.com/google/security-research/security/advisories/GHSA-76wf-9vgp-pj7w Remove these custom headers from incoming and outgoing requests.	2020-08-11 08:29:29 -07:00
Harshavardhana	1e2ebc9945	feat: time to bring back http2.0 support (#10230 ) Bonus move our CI/CD to go1.14	2020-08-10 09:02:29 -07:00
Harshavardhana	2a9819aff8	fix: refactor background heal for cluster health (#10225 )	2020-08-07 19:43:06 -07:00
Harshavardhana	6c6137b2e7	add cluster maintenance healthcheck drive heal affinity (#10218 )	2020-08-07 13:22:53 -07:00
Anis Elleuch	9138b2b503	Avoid duplicate headers when proxying S3 listing requests (#10220 )	2020-08-07 04:10:16 -07:00
Harshavardhana	77509ce391	Support looking up environment remotely (#10215 ) adds a feature where we can fetch the MinIO command-line remotely, this is primarily meant to add some stateless nature to the MinIO deployment in k8s environments, MinIO operator would run a webhook service endpoint which can be used to fetch any environment value in a generalized approach.	2020-08-06 18:03:16 -07:00
poornas	adcaa6f9de	fix: Change ListBucketTargets handler (#10217 ) to list all targets across a tenant. Also fixing some validations.	2020-08-06 17:10:21 -07:00
poornas	121164db56	fix: relax some replication validations (#10210 ) Also inherit storage class from source object if replication configuration does not have a storage class specified for destination bucket.	2020-08-05 20:01:20 -07:00
Harshavardhana	a20d4568a2	fix: make sure to use uniform drive count calculation (#10208 ) It is possible in situations when server was deployed in asymmetric configuration in the past such as ``` minio server ~/fs{1...4}/disk{1...5} ``` Results in setDriveCount of 10 in older releases but with fairly recent releases we have moved to having server affinity which means that a set drive count ascertained from above config will be now '4' While the object layer make sure that we honor `format.json` the storageClass configuration however was by mistake was using the global value obtained by heuristics. Which leads to prematurely using lower parity without being requested by the an administrator. This PR fixes this behavior.	2020-08-05 13:31:12 -07:00
Harshavardhana	e656beb915	feat: allow service accounts to be generated with OpenID STS (#10184 ) Bonus also fix a bug where we did not purge relevant service accounts generated by rotating credentials appropriately, service accounts should become invalid as soon as its corresponding parent user becomes invalid. Since service account themselves carry parent claim always we would never reach this problem, as the access get rejected at IAM policy layer.	2020-08-05 13:08:40 -07:00
poornas	88daaef76b	Validate object lock when setting replication config. (#10200 ) Check if object lock is enabled on destination bucket while setting replication configuration on a object lock enabled bucket.	2020-08-04 23:02:27 -07:00
Harshavardhana	0b8255529a	fix: proxies set keep-alive timeouts to be system dependent (#10199 ) Split the DialContext's one for internode and another for all other external communications especially proxy forwarders, gateway transport etc.	2020-08-04 14:55:53 -07:00
Harshavardhana	019fe69a57	fix: reduce an extra system call for writes instead fail later (#10187 )	2020-08-04 12:09:41 -07:00
Anis Elleuch	6ae30b21c9	fix ILM should not remove a protected version (#10189 )	2020-08-03 23:04:40 -07:00
Harshavardhana	b16781846e	allow server to start even with corrupted/faulty disks (#10175 )	2020-08-03 18:17:48 -07:00
Harshavardhana	5ce82b45da	add CopyObject optimization when source and destination are same (#10170 ) when source and destination are same and versioning is enabled on the destination bucket - we do not need to re-create the entire object once again to optimize on space utilization. Cases this PR is not supporting - any pre-existing legacy object will not be preserved in this manner, meaning a new dataDir will be created. - key-rotation and storage class changes of course will never re-use the dataDir	2020-08-03 16:21:10 -07:00
Harshavardhana	e99bc177c0	fix: allow FS mode situations when conflicting files exist (#10185 ) conflicting files can exist on FS at `.minio.sys/buckets/testbucket/policy.json/`, this is an expected valid scenario for FS mode allow it to work, i.e ignore and move forward	2020-08-03 13:20:49 -07:00
Harshavardhana	b68bc75dad	fix: quorum calculation mistake with reduced parity (#10186 ) With reduced parity our write quorum should be same as read quorum, but code was still assuming ``` readQuorum+1 ``` In all situations which is not necessary.	2020-08-03 12:15:08 -07:00
Harshavardhana	d61eac080b	fix: connection_string should override other params (#10180 ) closes #9965	2020-08-03 09:16:00 -07:00
poornas	a8dd7b3eda	Refactor replication target management. (#10154 ) Generalize replication target management so that remote targets for a bucket can be managed with ARNs. `mc admin bucket remote` command will be used to manage targets.	2020-07-30 19:55:22 -07:00
Harshavardhana	25a55bae6f	fix: avoid buffering of server sent events by proxies (#10164 )	2020-07-30 19:45:12 -07:00
Harshavardhana	fe157166ca	fix: Pass context all the way down to the network call in lockers (#10161 ) Context timeout might race on each other when timeouts are lower i.e when two lock attempts happened very quickly on the same resource and the servers were yet trying to establish quorum. This situation can lead to locks held which wouldn't be unlocked and subsequent lock attempts would fail. This would require a complete server restart. A potential of this issue happening is when server is booting up and we are trying to hold a 'transaction.lock' in quick bursts of timeout.	2020-07-29 23:15:34 -07:00
Adam Brown	f7259adf83	Update LastUpdate timestamp before save (#10152 )	2020-07-28 13:20:50 -07:00
Harshavardhana	6669560cb9	turn-off bucket usage metrics in gateway mode (#10150 ) closes #10147	2020-07-28 13:04:26 -07:00
poornas	b46ab7e921	Rename replication target handler (#10142 ) Rename replication target handler to a generic bucket target handler	2020-07-28 11:50:47 -07:00
Harshavardhana	27266f8a54	fix: if OPA set do not enforce policy claim (#10149 )	2020-07-28 11:47:57 -07:00
poornas	1b6ba0d062	Add validation in cache for offline drives (#10146 ) closes #10144	2020-07-28 10:06:52 -07:00
Harshavardhana	f200a7fb6a	fix: speed up OBD tests avoid unnecessary memory allocation (#10141 ) replace dummy buffer with nullReader{} instead, to avoid large memory allocations in memory constrainted environments. allows running obd tests in such environments.	2020-07-27 14:51:59 -07:00
Harshavardhana	47e304d03c	fix: add missing content-disposition from CORS handler (#10137 )	2020-07-27 09:03:38 -07:00
Harshavardhana	9108abf204	fix: allow shareable URLs with rotating creds (#10135 ) closes #8935	2020-07-27 09:02:53 -07:00
Harshavardhana	6529dcb3b5	fix: gateway Walk() implementation to list correct contents (#10131 ) closes #10122	2020-07-26 22:56:05 -07:00
Harshavardhana	abbf6ce6cc	simplify JWKS decoding in OpenID and more tests (#10119 ) add tests for non-compliant Azure AD behavior with "nonce" to fail properly and treat it as expected behavior for non-standard JWT tokens.	2020-07-25 08:42:41 -07:00
Harshavardhana	5ffc733eec	fix: enforce bucket quota from browser uploads (#10129 )	2020-07-24 21:16:54 -07:00
Harshavardhana	35212b673e	add unformatted disk as part of the error list (#10128 ) these errors should be ignored for quorum error calculation to ensure that we don't prematurely return unformatted disk error as part of API calls	2020-07-24 13:16:11 -07:00
Harshavardhana	57ff9abca2	Apply quota usage cache invalidation per second (#10127 ) Allow faster lookups for quota check enforcement	2020-07-24 12:24:21 -07:00
Jorge Israel Peña	4752323e1c	Use hdfs.Readdir() to optimize HDFS directory listings (#10121 ) Currently, listing directories on HDFS incurs a per-entry remote Stat() call penalty, the cost of which can really blow up on directories with many entries (+1,000) especially when considered in addition to peripheral calls (such as validation) and the fact that minio is an intermediary to the client (whereas other clients listed below can query HDFS directly). Because listing directories this way is expensive, the Golang HDFS library provides the [`Client.Open()`] function which creates a [`FileReader`] that is able to batch multiple calls together through the [`Readdir()`] function. This is substantially more efficient for very large directories. In one case we were witnessing about +20 seconds to list a directory with 1,500 entries, admittedly large, but the Java hdfs ls utility as well as the HDFS library sample ls utility were much faster. Hadoop HDFS DFS (4.02s): λ ~/code/minio → use-readdir » time hdfs dfs -ls /directory/with/1500/entries/ … hdfs dfs -ls 5.81s user 0.49s system 156% cpu 4.020 total Golang HDFS library (0.47s): λ ~/code/hdfs → master » time ./hdfs ls -lh /directory/with/1500/entries/ … ./hdfs ls -lh 0.13s user 0.14s system 56% cpu 0.478 total mc and minio without optimization (16.96s): λ ~/code/minio → master » time mc ls myhdfs/directory/with/1500/entries/ … ./mc ls 0.22s user 0.29s system 3% cpu 16.968 total mc and minio with optimization (0.40s): λ ~/code/minio → use-readdir » time mc ls myhdfs/directory/with/1500/entries/ … ./mc ls 0.13s user 0.28s system 102% cpu 0.403 total [`Client.Open()`]: https://godoc.org/github.com/colinmarc/hdfs#Client.Open [`FileReader`]: https://godoc.org/github.com/colinmarc/hdfs#FileReader [`Readdir()`]: https://godoc.org/github.com/colinmarc/hdfs#FileReader.Readdir	2020-07-24 11:31:51 -07:00
Klaus Post	11593c6cc4	Usage: Reset merged info when updating (#10126 ) When merging multiple buckets reset between each update. Avoids merging the same usage metrics multiple times resulting in duplicate data entries.	2020-07-24 11:02:10 -07:00
Harshavardhana	10025bda45	fix: add missing response headers to CORS handler (#10124 )	2020-07-24 00:46:51 -07:00
Harshavardhana	3a73f1ead5	refactor server update behavior (#10107 )	2020-07-23 08:03:31 -07:00
poornas	b9be841fd2	Add missing validation for replication API conditions (#10114 )	2020-07-22 17:39:40 -07:00
Anis Elleuch	456b2ef6eb	Avoid healing to be stuck with many concurrent event listeners (#10111 ) If there are many listeners to bucket notifications or to the trace subsystem, healing fails to work properly since it suspends itself when the number of concurrent connections is above a certain threshold. These connections are also continuous and not costly (no disk access), it is okay to just ignore them in waitForLowHTTPReq().	2020-07-22 13:16:55 -07:00
poornas	c43da3005a	Add support for server side bucket replication (#9882 )	2020-07-21 17:49:56 -07:00
Harshavardhana	a880283593	Send the lower level error directly from GetDiskID() (#10095 ) this is to detect situations of corruption disk format etc errors quickly and keep the disk online in such scenarios for requests to fail appropriately.	2020-07-21 13:54:06 -07:00
Harshavardhana	eb6bf454f1	fix: copyObject encryption from unencrypted object (#10102 ) This is a continuation of #10085	2020-07-21 12:25:01 -07:00
Harshavardhana	ec06089eda	fix: re-implement cluster healthcheck (#10101 )	2020-07-20 18:31:22 -07:00
Harshavardhana	0c4be55936	fix: fix lockup in merge-walk pool (#10098 ) Fixes two different types of problems - continuation of the problem seen in FS #9992 as not fixed for erasure coded deployments, reproduced this issue with spark and its fixed now - another issue was leaking walk go-routines which would lead to high memory usage and crash the system this is simply because all the walks which were purged at the top limit had leaking end walkers which would consume memory endlessly. closes #9966 closes #10088	2020-07-20 17:28:26 -07:00

1 2 3 4 5 ...

2884 Commits