minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	89bb9f17d7	fix: when parityDrives hits > len(storageDisks)/2, keep maxParity (#12387 ) Additionally move out `x-minio-internal-erasure-upgraded` from HTTP headers list, as its an internal header, rename elsewhere accordingly.	2021-05-27 13:38:04 -07:00
Klaus Post	acc452b7ce	Add more erasure codes on degraded systems. (#11852 ) In cases where a cluster is degraded, we do not uphold our consistency guarantee and we will write fewer erasure codes and rely on healing to recreate the missing shards. In some cases replacing known bad disks in practice take days. We want to change the behavior of a known degraded system to keep the erasure code promise of the storage class for each object. This will create the objects with the same confidence as a fully functional cluster. The tradeoff will be that objects created during a partial outage will take up slightly more space. This means that when the storage class is EC:4, there should always be written 4 parity shards, even if some disks are unavailable. When an object is created on a set, the disks are immediately checked. If any disks are unavailable additional parity shards will be made for each offline disk, up to 50% of the number of disks. We add an internal metadata field with the actual and intended erasure code level, this can optionally be picked up later by the scanner if we decide that data like this should be re-sharded.	2021-05-27 11:38:09 -07:00
Harshavardhana	be541dba8a	feat: introduce listUsers, listPolicies for any bucket (#12372 ) Bonus change LDAP settings such as user, group mappings are now listed as part of `mc admin user list` and `mc admin group list` Additionally this PR also deprecates the `/v2` API that is no longer in use.	2021-05-27 10:15:02 -07:00
Harshavardhana	b5ebfd35b4	fix: always prefer DataBlocks present in FileInfo (#12386 )	2021-05-27 10:11:50 -07:00
Anis Elleuch	530b703902	audit/logger: Increase http request timeout (#12385 ) A configured audit logger or HTTP logger is validated during MinIO server startup. Relax the timeout to 10 seconds in that case, otherwise, both loggers won't be used. 1 second could be too low for a busy HTTP endpoint.	2021-05-27 09:54:10 -07:00
Andreas Auernhammer	e8a12cbfdd	etag: compute ETag as MD5 for compressed single-part objects (#12375 ) This commit fixes a bug causing the MinIO server to compute the ETag of a single-part object as MD5 of the compressed content - not as MD5 of the actual content. This usually does not affect clients since the MinIO appended a `-1` to indicate that the ETag belongs to a multipart object. However, this behavior was problematic since: - A S3 client being very strict should reject such an ETag since the client uploaded the object via single-part API but got a multipart ETag that is not the content MD5. - The MinIO server leaks (via the ETag) that it compressed the object. This commit addresses both cases. Now, the MinIO server returns an ETag equal to the content MD5 for single-part objects that got compressed. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-27 08:18:41 -07:00
Anis Elleuch	e63908c391	Update bloom module (#12383 ) To fix dependency import issues when importing madmin-go v0.7.1	2021-05-27 08:02:39 -07:00
Harshavardhana	b251ae5f3d	fix: update default values for listing, replication workers	2021-05-26 11:55:46 -07:00
Anis Elleuch	0e80b5fe63	tests: Add test for upload of the same object inlined and not inlined (#12374 ) Upload an object smaller than small file threshold and upload another file bigger than small file threshold and tries to read it.	2021-05-26 08:09:23 -07:00
Harshavardhana	225d8c51fd	fix: missing path in admin trace (#12373 ) PR #12360 introduced a change which seems to have added a regression, the RawPath in r.URL seems to be empty, if it is fallback to r.URL.Path instead.	2021-05-26 08:04:12 -07:00
Klaus Post	3fff50120b	Revert heal locks (#12365 ) A lot of healing is likely to be on non-existing objects and locks are very expensive and will slow down scanning significantly. In cases where all are valid or, all are broken allow rejection without locking. Keep the existing behavior, but move the check for dangling objects to after the lock has been acquired. ``` _, err = getLatestFileInfo(ctx, partsMetadata, errs) if err != nil { return er.purgeObjectDangling(ctx, bucket, object, versionID, partsMetadata, errs, []error{}, opts) } ``` Revert "heal: Hold lock when reading xl.meta from disks (#12362)" This reverts commit `abd32065aa`	2021-05-25 17:02:06 -07:00
Harshavardhana	4840974d7a	fix: inline data upon overwrites should be readable (#12369 ) This PR fixes two bugs - Remove fi.Data upon overwrite of objects from inlined-data to non-inlined-data - Workaround for an existing bug on disk with latest releases to ignore fi.Data and instead read from the disk for non-inlined-data - Addtionally add a reserved metadata header to indicate data is inlined for a given version.	2021-05-25 16:33:06 -07:00
Harshavardhana	4fd1378242	fix: lint errors after upgrading golangci-lint (#12368 )	2021-05-25 14:17:33 -07:00
Harshavardhana	ed4941a5f3	fix: calculate dataBlocks properly in healing (#12364 )	2021-05-25 09:34:27 -07:00
Harshavardhana	cacdeca8cc	fix: return error for unexpected quorum in pickValidFileInfo (#12363 )	2021-05-24 18:31:56 -07:00
Anis Elleuch	abd32065aa	heal: Hold lock when reading xl.meta from disks (#12362 ) Lock is hold in healObject() after reading xl.meta from disks the first time. This commit will held the lock since the beginning of HealObject() Co-authored-by: Anis Elleuch <anis@min.io>	2021-05-24 13:39:38 -07:00
Harshavardhana	2baabd455b	docs: fix per tenant limits docs formatting	2021-05-24 09:37:17 -07:00
Harshavardhana	ebf75ef10d	fix: remove all unused code (#12360 )	2021-05-24 09:28:19 -07:00
Klaus Post	f01820a4ee	fix: invalid multipart offset when compressed+encrypted. (#12340 ) Fixes `testSSES3EncryptedGetObjectReadSeekFunctional` mint test. ``` { "args": { "bucketName": "minio-go-test-w53hbpat649nhvws", "objectName": "6mdswladz4vfpp2oit1pkn3qd11te5" }, "duration": 7537, "error": "We encountered an internal error, please try again.: cause(The requested range \"bytes 251717932 -> -116384170 of 135333762\" is not satisfiable.)", "function": "GetObject(bucketName, objectName)", "message": "CopyN failed", "name": "minio-go: testSSES3EncryptedGetObjectReadSeekFunctional", "status": "FAIL" } ``` Compressed files always start at the beginning of a part so no additional offset should be added.	2021-05-21 14:07:16 -07:00
Harshavardhana	0287711dc9	fix: implement readMetadata common function for re-use (#12353 ) Previous PR #12351 added functions to read from the reader stream to reduce memory usage, use the same technique in few other places where we are not interested in reading the data part.	2021-05-21 11:41:25 -07:00
Klaus Post	9d1b6fb37d	Add XL reader without data (#12351 ) Add XL metadata reader that reads metadata only on larger files. Use for scanning and listing for now.	2021-05-21 09:10:54 -07:00
Harshavardhana	32d8a48d4e	reduce memory usage in metacache reader (#12334 )	2021-05-20 09:00:11 -07:00
Harshavardhana	6060b755c6	fix: migrate users properly from older releases to newer (#12333 )	2021-05-19 19:25:44 -07:00
Krishnan Parthasarathi	cfa94cc35c	Simplify remote tier validation in lifecycle rule validation (#12329 )	2021-05-19 18:51:23 -07:00
Klaus Post	2ca9c533ef	feat: implement in-progress partial bucket updates (#12279 )	2021-05-19 14:38:30 -07:00
Anis Elleuch	866593fd94	heal: Ignore disks with non quorum modtime and dataDir (#12328 )	2021-05-19 12:04:08 -07:00
Harshavardhana	ecb5525c91	fix: muxing order for rejected APIs (#12321 )	2021-05-19 09:21:34 -07:00
Klaus Post	c2c803dd30	Fix list entry deduplication (#12325 ) File infos would always be the same. Add numversions as a final tiebreaker.	2021-05-19 09:21:18 -07:00
Harshavardhana	4f5d75f22b	fix: speed up drive mux registration (#12319 ) in setups with lots of drives the server startup is slow, initialize all local drives in parallel before registering with muxer. this speeds up when there are multiple pools and large collection of drives.	2021-05-18 17:25:00 -07:00
Harshavardhana	bb7fbcdc09	fix: generating service accounts for group only LDAP accounts (#12318 ) fixes #12315	2021-05-18 15:19:20 -07:00
Andreas Auernhammer	82c53ac260	sse-kms: set KMS key ID response header (#12316 ) This commit adds the `X-Amz-Server-Side-Encryption-Aws-Kms-Key-Id` response header to the GET, HEAD, PUT and Download API. Based on AWS documentation [1] AWS S3 returns the KMS key ID as part of the response headers. [1] https://docs.aws.amazon.com/AmazonS3/latest/userguide/specifying-kms-encryption.html Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-18 14:21:20 -07:00
Harshavardhana	a70e0da19e	use direntPool, direntNamePool for reusable buffers (#12314 ) - in readDirFn re-use buffers from direntPool() - in readDirN use separate dirent name buffer direntNamePool()	2021-05-18 10:29:50 -07:00
Harshavardhana	c6b7dc012a	fix: use key.Ciphertext for DecryptKey in KeyStatus (#12313 ) enhance GlobalKMS.Stat() for kes to actually perform a network call to check Version() of kes and also implicitly that its reachable.	2021-05-18 07:22:31 -07:00
Harshavardhana	2daba018d6	reduce allocations on multi-disk clusters (#12311 ) multi-disk clusters initialize buffer pools per disk, this is perhaps expensive and perhaps not useful, for a running server instance. As this may disallow re-use of buffers across sets, this change ensures that buffers across sets can be re-used at drive level, this can reduce quite a lot of memory on large drive setups.	2021-05-17 17:49:48 -07:00
Harshavardhana	d610578d84	fix: de-couple IAM migration and loading context from lock context (#12312 ) fixes #12307	2021-05-17 16:50:47 -07:00
Harshavardhana	a096a92c63	add io.ErrUnexpectedEOF for config retriable errors (#12309 ) fixes #12307	2021-05-17 15:13:14 -07:00
Harshavardhana	3d9873106d	feat: distributed setup can start now with default credentials (#12303 ) In lieu of new changes coming for server command line, this change is to deprecate strict requirement for distributed setups to provide root credentials. Bonus: remove MINIO_WORM warning from April 2020, it is time to remove this warning.	2021-05-17 08:45:22 -07:00
Klaus Post	cde6469b88	Fix hanging erasure writes (#12253 ) However, this slice is also used for closing the writers, so close is never called on these. Furthermore when an error is returned from a write it is now reported to the reader. bonus: remove unused heal param from `newBitrotWriter`. * Remove copy, now that we don't mutate.	2021-05-17 08:32:28 -07:00
Klaus Post	55375fa7f6	Update probabilities for bloom filter. (#12305 ) See https://github.com/minio/minio/discussions/12285 Results in M=958506 K=7 and 119840 bytes per filter when serialized compared to 26176 bytes before.	2021-05-17 08:31:04 -07:00
Harshavardhana	f1e479d274	remove more duplicate bloom filter trackers (#12302 ) At some places bloom filter tracker was getting updated for `.minio.sys/tmp` bucket, there is no reason to update bloom filters for those. And add a missing bloom filter update for MakeBucket() Bonus: purge unused function deleteEmptyDir()	2021-05-17 08:25:48 -07:00
Harshavardhana	2ab9dc7609	do not update bloomFilters for temporary objects	2021-05-15 19:54:07 -07:00
Harshavardhana	4d876d03e8	fix: do not fail upon faulty/non-writable drives gracefully start the server, if there are other drives available - print enough information for administrator to notice the errors in console. Bonus: for really large streams use larger buffer for writes.	2021-05-15 12:57:18 -07:00
Harshavardhana	d84261aa6d	fix: ensure proper usage of DataDir (#12300 ) - GetObject() should always use a common dataDir to read from when it starts reading, this allows the code in erasure decoding to have sane expectations. - Healing should always heal on the common dataDir, this allows the code in dangling object detection to purge dangling content. These both situations can happen under certain types of retries during PUT when server is restarting etc, some namespace entries might be left over.	2021-05-14 16:50:47 -07:00
Harshavardhana	5b18c57a54	fix: for deleteBucket delete on dnsStore first (#12298 ) attempt a delete on remote DNS store first before attempting locally, because removing at DNS store is cheaper than deleting locally, in case of errors locally we can cheaply recreate the bucket on dnsStore instead of.	2021-05-14 12:40:54 -07:00
Andreas Auernhammer	a1f70b106f	sse: add support for SSE-KMS bucket configurations (#12295 ) This commit adds support for SSE-KMS bucket configurations. Before, the MinIO server did not support SSE-KMS, and therefore, it was not possible to specify an SSE-KMS bucket config. Now, this is possible. For example: ``` mc encrypt set sse-kms some-key <alias>/my-bucket ``` Further, this commit fixes an issue caused by not supporting SSE-KMS bucket configuration and switching to SSE-KMS as default SSE method. Before, the server just checked whether an SSE bucket config was present (not which type of SSE config) and applied the default SSE method (which was switched from SSE-S3 to SSE-KMS). This caused objects to get encrypted with SSE-KMS even though a SSE-S3 bucket config was present. This issue is fixed as a side-effect of this commit. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-14 00:59:05 -07:00
Poorna Krishnamoorthy	951acf561c	Add support for syncing replica modifications (#11104 ) when bidirectional replication is set up. If ReplicaModifications is enabled in the replication configuration, sync metadata updates to source if replication rules are met. By default, if this configuration is unset, MinIO automatically sync's metadata updates on replica back to the source.	2021-05-13 19:20:45 -07:00
Harshavardhana	397391c89f	fix: parentUser mapped policy for OIDC creds (#12293 ) missing parentUser for OIDC STS creds can lead to fail to authenticate, this PR attempts to fix the parentUser policy map for distributed setups.	2021-05-13 16:21:06 -07:00
Andreas Auernhammer	9cd9f5a0b3	check that we can reach KES server and that the default key exists (#12291 ) This commit adds a check to the MinIO server setup that verifies that MinIO can reach KES, if configured, and that the default key exists. If the default key does not exist it will create it automatically. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-13 11:13:31 -07:00
Harshavardhana	5c0a7189c7	fix: LDAP authentication with groups only (#12283 ) fixes #12282	2021-05-12 21:25:07 -07:00
Harshavardhana	57aed841dd	do not return error for usage-cache version v4 (#12276 )	2021-05-12 08:07:02 -07:00
Klaus Post	229d83bb75	feat: add dynamic usage cache (#12229 ) A cache structure will be kept with a tree of usages. The cache is a tree structure where each keeps track of its children. An uncompacted branch contains a count of the files only directly at the branch level, and contains link to children branches or leaves. The leaves are "compacted" based on a number of properties. A compacted leaf contains the totals of all files beneath it. A leaf is only scanned once every dataUsageUpdateDirCycles, rarer if the bloom filter for the path is clean and no lifecycles are applied. Skipped leaves have their totals transferred from the previous cycle. A clean leaf will be included once every healFolderIncludeProb for partial heal scans. When selected there is a one in healObjectSelectProb that any object will be chosen for heal scan. Compaction happens when either: - The folder (and subfolders) contains less than dataScannerCompactLeastObject objects. - The folder itself contains more than dataScannerCompactAtFolders folders. - The folder only contains objects and no subfolders. - A bucket root will never be compacted. Furthermore, if a has more than dataScannerCompactAtChildren recursive children (uncompacted folders) the tree will be recursively scanned and the branches with the least number of objects will be compacted until the limit is reached. This ensures that any branch will never contain an unreasonable amount of other branches, and also that small branches with few objects don't take up unreasonable amounts of space. Whenever a branch is scanned, it is assumed that it will be un-compacted before it hits any of the above limits. This will make the branch rebalance itself when scanned if the distribution of objects has changed. TLDR; With current values: No bucket will ever have more than 10000 child nodes recursively. No single folder will have more than 2500 child nodes by itself. All subfolders are compacted if they have less than 500 objects in them recursively. We accumulate the (non-deletemarker) version count for paths as well, since we are changing the structure anyway.	2021-05-11 18:36:15 -07:00
Harshavardhana	fe21aa356c	fix: if targetUser empty use parentUser for serviceAccounts (#12275 )	2021-05-11 13:02:00 -07:00
Anis Elleuch	56d4d7b8b1	MRF: Better detection of non stable disks (#12252 ) MRF does not detect when a node is disconnected and reconnected quickly this change will ensure that MRF is alerted by comparing the last disk reconnection timestamp with the last MRF check time. Signed-off-by: Anis Elleuch <anis@min.io> Co-authored-by: Klaus Post <klauspost@gmail.com>	2021-05-11 09:19:15 -07:00
Harshavardhana	e84f533c6c	add missing wait groups for certain io.Pipe() usage (#12264 ) wait groups are necessary with io.Pipes() to avoid races when a blocking function may not be expected and a Write() -> Close() before Read() races on each other. We should avoid such situations.. Co-authored-by: Klaus Post <klauspost@gmail.com>	2021-05-11 09:18:37 -07:00
Anis Elleuch	0b34dfb479	lock: Timeout Unlock RPC call (#12213 ) RPC unlock call needs to be timed out otherwise this can block indefinitely. Signed-off-by: Anis Elleuch <anis@min.io>	2021-05-11 02:11:29 -07:00
Harshavardhana	b81fada834	use json unmarshal/marshal from jsoniter in hotpaths (#12269 )	2021-05-11 02:02:32 -07:00
Andreas Auernhammer	d8eb7d3e15	kms: replace KES client implementation with minio/kes (#12207 ) This commit replaces the custom KES client implementation with the KES SDK from https://github.com/minio/kes The SDK supports multi-server client load-balancing and requests retry out of the box. Therefore, this change reduces the overall complexity within the MinIO server and there is no need to maintain two separate client implementations. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-10 18:15:11 -07:00
Andreas Auernhammer	c03a06cca8	config: enforce AES-GCM in FIPS mode (#12265 ) This commit enforces the usage of AES-256 for config and IAM data en/decryption in FIPS mode. Further, it improves the implementation of `fips.Enabled` by making it a compile time constant. Now, the compiler is able to evaluate the any `if fips.Enabled { ... }` at compile time and eliminate unused code. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-10 08:24:11 -07:00
Harshavardhana	2d79d6d847	fix: do not niladic p.writers upon failure (#12255 ) p.writers is a verbatim value of bitrotWriter backed by a pipe() that should never be nil'ed, instead use the captured errors to skip the writes. additionally detect also short writes, and reject them as errors.	2021-05-10 08:20:23 -07:00
Harshavardhana	8b52d70012	fix: IAM not initialized then checkKeyValid() should return 503s (#12260 ) currently GetUser() returns 403 when IAM is not initialized this can lead to applications crashing, instead return 503 so that the applications can retry and backoff. fixes #12078	2021-05-09 08:14:19 -07:00
Harshavardhana	39d681a04a	update fsSimpleRenameFile contrib	2021-05-08 22:31:41 -07:00
Harshavardhana	764721e2c6	add root_disk threshold detection (#12259 ) as there is no automatic way to detect if there is a root disk mounted on / or /var for the container environments due to how the root disk information is masked inside overlay root inside container. this PR brings an environment variable to set root disk size threshold manually to detect the root disks in such situations.	2021-05-08 15:40:29 -07:00
Andreas Auernhammer	adaae26bbc	sse-kms: fix single-part object decryption (#12257 ) This commit fixes a bug in the single-part object decryption that is triggered in case of SSE-KMS. Before, it was assumed that the encryption is either SSE-C or SSE-S3. In case of SSE-KMS the SSE-C branch was executed. This lead to an invalid SSE-C algorithm error. This commit fixes this by inverting the `if-else` logic. Now, the SSE-C branch only gets executed when SSE-C headers are present. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-07 14:40:57 -07:00
Andreas Auernhammer	0ba8c0a19b	sse-kms: fix assignment to potential nil map (#12250 ) This commit fixes a bug introduced by `af0c65b`. When there is no / an empty client-provided SSE-KMS context the `ParseMetadata` may return a nil map (`kms.Context`). When unsealing the object key we must check that the context is nil before assigning a key-value pair. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-07 09:16:49 -07:00
Anis Elleuch	cb0b36f8c2	svcacct: Fix updating service account and add missing check (#12251 ) UpdateServiceAccount ignores updating fields when not passed from upper layer, such as empty policy, empty account status, and empty secret key. This PR will check for a secret key only if it is empty and add more check on the value of the account status. Signed-off-by: Anis Elleuch <anis@min.io>	2021-05-07 09:13:30 -07:00
Klaus Post	254698f126	fix: minor allocation improvements in xlMetaV2 (#12133 )	2021-05-07 09:11:05 -07:00
Krishnan Parthasarathi	0bab1c1895	Heal restored object contents on disk (#12238 )	2021-05-06 16:06:57 -07:00
Harshavardhana	d495cb68d3	fix: crash in prometherus metrics collector (#12244 ) node_health metrics crashes in gateway mode, in gateway mode ignore node health metrics. fixes #12243	2021-05-06 15:43:34 -07:00
Andreas Auernhammer	af0c65be93	add SSE-KMS support and use SSE-KMS for auto encryption (#12237 ) This commit adds basic SSE-KMS support. Now, a client can specify the SSE-KMS headers (algorithm, optional key-id, optional context) such that the object gets encrypted using the SSE-KMS method. Further, auto-encryption now defaults to SSE-KMS. This commit does not try to do any refactoring and instead tries to implement SSE-KMS as a minimal change to the code base. However, refactoring the entire crypto-related code is planned - but needs a separate effort. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-05-06 15:24:01 -07:00
Nitish Tiwari	776589f0da	Add free inode metric for Prometheus (#12225 )	2021-05-06 12:50:48 -07:00
Harshavardhana	361940706d	fix: avoid races in NewMultipartUpload under multiple pools (#12233 ) It is possible in some scenarios that in multiple pools, two concurrent calls for the same object as a multipart operation can lead to duplicate entries on two different pools. This PR fixes this - hold locks to serialize multiple callers so that we don't race. - make sure to look for existing objects on the namespace as well not just for existing uploadIDs	2021-05-06 10:45:33 -07:00
Harshavardhana	1aa5858543	move madmin to github.com/minio/madmin-go (#12239 )	2021-05-06 08:52:02 -07:00
Harshavardhana	f4623ea8dc	fix: validate secret key before updating service accounts	2021-05-05 16:41:47 -07:00
Harshavardhana	b8833c2947	do not change targetUser after permission validation for service accounts make sure that targetUser is always the one that is presented/validated from the incoming request, not the parentUser.	2021-05-05 16:13:52 -07:00
Anis Elleuch	af1b6e3458	iam: Do not create service accounts for non existant IAM users (#12236 ) When running MinIO server without LDAP/OpenID, we should error out when the code tries to create a service account for a non existant regular user. Bonus: refactor the check code to be show all cases more clearly Signed-off-by: Anis Elleuch <anis@min.io> Co-authored-by: Anis Elleuch <anis@min.io>	2021-05-05 16:04:50 -07:00
Harshavardhana	0eeb0a4e04	Revert "add SSE-KMS support and use SSE-KMS for auto encryption (#11767 )" This reverts commit `26f1fcab7d`.	2021-05-05 15:20:46 -07:00
Andreas Auernhammer	26f1fcab7d	add SSE-KMS support and use SSE-KMS for auto encryption (#11767 ) This commit adds basic SSE-KMS support. Now, a client can specify the SSE-KMS headers (algorithm, optional key-id, optional context) such that the object gets encrypted using the SSE-KMS method. Further, auto-encryption now defaults to SSE-KMS. This commit does not try to do any refactoring and instead tries to implement SSE-KMS as a minimal change to the code base. However, refactoring the entire crypto-related code is planned - but needs a separate effort. Signed-off-by: Andreas Auernhammer <aead@mail.de> Co-authored-by: Klaus Post <klauspost@gmail.com>	2021-05-05 11:24:14 -07:00
Harshavardhana	3a0e7347ca	support startTLS with serverName TLSConfig (#12219 ) fixes #12216	2021-05-04 20:13:24 -07:00
Harshavardhana	67001e3ce9	fix: allow root credentials to generate STS, service accounts (#12210 )	2021-05-04 11:58:19 -07:00
Harshavardhana	804a23a06d	update docs to remove _OLD credential references also update the docs about config, IAM on encryption.	2021-05-04 10:27:51 -07:00
Nitish Tiwari	c8aa56ccd7	Add node cpu & memory metrics to Prometheus cluster endpoint (#12214 )	2021-05-04 10:17:10 -07:00
Harshavardhana	ff36baeaa7	fix: attempt to drain the ReadFileStream for connection pooling (#12208 ) avoid time_wait build up with getObject requests if there are pending callers and they timeout, can lead to time_wait states Bonus share the same buffer pool with erasure healing logic, additionally also fixes a race where parallel readers were never cleanup during Encode() phase, because pipe.Reader end was never closed(). Added closer right away upon an error during Encode to make sure to avoid racy Close() while stream was still being Read().	2021-05-04 10:12:08 -07:00
Krishnan Parthasarathi	860bf1bab2	Add IsRemote method on FileInfo, ObjectInfo (#12209 ) Provides a convenient method to know if an object's contents are in its remote tier.	2021-05-04 08:40:42 -07:00
Andreas Auernhammer	4815f92fa8	fix MINIO_KMS_SECRET_KEY env. variable parsing (#12200 ) This commit fixes a bug when parsing the env. variable `MINIO_KMS_SECRET_KEY`. Before, the env. variable name - instead of its value - was parsed. This (obviously) did not work properly. This commit fixes this. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-04-30 18:47:30 -07:00
Harshavardhana	0d3ddf7286	fix: improve NewObjectReader implementation for careful cleanup usage (#12199 ) cleanup functions should never be cleaned before the reader is instantiated, this type of design leads to situations where order of lockers and places for them to use becomes confusing. Allow WithCleanupFuncs() if the caller wishes to add cleanupFns to be run upon close() or an error during initialization of the reader. Also make sure streams are closed before we unlock the resources, this allows for ordered cleanup of resources.	2021-04-30 18:37:58 -07:00
Harshavardhana	3524c00090	fix: nats testdata relocation fix	2021-04-30 17:22:56 -07:00
Harshavardhana	f7a87b30bf	Revert "deprecate embedded browser (#12163 )" This reverts commit `736d8cbac4`. Bring contrib files for older contributions	2021-04-30 08:50:39 -07:00
Harshavardhana	64f6020854	fix: cleanup locking, cancel context upon lock timeout (#12183 ) upon errors to acquire lock context would still leak, since the cancel would never be called. since the lock is never acquired - proactively clear it before returning.	2021-04-29 20:55:21 -07:00
Harshavardhana	0faa4e6187	fix: make sure failed requests only to failed queue (#12196 ) failed queue should be used for retried requests to avoid cascading the failures into incoming queue, this would allow for a more fair retry for failed replicas. Additionally also avoid taking context in queue task to avoid confusion, simplifies its usage.	2021-04-29 18:20:39 -07:00
Poorna Krishnamoorthy	90112b5644	Update ReplicationStatus if metadata not updated correctly (#12191 ) There can be situations where replication completed but the `X-Amz-Replication-Status` metadata update failed such as when the server returns 503 under high load. This object version will continue to be picked up by the scanner and replicateObject would perform no action since the versions match between source and target. The metadata would never reflect that replication was successful without this fix, leading to repeated re-queuing.	2021-04-29 16:46:26 -07:00
Harshavardhana	c4b21ac7fa	fix: remove healthcheck routine for replication targets (#12192 ) Bonus also fix a racy lookup on arnsMap() without a read lock, hold read locks to avoid such race. moving the healthcheck logic to minio-go	2021-04-29 16:41:28 -07:00
Andreas Auernhammer	e5ec1325fc	docs: add QuickStart section to KMS encryption of IAM data (#12190 ) This commit enhances the docs about IAM encryption. It adds a quick-start section that explains how to get started quickly with `MINIO_KMS_SECRET_KEY` instead of setting up KES. It also removes the startup message that gets printed when the server migrates IAM data to plaintext. We will point this out in the release notes. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-04-29 14:20:28 -07:00
Harshavardhana	c5a80ca5d5	support service accounts for OpenID connect properly (#12178 ) OpenID connect generated service accounts do not work properly after console logout, since the parentUser state is lost - instead use sub+iss claims for parentUser, according to OIDC spec both the claims provide the necessary stability across logins etc.	2021-04-29 13:01:42 -07:00
Harshavardhana	8cd89e10ea	Revert "fix: remove deprecated MINIO_ACCESS_KEY, MINIO_SECRET_KEY envs (#12173 )" This reverts commit `b0baaeaa3d`.	2021-04-29 10:56:53 -07:00
Harshavardhana	091845df39	fix: return quorum error upon decode failures (#12184 )	2021-04-29 10:00:03 -07:00
Harshavardhana	336c8ac99f	fix: do not heal when disks are down (#12186 ) HeadObject() was erroneously attempting a heal when disks are down, avoid it.	2021-04-29 09:54:16 -07:00
Harshavardhana	b3c8a1864f	fix: optimize ListBuckets for anonymous users (#12182 ) anonymous users are never allowed to listBuckets(), we do not need to further validate the policy, we can simply reject if credentials are empty.	2021-04-28 21:37:02 -07:00
Poorna Krishnamoorthy	632252ff1d	fix: change SetRemoteTarget API to allow editing remote target granularly (#12175 ) Currently, only credentials could be updated with `mc admin bucket remote edit`. Allow updating synchronous replication flag, path, bandwidth and healthcheck duration on buckets, and a flag to disable proxying in active-active replication.	2021-04-28 15:26:20 -07:00
Harshavardhana	77f9c71133	Revert "redirect to console project for browser (#12172 )" This reverts commit `301669cf7b`. fixes #12179	2021-04-28 12:22:15 -07:00
Krishnan Parthasarathi	0c9d095deb	ilm: Close warmBackend GetObject reader (#12174 )	2021-04-27 22:42:18 -07:00
Harshavardhana	b0baaeaa3d	fix: remove deprecated MINIO_ACCESS_KEY, MINIO_SECRET_KEY envs (#12173 )	2021-04-27 22:41:24 -07:00
Harshavardhana	301669cf7b	redirect to console project for browser (#12172 )	2021-04-27 16:39:41 -07:00
Anis Elleuch	9e797532dc	lock: Always cancel the returned Get(R)Lock context (#12162 ) * lock: Always cancel the returned Get(R)Lock context There is a leak with cancel created inside the locking mechanism. The cancel purpose was to cancel operations such erasure get/put that are holding non-refreshable locks. This PR will ensure the created context.Cancel is passed to the unlock API so it will cleanup and avoid leaks. * locks: Avoid returning nil cancel in local lockers Since there is no Refresh mechanism in the local locking mechanism, we do not generate a new context or cancel. Currently, a nil cancel function is returned but this can cause a crash. Return a dummy function instead.	2021-04-27 16:12:50 -07:00
Harshavardhana	736d8cbac4	deprecate embedded browser (#12163 ) https://github.com/minio/console takes over the functionality for the future object browser development Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-27 10:52:12 -07:00
Harshavardhana	cf335f6c63	service accounts should use LDAP user DN to assign credentials (#12166 ) LDAP DN should be used when allowing setting service accounts for LDAP users instead of just simple user, Bonus root owner should be allowed full access to all service account APIs. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-27 10:04:08 -07:00
Harshavardhana	c8050bc079	fix: sleeper behavior in data scanner (#12164 ) do not apply healReplication() for ILM expired, transitioned objects	2021-04-27 08:24:44 -07:00
Harshavardhana	edda244066	move pkg/rpc, pkg/csvparser, pkg/argon2 to contrib Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-26 18:24:40 -07:00
Poorna Krishnamoorthy	4be0f92067	Fix multipart restore to remove part match (#12161 ) Part ETags are not available after multipart finalizes, removing this check as not useful. Signed-off-by: Poorna Krishnamoorthy <poorna@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2021-04-26 18:24:06 -07:00
Harshavardhana	26544848ea	remove legacy master_key support by June (#12153 ) Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-26 16:02:05 -07:00
Harshavardhana	2966823818	use jsoniter for json marshal/unmarshal in KMS (#12146 ) Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-26 16:01:52 -07:00
Harshavardhana	d501c5e38b	add missing responseBody drain (#12147 ) Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-26 08:59:54 -07:00
Harshavardhana	d825d92499	rename production to release directory, rebuild assets	2021-04-25 16:51:29 -07:00
Andreas Auernhammer	f7feff8665	avoid parsing MINIO_KMS_MASTER_KEY as base64 (#12149 ) This commit reverts a change that added support for parsing base64-encoded keys set via `MINIO_KMS_MASTER_KEY`. The env. variable `MINIO_KMS_MASTER_KEY` is deprecated and should ONLY support parsing existing keys - not the new format. Any new deployment should use `MINIO_KMS_SECRET_KEY`. The legacy env. variable `MINIO_KMS_MASTER_KEY` will be removed at some point in time. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-04-25 11:04:31 -07:00
Harshavardhana	4eb9b6eaf8	preserve metadata multipart restore (#12139 ) avoid re-read of xl.meta instead just use the success criteria from PutObjectPart() and check the ETag matches per Part, if they match then the parts have been successfully restored as is. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-24 19:07:27 -07:00
Harshavardhana	f420996dfa	fix: allow parsing keys in both new and old format (#12144 ) Bonus fix fallback to decrypt previously encrypted content as well using older master key ciphertext format. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-24 19:05:25 -07:00
Poorna Krishnamoorthy	5d954ea228	fix: versionID and MTime for restored object (#12145 ) Signed-off-by: Poorna Krishnamoorthy <poorna@minio.io>	2021-04-24 19:04:35 -07:00
Harshavardhana	25d3c73162	add HEAD for cluster healthcheck (#12140 ) fixes #12130 Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 22:47:39 -07:00
Harshavardhana	82dc6aff1c	add support for configurable replication MRF workers (#12125 ) just like replication workers, allow failed replication workers to be configurable in situations like DR failures etc to catch up on replication sooner when DR is back online. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 21:58:45 -07:00
Poorna Krishnamoorthy	014e419151	fix: ensure pending replication queued to MRF queue (#12138 ) Signed-off-by: Poorna Krishnamoorthy <poorna@minio.io>	2021-04-23 16:52:57 -07:00
Harshavardhana	799691eded	fix: reload LDAP users properly with latest mapping (#12137 ) peer nodes would not update if policy is unset on a user, until policies reload every 5minutes. Make sure to reload the policies properly, if no policy is found make sure to delete such users and groups fixes #12074 Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 15:11:01 -07:00
Harshavardhana	cbfdf97abf	Use CompleteMultipartUpload in RestoreTransitionedObject Signed-off-by: Krishnan Parthasarathi <kp@minio.io>	2021-04-23 11:58:53 -07:00
Krishnan Parthasarathi	3831027c54	fix: compiler errors in restoreTransitionedObject (#12120 )	2021-04-23 11:58:53 -07:00
Harshavardhana	4d53054f8c	update internode API for FileInfo change Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Krishnan Parthasarathi	c829e3a13b	Support for remote tier management (#12090 ) With this change, MinIO's ILM supports transitioning objects to a remote tier. This change includes support for Azure Blob Storage, AWS S3 compatible object storage incl. MinIO and Google Cloud Storage as remote tier storage backends. Some new additions include: - Admin APIs remote tier configuration management - Simple journal to track remote objects to be 'collected' This is used by object API handlers which 'mutate' object versions by overwriting/replacing content (Put/CopyObject) or removing the version itself (e.g DeleteObjectVersion). - Rework of previous ILM transition to fit the new model In the new model, a storage class (a.k.a remote tier) is defined by the 'remote' object storage type (one of s3, azure, GCS), bucket name and a prefix. * Fixed bugs, review comments, and more unit-tests - Leverage inline small object feature - Migrate legacy objects to the latest object format before transitioning - Fix restore to particular version if specified - Extend SharedDataDirCount to handle transitioned and restored objects - Restore-object should accept version-id for version-suspended bucket (#12091) - Check if remote tier creds have sufficient permissions - Bonus minor fixes to existing error messages Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io> Co-authored-by: Krishna Srinivas <krishna@minio.io> Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Harshavardhana	069432566f	update license change for MinIO Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Klaus Post	e0d3a8c1f4	Alloc less for metacache decompression (#12134 ) Network streams are limited to 16K blocks. Don't alloc more upfront. Signed-off-by: Klaus Post <klauspost@gmail.com>	2021-04-23 10:27:42 -07:00
Harshavardhana	bb1198c2c6	revert CreateFile waitForResponse (#12124 ) instead use expect continue timeout, and have higher response header timeout, the new higher timeout satisfies worse case scenarios for total response time on a CreateFile operation. Also set the "expect" continue header to satisfy expect continue timeout behavior. Some clients seem to cause CreateFile body to be truncated, leading to no errors which instead fails with ObjectNotFound on a PUT operation, this change avoids such failures appropriately. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 10:18:18 -07:00
Anis Elleuch	c9dfa0d87b	audit: Add field to know who triggered the operation (#12129 ) This is for now needed to know if an external S3 request deleted a file or it was the scanner. Signed-off-by: Anis Elleuch <anis@min.io>	2021-04-23 09:51:12 -07:00
Harshavardhana	d0d67f9de0	feat: allow prometheus for only authorized users (#12121 ) allow restrictions on who can access Prometheus endpoint, additionally add prometheus as part of diagnostics canned policy. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-22 18:55:30 -07:00
Andreas Auernhammer	3455f786fa	kms: encrypt IAM/config data with the KMS (#12041 ) This commit changes the config/IAM encryption process. Instead of encrypting config data (users, policies etc.) with the root credentials MinIO now encrypts this data with a KMS - if configured. Therefore, this PR moves the MinIO-KMS configuration (via env. variables) to a "top-level" configuration. The KMS configuration cannot be stored in the config file since it is used to decrypt the config file in the first place. As a consequence, this commit also removes support for Hashicorp Vault - which has been deprecated anyway. Signed-off-by: Andreas Auernhammer <aead@mail.de>	2021-04-22 09:51:09 -07:00
Harshavardhana	a7acfa6158	fix: pick valid FileInfo additionally based on dataDir (#12116 ) * fix: pick valid FileInfo additionally based on dataDir historically we have always relied on modTime to be consistent and same, we can now add additional reference to look for the same dataDir value. A dataDir is the same for an object at a given point in time for a given version, let's say a `null` version is overwritten in quorum we do not by mistake pick up the fileInfo's incorrectly. * make sure to not preserve fi.Data Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-21 19:06:08 -07:00
Anis Elleuch	cebada2cc7	svcacct: Always search for parent user policy svcacct implied policy (#12117 ) InfoServiceAccount admin API does not correctly calculate the policy for a given service account in case if the policy is implied. Fix it. Signed-off-by: Anis Elleuch <anis@min.io>	2021-04-21 18:12:02 -07:00
Harshavardhana	38a9f87a56	Revert "svc: Disallow creating services accounts by root (#12062 )" This reverts commit `150f3677d6`. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-21 11:59:23 -07:00
Harshavardhana	4a41222310	fix: newMultipartUpload should go to same pool (#12106 ) avoid potential for duplicates under multi-pool setup, additionally also make sure CompleteMultipart is using a more optimal API for uploadID lookup and never delete the object there is a potential to create a delete marker during complete multipart. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-21 10:57:36 -07:00
Klaus Post	6235bd825b	Grab read lock while reading usage cache (#12111 ) Signed-off-by: Klaus Post <klauspost@gmail.com>	2021-04-21 08:39:00 -07:00
Harshavardhana	2ef824bbb2	collapse two distinct calls into single RenameData() call (#12093 ) This is an optimization by reducing one extra system call, and many network operations. This reduction should increase the performance for small file workloads.	2021-04-20 10:44:39 -07:00
Klaus Post	3d685b7fff	fix: zip error races in WebDownload (#12086 ) When an error is reported it is ignored and zipping continues with the next object. However, if there is an error it will write a response to `writeWebErrorResponse(w, err)`, but responses are still being built. Fixes #12082 Bonus: Exclude common compressed image types.	2021-04-19 08:44:18 -07:00
Poorna Krishnamoorthy	c9bf6007b4	Use custom transport for remote targets (#12080 )	2021-04-16 18:58:26 -07:00
Harshavardhana	7a0a5bdc0d	remove legacy path for LDAP during policy map removal (#12081 ) Thanks to @Alevsk for noticing this nuanced behavior change between releases from 03-04 to 03-20, make sure that we handle the legacy path removal as well.	2021-04-16 18:18:55 -07:00
Harshavardhana	0a9d8dfb0b	fix: crash in single drive mode for lifecycle (#12077 ) also make sure to close the channel on the producer side, not in a separate go-routine, this can lead to races between a writer and a closer. fixes #12073	2021-04-16 14:09:25 -07:00
Harshavardhana	a334554f99	fix: add helper for expected path.Clean behavior (#12068 ) current usage of path.Clean returns "." for empty strings instead we need `""` string as-is, make relevant changes as needed.	2021-04-15 16:32:13 -07:00
Poorna Krishnamoorthy	d30c5d1cf0	Avoid metadata update for incoming replication failure (#12054 ) This is an optimization to save IOPS. The replication failures will be re-queued once more to re-attempt replication. If it still does not succeed, the replication status is set as `FAILED` and will be caught up on scanner cycle.	2021-04-15 16:32:00 -07:00
Harshavardhana	75ac4ea840	remove possible double locks in bandwidth monitor (#12067 ) additionally reject bandwidth limits with synchronous replication for now.	2021-04-15 16:20:45 -07:00
Anis Elleuch	b6f5785a6d	svc: Display the correct policy of a particular service account (#12064 ) For InfoServiceAccount API, calculating the policy before showing it to the user was not correctly done (only UX issue, not a security issue) This commit fixes it.	2021-04-15 14:47:58 -07:00
Harshavardhana	39dd9b6483	fix: do not return an error on expired credentials (#12057 ) policy might have an associated mapping with an expired user key, do not return an error during DeletePolicy for such situations - proceed normally as its an expected situation.	2021-04-15 08:51:01 -07:00
Andreas Auernhammer	885c170a64	introduce new package pkg/kms (#12019 ) This commit introduces a new package `pkg/kms`. It contains basic types and functions to interact with various KMS implementations. This commit also moves KMS-related code from `cmd/crypto` to `pkg/kms`. Now, it is possible to implement a KMS-based config data encryption in the `pkg/config` package.	2021-04-15 08:47:33 -07:00
Harshavardhana	1456f9f090	fix: preserve shared dataDir during suspend overwrites (#12058 ) CopyObject() when shares dataDir needs to be preserved, and upon versioning suspended overwrites should still preserve the dataDir.	2021-04-15 08:44:05 -07:00
Anis Elleuch	150f3677d6	svc: Disallow creating services accounts by root (#12062 )	2021-04-15 08:43:44 -07:00
Anis Elleuch	291d2793ca	ldap: Create services accounts for LDAP and STS temp accounts (#11808 )	2021-04-14 22:51:14 -07:00
Harshavardhana	b70c298c27	update findDataDir to skip inline data (#12050 )	2021-04-14 22:44:27 -07:00
Harshavardhana	94e1bacd16	STS call should be rejected for missing policies (#12056 ) fixes #12055	2021-04-14 22:35:42 -07:00
Andreas Auernhammer	97aa831352	add new pkg/fips for FIPS 140-2 (#12051 ) This commit introduces a new package `pkg/fips` that bundles functionality to handle and configure cryptographic protocols in case of FIPS 140. If it is compiled with `--tags=fips` it assumes that a FIPS 140-2 cryptographic module is used to implement all FIPS compliant cryptographic primitives - like AES, SHA-256, ... In "FIPS mode" it excludes all non-FIPS compliant cryptographic primitives from the protocol parameters.	2021-04-14 08:29:56 -07:00
ebozduman	b4eeeb8449	PutObjectRetention : return matching error XML as AWS S3 (#11973 )	2021-04-14 00:01:53 -07:00
Harshavardhana	e85b28398b	fix: pre-allocate certain slices with expected capacity (#12044 ) Avoids append() based tiny allocations on known allocated slices repeated access.	2021-04-12 13:45:06 -07:00
Anis Elleuch	8ab111cfb6	scanner: Shuffle disks to scan (#12036 ) Ensure random association between disk and bucket in each crawling iteration to ensure that ILM applies correctly to objects not present in all disks.	2021-04-12 07:55:40 -07:00
Harshavardhana	641150f2a2	change updateVersion to only update keys, no deletes (#12032 ) there are situations where metadata can have keys with empty values, preserve existing behavior	2021-04-10 09:13:12 -07:00
sgandon	0ddc4f0075	fix: allow S3 gateway passthrough for SSE-S3 header on copy object (#12029 )	2021-04-09 08:56:09 -07:00
Harshavardhana	928ee1a7b2	remove null version dataDir upon overwrites (#12023 )	2021-04-08 19:55:44 -07:00
Harshavardhana	8f98e3acfa	fix build with fips tags	2021-04-08 19:31:10 -07:00
Harshavardhana	89d58bec16	avoid frequent DNS lookups for baremetal setups (#11972 ) bump up the DNS cache for baremetal setups upto 10 minutes	2021-04-08 17:51:59 -07:00
Klaus Post	f0ca0b3ca9	Add metadata checksum (#12017 ) - Add 32-bit checksum (32 LSB part of xxhash64) of the serialized metadata. This will ensure that we always reject corrupted metadata. - Add automatic repair of inline data, so the data structure can be used. If data was corrupted, we remove all unreadable entries to ensure that operations can succeed on the object. Since higher layers add bitrot checks this is not a big problem. Cannot downgrade to v1.1 metadata, but since that isn't released, no need for a major bump.	2021-04-08 17:29:54 -07:00
Harshavardhana	0e4794ea50	fix: allow S3 gateway passthrough for SSE-S3 header (#12020 ) only in case of S3 gateway we have a case where we need to allow for SSE-S3 headers as passthrough, If SSE-C headers are passed then they are rejected if KMS is not configured.	2021-04-08 16:40:38 -07:00
Harshavardhana	16ce7fb70c	fix: legacy object should be overwritten for metadataOnly updates (#12012 )	2021-04-08 14:29:27 -07:00
Harshavardhana	641e564b65	fips build tag uses relevant binary link for updates (#12014 ) This code is necessary for `mc admin update` command to work with fips compiled binaries, with fips tags the releaseInfo will automatically point to fips specific binaries.	2021-04-08 09:51:11 -07:00
Harshavardhana	835d2cb9a3	handle dns.ErrBucketConflict as BucketAlreadyExists (#12013 )	2021-04-08 08:24:55 -07:00
Andreas Auernhammer	cda570992e	set SSE headers in put-part response (#12008 ) This commit fixes a bug in the put-part implementation. The SSE headers should be set as specified by AWS - See: https://docs.aws.amazon.com/AmazonS3/latest/API/API_UploadPart.html Now, the MinIO server should set SSE-C headers, like `x-amz-server-side-encryption-customer-algorithm`. Fixes #11991	2021-04-07 15:05:00 -07:00
Harshavardhana	0b33fa50ae	fix: calculate correct content-range with partNumber query (#11992 ) fixes #11989 fixes #11824	2021-04-07 14:37:10 -07:00
Harshavardhana	4223ebab8d	fix: remove auto-close GetObjectReader (#12009 ) locks can get relinquished when Read() sees io.EOF leading to prematurely closing of the readers concurrent writes on the same object can have undesired consequences here when these locks are relinquished.	2021-04-07 13:29:27 -07:00
Klaus Post	48c5e7e5b6	Add runtime mem stats to server info (#11995 ) Adds information about runtime+gc memory use.	2021-04-07 10:40:51 -07:00
Klaus Post	d267d152ba	healing: re-read metadata after lock (#12004 ) Do no use potentially wrong metadata from before acquiring lock. Plus remove unused NoLock option.	2021-04-07 10:39:48 -07:00
Klaus Post	d2ac2f758e	odirectReader: handle EOF correctly (#11998 ) EOF may be sent along with data so queue it up and return it when the buffer is empty. Also, when reading data without direct io don't add a buffer that only results in extra memcopy.	2021-04-07 08:32:59 -07:00
Klaus Post	788a8bc254	Fix disk info race (#11984 ) Protect updated members in xlStorage. ``` WARNING: DATA RACE Write at 0x00c004b4ee78 by goroutine 1491: github.com/minio/minio/cmd.(xlStorage).GetDiskID() d:/minio/minio/cmd/xl-storage.go:590 +0x1078 github.com/minio/minio/cmd.(xlStorageDiskIDCheck).checkDiskStale() d:/minio/minio/cmd/xl-storage-disk-id-check.go:195 +0x84 github.com/minio/minio/cmd.(xlStorageDiskIDCheck).StatVol() d:/minio/minio/cmd/xl-storage-disk-id-check.go:284 +0x16a github.com/minio/minio/cmd.erasureObjects.getBucketInfo.func1() d:/minio/minio/cmd/erasure-bucket.go:100 +0x1a5 github.com/minio/minio/pkg/sync/errgroup.(Group).Go.func1() d:/minio/minio/pkg/sync/errgroup/errgroup.go:122 +0xd7 Previous read at 0x00c004b4ee78 by goroutine 1087: github.com/minio/minio/cmd.(xlStorage).CheckFile.func1() d:/minio/minio/cmd/xl-storage.go:1699 +0x384 github.com/minio/minio/cmd.(xlStorage).CheckFile() d:/minio/minio/cmd/xl-storage.go:1726 +0x13c github.com/minio/minio/cmd.(xlStorageDiskIDCheck).CheckFile() d:/minio/minio/cmd/xl-storage-disk-id-check.go:446 +0x23b github.com/minio/minio/cmd.erasureObjects.parentDirIsObject.func1() d:/minio/minio/cmd/erasure-common.go:173 +0x194 github.com/minio/minio/pkg/sync/errgroup.(Group).Go.func1() d:/minio/minio/pkg/sync/errgroup/errgroup.go:122 +0xd7 ```	2021-04-06 11:33:42 -07:00
Klaus Post	111c02770e	Fix data race when connecting disks (#11983 ) Multiple disks from the same set would be writing concurrently. ``` WARNING: DATA RACE Write at 0x00c002100ce0 by goroutine 166: github.com/minio/minio/cmd.(erasureSets).connectDisks.func1() d:/minio/minio/cmd/erasure-sets.go:254 +0x82f Previous write at 0x00c002100ce0 by goroutine 129: github.com/minio/minio/cmd.(erasureSets).connectDisks.func1() d:/minio/minio/cmd/erasure-sets.go:254 +0x82f Goroutine 166 (running) created at: github.com/minio/minio/cmd.(erasureSets).connectDisks() d:/minio/minio/cmd/erasure-sets.go:210 +0x324 github.com/minio/minio/cmd.(erasureSets).monitorAndConnectEndpoints() d:/minio/minio/cmd/erasure-sets.go:288 +0x244 Goroutine 129 (finished) created at: github.com/minio/minio/cmd.(erasureSets).connectDisks() d:/minio/minio/cmd/erasure-sets.go:210 +0x324 github.com/minio/minio/cmd.(erasureSets).monitorAndConnectEndpoints() d:/minio/minio/cmd/erasure-sets.go:288 +0x244 ```	2021-04-06 11:33:10 -07:00
Poorna Krishnamoorthy	40409437cd	Add initial usage in GetBucketReplicationMetrics API (#11985 )	2021-04-06 11:32:52 -07:00
iternity-dotcom	02f797a23b	remove redundant GetBucketLifecycleHandler call (#11982 )	2021-04-06 09:21:37 -07:00
Andreas Auernhammer	d5d2fc9850	bitrot: add selftest for server startup (#11917 ) This commit adds a self-test for all bitrot algorithms: - SHA-256 - BLAKE2b - HighwayHash The self-test computes an incremental checksum of pseudo-random messages. If a bitrot algorithm implementation stops working on some CPU architecture or with a certain Go version this self-test will prevent the server from starting and silently corrupting data. For additional context see: minio/highwayhash#19	2021-04-06 08:38:22 -07:00
Poorna Krishnamoorthy	075bccda42	Fix cluster bucket stats API for prometheus (#11970 ) Metrics calculation was accumulating inital usage across all nodes rather than using initial usage only once. Also fixing: - bug where all peer traffic was going to the same node. - reset counters when replication status changes from PENDING -> FAILED	2021-04-06 08:36:54 -07:00
Klaus Post	0276652f26	Fix Access Key requests (#11979 ) Fix accessing claims when auth error is unchecked. Only replaced when unchecked and when clearly without side effects. Fixes #11959	2021-04-06 08:35:46 -07:00
Harshavardhana	abb55bd49e	fix: properly close leaking bandwidth monitor channel (#11967 ) This PR fixes - close leaking bandwidth report channel leakage - remove the closer requirement for bandwidth monitor instead if Read() fails remember the error and return error for all subsequent reads. - use locking for usage-cache.bin updates, with inline data we cannot afford to have concurrent writes to usage-cache.bin corrupting xl.meta	2021-04-05 16:07:53 -07:00
Poorna Krishnamoorthy	bb6561fe55	fix: route for replication-metrics API (#11968 )	2021-04-05 13:36:39 -07:00
Harshavardhana	5cce9361bc	fix: avoid an extra rename when there is no dataDir (#11964 ) also perform globalSync() in defer when enabled for RenameData(), to ensure all calls are flushed to disk.	2021-04-05 08:52:28 -07:00
Harshavardhana	09ee303244	add cluster support for realtime bucket stats (#11963 ) implementation in #11949 only catered from single node, but we need cluster metrics by capturing from all peers. introduce bucket stats API that will be used for capturing in-line bucket usage as well eventually	2021-04-04 15:34:33 -07:00
Harshavardhana	d46386246f	api: Introduce metadata update APIs to update only metadata (#11962 ) Current implementation heavily relies on readAllFileInfo but with the advent of xl.meta inlined with data, we cannot easily avoid reading data when we are only interested is updating metadata, this leads to invariably write amplification during metadata updates, repeatedly reading data when we are only interested in updating metadata. This PR ensures that we implement a metadata only update API at storage layer, that handles updates to metadata alone for any given version - given the version is valid and present. This helps reduce the chattiness for following calls.. - PutObjectTags - DeleteObjectTags - PutObjectLegalHold - PutObjectRetention - ReplicateObject (updates metadata on replication status)	2021-04-04 13:32:31 -07:00
Poorna Krishnamoorthy	47c09a1e6f	Various improvements in replication (#11949 ) - collect real time replication metrics for prometheus. - add pending_count, failed_count metric for total pending/failed replication operations. - add API to get replication metrics - add MRF worker to handle spill-over replication operations - multiple issues found with replication - fixes an issue when client sends a bucket name with `/` at the end from SetRemoteTarget API call make sure to trim the bucket name to avoid any extra `/`. - hold write locks in GetObjectNInfo during replication to ensure that object version stack is not overwritten while reading the content. - add additional protection during WriteMetadata() to ensure that we always write a valid FileInfo{} and avoid ever writing empty FileInfo{} to the lowest layers. Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2021-04-03 09:03:42 -07:00
Harshavardhana	bf106453b8	add policy conditions support for signatureVersion and authType (#11947 ) https://docs.aws.amazon.com/AmazonS3/latest/API/bucket-policy-s3-sigv4-conditions.html fixes #11944	2021-04-02 09:34:15 -07:00
Harshavardhana	434e5c0cfe	allow preserving legacyXLv1 with inline data format (#11951 ) current master breaks this important requirement we need to preserve legacyXLv1 format, this is simply ignored and overwritten causing a myriad of issues by leaving stale files on the namespace etc. for now lets still use the two-phase approach of writing to `tmp` and then renaming the content to the actual namespace.	2021-04-01 22:12:03 -07:00
Harshavardhana	204c610d84	do not use dataDir to reference inline data use versionID (#11942 ) versionID is the one that needs to be preserved and as well as overwritten in case of replication, transition etc - dataDir is an ephemeral entity that changes during overwrites - make sure that versionID is used to save the object content. this would break things if you are already running the latest master, please wipe your current content and re-do your setup after this change.	2021-04-01 13:09:23 -07:00
Harshavardhana	f966fbc4a3	make sure to preserve checksumInfo to lookup older hash (#11940 ) upgrading from 2yr old releases is expected to work, the issue was we were missing checksum info to be passed down to newBitrotReader() for whole bitrot calculation	2021-03-31 21:14:08 -07:00
Harshavardhana	3c571472e0	avoid network read errors crashing CreateFile call (#11939 ) Thanks to @dvaldivia for reproducing this	2021-03-31 18:44:45 -07:00
Harshavardhana	f60eaabfcd	fix: notify parent user in notification events (#11934 ) fixes #11885	2021-03-31 13:21:10 -07:00
Harshavardhana	18dee6a333	add stringer for ErrorCodes (#11933 )	2021-03-31 09:30:52 -07:00
Klaus Post	4dcce17eb9	Determine small objects on shard size (#11935 ) Use shard size to determine when to inline data. For unversioned objects, use 128K/shard and for versioned 16K thresholds.	2021-03-31 09:19:14 -07:00
Klaus Post	0d8c74358d	Add erasure and compression self-tests (#11918 ) Ensure that we don't use potentially broken algorithms for critical functions, whether it be a runtime problem or implementation problem for a specific platform.	2021-03-31 09:11:37 -07:00
Anis Elleuch	6b484f45c6	crawling: Apply lifecycle then decide healing action (#11563 ) It is inefficient to decide to heal an object before checking its lifecycle for expiration or transition. This commit will just reverse the order of action: evaluate lifecycle and heal only if asked and lifecycle resulted a NoneAction.	2021-03-31 02:15:08 -07:00
Ritesh H Shukla	3ddd8b04d1	fix: handle unsupported APIs more granularly (#11674 )	2021-03-30 23:19:36 -07:00
Harshavardhana	8e6e287729	fix: delete/delete marker replication versions consistent (#11932 ) replication didn't work as expected when deletion of delete markers was requested in DeleteMultipleObjects API, this is due to incorrect lookup elements being used to look for delete markers.	2021-03-30 17:15:36 -07:00
Harshavardhana	014edd3462	allow configuring scanner cycles dynamically (#11931 ) This allows us to speed up or slow down sleeps between multiple scanner cycles, helps in testing as well as some deployments might want to run scanner more frequently. This change is also dynamic can be applied on a running cluster, subsequent cycles pickup the newly set value.	2021-03-30 13:59:02 -07:00
Steven Reitsma	e9fede88b3	fix: multi delete when using S3 Gateway with SSE (#11929 )	2021-03-30 13:09:48 -07:00
Harshavardhana	edf053c5c9	disksWithAllParts should use parts if present (#11923 )	2021-03-30 01:51:00 -07:00
Klaus Post	2623338dc5	Inline small file data in xl.meta file (#11758 )	2021-03-29 17:00:55 -07:00
Anis Elleuch	f5831174e6	iam: Use 'on' for enabled accounts for consistency (#11913 ) This commit does not fix any bug, just ensure consistency.	2021-03-29 09:32:36 -07:00
Harshavardhana	d93c6cb9c7	use Access() instead of Lstat() for frequent use (#11911 ) using Lstat() is causing tiny memory allocations, that are usually wasted and never used, instead we can simply uses Access() call that does 0 memory allocations.	2021-03-29 08:07:23 -07:00
Harshavardhana	7c5b35d20f	trace: enhance trace experience further	2021-03-27 13:19:14 -07:00
Anis Elleuch	07ab4d1250	trace: Add prefix to func names of OS & Storage (#11912 )	2021-03-27 10:07:07 -07:00
Anis Elleuch	d8b5adfd10	trace: Add storage & OS tracing (#11889 )	2021-03-26 23:24:07 -07:00
Poorna Krishnamoorthy	95096e31a7	Improve error message from SetRemoteTargetHandler (#11909 )	2021-03-26 18:58:13 -07:00
Harshavardhana	d8bda2dd92	[feat] Add targz transparent extract support (#11849 ) This feature brings in support for auto extraction of objects onto MinIO's namespace from an incoming tar gzipped stream, the only expected metadata sent by the client is to set `snowball-auto-extract`. All the contents from the tar stream are saved as folders and objects on the namespace. fixes #8715	2021-03-26 17:15:09 -07:00
Harshavardhana	df42b128db	fix: service accounts policy enforcement regression (#11910 ) service accounts were not inheriting parent policies anymore due to refactors in the PolicyDBGet() from the latest release, fix this behavior properly.	2021-03-26 13:55:42 -07:00
Anis Elleuch	2c296652f7	Simplify access to local node name (#11907 ) The local node name is heavily used in tracing, create a new global variable to store it. Multiple goroutines can access it since it won't be changed later.	2021-03-26 11:37:58 -07:00
Klaus Post	9efcb9e15c	Fix listPathRaw/WalkDir cancelation (#11905 ) In #11888 we observe a lot of running, WalkDir calls. There doesn't appear to be any listerners for these calls, so they should be aborted. Ensure that WalkDir aborts when upstream cancels the request. Fixes #11888	2021-03-26 11:18:30 -07:00
Anis Elleuch	8d5456c15a	Fix error returned by HealObject in some cases (#11906 ) The background healing can return NoSuchUpload error, the reason is that healing code can return errFileNotFound with three parameters. Simplify the code by returning exact errUploadNotFound error in multipart code. Also ensure that a typed error is always returned whatever the number of parameters because it is better than showing internal error.	2021-03-26 11:17:23 -07:00
Harshavardhana	cf87303094	do not call LocalStorageInfo on gateways (#11903 ) fixes https://github.com/minio/mc/issues/3665	2021-03-25 15:26:22 -07:00
Harshavardhana	90d8ec6310	fix: reject duplicate keys in PostPolicyJSON document (#11902 ) fixes #11894	2021-03-25 13:57:57 -07:00
Klaus Post	b383522743	fix error could not read /proc ion windows. (#11868 ) Bonus: Prealloc reasonable sizes for metrics.	2021-03-25 12:58:43 -07:00
Aditya Manthramurthy	b4d8bcf644	Converge PolicyDBGet functions in IAM (#11891 )	2021-03-25 00:38:15 -07:00
Harshavardhana	d7f32ad649	xl: avoid sending Delete() remote call for fully successful runs an optimization to avoid extra syscalls in PutObject(), adds up to our PutObject response times.	2021-03-24 17:32:12 -07:00
Aditya Manthramurthy	906d68c356	Fix LDAP policy application on user policy (#11887 )	2021-03-24 12:29:25 -07:00
Klaus Post	749e9c5771	metrics: Add canceled requests (#11881 ) Add metric for canceled requests	2021-03-24 10:25:27 -07:00
Harshavardhana	410e84d273	xl: add checks for minioTmpMetaBucket in CreateFile	2021-03-24 09:36:10 -07:00
Harshavardhana	75741dbf4a	xl: remove cleanupDir instead use Delete() (#11880 ) use a single call to remove directly at disk instead of doing recursively at network layer.	2021-03-24 09:08:05 -07:00
Anis Elleuch	fad7b27f15	metrics: Change type of minio_s3_requests_waiting_total to gauge (#11884 )	2021-03-24 09:06:37 -07:00
Harshavardhana	79564656eb	xl: CreateFile shouldn't prematurely timeout (#11878 ) For large objects taking more than '3 minutes' response times in a single PUT operation can timeout prematurely as 'ResponseHeader' timeout hits for 3 minutes. Avoid this by keeping the connection active during CreateFile phase.	2021-03-24 09:05:03 -07:00
Harshavardhana	21cfc4aa49	Revert "xl: CreateFile shouldn't prematurely timeout (#11854 )" This reverts commit `922c7b57f5`.	2021-03-23 23:47:45 -07:00
Harshavardhana	e80239a661	simplify OS instrumentation remove functions for global variables	2021-03-23 22:32:44 -07:00
Ritesh H Shukla	6a2ed44095	fix: optionally enable tracing posix calls	2021-03-23 22:23:08 -07:00
Aditya Manthramurthy	8adfeb0d84	fix: AccountInfo API for LDAP users (#11874 ) Also, ensure admin APIs auth additionally validates groups	2021-03-23 17:39:20 -07:00
Harshavardhana	d23485e571	fix: LDAP groups handling and group mapping (#11855 ) comprehensively handle group mapping for LDAP users across IAM sub-subsytem.	2021-03-23 15:15:51 -07:00
Harshavardhana	da70e6ddf6	avoid healObjects recursively healing at empty path (#11856 ) baseDirFromPrefix(prefix) for object names without parent directory incorrectly uses empty path, leading to long listing at various paths that are not useful for healing - avoid this listing completely if "baseDir" returns empty simple use the "prefix" as is. this improves startup performance significantly	2021-03-23 07:57:07 -07:00
Harshavardhana	922c7b57f5	xl: CreateFile shouldn't prematurely timeout (#11854 ) For large objects taking more than '3 minutes' response times in a single PUT operation can timeout prematurely as 'ResponseHeader' timeout hits for 3 minutes. Avoid this by keeping the connection active during CreateFile phase.	2021-03-22 18:25:05 -07:00
Harshavardhana	726d80dbb7	fix: merge duplicate keys in post policy (#11843 ) some SDKs might incorrectly send duplicate entries for keys such as "conditions", Go stdlib unmarshal for JSON does not support duplicate keys - instead skips the first duplicate and only preserves the last entry. This can lead to issues where a policy JSON while being valid might not properly apply the required conditions, allowing situations where POST policy JSON would end up allowing uploads to unauthorized buckets and paths. This PR fixes this properly.	2021-03-20 22:16:30 -07:00
Ritesh H Shukla	23b03dadb8	Add process uptime metric (#11844 )	2021-03-20 21:23:27 -07:00
Andreas Auernhammer	7b3719c17b	crypto: simplify Context encoding (#11812 ) This commit adds a `MarshalText` implementation to the `crypto.Context` type. The `MarshalText` implementation replaces the `WriteTo` and `AppendTo` implementation. It is slightly slower than the `AppendTo` implementation ``` goos: darwin goarch: arm64 pkg: github.com/minio/minio/cmd/crypto BenchmarkContext_AppendTo/0-elems-8 381475698 2.892 ns/op 0 B/op 0 allocs/op BenchmarkContext_AppendTo/1-elems-8 17945088 67.54 ns/op 0 B/op 0 allocs/op BenchmarkContext_AppendTo/3-elems-8 5431770 221.2 ns/op 72 B/op 2 allocs/op BenchmarkContext_AppendTo/4-elems-8 3430684 346.7 ns/op 88 B/op 2 allocs/op ``` vs. ``` BenchmarkContext/0-elems-8 135819834 8.658 ns/op 2 B/op 1 allocs/op BenchmarkContext/1-elems-8 13326243 89.20 ns/op 128 B/op 1 allocs/op BenchmarkContext/3-elems-8 4935301 243.1 ns/op 200 B/op 3 allocs/op BenchmarkContext/4-elems-8 2792142 428.2 ns/op 504 B/op 4 allocs/op goos: darwin ``` However, the `AppendTo` benchmark used a pre-allocated buffer. While this improves its performance it does not match the actual usage of `crypto.Context` which is passed to a `KMS` and always encoded into a newly allocated buffer. Therefore, this change seems acceptable since it should not impact the actual performance but reduces the overall code for Context marshaling.	2021-03-20 02:48:48 -07:00
Harshavardhana	9a6487319a	remove MINIO_IO_DEADLINE support (#11841 ) this feature in actual deployment was found to be not that useful, remove support for this for now.	2021-03-20 02:47:04 -07:00
Aditya Manthramurthy	94ff624242	Fix querying LDAP group/user policy (#11840 )	2021-03-20 02:37:52 -07:00
Anis Elleuch	98ff91b484	xl: Reduce usage of isDirEmpty() (#11838 ) When an object is removed, its parent directory is inspected to check if it is empty to remove if that is the case. However, we can use os.Remove() directly since it is only able to remove a file or an empty directory.	2021-03-19 15:42:01 -07:00
Anis Elleuch	4d86384dc7	xl: Remove non needed check for empty dir (#11835 ) RenameData renames xl.meta and data dir and removes the parent directory if empty, however, there is a duplicate check for empty dir, since the parent dir of xl.meta is always the same as the data-dir.	2021-03-19 12:26:53 -07:00
Ritesh H Shukla	b5dcaaccb4	Introduce metrics caching for performant metrics (#11831 )	2021-03-19 00:04:29 -07:00
Harshavardhana	b92a220db1	fix: handle weird drives sporadic read O_DIRECT behavior (#11832 ) on freshReads if drive returns errInvalidArgument, we should simply turn-off DirectIO and read normally, there are situations in k8s like environments where the drives behave sporadically in a single deployment and may not have been implemented properly to handle O_DIRECT for reads.	2021-03-18 20:16:50 -07:00
Harshavardhana	51a8619a79	[feat] Add configurable deadline for writers (#11822 ) This PR adds deadlines per Write() calls, such that slow drives are timed-out appropriately and the overall responsiveness for Writes() is always up to a predefined threshold providing applications sustained latency even if one of the drives is slow to respond.	2021-03-18 14:09:55 -07:00
Anis Elleuch	14d89eaae4	mrf: Enhance behavior for better results (#11788 ) MRF was starting to heal when it receives a disk connection event, which is not good when a node having multiple disks reconnects to the cluster. Besides, MRF needs Remove healing option to remove stale files.	2021-03-18 11:19:02 -07:00
Harshavardhana	add3cd4e44	allow configuring delete cleanup interval from default 10minutes (#11818 )	2021-03-17 15:15:58 -07:00
Harshavardhana	60b0f2324e	storage write call path optimizations (#11805 ) - write in o_dsync instead of o_direct for smaller objects to avoid unaligned double Write() situations that may arise for smaller objects < 128KiB - avoid fallocate() as its not useful since we do not use Append() semantics anymore, fallocate is not useful for streaming I/O we can save on a syscall - createFile() doesn't need to validate `bucket` name with a Lstat() call since createFile() is only used to write at `minioTmpBucket` - use io.Copy() when writing unAligned writes to allow usage of ReadFrom() from *os.File providing zero buffer writes().	2021-03-17 09:38:38 -07:00
Anis Elleuch	0eb146e1b2	add additional metrics per disk API latency, API call counts #11250 ) ``` mc admin info --json ``` provides these details, for now, we shall eventually expose this at Prometheus level eventually. Co-authored-by: Harshavardhana <harsha@minio.io>	2021-03-16 20:06:57 -07:00
Andreas Auernhammer	e197800f90	s3v4: read and verify S3 signature v4 chunks separately (#11801 ) This commit fixes a security issue in the signature v4 chunked reader. Before, the reader returned unverified data to the caller and would only verify the chunk signature once it has encountered the end of the chunk payload. Now, the chunk reader reads the entire chunk into an in-memory buffer, verifies the signature and then returns data to the caller. In general, this is a common security problem. We verifying data streams, the verifier MUST NOT return data to the upper layers / its callers as long as it has not verified the current data chunk / data segment: ``` func (r *Reader) Read(buffer []byte) { if err := r.readNext(r.internalBuffer); err != nil { return err } if err := r.verify(r.internalBuffer); err != nil { return err } copy(buffer, r.internalBuffer) } ```	2021-03-16 13:33:40 -07:00
Klaus Post	771dea175c	erasure pools enable faster checks for file not found (#11799 ) For operations that require the object to exist make it possible to detect if the file isn't found in any pool. This will allow these to return the error early without having to re-check.	2021-03-16 11:02:20 -07:00
Harshavardhana	6160188bf3	fix: erasure index based reading based on actual ParityBlocks (#11792 ) in some setups with ordering issues in drive configuration, we should rely on expected parityBlocks instead of `len(disks)/2`	2021-03-15 20:03:13 -07:00
Steve Wills	642ba3f2d6	fix: runtime issue on FreeBSD due to missing O_NOATIME/O_DSYNC support (#11790 ) See also: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=253937	2021-03-15 14:02:36 -07:00
Harshavardhana	afbd3e41eb	add missing principalId in web notifications (#11777 ) fixes #11561	2021-03-13 10:52:43 -08:00
Poorna Krishnamoorthy	5e003549cc	Replication: Enforce DeleteMarker disable setting (#11720 ) This PR also enforces DeleteReplication disable setting	2021-03-13 10:28:35 -08:00
Nitish Tiwari	7fa3e4106b	Add consoleAdmin as a default canned policy (#11770 )	2021-03-12 12:51:43 -08:00
Philip Brown	75db500e85	cmd/os-readdir_other.go - return nil with err (#11772 )	2021-03-12 07:22:25 -08:00
Harshavardhana	feafccf007	handle trimming '/' if present in the object names (#11765 ) - MultipleDeletes should handle '/' prefix for objectnames - Trimming the slash alone is enough for ListObjects() prefix and markers fixes #11769	2021-03-11 13:57:03 -08:00
Anis Elleuch	f92b7a5621	Browser: Shared link has content-disposition header (#11712 ) The shared link will be automatically downloadable when the user opens the shared link in a browser.	2021-03-10 23:02:16 -08:00
Poorna Krishnamoorthy	c25e75f0b5	Fix redact LDAP password properly (#11762 ) fixes #11742 previous pull request #11750 fixed only the web trace	2021-03-10 11:05:38 -08:00
Harshavardhana	777344a594	add release build-arg to docker multiarch builds (#11754 ) additional paths to ignore for healing	2021-03-10 09:38:35 -08:00
Poorna Krishnamoorthy	878bc6c72b	Redact LDAP password if any in request trace (#11750 ) Fixes: #11742	2021-03-09 14:43:16 -08:00
Klaus Post	fdc2f69218	truncate xl.meta files upon rewrites #11749 ) If the destination files exist and is larger - junk data will be left at the end of the file.	2021-03-09 14:42:24 -08:00
Anis Elleuch	0d124095ea	lc: Return expiration header only when version id is unspecified (#11718 ) Follow S3 specification to return Expiration header in HEAD/GET call only when version-id is not passed in the request.	2021-03-09 13:19:08 -08:00
Harshavardhana	691035832a	fix: normalize object layer inputs (#11534 ) Cases where we have applications making request for `//` in object names make sure that all are normalized to `/` and all such requests that are prefixed '/' are removed. To ensure a consistent view from all operations.	2021-03-09 12:58:22 -08:00
Anis Elleuch	eac66e67ec	Use maximum parity for config files (#11740 ) Some deployments have low parity (EC:2), but we really do not need to save our config data with the same parity configuration. N/2 would be better to keep MinIO configurations intact when unexpected a number of drives fail.	2021-03-09 10:19:47 -08:00
Anis Elleuch	57f3ed22d4	erasure: Reduce the interval of cleaning up .trash folder (#11741 ) Reduce from 30 to 10 minutes.	2021-03-09 09:45:38 -08:00
Poorna Krishnamoorthy	2f29719e6b	resize replication worker pool dynamically after config update (#11737 )	2021-03-09 02:56:42 -08:00
Andreas Auernhammer	209fe61dcc	vault: disable Hashicorp Vault with opt-in (#11711 ) This commit disables the Hashicorp Vault support but provides a way to temp. enable it via the `MINIO_KMS_VAULT_DEPRECATION=off` Vault support has been deprecated long ago and this commit just requires users to take action if they maintain a Vault integration.	2021-03-09 00:02:35 -08:00
Harshavardhana	8ecffdb7a7	Revert "Revert "heal: Heal bucket metadata when a fresh disk is inserted (#11734 )"" This reverts commit `806df164b2`.	2021-03-08 16:12:17 -08:00
Harshavardhana	806df164b2	Revert "heal: Heal bucket metadata when a fresh disk is inserted (#11734 )" This reverts commit `64662a49ff`.	2021-03-08 14:43:24 -08:00
Klaus Post	4ac9ed4248	CopyObject: Do not remove crypto info when compressed (#11702 ) Removing crypto info makes it impossible to copy encrypted+compressed objects. Disable destination compression when encrypted.	2021-03-08 12:57:54 -08:00
Klaus Post	3ff5f55dcb	Fetch fileinfo concurrently (#11700 ) For non-erasure setups fetch up to 10 fileinfos concurrently. Fixes #11625	2021-03-08 11:30:43 -08:00
Max Xu	097e5eba9f	feat: remove go-bindata-assetfs in favor of embed by upgrading to go1.16 (#11733 )	2021-03-08 11:26:43 -08:00
Anis Elleuch	64662a49ff	heal: Heal bucket metadata when a fresh disk is inserted (#11734 ) Replacing disk with a fresh one never heals bucket metadata (policy, notification, etc..). This commit fixes the issue.	2021-03-08 10:54:13 -08:00
Harshavardhana	78e867e145	ignore healing .trash, .metacache amd .multipart paths (#11725 )	2021-03-07 09:38:31 -08:00
Harshavardhana	9ccc483df6	[feat]: change erasure coding default block size from 10MiB to 1MiB (#11721 ) major performance improvements in range GETs to avoid large read amplification when ranges are tiny and random ``` ------------------- Operation: GET Operations: 142014 -> 339421 Duration: 4m50s -> 4m56s * Average: +139.41% (+1177.3 MiB/s) throughput, +139.11% (+658.4) obj/s * Fastest: +125.24% (+1207.4 MiB/s) throughput, +132.32% (+612.9) obj/s * 50% Median: +139.06% (+1175.7 MiB/s) throughput, +133.46% (+660.9) obj/s * Slowest: +203.40% (+1267.9 MiB/s) throughput, +198.59% (+753.5) obj/s ``` TTFB from 10MiB BlockSize ``` * First Access TTFB: Avg: 81ms, Median: 61ms, Best: 20ms, Worst: 2.056s ``` TTFB from 1MiB BlockSize ``` * First Access TTFB: Avg: 22ms, Median: 21ms, Best: 8ms, Worst: 91ms ``` Full object reads however do see a slight change which won't be noticeable in real world, so not doing any comparisons TTFB still had improvements with full object reads with 1MiB ``` * First Access TTFB: Avg: 68ms, Median: 35ms, Best: 11ms, Worst: 1.16s ``` v/s TTFB with 10MiB ``` * First Access TTFB: Avg: 388ms, Median: 98ms, Best: 20ms, Worst: 4.156s ``` This change should affect all new uploads, previous uploads should continue to work with business as usual. But dramatic improvements can be seen with these changes.	2021-03-06 14:09:34 -08:00
Anis Elleuch	abce040088	fix: Remove repetitive IAM ready message (#11723 ) "IAM initialization complete" is printed each 5 minutes, avoid this by printing it only during the first initialization of IAM.	2021-03-06 09:27:46 -08:00
Anis Elleuch	558762bdf6	iam: Return a slice of policies for a group (#11722 ) A group can have multiple policies, a user subscribed to readwrite & diagnostics can perform S3 operations & admin operations as well. However, the current code only returns one policy for one group.	2021-03-06 09:27:06 -08:00
Harshavardhana	d971061305	use listPathRaw for HealObjects() instead of expensive WalkVersions() (#11675 )	2021-03-06 09:25:48 -08:00
Andreas Auernhammer	509bcc01ad	fips: do not use SHA-3 when building a FIPS-140 2 binary (#11710 ) This commit disables SHA-3 for OpenID when building a FIPS-140 2 compatible binary. While SHA-3 is a crypto. hash function accepted by NIST there is no FIPS-140 2 compliant implementation available when using the boringcrypto Go branch. Therefore, SHA-3 must not be used when building a FIPS-140 2 binary.	2021-03-05 20:43:42 -08:00
Krishnan Parthasarathi	bcf9825082	Data usage should account for transitioned objects (#11717 )	2021-03-05 14:15:53 -08:00
sgandon	124816f6a6	fix : IAM Intialization failing with a large number of users/policies (#11701 )	2021-03-05 08:36:16 -08:00
Klaus Post	fa9cf1251b	Imporve healing and reporting (#11312 ) * Provide information on actively healing, buckets healed/queued, objects healed/failed. * Add concurrent healing of multiple sets (typically on startup). * Add bucket level resume, so restarts will only heal non-healed buckets. * Print summary after healing a disk is done.	2021-03-04 14:36:23 -08:00
Harshavardhana	d73d756a80	fix: incorrect errors thrown by lint (#11699 ) fixes #11698	2021-03-04 14:27:38 -08:00
Aditya Manthramurthy	7488c77e7c	Test LDAP connection configuration at startup (#11684 )	2021-03-04 12:17:36 -08:00
Harshavardhana	786585009e	fix: capture disks when entire peer is offline (#11697 ) currently when one of the peer is down, the drives from that peer are reported as '0/0' offline instead we should capture/filter the drives from the peer and populate it appropriately such that `mc admin info` displays correct info.	2021-03-04 10:07:05 -08:00
Anis Elleuch	7be7109471	locking: Add Refresh for better locking cleanup (#11535 ) Co-authored-by: Anis Elleuch <anis@min.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2021-03-03 18:36:43 -08:00
Klaus Post	c3217bd6eb	Use actual size for buffer selection (#11687 ) For compressed inputs, this will be -1, but the object may be small.	2021-03-03 16:28:10 -08:00
Andreas Auernhammer	f14cc6c943	etag: add FromContentMD5 to parse content-md5 as ETag (#11688 ) This commit adds the `FromContentMD5` function to parse a client-provided content-md5 as ETag. Further, it also adds multipart ETag computation for future needs.	2021-03-03 12:58:28 -08:00
Harshavardhana	2c198ae7b6	fix: prometheus metrics disks_online count when disks are down (#11689 ) prometheus metrics was using total disks instead of online disk count, when disks were down, this PR fixes this and also adds a new metric for total_disk_count	2021-03-03 11:18:41 -08:00
Poorna Krishnamoorthy	690434514d	Avoid notification event for replicas (#11683 ) Creating notification events for replica creation is not particularly useful to send as the notification event generated at source already includes replication completion events. For applications using replica cluster as failover, avoiding duplicate notifications for replica event will allow seamless failover.	2021-03-03 11:13:31 -08:00
Harshavardhana	039f59b552	fix: missing user policy enforcement in PostPolicyHandler (#11682 )	2021-03-03 08:47:08 -08:00
Harshavardhana	c6a120df0e	fix: Prometheus metrics to re-use storage disks (#11647 ) also re-use storage disks for all `mc admin server info` calls as well, implement a new LocalStorageInfo() API call at ObjectLayer to lookup local disks storageInfo also fixes bugs where there were double calls to StorageInfo()	2021-03-02 17:28:04 -08:00
Klaus Post	cd9e30c0f4	IAM: Block while loading users (#11671 ) While starting up a request that needs all IAM data will start another load operation if the first on startup hasn't finished. This slows down both operations. Block these requests until initial load has completed. Blocking calls will be ListPolicies, ListUsers, ListServiceAccounts, ListGroups - and the calls that eventually trigger these. These will wait for the initial load to complete. Fixes issue seen in #11305	2021-03-02 17:08:25 -08:00
Harshavardhana	f96d4cf7d3	fix: do not deny admins to change other passwords fixes a regression from #11680	2021-03-02 17:02:32 -08:00
Harshavardhana	879599b0cf	fix: enforce deny if present for implicit permissions (#11680 ) Implicit permissions for any user is to be allowed to change their own password, we need to restrict this further even if there is an implicit allow for this scenario - we have to honor Deny statements if they are specified.	2021-03-02 15:35:50 -08:00
Harshavardhana	b1bb3f7016	[feat]: implement GetBucketPolicyStatus API (#11673 ) additionally also add more APIs in notImplemented list, adjust routing rules appropriately	2021-03-01 23:10:33 -08:00
Anis Elleuch	e8d8dfa3ae	Add metric for internode RPC calls errors (#11669 )	2021-03-01 12:31:33 -08:00
Nitish Tiwari	bbd1244a88	Add support for mTLS for Audit log target (#11645 )	2021-03-01 09:19:13 -08:00
Klaus Post	10bdb78699	fix: listObjectVersions Include object in marker (#11562 ) ListObjectVersions would skip past the object in the marker when version id is specified. Make `listPath` return the object with the marker and truncate it if not needed. Avoid having to parse unintended objects to find a version marker.	2021-03-01 08:12:02 -08:00
Shireesh Anjal	289b22d911	fix: pool number not added for one server (#11670 ) The previous code was iterating over replies from peers and assigning pool numbers to them, thus missing to add it for the local server. Fixed by iterating over the server properties of all the servers including the local one.	2021-03-01 08:09:43 -08:00
Harshavardhana	0b9c17443e	update gopsutil to use the v3 API (#11638 )	2021-03-01 00:15:46 -08:00
Bala FA	23f7ab40b3	Add PoolNumber field to madmin.ServerProperties (#11327 )	2021-02-28 21:26:28 -08:00
Harshavardhana	2f4af09c01	fix: alow changes to readAllData to decrement activeCount()	2021-02-28 20:09:23 -08:00
Harshavardhana	37960cbc2f	fix: avoid writing more content on network with O_DIRECT reads (#11659 ) There was an io.LimitReader was missing for the 'length' parameter for ranged requests, that would cause client to get truncated responses and errors. fixes #11651	2021-02-28 15:33:03 -08:00
cbows	c67d1bf120	add unauthenticated lookup-bind mode to LDAP identity (#11655 ) Closes #11646	2021-02-28 12:57:31 -08:00
Klaus Post	c5b3a675fa	Block profiling tweaks (#11612 ) The base profiles contains no valuable data, don't record them. Reduce block rate by 2 orders of magnitude, should still capture just as valuable data with less CPU strain.	2021-02-27 09:22:14 -08:00
Harshavardhana	b690304eed	use faster way for siphash (#11640 )	2021-02-26 16:53:06 -08:00
Harshavardhana	9171d6ef65	rename all references from crawl -> scanner (#11621 )	2021-02-26 15:11:42 -08:00
Harshavardhana	6386b45c08	[feat] use rename instead of recursive deletes (#11641 ) most of the delete calls today spend time in a blocking operation where multiple calls need to be recursively sent to delete the objects, instead we can use rename operation to atomically move the objects from the namespace to `tmp/.trash` we can schedule deletion of objects at this location once in 15, 30mins and we can also add wait times between each delete operation. this allows us to make delete's faster as well less chattier on the drives, each server runs locally a groutine which would clean this up regularly.	2021-02-26 09:52:27 -08:00
Andreas Auernhammer	1f659204a2	remove GetObject from ObjectLayer interface (#11635 ) This commit removes the `GetObject` method from the `ObjectLayer` interface. The `GetObject` method is not longer used by the HTTP handlers implementing the high-level S3 semantics. Instead, they use the `GetObjectNInfo` method which returns both, an object handle as well as the object metadata. Therefore, it is no longer necessary that a concrete `ObjectLayer` implements `GetObject`.	2021-02-26 09:52:02 -08:00
Harshavardhana	f9f6fd0421	fix: service account permissions generated from LDAP user (#11637 ) service accounts generated from LDAP parent user did not inherit correct permissions, this PR fixes this fully.	2021-02-25 13:49:59 -08:00
Klaus Post	85620dfe93	use bucket in path in distribution hash (#11634 ) Use bucket in erasure distribution hash. For the rare cases where objects with the same names are uploaded to many buckets.	2021-02-25 10:11:31 -08:00
Harshavardhana	a8e4f64ff3	Revert "fix: remove persistence layer for metacache store in memory (#11538 )" This reverts commit `b23659927c`.	2021-02-24 22:24:51 -08:00
Krishnan Parthasarathi	ca5c6e3160	fix: translate empty versionID string to null version where appropriate (#11629 ) We store the null version as empty string. We should translate it to null version for bucket with version suspended too.	2021-02-24 18:39:10 -08:00
Harshavardhana	b23659927c	fix: remove persistence layer for metacache store in memory (#11538 ) store the cache in-memory instead of disks to avoid large write amplifications for list heavy workloads, store in memory instead and let it auto expire.	2021-02-24 15:51:41 -08:00
Andreas Auernhammer	c1a49be639	use crypto/sha256 for FIPS 140-2 compliance (#11623 ) This commit replaces the usage of github.com/minio/sha256-simd with crypto/sha256 of the standard library in all non-performance critical paths. This is necessary for FIPS 140-2 compliance which requires that all crypto. primitives are implemented by a FIPS-validated module. Go can use the Google FIPS module. The boringcrypto branch of the Go standard library uses the BoringSSL FIPS module to implement crypto. primitives like AES or SHA256. We only keep github.com/minio/sha256-simd when computing the content-SHA256 of an object. Therefore, this commit relies on a build tag `fips`. When MinIO is compiled without the `fips` flag it will use github.com/minio/sha256-simd. When MinIO is compiled with the fips flag (go build --tags "fips") then MinIO uses crypto/sha256 to compute the content-SHA256.	2021-02-24 09:00:15 -08:00
Klaus Post	03172b89e2	Ensure cache has finished deserializing (#11620 ) Make sure that response has been fully deserialized before returning.	2021-02-24 02:59:49 -08:00
Harshavardhana	b517c791e9	[feat]: use DSYNC for xl.meta writes and NOATIME for reads (#11615 ) Instead of using O_SYNC, we are better off using O_DSYNC instead since we are only ever interested in data to be persisted to disk not the associated filesystem metadata. For reads we ask customers to turn off noatime, but instead we can proactively use O_NOATIME flag to avoid atime updates upon reads.	2021-02-24 00:14:16 -08:00
Petr Tichý	14aef52004	remove Content-MD5 on Range requests (#11611 ) This removes the Content-MD5 response header on Range requests in Azure Gateway mode. The partial content MD5 doesn't match the full object MD5 in metadata.	2021-02-23 19:32:56 -08:00
Andreas Auernhammer	d4b822d697	pkg/etag: add new package for S3 ETag handling (#11577 ) This commit adds a new package `etag` for dealing with S3 ETags. Even though ETag is often viewed as MD5 checksum of an object, handling S3 ETags correctly is a surprisingly complex task. While it is true that the ETag corresponds to the MD5 for the most basic S3 API operations, there are many exceptions in case of multipart uploads or encryption. In worse, some S3 clients expect very specific behavior when it comes to ETags. For example, some clients expect that the ETag is a double-quoted string and fail otherwise. Non-AWS compliant ETag handling has been a source of many bugs in the past. Therefore, this commit adds a dedicated `etag` package that provides functionality for parsing, generating and converting S3 ETags. Further, this commit removes the ETag computation from the `hash` package. Instead, the `hash` package (i.e. `hash.Reader`) should focus only on computing and verifying the content-sha256. One core feature of this commit is to provide a mechanism to communicate a computed ETag from a low-level `io.Reader` to a high-level `io.Reader`. This problem occurs when an S3 server receives a request and has to compute the ETag of the content. However, the server may also wrap the initial body with several other `io.Reader`, e.g. when encrypting or compressing the content: ``` reader := Encrypt(Compress(ETag(content))) ``` In such a case, the ETag should be accessible by the high-level `io.Reader`. The `etag` provides a mechanism to wrap `io.Reader` implementations such that the `ETag` can be accessed by a type-check. This technique is applied to the PUT, COPY and Upload handlers.	2021-02-23 12:31:53 -08:00
Harshavardhana	aa7244a9a4	fix: make sure to convert the error properly in HealBucket() (#11610 ) server startup code expects the object layer to properly convert error into a proper type, so that in situations when servers are coming up and quorum is not available servers wait on each other.	2021-02-23 09:23:11 -08:00
Harshavardhana	2a79ea0332	isServerResolvable its sufficient to check server is reachable (#11609 ) using isServerResolvable for expiration can lead to chicken and egg problems, a lock might expire knowingly when server is booting up causing perpetual locks getting expired.	2021-02-22 16:29:53 -08:00
Aditya Manthramurthy	02e7de6367	LDAP config: fix substitution variables (#11586 ) - In username search filter and username format variables we support %s for replacing with the username. - In group search filter we support %s for username and %d for the full DN of the username.	2021-02-22 13:20:36 -08:00
Harshavardhana	da676ac298	remove network calls for getLocalDisks (#11603 )	2021-02-22 13:19:44 -08:00
Harshavardhana	18ec933085	fix: for containers use root-disk detection cleverly (#11593 ) root-disk implemented currently had issues where root disk partitions getting modified might race and provide incorrect results, to avoid this lets rely again back on DeviceID and match it instead. In-case of containers `/data` is one such extra entity that needs to be verified for root disk, due to how 'overlay' filesystem works and the 'overlay' presents a completely different 'device' id - using `/data` as another entity for fallback helps because our containers describe 'VOLUME' parameter that allows containers to automatically have a virtual `/data` that points to the container root path this can either be at `/` or `/var/lib/` (on different partition)	2021-02-22 10:32:21 -08:00
Harshavardhana	c31d2c3fdc	fix: CrawlAndGetDataUsage close pipe() before using a new one (#11600 ) also additionally make sure errors during deserializer closes the reader with right error type such that Write() end actually see the final error, this avoids a waitGroup usage and waiting.	2021-02-22 10:04:32 -08:00
Harshavardhana	8778828a03	fix: read metadata in O_DIRECT if configured and supported (#11594 ) reduce the page-cache pressure completely by moving the entire read-phase of our operations to O_DIRECT, primarily this is going to be very useful for chatty metadata operations such as listing, scanner, ilm, healing like operations to avoid filling up the page-cache upon repeated runs.	2021-02-22 01:36:17 -08:00
Sarasa Kisaragi	48b212dd8e	Fix HDFS wrong filepath if subpath provided (#11574 )	2021-02-20 15:32:18 -08:00
Harshavardhana	be7de911c4	fix: update minio-go to fix an issue with S3 gateway (#11591 ) since we have changed our default envs to MINIO_ROOT_USER, MINIO_ROOT_PASSWORD this was not supported by minio-go credentials package, update minio-go to v7.0.10 for this support. This also addresses few bugs related to users had to specify AWS_ACCESS_KEY_ID as well to authenticate with their S3 backend if they only used MINIO_ROOT_USER.	2021-02-20 11:10:21 -08:00
Harshavardhana	8cad407e0b	fix: Bring support for symlink on regular files on NAS (#11383 ) fixes #11203	2021-02-20 00:30:12 -08:00
Poorna Krishnamoorthy	85d2187c20	fix: ETag mismatch for large upload in replica (#11587 )	2021-02-20 00:22:17 -08:00
Anis Elleuch	98d3f94996	metrics: Add the number of requests in the waiting queue (#11580 ) We can use this metric to check if there are too many S3 clients in the queue and could explain why some of those S3 clients are timing out. ``` minio_s3_requests_waiting_total{server="127.0.0.1:9000"} 9981 ``` If max_requests is 10000 then there is a strong possibility that clients are timing out because of the queue deadline.	2021-02-20 00:21:55 -08:00
mailsmail	173284903b	fix incorrect http range in SelectObjectContentHandler (#11585 )	2021-02-19 17:55:28 -08:00
Poorna Krishnamoorthy	2dce5d9442	fix: delete marker permanent delete replication (#11581 )	2021-02-18 16:35:37 -08:00
Anis Elleuch	f28b063091	heal: Use healDeleteDangling global const in self healing (#11579 ) A small fix, use healDeleteDangling constant instead of 'true' in the self-healing code.	2021-02-18 15:16:20 -08:00
Klaus Post	c5b2a8441b	fix: faster healing when disk is replaced. (#11520 )	2021-02-18 11:06:54 -08:00
Klaus Post	8a6b13c239	Avoid synchronizing usage writes (#11560 ) If the periodic `case <-t.C:` save gets held up for a long time it will end up synchronize all disk writes for saving the caches. We add jitter to per set writes so they don't sync up and don't hold a lock for the write, since it isn't needed anyway. If an outage prevents writes for a long while we also add individual waits for each disk in case there was a queue. Furthermore limit the number of buffers kept to 2GiB, since this could get huge in large clusters. This will not act as a hard limit but should be enough for normal operation.	2021-02-18 00:38:37 -08:00
Poorna Krishnamoorthy	8e8a792d9d	Allow delete marker replication from replica (#11566 ) in the case of active-active replication. This PR also has the following changes: - add docs on replication design - fix corner case of completing versioned delete on a delete marker when the target is down and `mc rm --vid` is performed repeatedly. Instead the version should still be retained in the `PENDING\|FAILED` state until replication sync completes. - remove `s3:Replication:OperationCompletedReplication` and `s3:Replication:OperationFailedReplication` from ObjectCreated events type	2021-02-18 00:33:51 -08:00
Harshavardhana	95e0acbb26	fix: allow accountInfo with creds with parentUsers (#11568 )	2021-02-17 20:57:17 -08:00
Poorna Krishnamoorthy	55037e6e54	lifecycle:Fix args passed to determine expiry header (#11567 )	2021-02-17 19:25:19 -08:00
Harshavardhana	289e1d8b2a	fix: reduce crawler memory usage by orders of magnitude (#11556 ) currently crawler waits for an entire readdir call to return until it processes usage, lifecycle, replication and healing - instead we should pass the applicator all the way down to avoid building any special stack for all the contents in a single directory. This allows for - no need to remember the entire list of entries per directory before applying the required functions - no need to wait for entire readdir() call to finish before applying the required functions	2021-02-17 15:34:42 -08:00
Harshavardhana	ffea6fcf09	fix: rename crawler as scanner in config (#11549 )	2021-02-17 12:04:11 -08:00
Klaus Post	11b2220696	Don't autoheal if disks are healing (#11558 ) Don't spawn automatic healing ops if a disk is healing.	2021-02-17 10:18:12 -08:00
Harshavardhana	aa8450a2a1	fix: parallelize getPoolIdx() for object lookup (#11547 )	2021-02-16 19:36:15 -08:00
Harshavardhana	7d4a2d2b68	fix: multiple pool reads parallelize when possible (#11537 )	2021-02-16 02:43:47 -08:00
Anis Elleuch	c4e12dc846	fix: in MultiDelete API return MalformedXML upon empty input (#11532 ) To follow S3 spec	2021-02-13 09:48:25 -08:00
Harshavardhana	a94a9c37fa	fix: support IAM policy handling for wildcard actions (#11530 ) This PR fixes - allow 's3:versionid` as a valid conditional for Get,Put,Tags,Object locking APIs - allow additional headers missing for object APIs - allow wildcard based action matching	2021-02-12 23:05:09 -08:00
Harshavardhana	79b6a43467	fix: avoid timed value for network calls (#11531 ) additionally simply timedValue to have RWMutex to avoid concurrent calls to DiskInfo() getting serialized, this has an effect on all calls that use GetDiskInfo() on the same disks. Such as getOnlineDisks, getOnlineDisksWithoutHealing	2021-02-12 18:17:52 -08:00
Shireesh Anjal	928de04f7a	fix: osinfos incomplete in case of warnings (#11505 ) The function used for getting host information (host.SensorsTemperaturesWithContext) returns warnings in some cases. Returning with error in such cases means we miss out on the other useful information already fetched (os info). If the OS info has been succesfully fetched, it should always be included in the output irrespective of whether the other data (CPU sensors, users) could be fetched or not.	2021-02-12 17:57:57 -08:00
Poorna Krishnamoorthy	93fd248b52	fix: save ModTime properly in disk cache (#11522 ) fix #11414	2021-02-11 19:25:47 -08:00
Harshavardhana	2a7b123895	turn off http2 for TLS setups for now (#11523 ) due to lots of issues with x/net/http2, as well as the bundled h2_bundle.go in the go runtime should be avoided for now. https://github.com/golang/go/issues/23559 https://github.com/golang/go/issues/42534 https://github.com/golang/go/issues/43989 https://github.com/golang/go/issues/33425 https://github.com/golang/go/issues/29246 With collection of such issues present, it make sense to remove HTTP2 support for now	2021-02-11 15:53:04 -08:00
Harshavardhana	b3c56b53fb	fix: metacache should only rename entries during cleanup (#11503 ) To avoid large delays in metacache cleanup, use rename instead of recursive delete calls, renames are cheaper move the content to minioMetaTmpBucket and then cleanup this folder once in 24hrs instead. If the new cache can replace an existing one, we should let it replace since that is currently being saved anyways, this avoids pile up of 1000's of metacache entires for same listing calls that are not necessary to be stored on disk.	2021-02-11 10:22:03 -08:00
Poorna Krishnamoorthy	f24d8127ab	fix: DeleteMultipleObjectsHandler to process deleted objects correctly (#11515 ) DeleteMarkerVersionID which is returned by the lower layer should not be used in the key to lookup ObjectToDelete map	2021-02-10 23:41:41 -08:00
Harshavardhana	7875d472bc	avoid notification for non-existent delete objects (#11514 ) Skip notifications on objects that might have had an error during deletion, this also avoids unnecessary replication attempt on such objects. Refactor some places to make sure that we have notified the client before we - notify - schedule for replication - lifecycle etc.	2021-02-10 22:00:42 -08:00
Harshavardhana	711adb9652	remove ipv6 fallbackdelay leave it as default	2021-02-10 17:35:09 -08:00
Poorna Krishnamoorthy	e6b4ea7618	More fixes for delete marker replication (#11504 ) continuation of PR#11491 for multiple server pools and bi-directional replication. Moving proxying for GET/HEAD to handler level rather than server pool layer as this was also causing incorrect proxying of HEAD. Also fixing metadata update on CopyObject - minio-go was not passing source version ID in X-Amz-Copy-Source header	2021-02-10 17:25:04 -08:00
Aditya Manthramurthy	466e95bb59	Return group DN instead of group name in LDAP STS (#11501 ) - Additionally, check if the user or their groups has a policy attached during the STS call. - Remove the group name attribute configuration value.	2021-02-10 16:52:49 -08:00
Harshavardhana	881f98e511	fix: use getPoolIdx in DeleteObjects() (#11513 ) filter out relevant objects for each pool to avoid calling, further delete operations on subsequent pools where some of these objects might not exist. This is mainly useful to avoid situations during bi-directional bucket replication.	2021-02-10 14:25:43 -08:00
Harshavardhana	cbf4bb62e0	fix: getPoolIdx decouple from top level options (#11512 ) top-level options shouldn't be passed down for GetObjectInfo() while verifying the objects in different pools, this is to make sure that we always get the value from the pool where the object exists.	2021-02-10 11:45:02 -08:00
Anis Elleuch	682482459d	Change the default object content-type to binary/octet-stream (#11508 )	2021-02-10 08:56:37 -08:00
Krishnan Parthasarathi	b87fae0049	Simplify PutObjReader for plain-text reader usage (#11470 ) This change moves away from a unified constructor for plaintext and encrypted usage. NewPutObjReader is simplified for the plain-text reader use. For encrypted reader use, WithEncryption should be called on an initialized PutObjReader. Plaintext: func NewPutObjReader(rawReader hash.Reader) PutObjReader The hash.Reader is used to provide payload size and md5sum to the downstream consumers. This is different from the previous version in that there is no need to pass nil values for unused parameters. Encrypted: func WithEncryption(encReader hash.Reader, key crypto.ObjectKey) (*PutObjReader, error) This method sets up encrypted reader along with the key to seal the md5sum produced by the plain-text reader (already setup when NewPutObjReader was called). Usage: ``` pReader := NewPutObjReader(rawReader) // ... other object handler code goes here // Prepare the encrypted hashed reader pReader, err = pReader.WithEncryption(encReader, objEncKey) ```	2021-02-10 08:52:50 -08:00
Shireesh Anjal	5a18d437ce	fix: drive hw info incomplete when smartinfo fails (#11509 ) Collection of SMART information doesn't work in certain scenarios e.g. in a container based setup. In such cases, instead of returning an error (without any data), we should only set the error on the smartinfo struct, so that other important drive hw info like device, mountpoint, etc is retained in the output.	2021-02-10 08:48:14 -08:00
Poorna Krishnamoorthy	93eb549a83	fix: duplicate delete marker attempts in bi-directional replication (#11491 )	2021-02-09 15:11:43 -08:00
Harshavardhana	fe3c39b583	use the new errgroup API whereever applicable (#11466 ) start using the new errgroup concurrency control API introduced in #11457	2021-02-09 12:08:25 -08:00
Harshavardhana	84d400487f	fix: accountInfo API to cater for federated setups (#11484 ) when MinIO is deployed in a federated setup, use etcd based listing of buckets to provide appropriate filtering of buckets per user.	2021-02-09 09:53:07 -08:00
Shireesh Anjal	3afa499885	fix: empty buckets/objects nodes in new setup (#11493 )	2021-02-09 09:52:38 -08:00
Krishna Srinivas	876b79b8d8	read-health check endpoint returns success if cluster can serve read requests (#11310 )	2021-02-09 01:00:44 -08:00
Ritesh H Shukla	3d74efa6b1	fux: copy object for encrypted objects (#11490 )	2021-02-08 19:58:17 -08:00
Harshavardhana	68d299e719	fix: case-insensitive lookups for metadata (#11489 ) continuation of #11487, with more changes	2021-02-08 18:12:28 -08:00
Poorna Krishnamoorthy	f9c5636c2d	fix: lookup metdata case insensitively (#11487 ) while setting replication options	2021-02-08 16:19:05 -08:00
Klaus Post	9b10118d34	Metacache add abs entry limit (#11483 ) Add an absolute limit to the number of metacaches for a bucket. Delete excess caches if they haven't been handed out in an hour.	2021-02-08 11:36:16 -08:00
Harshavardhana	0e3211f4ad	fix: server upgrades should have more descriptive error messages (#11476 ) during rolling upgrade, provide a more descriptive error message and discourage rolling upgrade in such situations, allowing users to take action. additionally also rename `slashpath -> pathutil` to avoid a slighly mis-pronounced usage of `path` package.	2021-02-08 10:15:12 -08:00
Harshavardhana	2e4d9124ad	honor region specified for remote targets (#11480 ) fixes #11472	2021-02-08 08:54:27 -08:00
Harshavardhana	6fef4c21b9	fix: align atomic variables for 32bit arch (#11475 ) fixes #11474	2021-02-08 08:51:12 -08:00
Poorna Krishnamoorthy	8e1bbd989a	replication:alloc UserDefined map before use (#11478 )	2021-02-07 22:01:10 -08:00
Sarasa Kisaragi	152d7cd95b	HDFS support keytab (#11473 )	2021-02-07 17:29:47 -08:00
Harshavardhana	0d057c777a	remove restriction for multi pool distribution algo	2021-02-06 16:19:05 -08:00
Anis Elleuch	275f7a63e8	lc: Apply DeleteAction correctly to objects (#11471 ) When lifecycle decides to Delete an object and not a version in a versioned bucket, the code should create a delete marker and not removing the scanned version. This commit fixes the issue.	2021-02-06 16:10:33 -08:00
Shireesh Anjal	97fe57bba9	Remove Connections from SysProcess struct (#11373 ) The connections info of the processes takes up a huge amount of space, and is not important for adding any useful health checks. Removing it will significantly reduce the size of the subnet health report.	2021-02-05 21:32:28 -08:00
Harshavardhana	88c1bb0720	fix: improper ticker usage in goroutines (#11468 ) - lock maintenance loop was incorrectly sleeping as well as using ticker badly, leading to extra expiration routines getting triggered that could flood the network. - multipart upload cleanup should be based on timer instead of ticker, to ensure that long running jobs don't get triggered twice. - make sure to get right lockers for object name	2021-02-05 19:23:48 -08:00
Harshavardhana	1fdafaf72f	fix: listing for directory object when delimiter is present (#11463 ) When you have heirarchy of prefixes with directory objects our current master would list directory objects as prefixes when delimiter is present, this is inconsistent with AWS S3 ``` aws s3api list-objects --endpoint-url http://localhost:9000 \ --profile minio --bucket testbucket-v --prefix new/ --delimiter / { "CommonPrefixes": [ { "Prefix": "new/" }, { "Prefix": "new/new/" } ] } ``` Instead this PR fixes this to behave like AWS S3 ``` aws s3api list-objects --endpoint-url http://localhost:9000 \ --profile minio --bucket testbucket-v --prefix new/ --delimiter / { "Contents": [ { "Key": "new/", "LastModified": "2021-02-05T06:27:42.660Z", "ETag": "\"d41d8cd98f00b204e9800998ecf8427e\"", "Size": 0, "StorageClass": "STANDARD", "Owner": { "DisplayName": "", "ID": "02d6176db174dc93cb1b899f7c6078f08654445fe8cf1b6ce98d8855f66bdbf4" } } ], "CommonPrefixes": [ { "Prefix": "new/new/" } ] } ```	2021-02-05 16:24:40 -08:00
Ritesh H Shukla	5fe4bb6b36	Reduce redundant crawler logging (#11448 )	2021-02-05 15:51:11 -08:00
Harshavardhana	99b733d44c	fix: deletion of delete marker regression (#11465 ) fixes #11440 fixes #11451 fixes #11454	2021-02-05 15:06:23 -08:00
Klaus Post	b4ac05523b	Add parallel bucket healing during startup (#11457 ) Replaces #11449 Does concurrent healing but limits concurrency to 50 buckets. Aborts on first error. `errgroup.Group` is extended to facilitate this in a generic way.	2021-02-05 13:04:26 -08:00
Anis Elleuch	c7eacba41c	health-info: Add tags to errors (#11412 ) We use multiple libraries in health info, but the returned error does not indicate exactly what library call is failing, hence adding named tags to returned errors whenever applicable.	2021-02-05 12:37:15 -08:00
Anis Elleuch	1887c25279	xl: Fix feeding NumVersions & SuccessorModTime to lifecycle (#11462 ) After recent refactor where lifecycle started to rely on ObjectInfo to make decisions, it turned out there are some issues calculating Successor Modtime and NumVersions, hence the lifecycle is not working as expected in a versioning bucket in some cases. This commit fixes the behavior.	2021-02-05 11:59:08 -08:00
Harshavardhana	c9b0f595b9	support directory objects in listing in certain scenarios (#11452 ) When a directory object is presented as a `prefix` param our implementation tend to only list objects present common to the `prefix` than the `prefix` itself, to mimic AWS S3 like flat key behavior this PR ensures that if `prefix` is directory object, it should be automatically considered to be part of the eventual listing result. fixes #11370	2021-02-05 10:12:25 -08:00
Harshavardhana	8bb580abfc	fix: use getObjectNInfo to avoid bytes.Buffer usage (#11428 ) few places were still using legacy call GetObject() which was mainly designed for client response writer, use GetObjectNInfo() for internal calls instead.	2021-02-05 09:57:30 -08:00
Harshavardhana	da55a05587	fix aggressive expiration detection (#11446 ) for some flaky networks this may be too fast of a value choose a defensive value, and let this be addressed properly in a new refactor of dsync with renewal logic. Also enable faster fallback delay to cater for misconfigured IPv6 servers refer - https://golang.org/pkg/net/#Dialer - https://tools.ietf.org/html/rfc6555	2021-02-04 16:56:40 -08:00
Harshavardhana	3fc4d6f620	update dependenices for relevant projects (#11445 ) - minio-go -> v7.0.8 - ldap/v3 -> v3.2.4 - reedsolomon -> v1.9.11 - sio-go -> v0.3.1 - msgp -> v1.1.5 - simdjson-go, md5-simd, highwayhash	2021-02-04 13:49:52 -08:00
Ritesh H Shukla	67a8f37df0	fix: disk usage capacity metric reporting (#11435 )	2021-02-04 12:26:58 -08:00
ArthurMa	df0c678167	fix: ldap config parsing issue for UserDNSearchFilter (#11437 )	2021-02-04 11:07:29 -08:00
Harshavardhana	f108873c48	fix: replication metadata comparsion and other fixes (#11410 ) - using miniogo.ObjectInfo.UserMetadata is not correct - using UserTags from Map->String() can change order - ContentType comparison needs to be removed. - Compare both lowercase and uppercase key names. - do not silently error out constructing PutObjectOptions if tag parsing fails - avoid notification for empty object info, failed operations should rely on valid objInfo for notification in all situations - optimize copyObject implementation, also introduce a new replication event - clone ObjectInfo() before scheduling for replication - add additional headers for comparison - remove strings.EqualFold comparison avoid unexpected bugs - fix pool based proxying with multiple pools - compare only specific metadata Co-authored-by: Poorna Krishnamoorthy <poornas@users.noreply.github.com>	2021-02-03 20:41:33 -08:00
Andreas Auernhammer	871b450dbd	crypto: add support for decrypting SSE-KMS metadata (#11415 ) This commit refactors the SSE implementation and add S3-compatible SSE-KMS context handling. SSE-KMS differs from SSE-S3 in two main aspects: 1. The client can request a particular key and specify a KMS context as part of the request. 2. The ETag of an SSE-KMS encrypted object is not the MD5 sum of the object content. This commit only focuses on the 1st aspect. A client can send an optional SSE context when using SSE-KMS. This context is remembered by the S3 server such that the client does not have to specify the context again (during multipart PUT / GET / HEAD ...). The crypto. context also includes the bucket/object name to prevent renaming objects at the backend. Now, AWS S3 behaves as following: - If the user does not provide a SSE-KMS context it does not store one - resp. does not include the SSE-KMS context header in the response (e.g. HEAD). - If the user specifies a SSE-KMS context without the bucket/object name then AWS stores the exact context the client provided but adds the bucket/object name internally. The response contains the KMS context without the bucket/object name. - If the user specifies a SSE-KMS context with the bucket/object name then AWS again stores the exact context provided by the client. The response contains the KMS context with the bucket/object name. This commit implements this behavior w.r.t. SSE-KMS. However, as of now, no such object can be created since the server rejects SSE-KMS encryption requests. This commit is one stepping stone for SSE-KMS support. Co-authored-by: Harshavardhana <harsha@minio.io>	2021-02-03 15:19:08 -08:00
Harshavardhana	f71e192343	avoid listing an empty dir without __XLDIR__ (#11427 ) ``` minio server /tmp/disk{1...4} mc mb myminio/testbucket/ mkdir -p /tmp/disk{1..4}/testbucket/test-prefix/ ``` This would end up being listed in the current master, this PR fixes this situation. If a directory is a leaf dir we should it being listed, since it cannot be deleted anymore with DeleteObject, DeleteObjects() API calls because we natively support directories now. Avoid listing it and let healing purge this folder eventually in the background.	2021-02-03 14:06:54 -08:00
Anis Elleuch	b3f81e75f6	xl: Make it clear when to create delete marker for a non existant object (#11423 )	2021-02-03 10:33:43 -08:00
Klaus Post	a71e0483c9	Fix nil disks in getOnlineDisksWithHealing (#11419 ) If a disk is skipped when nil it is still returned.	2021-02-02 17:04:37 -08:00
Klaus Post	4a9d9c8585	Update colinmarc/hdfs (#11417 ) Updates needed dependency as well. Fixes #11416	2021-02-02 15:37:30 -08:00
Harshavardhana	c885777ac6	Add support for TCP_QUICKACK (#11369 ) TCP_QUICKACK is a setting that allows TCP endpoints to acknowledge the receipt of data instantly in situations where they would normally wait to see if more data would be arriving. https://assets.extrahop.com/whitepapers/TCP-Optimization-Guide-by-ExtraHop.pdf	2021-02-02 09:44:18 -08:00
Poorna Krishnamoorthy	fe3aca70c3	Make number of replication workers configurable. (#11379 ) MINIO_API_REPLICATION_WORKERS env.var and `mc admin config set api` allow number of replication workers to be configurable. Defaults to half the number of cpus available. Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2021-02-02 16:45:06 +05:30
Ritesh H Shukla	c4848f9b4f	Add process start time to cluster metrics. (#11405 )	2021-02-01 23:02:18 -08:00
Andreas Auernhammer	838d4dafbd	gateway: don't use encrypted ETags for If-Match (#11400 ) This commit fixes a bug in the S3 gateway that causes GET requests to fail when the object is encrypted by the gateway itself. The gateway was not able to GET the object since it always specified a `If-Match` pre-condition checking that the object ETag matches an expected ETag - even for encrypted ETags. The problem is that an encrypted ETag will never match the ETag computed by the backend causing the `If-Match` pre-condition to fail. This commit fixes this by not sending an `If-Match` header when the ETag is encrypted. This is acceptable because: 1. A gateway-encrypted object consists of two objects at the backend and there is no way to provide a concurrency-safe implementation of two consecutive S3 GETs in the deployment model of the S3 gateway. Ref: S3 gateways are self-contained and isolated - and there may be multiple instances at the same time (no lock across instances). 2. Even if the data object changes (concurrent PUT) while gateway A has download the metadata object (but not issued the GET to the data object => data race) then we don't return invalid data to the client since the decryption (of the currently uploaded data) will fail - given the metadata of the previous object.	2021-02-01 23:02:08 -08:00
Anis Elleuch	e96fdcd5ec	tagging: Add event notif for PUT object tagging (#11366 ) An optimization to avoid double calling for during PutObject tagging	2021-02-01 13:52:51 -08:00
Anis Elleuch	6ef678663e	xl: Create a delete-marker when no other version exists (#11362 ) Currently, it is not possible to create a delete-marker when xl.meta does not exist (no version is created for that object yet). This makes a problem for replication and mc mirroring with versioning enabled. This also follows S3 specification.	2021-02-01 13:23:50 -08:00
Harshavardhana	f737a027cf	fix: regression introduced in federated listing buckets regression was introduced in `6cd255d516` fix it properly.	2021-02-01 12:06:58 -08:00
Anis Elleuch	65aa2bc614	ilm: Remove object in HEAD/GET if having an applicable ILM rule (#11296 ) Remove an object on the fly if there is a lifecycle rule with delete expiry action for the corresponding object.	2021-02-01 09:52:11 -08:00
Andreas Auernhammer	33554651e9	crypto: deprecate native Hashicorp Vault support (#11352 ) This commit deprecates the native Hashicorp Vault support and removes the legacy Vault documentation. The native Hashicorp Vault documentation is marked as outdated and deprecated for over a year now. We give another 6 months before we start removing Hashicorp Vault support and show a deprecation warning when a MinIO server starts with a native Vault configuration.	2021-01-29 17:55:37 -08:00
Poorna Krishnamoorthy	c82aef0a56	fix ObjectInfo returned by CopyObject (#11377 ) erasure CopyObject was returning old metadata	2021-01-29 14:49:18 -08:00
Harshavardhana	1e53bf2789	fix: allow expansion with newer constraints for older setups (#11372 ) currently we had a restriction where older setups would need to follow previous style of "stripe" count being same expansion, we can relax that instead newer pools can be expanded for older setups with newer constraints of common parity ratio.	2021-01-29 11:40:55 -08:00
Ritesh H Shukla	c8489a8f0c	fix: log notification errors only once (#11350 )	2021-01-28 13:40:31 -08:00
Klaus Post	2680772d4b	Don't mark remotes online when shutting down (#11368 ) Shutting down will mark remotes online when the shutdown has started since the context is canceled. For example: ``` API: SYSTEM() Time: 16:21:31 CET 01/28/2021 DeploymentID: 313b0065-c5a1-4aa3-9233-07223e77a730 Error: Storage resources are insufficient for the write operation .minio.sys/tmp/ced455c4-3d27-4bdd-95fc-b4707a179b8a/fd934ef3-8fc8-4330-abc1-f039fbbb9700/part.1 (cmd.InsufficientWriteQuorum) 1: d:\minio\minio\cmd\data-usage.go:56:cmd.storeDataUsageInBackend() Exiting on signal: INTERRUPT Client http://127.0.0.1:9002/minio/lock/v5 online Client http://127.0.0.1:9002/minio/storage/data/distxl/s2/d3/v24 online Client http://127.0.0.1:9002/minio/storage/data/distxl/s2/d2/v24 online Client http://127.0.0.1:9002/minio/storage/data/distxl/s2/d1/v24 online Client http://127.0.0.1:9002/minio/peer/v12 online Client http://127.0.0.1:9002/minio/storage/data/distxl/s2/d4/v24 online ``` Use a fresh context for health checks.	2021-01-28 13:38:12 -08:00
Harshavardhana	567f7bdd05	fix: verify overlapping domains when > 1	2021-01-28 13:08:53 -08:00
Harshavardhana	6cd255d516	fix: allow updated domain names in federation (#11365 ) additionally also disallow overlapping domain names	2021-01-28 11:44:48 -08:00
Aditya Manthramurthy	e79829b5b3	Bind to lookup user after user auth to lookup ldap groups (#11357 )	2021-01-27 17:31:21 -08:00
Poorna Krishnamoorthy	fd3f02637a	fix: replication regression due to proxying requests (#11356 ) In PR #11165 due to incorrect proxying for 2 way replication even when the object was not yet replicated Additionally, fix metadata comparisons when deciding to do full replication vs metadata copy. fixes #11340	2021-01-27 11:22:34 -08:00
Harshavardhana	e019f21bda	fix: trigger heal if one of the parts are not found (#11358 ) Previously we added heal trigger when bit-rot checks failed, now extend that to support heal when parts are not found either. This healing gets only triggered if we can successfully decode the object i.e read quorum is still satisfied for the object.	2021-01-27 10:21:14 -08:00
Anis Elleuch	e9ac7b0fb7	heal: Remove empty directories (#11354 ) Since the introduction of __XLDIR__, an empty directory does not have a meaning anymore in erasure mode. Make healing removes it wherever it finds it.	2021-01-27 02:19:28 -08:00
Harshavardhana	1debd722b5	rename last remaining Zone->Pool	2021-01-26 20:47:42 -08:00
massintha azamoum	e7f6051f19	Send bucket name to peers when bucket notification is enabled (#11351 )	2021-01-26 13:48:28 -08:00
Harshavardhana	6717295e18	fix: rename audit log docs and datastructure	2021-01-26 13:39:55 -08:00
Anis Elleuch	00cff1aac5	audit: per object send pool number, set number and servers per operation (#11233 )	2021-01-26 13:21:51 -08:00
Harshavardhana	9722531817	fix: purge LDAP deprecated keys	2021-01-26 09:53:29 -08:00
Harshavardhana	5c6bfae4c7	fix: load credentials from etcd directly when possible (#11339 ) under large deployments loading credentials might be time consuming, while this is okay and we will not respond quickly for `mc admin user list` like queries but it is possible to support `mc admin user info` just like how we handle authentication by fetching the user directly from persistent store. additionally support service accounts properly, reloaded from etcd during watch() - this was missing This PR is also half way remedy for #11305	2021-01-25 20:01:49 -08:00
Aditya Manthramurthy	5f51ef0b40	Add LDAP Lookup-Bind mode (#11318 ) This change allows the MinIO server to be configured with a special (read-only) LDAP account to perform user DN lookups. The following configuration parameters are added (along with corresponding environment variables) to LDAP identity configuration (under `identity_ldap`): - lookup_bind_dn / MINIO_IDENTITY_LDAP_LOOKUP_BIND_DN - lookup_bind_password / MINIO_IDENTITY_LDAP_LOOKUP_BIND_PASSWORD - user_dn_search_base_dn / MINIO_IDENTITY_LDAP_USER_DN_SEARCH_BASE_DN - user_dn_search_filter / MINIO_IDENTITY_LDAP_USER_DN_SEARCH_FILTER This lookup-bind account is a service account that is used to lookup the user's DN from their username provided in the STS API. When configured, searching for the user DN is enabled and configuration of the base DN and filter for search is required. In this "lookup-bind" mode, the username format is not checked and must not be specified. This feature is to support Active Directory setups where the DN cannot be simply derived from the username. When the lookup-bind is not configured, the old behavior is enabled: the minio server performs LDAP lookups as the LDAP user making the STS API request and the username format is checked and configuring it is required.	2021-01-25 14:26:10 -08:00
Harshavardhana	7e266293e6	fix: notify bucket replication after replication/ilm (#11343 )	2021-01-25 14:04:41 -08:00
Harshavardhana	eb6871ecd9	fix: LoginSTS should be an inline implementation (#11337 ) STS tokens can be obtained by using local APIs once the remote JWT token is presented, current code was not validating the incoming token in the first place and was incorrectly making a network operation using that token. For the most part this always works without issues, but under adversarial scenarios it exposes client to hand-craft a request that can reach internal services without authentication. This kind of proxying should be avoided before validating the incoming token.	2021-01-25 10:15:03 -08:00
Harshavardhana	9cdd981ce7	fix: expire locks only on participating lockers (#11335 ) additionally also add a new ForceUnlock API, to allow forcibly unlocking locks if possible.	2021-01-25 10:01:27 -08:00
Anis Elleuch	bd8020aba8	heal: Decode object name in healing result (#11348 ) The user can see __XLDIR__ prefix in mc admin heal when the command heals an empty object with a trailing slash. This commit decodes the name of the object before sending it back to the upper level.	2021-01-25 09:53:37 -08:00
Harshavardhana	09bc49bd51	fix: healBucket across sets should capture results properly (#11341 ) healing `.minio.sys/config` returns incorrect quorum errors across sets, healing of the buckets.	2021-01-25 09:45:09 -08:00
Harshavardhana	82f0471d1b	honor maxWait heal config when maxIO hits (#11338 )	2021-01-25 07:53:12 -08:00
Harshavardhana	6a95f412c9	avoid double CORS headers in federation (#11334 ) CORS proxying adds double headers one by the receiving server, one by proxied server. Remove them before proxying when 'Origin' header is found.	2021-01-23 18:27:23 -08:00
Ritesh H Shukla	7575c24037	Add open FD and FD limit to cluster metrics (#11328 )	2021-01-22 18:30:16 -08:00
Harshavardhana	43f973c4cf	fix: check for O_DIRECT support for reads and writes (#11331 ) In-case user enables O_DIRECT for reads and backend does not support it we shall proceed to turn it off instead and print a warning. This validation avoids any unexpected downtimes that users may incur.	2021-01-22 15:38:21 -08:00
Harshavardhana	1b453728a3	initialize forwarder after init() to avoid crashes (#11330 ) DNSCache dialer is a global value initialized in init(), whereas `go` keeps `var =` before `init()` , also we don't need to keep proxy routers as global entities - register the forwarder as necessary to avoid crashes.	2021-01-22 15:37:41 -08:00
Harshavardhana	a6c146bd00	validate storage class across pools when setting config (#11320 ) ``` mc admin config set alias/ storage_class standard=EC:3 ``` should only succeed if parity ratio is valid for all server pools, if not we should fail proactively. This PR also needs to bring other changes now that we need to cater for variadic drive counts per pool. Bonus fixes also various bugs reproduced with - GetObjectWithPartNumber() - CopyObjectPartWithOffsets() - CopyObjectWithMetadata() - PutObjectPart,PutObject with truncated streams	2021-01-22 12:09:24 -08:00
Klaus Post	2167ba0111	Feed correct part number to sio (#11326 ) When offsets were specified we relied on the first part number to be correct. Recalculate based on offset.	2021-01-21 08:43:03 -08:00
Klaus Post	4e6d717f39	Compress profiling data (#11313 ) Trace data can be rather large and compresses fine. Compress profile data in zip files: ``` 277.895.314 before.profiles.zip 152.800.318 after.profiles.zip ```	2021-01-20 15:49:53 -08:00
Poorna Krishnamoorthy	845e251fa9	fix: crash in notificationsys when peers online is 0 (#11307 ) Check if the number of peers online > 0 before using peerClient	2021-01-20 13:13:05 -08:00
Harshavardhana	d1a8f0b786	fix possible crashes on deleteMarker replication (#11308 ) Delete marker can have `metaSys` set to nil, that can lead to crashes after the delete marker has been healed. Additionally also fix isObjectDangling check for transitioned objects, that do not have parts should be treated similar to Delete marker.	2021-01-20 13:12:12 -08:00
Klaus Post	dac19d7272	Clarify root disk error (#11314 ) Make it clearer what the problem is and how to resolve it.	2021-01-20 13:11:42 -08:00
Harshavardhana	7624c8b9bb	fix: honor storage class uniformity for multiple pools (#11309 )	2021-01-20 01:41:18 -08:00
Klaus Post	19fb1086b2	select: Fix leak on compressed files (#11302 ) Properly close gzip reader when done reading fixes #11300	2021-01-19 17:51:46 -08:00
Harshavardhana	a5e23a40ff	fix: allow delayed etcd updates to have fallbacks (#11151 ) fixes #11149	2021-01-19 10:05:41 -08:00
Harshavardhana	1ad2b7b699	fix: add stricter validation for erasure server pools (#11299 ) During expansion we need to validate if - new deployment is expanded with newer constraints - existing deployment is expanded with older constraints - multiple server pools rejected if they have different deploymentID and distribution algo	2021-01-19 10:01:31 -08:00
Harshavardhana	b5049d541f	fix: reduce an extra readdir() attempted on non-legacy setups (#11301 ) to verify moving content and preserving legacy content, we have way to detect the objects through readdir() this path is not necessary for most common cases on newer setups, avoid readdir() to save multiple system calls. also fix the CheckFile behavior for most common use case i.e without legacy format.	2021-01-19 10:01:06 -08:00
Harshavardhana	e0055609bb	fix: crawler to skip healing the drives in a set being healed (#11274 ) If an erasure set had a drive replacement recently, we don't need to attempt healing on another drive with in the same erasure set - this would ensure we do not double heal the same content and also prioritizes usage for such an erasure set to be calculated sooner.	2021-01-19 02:40:52 -08:00
Klaus Post	e8ce348da1	crypto: Escape JSON text (#10794 ) Escape the JSON keys+values from the context. We do not add the HTML escapes, since that is an extra escape level not mandatory for JSON.	2021-01-19 01:39:04 -08:00
Ritesh H Shukla	b4add82bb6	Updated Prometheus metrics (#11141 ) * Add metrics for nodes online and offline * Add cluster capacity metrics * Introduce v2 metrics	2021-01-18 20:35:38 -08:00
Harshavardhana	3ca6330661	fix: optimize parentDirIsObject by moving isObject to storage layer (#11291 ) For objects with `N` prefix depth, this PR reduces `N` such network operations by converting `CheckFile` into a single bulk operation. Reduction in chattiness here would allow disks to be utilized more cleanly, while maintaining the same functionality along with one extra volume check stat() call is removed. Update tests to test multiple sets scenario	2021-01-18 12:25:22 -08:00
Aditya Manthramurthy	3163a660aa	Fix support for multiple LDAP user formats (#11276 ) Fixes support for using multiple base DNs for user search in the LDAP directory allowing users from different subtrees in the LDAP hierarchy to request credentials. - The username in the produced credentials is now the full DN of the LDAP user to disambiguate users in different base DNs.	2021-01-17 21:54:32 -08:00
Harshavardhana	0dadfd1b3d	fix: do not compute usage for not found lifecycle operations (#11288 ) Currently we would proceed to apply incorrect lifecycle policies for non-existent objects, this PR handles them appropriately.	2021-01-17 13:58:41 -08:00
Harshavardhana	4315f93421	fix: make sure parentDirIsObject is used at set level (#11280 ) parentDirIsObject is not using set level understanding to check for parent objects, without this it can lead to objects that can actually reside on a separate set as objects and would conflict.	2021-01-17 01:11:48 -08:00
Harshavardhana	ddb5d7043a	fix: standard storage class is allowed to be '0'	2021-01-16 17:32:25 -08:00
Harshavardhana	f903cae6ff	Support variable server pools (#11256 ) Current implementation requires server pools to have same erasure stripe sizes, to facilitate same SLA and expectations. This PR allows server pools to be variadic, i.e they do not have to be same erasure stripe sizes - instead they should have SLA for parity ratio. If the parity ratio cannot be guaranteed by the new server pool, the deployment is rejected i.e server pool expansion is not allowed.	2021-01-16 12:08:02 -08:00
Poorna Krishnamoorthy	7090bcc8e0	fix: doc links and delete replication permissions enforcement (#11285 )	2021-01-15 15:22:55 -08:00
Harshavardhana	c222bde14b	fix: use common logging implementation for DNSCache (#11284 )	2021-01-15 14:04:56 -08:00
Poorna Krishnamoorthy	feaf8dfb9a	Fix replication status reported on completion (#11273 ) Fixes: #11272	2021-01-13 11:52:28 -08:00
Harshavardhana	628ef081d1	fix: preserve cache calculated previously while moving from v2 to v3 (#11269 ) This ensures that all the prometheus monitoring and usage trackers to avoid alerts configured, although we cannot support v1 to v2 here - we can v2 to v3.	2021-01-13 09:58:08 -08:00
Harshavardhana	44dff36ff7	listing with prefix prefixed with '/' should be ignored (#11268 ) fixes #11265	2021-01-13 09:44:11 -08:00
Poorna Krishnamoorthy	b97d53b29c	fix remote target healthcheck (#11267 )	2021-01-12 20:48:04 -08:00
Harshavardhana	1a5775e2e8	enable small and large file optimization (#11260 ) - for large objects we found that 1MiB block for r/w respectively. - for small objects we found that 128KiB block for r/w respectively.	2021-01-12 10:20:39 -08:00
Anis Elleuch	e2579b1f5a	azure: Use default upload parameters to avoid consuming too much memory (#11251 ) A lot of memory is consumed when uploading small files in parallel, use the default upload parameters and add MINIO_AZURE_UPLOAD_CONCURRENCY for users to tweak.	2021-01-11 22:48:09 -08:00
Poorna Krishnamoorthy	7824e19d20	Allow synchronous replication if enabled. (#11165 ) Synchronous replication can be enabled by setting the --sync flag while adding a remote replication target. This PR also adds proxying on GET/HEAD to another node in a active-active replication setup in the event of a 404 on the current node.	2021-01-11 22:36:51 -08:00
Harshavardhana	317305d5f9	fix: regression in adding new replication targets (#11257 )	2021-01-11 09:08:42 -08:00
Harshavardhana	e4e117faab	fix: enable xl.json to xl.meta only if legacy drive is found (#11255 ) another optimization is renameLegacyMetadata() never needs to validate bucket with os.Stat() again, leading to reduction in one extra syscall.	2021-01-11 02:27:04 -08:00
Klaus Post	51dad1d130	Fix missing GetObjectNInfo Closure (#11243 ) Review for missing Close of returned value from `GetObjectNInfo`. This was often obscured by the stuff that auto-unlocks when reaching EOF.	2021-01-08 10:12:26 -08:00
Harshavardhana	4593b146be	fix: print errors only when metacache status has errors (#11248 )	2021-01-08 16:52:19 +05:30
Harshavardhana	f21d650ed4	fix: readData in bulk call using messagepack byte wrappers (#11228 ) This PR refactors the way we use buffers for O_DIRECT and to re-use those buffers for messagepack reader writer. After some extensive benchmarking found that not all objects have this benefit, and only objects smaller than 64KiB see this benefit overall. Benefits are seen from almost all objects from 1KiB - 32KiB Beyond this no objects see benefit with bulk call approach as the latency of bytes sent over the wire v/s streaming content directly from disk negate each other with no remarkable benefits. All other optimizations include reuse of msgp.Reader, msgp.Writer using sync.Pool's for all internode calls.	2021-01-07 19:27:31 -08:00
Harshavardhana	a4f6705874	expire stale locks when owner is down (#11247 ) fixes #11246	2021-01-07 19:16:18 -08:00
Poorna Krishnamoorthy	b35b537e3f	Pass versionID to checkReplicateDelete in web handler (#11244 )	2021-01-07 15:28:27 -08:00
Harshavardhana	5c52d5ffc7	fix: treat errVolumeNotFound as EOF error in listPathRaw (#11238 )	2021-01-07 09:52:53 -08:00
Harshavardhana	f0808bb2e5	fix: getObject fd leaks in transition and replication code (#11237 )	2021-01-06 16:13:10 -08:00
Harshavardhana	a6dee21092	initialize IAM store before Init() to avoid any crash (#11236 )	2021-01-06 13:40:20 -08:00
Anis Elleuch	6f781c5e7a	heal: Reduce whitespace ticker to 5 seconds (#11234 ) 30 seconds white spaces is long for some setups which time out when no read activity in short time, reduce the subnet health white space ticker to 5 seconds, since it has no cost at all.	2021-01-06 13:29:50 -08:00
Harshavardhana	f8ca859790	fix: server/gateway banner formatting (#11230 )	2021-01-06 10:38:07 -08:00
Harshavardhana	76e2713ffe	fix: use buffers only when necessary for io.Copy() (#11229 ) Use separate sync.Pool for writes/reads Avoid passing buffers for io.CopyBuffer() if the writer or reader implement io.WriteTo or io.ReadFrom respectively then its useless for sync.Pool to allocate buffers on its own since that will be completely ignored by the io.CopyBuffer Go implementation. Improve this wherever we see this to be optimal. This allows us to be more efficient on memory usage. ``` 385 // copyBuffer is the actual implementation of Copy and CopyBuffer. 386 // if buf is nil, one is allocated. 387 func copyBuffer(dst Writer, src Reader, buf []byte) (written int64, err error) { 388 // If the reader has a WriteTo method, use it to do the copy. 389 // Avoids an allocation and a copy. 390 if wt, ok := src.(WriterTo); ok { 391 return wt.WriteTo(dst) 392 } 393 // Similarly, if the writer has a ReadFrom method, use it to do the copy. 394 if rt, ok := dst.(ReaderFrom); ok { 395 return rt.ReadFrom(src) 396 } ``` From readahead package ``` // WriteTo writes data to w until there's no more data to write or when an error occurs. // The return value n is the number of bytes written. // Any error encountered during the write is also returned. func (a *reader) WriteTo(w io.Writer) (n int64, err error) { if a.err != nil { return 0, a.err } n = 0 for { err = a.fill() if err != nil { return n, err } n2, err := w.Write(a.cur.buffer()) a.cur.inc(n2) n += int64(n2) if err != nil { return n, err } ```	2021-01-06 09:36:55 -08:00
Harshavardhana	b5d291ea88	fix: rename remaining zone -> pool (#11231 )	2021-01-06 09:35:47 -08:00
Klaus Post	eb9172eecb	Allow Compression + encryption (#11103 )	2021-01-05 20:08:35 -08:00
Poorna Krishnamoorthy	64bddf47d8	Pass deletemarker correctly to replicate opts (#11227 ) fixes: #11180	2021-01-05 14:12:37 -08:00
Harshavardhana	4ed45ce543	fix: healing buckets during pool expansion (#11224 ) fixes #11209	2021-01-05 13:24:22 -08:00
Klaus Post	ad511b0eb8	tests: Fix occasional data race (#11223 ) CI tests could trigger a data race. Servers are generally not expected to reinitialize, so tests could trigger data races when reinitializing and async operations are running. We add the option to safely reset global vars instead of overwriting. Fixes races like: ``` WARNING: DATA RACE Read at 0x00000477ab18 by goroutine 1159: github.com/minio/minio/cmd.FileInfo.ToObjectInfo() /home/runner/work/minio/minio/cmd/erasure-metadata.go:105 +0x16d github.com/minio/minio/cmd.erasureObjects.putObject() /home/runner/work/minio/minio/cmd/erasure-object.go:748 +0x13f8 github.com/minio/minio/cmd.(erasureObjects).listPath.func3.2() /home/runner/work/minio/minio/cmd/metacache-set.go:682 +0x7d3 github.com/minio/minio/cmd.newMetacacheBlockWriter.func1.2() /home/runner/work/minio/minio/cmd/metacache-stream.go:777 +0x1c4 github.com/minio/minio/cmd.newMetacacheBlockWriter.func1() /home/runner/work/minio/minio/cmd/metacache-stream.go:806 +0x614 Previous write at 0x00000477ab18 by goroutine 1269: [failed to restore the stack] Goroutine 1159 (running) created at: github.com/minio/minio/cmd.newMetacacheBlockWriter() /home/runner/work/minio/minio/cmd/metacache-stream.go:760 +0x112 github.com/minio/minio/cmd.(erasureObjects).listPath.func3() /home/runner/work/minio/minio/cmd/metacache-set.go:672 +0xe22 Goroutine 1269 (running) created at: testing.(T).Run() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1095 +0x537 testing.runTests.func1() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1339 +0xa6 testing.tRunner() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1050 +0x1eb testing.runTests() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1337 +0x594 testing.(M).Run() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1252 +0x2ff github.com/minio/minio/cmd.TestMain() /home/runner/work/minio/minio/cmd/test-utils_test.go:120 +0x44e main.main() _testmain.go:1408 +0x223 ================== ================== WARNING: DATA RACE Read at 0x00000477aae8 by goroutine 1159: github.com/minio/minio/cmd.(BucketVersioningSys).Enabled() /home/runner/work/minio/minio/cmd/bucket-versioning.go:26 +0x52 github.com/minio/minio/cmd.FileInfo.ToObjectInfo() /home/runner/work/minio/minio/cmd/erasure-metadata.go:105 +0x197 github.com/minio/minio/cmd.erasureObjects.putObject() /home/runner/work/minio/minio/cmd/erasure-object.go:748 +0x13f8 github.com/minio/minio/cmd.(erasureObjects).listPath.func3.2() /home/runner/work/minio/minio/cmd/metacache-set.go:682 +0x7d3 github.com/minio/minio/cmd.newMetacacheBlockWriter.func1.2() /home/runner/work/minio/minio/cmd/metacache-stream.go:777 +0x1c4 github.com/minio/minio/cmd.newMetacacheBlockWriter.func1() /home/runner/work/minio/minio/cmd/metacache-stream.go:806 +0x614 Previous write at 0x00000477aae8 by goroutine 1269: [failed to restore the stack] Goroutine 1159 (running) created at: github.com/minio/minio/cmd.newMetacacheBlockWriter() /home/runner/work/minio/minio/cmd/metacache-stream.go:760 +0x112 github.com/minio/minio/cmd.(erasureObjects).listPath.func3() /home/runner/work/minio/minio/cmd/metacache-set.go:672 +0xe22 Goroutine 1269 (running) created at: testing.(T).Run() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1095 +0x537 testing.runTests.func1() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1339 +0xa6 testing.tRunner() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1050 +0x1eb testing.runTests() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1337 +0x594 testing.(*M).Run() /opt/hostedtoolcache/go/1.14.13/x64/src/testing/testing.go:1252 +0x2ff github.com/minio/minio/cmd.TestMain() /home/runner/work/minio/minio/cmd/test-utils_test.go:120 +0x44e main.main() _testmain.go:1408 +0x223 ================== ```	2021-01-05 10:45:26 -08:00
Harshavardhana	cb0eaeaad8	feat: migrate to ROOT_USER/PASSWORD from ACCESS/SECRET_KEY (#11185 )	2021-01-05 10:22:57 -08:00
Harshavardhana	d0027c3c41	do not use large buffers if not necessary (#11220 ) without this change, there is a performance regression for small objects GETs, this makes the overall speed to go back to pre '59d363' commit days.	2021-01-04 18:51:52 -08:00
Anis Elleuch	cb7fc99368	handlers: Avoid initializing a struct in each handler call (#11217 )	2021-01-04 09:54:22 -08:00
Harshavardhana	a4383051d9	remove/deprecate crawler disable environment (#11214 ) with changes present to automatically throttle crawler at runtime, there is no need to have an environment value to disable crawling. crawling is a fundamental piece for healing, lifecycle and many other features there is no good reason anyone would need to disable this on a production system. * Apply suggestions from code review	2021-01-04 09:43:31 -08:00
Harshavardhana	e7ae49f9c9	fix: calculate prometheus disks_offline/disks_total correctly (#11215 ) fixes #11196	2021-01-04 09:42:09 -08:00
Anis Elleuch	153d4be032	tracing: NumSubscribers() to use atomic instead of mutex (#11219 ) globalSubscribers.NumSubscribers() is heavily used in S3 requests and it uses mutex, use atomic.Load instead since it is faster Co-authored-by: Anis Elleuch <anis@min.io>	2021-01-04 09:40:30 -08:00
Anis Elleuch	dfd99b6d8f	handlers: Little bit more optimizations (#11211 )	2021-01-04 00:01:06 -08:00
Harshavardhana	c4b1d394d6	erasure: avoid io.Copy in hotpaths to reduce allocation (#11213 )	2021-01-03 16:27:34 -08:00
Harshavardhana	c4131c2798	feat: Small object optimization read data in single bulk call (#11207 )	2021-01-03 11:27:57 -08:00
Anis Elleuch	c9d502e6fa	parentDirIsObject() to return quickly with inexistant parent (#11204 ) Rewrite parentIsObject() function. Currently if a client uploads a/b/c/d, we always check if c, b, a are actual objects or not. The new code will check with the reverse order and quickly quit if the segment doesn't exist. So if a, b, c in 'a/b/c' does not exist in the first place, then returns false quickly.	2021-01-02 12:01:29 -08:00
Anis Elleuch	677e80c0f8	xl: Remove check-dir in ReadVersion (#11200 ) The only purpose of check-dir flag in ReadVersion is to return 404 when an object has xl.meta but without data. This is causing an extract call to the disk which can be penalizing in case of busy system where disks receive many concurrent access.	2021-01-02 10:35:57 -08:00
Harshavardhana	aa85af4d1a	fix: missing CopyObjectPart maxClients reorder	2021-01-01 23:07:37 -08:00
Anis Elleuch	ae731d232f	trace: Reorder http/trace maxClients wrapping for correct tracing (#11202 ) mc admin trace does not show the correct handler name in the output: it is printing `maxClients' for all handlers. The reason is that the wrong order of handler wrapping.	2021-01-01 23:06:07 -08:00
Anis Elleuch	a317d220ed	xl-storage: Do not stat bucket assuming the object exists (#11201 ) In HEAD/GET, only STAT the bucket if the object does not exist to return the correct error response.	2021-01-01 09:44:36 -08:00
Harshavardhana	3e1221a01c	fix: log once updating dataUsageCache versions (#11190 ) also reduce usage of *bytes.Buffer for reading `usage-cache.bin`	2020-12-31 09:45:09 -08:00
Ritesh H Shukla	36fc2f98ed	fix: admin trace throttled requests (#11192 )	2020-12-30 21:04:55 -08:00
Ritesh H Shukla	556524c715	Reduce logging when peer is offline (#11184 )	2020-12-30 14:38:54 -08:00
Harshavardhana	cc457f1798	fix: enhance logging in crawler use console.Debug instead of logger.Info (#11179 )	2020-12-29 01:57:28 -08:00
Harshavardhana	ca0d31b09a	fix: re-arrange handlers to handle requests on /minio (#11177 ) fixes #11175	2020-12-28 17:10:33 -08:00
Harshavardhana	445a9bd827	fix: heal optimizations in crawler to avoid multiple healing attempts (#11173 ) Fixes two problems - Double healing when bitrot is enabled, instead heal attempt once in applyActions() before lifecycle is applied. - If applyActions() is successful and getSize() returns proper value, then object is accounted for and should be removed from the oldCache namespace map to avoid double heal attempts.	2020-12-28 10:31:00 -08:00
Harshavardhana	d8d25a308f	fix: use HealObject for cleaning up dangling objects (#11171 ) main reason is that HealObjects starts a recursive listing for each object, this can be a really really long time on large namespaces instead avoid recursive listing just perform HealObject() instead at the prefix. delete's already handle purging dangling content, we don't need to achieve this by doing recursive listing, this in-turn can delay crawling significantly.	2020-12-27 15:42:20 -08:00
Harshavardhana	c19e6ce773	avoid a crash in crawler when lifecycle is not initialized (#11170 ) Bonus for static buffers use bytes.NewReader instead of bytes.NewBuffer, to use a more reader friendly implementation	2020-12-26 22:58:06 -08:00
Harshavardhana	59d3639396	fix: inherit heal opts globally, including bitrot settings (#11166 ) Bonus re-use ReadFileStream internal io.Copy buffers, fixes lots of chatty allocations when reading metacache readers with many sustained concurrent listing operations ``` 17.30GB 1.27% 84.80% 35.26GB 2.58% io.copyBuffer ```	2020-12-24 23:04:03 -08:00
Harshavardhana	027e17468a	fix: discarding results do not attempt in-memory metacache writer (#11163 ) Optimizations include - do not write the metacache block if the size of the block is '0' and it is the first block - where listing is attempted for a transient prefix, this helps to avoid creating lots of empty metacache entries for `minioMetaBucket` - avoid the entire initialization sequence of cacheCh , metacacheBlockWriter if we are simply going to skip them when discardResults is set to true. - No need to hold write locks while writing metacache blocks - each block is unique, per bucket, per prefix and also is written by a single node.	2020-12-24 15:02:02 -08:00
Harshavardhana	45ea161f8d	webUI: change listing to 1000 keys from browser UI (#11159 ) gateway implementations do not handle maxKeys being `-1` properly unlike MinIO implementation, handle it by setting an appropriate value. fixes #11158	2020-12-23 19:58:15 -08:00
Harshavardhana	6a66f142d4	fix: strict quorum in list should list on all drives (#11157 ) current implementation was incorrect, it in-fact assumed only read quorum number of disks. in-fact that value is only meant for read quorum good entries from all online disks. This PR fixes this behavior properly.	2020-12-23 09:26:40 -08:00
Harshavardhana	5982965839	fix: re-use bytes.Buffer using sync.Pool (#11156 )	2020-12-22 23:22:37 -08:00
Harshavardhana	8565cefe4e	fix: allow HTTP2.0 to be always configured	2020-12-22 16:32:58 -08:00
Andreas Auernhammer	8cdf2106b0	refactor cmd/crypto code for SSE handling and parsing (#11045 ) This commit refactors the code in `cmd/crypto` and separates SSE-S3, SSE-C and SSE-KMS. This commit should not cause any behavior change except for: - `IsRequested(http.Header)` which now returns the requested type {SSE-C, SSE-S3, SSE-KMS} and does not consider SSE-C copy headers. However, SSE-C copy headers alone are anyway not valid.	2020-12-22 09:19:32 -08:00
Harshavardhana	35fafb837b	fix: issues with handling delete markers in metacache (#11150 ) Additional cases handled - fix address situations where healing is not triggered on failed writes and deletes. - consider object exists during listing when metadata can be successfully decoded.	2020-12-22 09:16:43 -08:00
Harshavardhana	274bbad5cb	fix: select always online peers for remote listing (#11153 ) always find the right set of online peers for remote listing, this may have an effect on listing if the server is down - we should do this to avoid always performing transient operations on bucket->peerClient that is permanently or down for a long period.	2020-12-22 09:16:07 -08:00
Harshavardhana	5c451d1690	update x/net/http2 to address few bugs (#11144 ) additionally also configure http2 healthcheck values to quickly detect unstable connections and let them timeout. also use single transport for proxying requests	2020-12-21 21:42:38 -08:00
Poorna Krishnamoorthy	c987313431	Encrypt remote target if kms is configured (#11034 ) Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2020-12-21 16:21:33 -08:00
Anis Elleuch	2ecaab55a6	admin: ServerInfo returns info without object layer initialized (#11142 )	2020-12-21 09:35:19 -08:00
Harshavardhana	3e792ae2a2	fix: change defaults for DNS cache dialer (#11145 )	2020-12-21 09:33:29 -08:00
Harshavardhana	4cc500a041	normalize users with double // in accessKeys (#11143 ) Bonus fix, use constant time compare for secret keys in web-handlers.go:SetAuth()	2020-12-20 10:09:51 -08:00
Harshavardhana	d8e28830cf	fix: allow STS creds for admin accounts to add users (#11138 ) Allow rotating creds with privileges to add users fixes https://github.com/minio/console/issues/529	2020-12-19 13:24:21 -08:00
Harshavardhana	3e16ec457a	fix: support user/groups with '/' character (#11127 ) NOTE: user/groups with `//` shall be normalized to `/` fixes #11126	2020-12-19 09:36:37 -08:00
Harshavardhana	e5d378931d	fix: delimiter based listing was broken without marker (#11136 ) with missing nextMarker with delimiter based listing, top level prefixes beyond 4500 or max-keys value wouldn't be sent back for client to ask for the next batch. reproduced at a customer deployment, create prefixes as shown below ``` for year in $(seq 2017 2020) do for month in {01..12} do for day in {01..31} do mc -q cp file myminio/testbucket/dir/day_id=$year-$month-$day/; done done done ``` Then perform ``` aws s3api --profile minio --endpoint-url http://localhost:9000 list-objects \ --bucket testbucket --prefix dir/ --delimiter / --max-keys 1000 ``` You shall see missing NextMarker, this would disallow listing beyond max-keys requested and also disallow beyond 4500 (maxKeyObjectList) prefixes being listed because client wouldn't know the NextMarker available. This PR addresses this situation properly by making the implementation more spec compatible. i.e NextMarker in-fact can be either an object, a prefix with delimiter depending on the input operation. This issue was introduced after the list caching changes and has been present for a while.	2020-12-19 09:36:04 -08:00
Anis Elleuch	e63a10e505	Profiling does not required object layer to be initialized (#11133 )	2020-12-18 11:51:15 -08:00
Anis Elleuch	5434088c51	replication: Ensure to always use nano precision source modtime (#11135 )	2020-12-18 11:37:28 -08:00
Harshavardhana	a773cf48d8	fix: overlapping object and prefix rejected (#11130 ) fixes #11129	2020-12-18 08:51:09 -08:00
Harshavardhana	f714840da7	add _MINIO_SERVER_DEBUG env for enabling debug messages (#11128 )	2020-12-17 16:52:47 -08:00
Harshavardhana	7c9ef76f66	fix: timer deadlock on expired timers (#11124 ) issue was introduced in #11106 the following pattern <-t.C // timer fired if !t.Stop() { <-t.C // timer hangs } Seems to hang at the last `t.C` line, this issue happens because a fired timer cannot be Stopped() anymore and t.Stop() returns `false` leading to confusing state of usage. Refactor the code such that use timers appropriately with exact requirements in place.	2020-12-17 12:35:02 -08:00
Anis Elleuch	cffdb01279	azure/s3 gateways: Pass ETag during GET call to avoid data corruption (#11024 ) Both Azure & S3 gateways call for object information before returning the stream of the object, however, the object content/length could be modified meanwhile, which means it can return a corrupted object. Use ETag to ensure that the object was not modified during the GET call	2020-12-17 09:11:14 -08:00
Harshavardhana	b390a2a0b9	fix: reuser timers in erasure set hotpaths (#11106 ) reuser timers in - connectDisks() monitoring - healMRFRoutine() channel timeouts	2020-12-16 14:33:05 -08:00
Harshavardhana	90158f1e33	fix: avoid logging for Heal APIs in FS mode (#11121 ) fixes #11120	2020-12-16 09:46:13 -08:00
Harshavardhana	c606c76323	fix: prioritized latest buckets for crawler to finish the scans faster (#11115 ) crawler should only ListBuckets once not for each serverPool, buckets are same across all pools, across sets and ListBuckets always returns an unified view, once list buckets returns sort it by create time to scan the latest buckets earlier with the assumption that latest buckets would have lesser content than older buckets allowing them to be scanned faster and also to be able to provide more closer to latest view.	2020-12-15 17:34:54 -08:00
Klaus Post	e7d3b49a20	metacache: Make very small requests transient (#11109 )	2020-12-15 11:25:36 -08:00
Harshavardhana	5df61ab96b	fix: remove gorilla/rpc/ deps fully after our fork (#11108 )	2020-12-15 11:18:06 -08:00
Poorna Krishnamoorthy	3456b03b12	Ignore ObjectNotFound errors in delete api while enforcing locking (#11114 ) AWS does not report this or version not found as errors in the response.	2020-12-15 11:15:49 -08:00
Klaus Post	f6fb27e8f0	Don't copy interesting ids, clean up logging (#11102 ) When searching the caches don't copy the ids, instead inline the loop. ``` Benchmark_bucketMetacache_findCache-32 19200 63490 ns/op 8303 B/op 5 allocs/op Benchmark_bucketMetacache_findCache-32 20338 58609 ns/op 111 B/op 4 allocs/op ``` Add a reasonable, but still the simplistic benchmark. Bonus - make nicer zero alloc logging	2020-12-14 13:13:33 -08:00
Harshavardhana	8368ab76aa	fix: remove the requirement for healing buckets in ListBucketsHeal (#11098 ) With new refactor of bucket healing, healing bucket happens automatically including its metadata, there is no need to redundant heal buckets also in ListBucketsHeal remove it.	2020-12-14 12:07:07 -08:00
Harshavardhana	3e83643320	lifecycle improvements and additional debug logging (#11096 ) Bonus change fix browser assets	2020-12-13 12:05:54 -08:00
Harshavardhana	2eb52ca5f4	fix: heal bucket metadata right before healing bucket (#11097 ) optimization mainly to avoid listing the entire `.minio.sys/buckets/.minio.sys` directory, this can get really huge and comes in the way of startup routines, contents inside `.minio.sys/buckets/.minio.sys` are rather transient and not necessary to be healed.	2020-12-13 11:57:08 -08:00
Anis Elleuch	f164085227	xl: Always set root disk to true in test environment (#11094 ) Tests environments (go test or manual testing) should always consider the passed disks are root disks and should not rely on disk.IsRootDisk() function. The reason is that this latter can return a false negative when called in a busy system. However, returning a false negative will only occur in a testing environment and not in a production, so we can accept this trade-off for now.	2020-12-12 16:10:07 -08:00
Harshavardhana	48191dd748	return NoSuchVersion if invalid version-id is specified (#11091 )	2020-12-11 20:44:08 -08:00
Anis Elleuch	c4f29d24da	metacache: Ask all disks when drive count is 4 (#11087 )	2020-12-11 17:54:31 -08:00
Harshavardhana	db7890660e	fix: a crash when disk is nil, safe access on erasureDisks (#11089 ) fixes #11088	2020-12-11 16:58:36 -08:00
Poorna Krishnamoorthy	9adc33efbb	Return version-id header in DeleteObject response (#11090 ) even when the object version is non-existent To make this consistent with aws behavior. Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2020-12-11 16:58:15 -08:00
Poorna Krishnamoorthy	8f65aba04b	ignore NoSuchVersion error in DeleteObjects API (#11086 ) Currently, the error response reports NoSuchVersion for a non-existent version-id, whereas AWS ignores it.	2020-12-11 12:39:09 -08:00
Harshavardhana	3a0082f0f1	fix: TTFB prometheus metrics calculation (#11082 ) until now metrics was reporting entire call duration instead of ttfb's this PR fixes it	2020-12-10 23:02:25 -08:00
Klaus Post	4bca62a0bd	crawler: Stream bucket usage cache data (#11068 ) Stream bucket caches to storage and through RPC calls.	2020-12-10 13:03:22 -08:00
Klaus Post	82e2be4239	metacache: Speed up cleanup operation (#11078 ) Perform cleanup operations on copied data. Avoids read locking data while determining which caches to keep. Also, reduce the log(NN) operation to log(NM) where M caches with the same root or below when checking potential replacements.	2020-12-10 12:30:28 -08:00
Harshavardhana	4550ac6fff	fix: refactor locks to apply them uniquely per node (#11052 ) This refactor is done for few reasons below - to avoid deadlocks in scenarios when number of nodes are smaller < actual erasure stripe count where in N participating local lockers can lead to deadlocks across systems. - avoids expiry routines to run 1000 of separate network operations and routes per disk where as each of them are still accessing one single local entity. - it is ideal to have since globalLockServer per instance. - In a 32node deployment however, each server group is still concentrated towards the same set of lockers that partipicate during the write/read phase, unlike previous minio/dsync implementation - this potentially avoids send 32 requests instead we will still send at max requests of unique nodes participating in a write/read phase. - reduces overall chattiness on smaller setups.	2020-12-10 07:28:37 -08:00
Klaus Post	e65ed2e44f	listcache: Add path index (#11063 ) Add a root path index. ``` Before: Benchmark_bucketMetacache_findCache-32 10000 730737 ns/op With excluded prints: Benchmark_bucketMetacache_findCache-32 10000 207100 ns/op With the root path: Benchmark_bucketMetacache_findCache-32 705765 1943 ns/op ``` Benchmark used (not linear): ```Go func Benchmark_bucketMetacache_findCache(b *testing.B) { bm := newBucketMetacache("", false) for i := 0; i < b.N; i++ { bm.findCache(listPathOptions{ ID: mustGetUUID(), Bucket: "", BaseDir: "prefix/" + mustGetUUID(), Prefix: "", FilterPrefix: "", Marker: "", Limit: 0, AskDisks: 0, Recursive: false, Separator: slashSeparator, Create: true, CurrentCycle: 0, OldestCycle: 0, }) } } ``` Replaces #11058	2020-12-09 08:37:43 -08:00
Anis Elleuch	d90044b847	federation: Redirect Lifecycle PUT request by bucket name (#11062 ) The bucket forwarder handler considers MakeBucket to be always local but it mistakenly thinks that PUT bucket lifecycle to be a MakeBucket call. Fix the check of the MakeBucket call by ensuring that the query is empty in the PUT url.	2020-12-09 07:25:26 -08:00
Harshavardhana	d8c1f93de6	reject mixed drive situations with drives on root disks (#11057 ) till now we used to match the inode number of the root drive and the drive path minio would use, if they match we knew that its a root disk. this may not be true in all situations such as running inside a container environment where the container might be mounted from a different partition altogether, root disk detection might fail.	2020-12-09 00:27:02 -08:00
Anis Elleuch	a51488cbaa	s3: Fix reading GET with partNumber specified (#11032 ) partNumber was miscalculting the start and end of parts when partNumber query is specified in the GET request. This commit fixes it and also fixes the ContentRange header in that case.	2020-12-08 13:12:42 -08:00
Harshavardhana	dc819afa44	fix: auto update crawler meta version PR `038bcd9079` introduced version '3', we need to make sure that we do not print an unexpected error instead log a message to indicate we will auto update the version.	2020-12-08 10:40:51 -08:00
Harshavardhana	4a564336fe	Revert "Add metrics for nodes online and offline (#11050 )" This reverts commit `f60bbdf86b`.	2020-12-08 09:23:35 -08:00
Ritesh H Shukla	f60bbdf86b	Add metrics for nodes online and offline (#11050 )	2020-12-08 01:06:27 -08:00
Poorna Krishnamoorthy	f3beb1236a	Add cache usage, total capacity to prometheus metrics (#11026 )	2020-12-07 16:35:11 -08:00
Poorna Krishnamoorthy	934bed47fa	Add transition event notification (#11047 ) This is a MinIO specific extension to allow monitoring of transition events.	2020-12-07 13:53:28 -08:00
Ritesh H Shukla	038bcd9079	Add replication capacity metrics support in crawler (#10786 )	2020-12-07 13:47:48 -08:00
Harshavardhana	ce93b2681b	fix: re-use er.getDisks() properly in certain calls (#11043 )	2020-12-07 10:04:07 -08:00
Harshavardhana	8d036ed6d8	fix: allow sub-admin to modify password for other users (#11039 ) fixes #11037	2020-12-06 20:36:34 -08:00
Harshavardhana	9c53cc1b83	fix: heal multiple buckets in bulk (#11029 ) makes server startup, orders of magnitude faster with large number of buckets	2020-12-05 13:00:44 -08:00
Harshavardhana	3514e89eb3	support envs as well for new crawler sub-system (#11033 )	2020-12-04 21:54:24 -08:00
Klaus Post	a896125490	Add crawler delay config + dynamic config values (#11018 )	2020-12-04 09:32:35 -08:00
Harshavardhana	e083471ec4	use argon2 with sync.Pool for better memory management (#11019 )	2020-12-03 19:23:19 -08:00
Harshavardhana	80d31113e5	fix: etcd import paths again depend on v3.4.14 release (#11020 ) Due to botched upstream renames of project repositories and incomplete migration to go.mod support, our current dependency version of `go.mod` had bugs i.e it was using commits from master branch which didn't have the required fixes present in release-3.4 branches which leads to some rare bugs https://github.com/etcd-io/etcd/pull/11477 provides a workaround for now and we should migrate to this. release-3.5 eventually claims to fix all of this properly until then we cannot use /v3 import right now	2020-12-03 11:35:18 -08:00
Ritesh H Shukla	7e2b79984e	Stream bucket bandwidth measurements (#11014 )	2020-12-03 11:34:42 -08:00
Harshavardhana	951b6b203b	skip metacache entries healing to speed up startup	2020-12-02 21:30:54 -08:00
Harshavardhana	44e23b7f4f	fix: startup being slow - wait only if IOCount > 0	2020-12-02 21:06:17 -08:00
Harshavardhana	96c0ce1f0c	add support for tuning healing to make healing more aggressive (#11003 ) supports `mc admin config set <alias> heal sleep=100ms` to enable more aggressive healing under certain times. also optimize some areas that were doing extra checks than necessary when bitrotscan was enabled, avoid double sleeps make healing more predictable. fixes #10497	2020-12-02 11:12:00 -08:00
ebozduman	303be1866d	Adds "x-amz-usr-agent" and "x-id" params to be used in authentication of presignedURL (#10792 )	2020-12-02 02:02:49 -08:00
Harshavardhana	4ec45753e6	rename server sets to server pools	2020-12-01 13:50:33 -08:00
Klaus Post	e6ea5c2703	crawler: Missing folder heal check per set (#10876 )	2020-12-01 12:07:39 -08:00
Harshavardhana	790833f3b2	Revert "Support variable server sets (#10314 )" This reverts commit `aabf053d2f`.	2020-12-01 12:02:29 -08:00
Harshavardhana	7cbca43eb1	fix: allow admins to create users (#11005 ) PR #10978 introduced a regression, root credential should be allowed to create users	2020-11-30 21:53:23 -08:00
Poorna Krishnamoorthy	2f564437ae	Disallow writeback caching with cache_after (#11002 ) fixes #10974	2020-11-30 20:53:27 -08:00
Harshavardhana	bdd094bc39	fix: avoid sending errors on missing objects on locked buckets (#10994 ) make sure multi-object delete returned errors that are AWS S3 compatible	2020-11-28 21:15:45 -08:00
Harshavardhana	e6fa410778	fix: allow accountInfo, addUser and getUserInfo implicit (#10978 ) - accountInfo API that returns information about user, access to buckets and the size per bucket - addUser - user is allowed to change their secretKey - getUserInfo - returns user info if the incoming is the same user requesting their information	2020-11-27 17:23:57 -08:00
Harshavardhana	aabf053d2f	Support variable server sets (#10314 )	2020-11-25 16:28:47 -08:00
Anis Elleuch	91130e884b	Avoid sending errors in gob in storage requests (#10977 )	2020-11-25 12:42:48 -08:00
Poorna Krishnamoorthy	2ff655a745	Refactor replication, ILM handling in DELETE API (#10945 )	2020-11-25 11:24:50 -08:00
Klaus Post	0422eda6a2	metacache: Always close block writer (#10973 ) In some cases a writer could be left behind unclosed, leaking compression blocks. Always close and set compression concurrency to 2 which should be fine to keep up.	2020-11-25 09:37:30 -08:00
Harshavardhana	31e6f60847	fix: improve error handling in metacache (#10965 )	2020-11-25 01:11:22 -08:00
Poorna Krishnamoorthy	3ad41fe89d	Add admin API to edit remote bucket target credentials (#10848 )	2020-11-24 19:09:05 -08:00
Klaus Post	a75fafdbe2	Remove msgp workaround (#10964 ) The error in `github.com/philhofer/fwd` was quickly fixed through https://github.com/philhofer/fwd/pull/22 - update the dependency and remove the workaround.	2020-11-24 11:58:10 -08:00
Klaus Post	a58b7874ef	Temporary workaround for msgp skipping (#10960 ) Due to https://github.com/philhofer/fwd/issues/20 when skipping a metadata entry that is >2048 bytes and the buffer is full (2048 bytes) the skip will fail with `io.ErrNoProgress`. Enlarge the buffer so we temporarily make this much more unlikely. If it still happens we will have to rewrite the skips to reads. Fixes #10959	2020-11-23 18:51:59 -08:00
Harshavardhana	6990de9c94	fix: dangling object delete shall return object doesn't exist (#10961 ) dangling object when deleted means object doesn't exist anymore, so we should return appropriate errors, this allows crawler heal to ensure that it removes the tracker for dangling objects.	2020-11-23 18:50:53 -08:00
Anis Elleuch	75a8e81f8f	azure: Specify different Azure storage in the shell env (#10943 ) AZURE_STORAGE_ACCOUNT and AZURE_STORAGE_KEY are used in azure CLI to specify the azure blob storage access & secret keys. With this commit, it is possible to set them if you want the gateway's own credentials to be different from the Azure blob credentials. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-11-23 16:45:56 -08:00
Harshavardhana	519c0077a9	fix: do not return an error for successfully deleted dangling objects (#10938 ) dangling objects when removed `mc admin heal -r` or crawler auto heal would incorrectly return error - this can interfere with usage calculation as the entry size for this would be returned as `0`, instead upon success use the resultant object size to calculate the final size for the object and avoid reporting this in the log messages Also do not set ObjectSize in healResultItem to be '-1' this has an effect on crawler metrics calculating 1 byte less for objects which seem to be missing their `xl.meta`	2020-11-23 09:12:17 -08:00
Harshavardhana	734d07a532	fix: all hosts local and port same should be local erasure setup (#10951 ) this is needed to avoid initializing notification peers that can lead to races in many sub-systems fixes #10950	2020-11-23 09:07:50 -08:00
Harshavardhana	df93102235	fix: unwrapping issues with os.Is* functions (#10949 ) reduces 3 stat calls, reducing the overall startup time significantly.	2020-11-23 08:36:49 -08:00
Poorna Krishnamoorthy	39f3d5493b	Show Delete replication status header (#10946 ) X-Minio-Replication-Delete-Status header shows the status of the replication of a permanent delete of a version. All GETs are disallowed and return 405 on this object version. In the case of replicating delete markers. X-Minio-Replication-DeleteMarker-Status shows the status of replication, and would similarly return 405. Additionally, this PR adds reporting of delete marker event completion and updates documentation	2020-11-21 23:48:50 -08:00
Klaus Post	692ff41ef7	Unwrap network errors (#10934 ) Alternative to #10927 Instead of having an upstream fix, do unwrap when checking network errors. 'As' will also work when destination is an interface as checked by the tests.	2020-11-20 22:55:35 -08:00
Harshavardhana	86409fa93d	add audit/admin trace support for browser requests (#10947 ) To support this functionality we had to fork the gorilla/rpc package with relevant changes	2020-11-20 22:52:17 -08:00
Shireesh Anjal	7bc47a14cc	Rename OBD to Health (#10842 ) Also, Remove thread stats and openfds from the health report as we already have process stats and numfds	2020-11-20 12:52:53 -08:00
Harshavardhana	73e308079a	fix: handle errors appropriately as they are wrapped (#10917 )	2020-11-20 10:43:07 -08:00
Poorna Krishnamoorthy	08b24620c0	Display storage-class of transitioned object in HEAD	2020-11-20 09:17:31 -08:00
Harshavardhana	95675b0c9a	fix: do not crash PutObjectTags when node is down (#10940 ) fixes #10939	2020-11-20 09:10:48 -08:00
Poorna Krishnamoorthy	251c1ef6da	Add support for replication of object tags, retention metadata (#10880 )	2020-11-19 18:56:09 -08:00
Poorna Krishnamoorthy	0fa430c1da	validate service type of target in replication/ilm transition config (#10928 )	2020-11-19 18:47:33 -08:00
Poorna Krishnamoorthy	f60b6eb82e	fix validation for deletemarker replication on object locked bucket (#10892 )	2020-11-19 18:47:19 -08:00
Poorna Krishnamoorthy	1ebf6f146a	Add support for ILM transition (#10565 ) This PR adds transition support for ILM to transition data to another MinIO target represented by a storage class ARN. Subsequent GET or HEAD for that object will be streamed from the transition tier. If PostRestoreObject API is invoked, the transitioned object can be restored for duration specified to the source cluster.	2020-11-19 18:47:17 -08:00
Harshavardhana	8f7fe0405e	fix: delete marker replication should support directories (#10878 ) allow directories to be replicated as well, along with their delete markers in replication. Bonus fix to fix bloom filter updates for directories to be preserved.	2020-11-19 18:47:12 -08:00
Harshavardhana	9a34fd5c4a	Revert "Revert "Add delete marker replication support (#10396 )"" This reverts commit `267d7bf0a9`.	2020-11-19 18:43:58 -08:00
Harshavardhana	f794fe79e3	fix: network shutdown was not handle properly (#10927 ) fixes a regression introduced in #10859, due to the error returned by rest.Client being typed i.e *rest.NetworkError - IsNetworkHostDown function didn't work as expected to detect network issues. This in-turn aggravated the situations when nodes are disconnected leading to performance loss.	2020-11-19 13:53:49 -08:00
Harshavardhana	0f9e125cf3	fix: check for gateway backend online without http request (#10924 ) fixes #10921	2020-11-19 10:38:02 -08:00
Harshavardhana	d778d9493f	remove MinIO release tag as part of HTTP Server string (#10929 )	2020-11-19 09:16:02 -08:00
Harshavardhana	70d2c2ccc9	skip files that are not erasure objects or directories (#10926 ) without this change WalkDir reports errors while trying to read `format.json/xl.meta` which is a replicated file	2020-11-19 09:15:09 -08:00
Harshavardhana	9dea7020f0	allow prefix filtering for WalkDir to be optional (#10923 )	2020-11-18 12:03:16 -08:00
Klaus Post	990d074f7d	metacache: Allow prefix filtering (#10920 ) Do listings with prefix filter when bloom filter is dirty. This will forward the prefix filter to the lister which will make it only scan the folders/objects with the specified prefix. If we have a clean bloom filter we try to build a more generally useful cache so in that case, we will list all objects/folders.	2020-11-18 10:44:18 -08:00
Klaus Post	e413f05397	Save listing error async (#10922 ) Since the RPC call may have to time out save an error state async to not hold up the listing returning. Fixes #10919	2020-11-18 10:28:22 -08:00
Harshavardhana	d1b1fee080	fix: save healing tracker right before healing (#10915 ) this change avoids a situation where accidentally if the user deleted the healing tracker or drives were replaced again within the 10sec window.	2020-11-18 09:34:46 -08:00
Harshavardhana	9738d605e4	increase readdir per block memory to facilitate faster WalkDir (#10908 )	2020-11-18 09:21:02 -08:00
Klaus Post	10099357b6	listcache: Wrap returned errors (#10882 ) To give an indication of where they happen	2020-11-17 09:11:59 -08:00
Harshavardhana	80b8ce89a4	remove context deadline from Delete calls (#10901 )	2020-11-17 09:09:45 -08:00
Poorna Krishnamoorthy	0b766288ef	fix: send replication completed event notification (#10902 )	2020-11-15 22:16:41 -08:00
Rafael Bodill	598ca0569c	fix: global in-place update boolean check (#10900 )	2020-11-15 13:34:12 -08:00
Poorna Krishnamoorthy	d295ce5708	Fix disk cache usage percent for prometheus (#10898 ) Fixes: #10895 Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2020-11-14 19:18:00 -08:00
Klaus Post	b5a3d79bce	listobjectversions: Add shortcut for Veeam blocks (#10893 ) Add shortcut for `APN/1.0 Veeam/1.0 Backup/10.0` It requests unique blocks with a specific prefix. We skip scanning the parent directory for more objects matching the prefix.	2020-11-13 16:58:20 -08:00
Harshavardhana	17a5ff51ff	fix: move context timeout closer to network for Delete calls (#10897 ) allowing for disconnects to be limited to the drive themselves instead of disconnecting all drives.	2020-11-13 16:56:45 -08:00
Harshavardhana	0bcb1b679d	fix: disallow update if dates are same (#10890 ) fixes #10889	2020-11-12 14:18:59 -08:00
Klaus Post	a3017c724e	Sort directory objects correctly (#10886 ) Decode dir objects when listing and sort them correctly.	2020-11-12 13:09:34 -08:00
Harshavardhana	267d7bf0a9	Revert "Add delete marker replication support (#10396 )" This reverts commit `50c10a5087`. PR is moved to origin/dev branch	2020-11-12 11:43:14 -08:00
cksac	be83dfc52a	fix: HDFS list bucket when subpath is provided (#10884 )	2020-11-12 11:26:51 -08:00
Harshavardhana	ca88ca753c	ignore typed errors correctly in list cache layer (#10879 ) bonus write bucket metadata cache with enough quorum Possible fix for #10868	2020-11-12 09:28:56 -08:00
Klaus Post	f86d3538f6	Allow deeper sleep (#10883 ) Allow each crawler operation to sleep up to 10 seconds on very heavily loaded systems. This will of course make minimum crawler speed less, but should be more effective at stopping.	2020-11-12 09:17:56 -08:00
Klaus Post	1c3590078d	Skip 0 byte stream writes (#10875 ) Don't send a packet when receiving 0 bytes or there is an error recorded	2020-11-11 18:07:40 -08:00
Harshavardhana	aa158228f9	fix: simplify healing metadata objects per set (#10867 )	2020-11-11 10:58:16 -08:00
Klaus Post	8747834c69	DeletedObjects: Return objects on lock failure (#10874 ) Return objects when locking fails. <details> <summary>Panic</summary> ``` : 2020/11/10 04:15:55 http: panic serving 10.10.62.153:44858: runtime error: index out of range [0] with length 0 : goroutine 363537270 [running]: : net/http.(conn).serve.func1(0xc019232780) : net/http/server.go:1801 +0x147 : panic(0x1cadd60, 0xc001719260) : runtime/panic.go:975 +0x47a : github.com/minio/minio/cmd.criticalErrorHandler.ServeHTTP.func1(0xc0121d1200, 0x210cda0, 0xc0141940e0) : github.com/minio/minio/cmd/generic-handlers.go:781 +0x1a8 : panic(0x1cadd60, 0xc001719260) : runtime/panic.go:969 +0x1b9 : github.com/minio/minio/cmd.objectAPIHandlers.DeleteMultipleObjectsHandler(0x1e71ce8, 0x1e71cc8, 0x2108420, 0xc0192328c0, 0xc0121d1400) : github.com/minio/minio/cmd/bucket-handlers.go:465 +0x2490 : net/http.HandlerFunc.ServeHTTP(...) : net/http/server.go:2042 : github.com/minio/minio/cmd.httpTraceAll.func1(0x2108420, 0xc0192328c0, 0xc0121d1400) : github.com/minio/minio/cmd/handler-utils.go:353 +0x158 : net/http.HandlerFunc.ServeHTTP(...) : net/http/server.go:2042 : github.com/minio/minio/cmd.collectAPIStats.func1(0x2108420, 0xc019232820, 0xc0121d1400) : github.com/minio/minio/cmd/handler-utils.go:380 +0xed : net/http.HandlerFunc.ServeHTTP(...) : net/http/server.go:2042 : github.com/minio/minio/cmd.maxClients.func1(0x2108420, 0xc019232820, 0xc0121d1400) : github.com/minio/minio/cmd/handler-api.go:132 +0x33b : net/http.HandlerFunc.ServeHTTP(0xc00271d590, 0x2108420, 0xc019232820, 0xc0121d1400) : net/http/server.go:2042 +0x44 : github.com/minio/minio/cmd.redirectHandler.ServeHTTP(0x20e2180, 0xc00271d590, 0x2108420, 0xc019232820, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:192 +0x156 : github.com/minio/minio/cmd.customHeaderHandler.ServeHTTP(0x20e1060, 0xc0141a22b0, 0x21083e0, 0xc01814d2e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:751 +0x162 : github.com/minio/minio/cmd.securityHeaderHandler.ServeHTTP(0x20e0fc0, 0xc0141a22c0, 0x21083e0, 0xc01814d2e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:766 +0x1d6 : github.com/minio/minio/cmd.bucketForwardingHandler.ServeHTTP(0xc0121c7a40, 0x20e1120, 0xc0141a22d0, 0x21083e0, 0xc01814d2e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:624 +0xbf : github.com/minio/minio/cmd.requestValidityHandler.ServeHTTP(0x20e0f20, 0xc01814d280, 0x21083e0, 0xc01814d2e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:608 +0x42a : github.com/minio/minio/cmd.httpStatsHandler.ServeHTTP(0x20e10c0, 0xc0141a2300, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:536 +0xe4 : github.com/minio/minio/cmd.requestSizeLimitHandler.ServeHTTP(0x20e0fe0, 0xc0141a2310, 0x50004000000, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:68 +0xd4 : github.com/minio/minio/cmd.requestHeaderSizeLimitHandler.ServeHTTP(0x20e10a0, 0xc01814d2a0, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:93 +0x1b7 : github.com/minio/minio/cmd.crossDomainPolicy.ServeHTTP(0x20e1080, 0xc0141a2320, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/crossdomain-xml-handler.go:51 +0x82 : github.com/minio/minio/cmd.browserRedirectHandler.ServeHTTP(0x20e0fa0, 0xc0141a2330, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:276 +0x68 : github.com/minio/minio/cmd.minioReservedBucketHandler.ServeHTTP(0x20e0f00, 0xc0141a2340, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:344 +0xb8 : github.com/minio/minio/cmd.cacheControlHandler.ServeHTTP(0x20e1020, 0xc0141a2350, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:303 +0x1ce : github.com/minio/minio/cmd.timeValidityHandler.ServeHTTP(0x20e0f40, 0xc0141a2360, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:414 +0x3ca : github.com/minio/minio/cmd.resourceHandler.ServeHTTP(0x20e1160, 0xc0141a2370, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:516 +0xab : github.com/minio/minio/cmd.authHandler.ServeHTTP(0x20e1100, 0xc0141a2380, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/auth-handler.go:502 +0x2e7 : github.com/minio/minio/cmd.sseTLSHandler.ServeHTTP(0x20e0ee0, 0xc0141a2390, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:802 +0x79 : github.com/minio/minio/cmd.reservedMetadataHandler.ServeHTTP(0x20e1140, 0xc0141a23a0, 0x210cda0, 0xc0141940e0, 0xc0121d1400) : github.com/minio/minio/cmd/generic-handlers.go:139 +0x1b7 : github.com/gorilla/mux.(Router).ServeHTTP(0xc00073fb00, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : github.com/gorilla/mux@v1.8.0/mux.go:210 +0xd3 : github.com/rs/cors.(Cors).Handler.func1(0x210cda0, 0xc0141940e0, 0xc0121d1200) : github.com/rs/cors@v1.7.0/cors.go:219 +0x1b9 : net/http.HandlerFunc.ServeHTTP(0xc0009aece0, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : net/http/server.go:2042 +0x44 : github.com/minio/minio/cmd.criticalErrorHandler.ServeHTTP(0x20e2180, 0xc0009aece0, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : github.com/minio/minio/cmd/generic-handlers.go:784 +0x85 : github.com/minio/minio/cmd/http.(Server).Start.func1(0x210cda0, 0xc0141940e0, 0xc0121d1200) : github.com/minio/minio/cmd/http/server.go:101 +0x258 : net/http.HandlerFunc.ServeHTTP(0xc000dc4080, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : net/http/server.go:2042 +0x44 : net/http.serverHandler.ServeHTTP(0xc000764c60, 0x210cda0, 0xc0141940e0, 0xc0121d1200) : net/http/server.go:2843 +0xa3 : net/http.(conn).serve(0xc019232780, 0x2114720, 0xc03381f6c0) : net/http/server.go:1925 +0x8ad : created by net/http.(Server).Serve : net/http/server.go:2969 +0x36c ``` </details>	2020-11-11 09:14:32 -08:00
Poorna Krishnamoorthy	50c10a5087	Add delete marker replication support (#10396 ) Delete marker replication is implemented for V2 configuration specified in AWS spec (though AWS allows it only in the V1 configuration). This PR also brings in a MinIO only extension of replicating permanent deletes, i.e. deletes specifying version id are replicated to target cluster.	2020-11-10 15:24:14 -08:00
Steven Reitsma	4683a623dc	fix: negative STS IAM token TTL value (#10866 )	2020-11-10 12:24:01 -08:00
Klaus Post	06899210a7	Reduce health check output (#10859 ) This will make the health check clients 'silent'. Use `IsNetworkOrHostDown` determine if network is ok so it mimics the functionality in the actual client.	2020-11-10 09:28:23 -08:00
Harshavardhana	cbdab62c1e	fix: heal user/metadata right away upon server startup (#10863 ) this is needed such that we make sure to heal the users, policies and bucket metadata right away as we do listing based on list cache which only lists '3' sufficiently good drives, to avoid possibly losing access to these users upon upgrade make sure to heal them.	2020-11-10 09:02:06 -08:00
Harshavardhana	8df6112204	fix: avoid divide by zero error single node distributed setup (#10862 )	2020-11-09 20:40:39 -08:00
Harshavardhana	97692bc772	re-route requests if IAM is not initialized (#10850 )	2020-11-07 21:03:06 -08:00
Steven Reitsma	54120107ce	fix: infinite loop in cleanupStaleUploads of encrypted MPUs (#10845 ) fixes #10588	2020-11-06 11:53:42 -08:00
Klaus Post	9bf5990ea9	metadata: Invalidate cache if unreadable and not updating (#10844 ) If a scanning server shuts down unexpectedly we may have "successful" caches that are incomplete on a set. In this case mark the cache with an error so it will no longer be handed out.	2020-11-06 08:54:09 -08:00
Steven Reitsma	74f7cf24ae	fix: s3 gateway SSE pagination (#10840 ) Fixes #10838	2020-11-05 15:04:03 -08:00
Harshavardhana	fb28aa847b	fix: add missing deleted key element in multiObjectDelete (#10839 ) fixes #10832	2020-11-05 12:47:46 -08:00
Klaus Post	0724205f35	metacache: Add option for life extension (#10837 ) Add `MINIO_API_EXTEND_LIST_CACHE_LIFE` that will extend the life of generated caches for a while. This changes caches to remain valid until no updates have been received for the specified time plus a fixed margin. This also changes the caches from being invalidated when the first set finishes until the last set has finished plus the specified time has passed.	2020-11-05 11:49:56 -08:00
Harshavardhana	b72cac4cf3	fix: dangling objects on actual namespace (#10822 )	2020-11-05 11:48:55 -08:00
Klaus Post	bd77f29fc4	Don't replace caches that are receiving updates (#10834 ) Keep caches while they are receiving updates. Move update code to separate function.	2020-11-05 07:34:08 -08:00
Klaus Post	d1e1205036	metacache: Always close the s2 writer (#10836 ) The s2 writer could be leaked if there was an error. Make sure it is always closed.	2020-11-05 07:30:14 -08:00
Harshavardhana	71753e21e0	add missing TTL for STS credentials on etcd (#10828 )	2020-11-04 13:06:05 -08:00
Harshavardhana	fde3299bf3	re-use optimized readdir for isDirEmpty() (#10829 ) reduces effective memory usage by an order of magnitude, also increases performance for small objects	2020-11-04 13:05:21 -08:00
Harshavardhana	1a1f00fa15	fix: use internode data for DisksInfo, VolsInfo in message pack (#10821 ) Similar to #10775 for fewer memory allocations, since we use getOnlineDisks() extensively for listing we should optimize it further. Additionally, remove all unused walkers from the storage layer	2020-11-04 10:10:54 -08:00
Bill Thorp	4a1efabda4	Context based AccessKey passing (#10615 ) A new field called AccessKey is added to the ReqInfo struct and populated. Because ReqInfo is added to the context, this allows the AccessKey to be accessed from 3rd-party code, such as a custom ObjectLayer. Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Kaloyan Raev <kaloyan@storj.io>	2020-11-04 09:13:34 -08:00
Klaus Post	3b88a646ec	Add remote online/offline information (#10825 ) Log information about remote clients being marked offline. This will help to identify root causes of failures.	2020-11-04 08:27:32 -08:00
Klaus Post	2294e53a0b	Don't retain context in locker (#10515 ) Use the context for internal timeouts, but disconnect it from outgoing calls so we always receive the results and cancel it remotely.	2020-11-04 08:25:42 -08:00
Klaus Post	f0819cce75	Keep transient lists while they are updating (#10826 ) On extremely long running listings keep the transient list 15 minutes after last update instead of using start time. Also don't do overlap checks on transient lists.	2020-11-04 08:01:33 -08:00
Klaus Post	1e11b4629f	Add remote Diskinfo caching (#10824 ) Add 1 second remote disk info cache. Should decrease need for remote calls a great deal due to how actively it is used now.	2020-11-04 08:00:18 -08:00
Harshavardhana	5c72a34fa8	fix: honor delimiter as per AWS S3 spec (#10823 )	2020-11-04 07:56:58 -08:00
Klaus Post	b9277c8030	metacache: Add trashcan (#10820 ) Add trashcan that keeps recently updated lists after bucket deletion. All caches were deleted once a bucket was deleted, so caches still running would report errors. Now they are canceled. Fix `.minio.sys` not being transient.	2020-11-03 12:47:52 -08:00
Harshavardhana	8c76e1353e	initialize IAM after etcd has initialized (#10819 )	2020-11-03 12:12:30 -08:00
Harshavardhana	ad382799b1	use list cache for Walk() with webUI and quota (#10814 ) bring list cache optimizations for web UI object listing, also FIFO quota enforcement through list cache as well.	2020-11-03 08:53:48 -08:00
Harshavardhana	68de5a6f6a	fix: IAM store fallback to list users and policies from disk (#10787 ) Bonus fixes, remove package retry it is harder to get it right, also manage context remove it such that we don't have to rely on it anymore instead use a simple Jitter retry.	2020-11-02 17:52:13 -08:00
Harshavardhana	4ea31da889	fix: move list quorum ENV to config (#10804 )	2020-11-02 17:21:56 -08:00
Klaus Post	0a796505c1	metacache: Check only one disk for updates (#10809 ) Check only one disk for updates. This will reduce IO while waiting for lists to finish.	2020-11-02 17:20:27 -08:00
Klaus Post	37749f4623	Optimize FileInfo(Version) transfer (#10775 ) File Info decoding, in particular, is showing up as a major allocator and time consumer for internode data transfers Switch to message pack for cross-server transfers: ``` MSGP: Size: 945 bytes BenchmarkEncodeFileInfoMsgp-32 1558444 866 ns/op 1.16 MB/s 0 B/op 0 allocs/op BenchmarkDecodeFileInfoMsgp-32 479968 2487 ns/op 0.40 MB/s 848 B/op 18 allocs/op GOB: Size: 1409 bytes BenchmarkEncodeFileInfoGOB-32 333339 3237 ns/op 0.31 MB/s 576 B/op 19 allocs/op BenchmarkDecodeFileInfoGOB-32 20869 57837 ns/op 0.02 MB/s 16439 B/op 428 allocs/op ```	2020-11-02 17:07:52 -08:00
Klaus Post	86e0d272f3	Reduce WriteAll allocs (#10810 ) WriteAll saw 127GB allocs in a 5 minute timeframe for 4MiB buffers used by `io.CopyBuffer` even if they are pooled. Since all writers appear to write byte buffers, just send those instead and write directly. The files are opened through the `os` package so they have no special properties anyway. This removes the alloc and copy for each operation. REST sends content length so a precise alloc can be made.	2020-11-02 16:14:31 -08:00
Harshavardhana	8527f22df1	optimize request URL encoding for internode (#10811 ) this reduces allocations in order of magnitude Also, revert "erasure: delete dangling objects automatically (#10765)" affects list caching should be investigated.	2020-11-02 15:15:12 -08:00
Anis Elleuch	b456292295	erasure: delete dangling objects automatically (#10765 )	2020-11-02 10:49:30 -08:00
Poorna Krishnamoorthy	03fdbc3ec2	Add async caching commit option in diskcache (#10742 ) Add store and a forward option for a single part uploads when an async mode is enabled with env MINIO_CACHE_COMMIT=writeback It defaults to `writethrough` if unspecified.	2020-11-02 10:00:45 -08:00
Harshavardhana	4c773f7068	re-use remote transports in Peer,Storage,Locker clients (#10788 ) use one transport for internode communication	2020-11-02 07:43:11 -08:00
Harshavardhana	5412d730c1	simplify monitoring doesn't need to be canceled (#10803 ) connect disks monitoring doesn't need to be canceled upon drive replacement, since we only need to replace the newly replaced drive.	2020-10-31 14:10:12 -07:00
Klaus Post	fe9f23e632	Recreate bucket metacache if corrupted (#10800 ) If bucket metadata cannot be read, clean up existing and create a new.	2020-10-31 10:26:16 -07:00
Klaus Post	422898d9b3	Clean up metadata cache when deleting bucket (#10802 ) Metadata caches were left behind when deleting a bucket.	2020-10-31 09:46:18 -07:00
Harshavardhana	b686bb9c83	fix: replaced drive properly by healing the entire drive (#10799 ) Bonus fixes, we do not need reload format anymore as the replaced drive is healed locally we only need to ensure that drive heal reloads the drive properly. We preserve the UUID of the original order, this means that the replacement in `format.json` doesn't mean that the drive needs to be reloaded into memory anymore. fixes #10791	2020-10-31 01:34:48 -07:00
Harshavardhana	5e5cdc581d	remove unnecessary logging and move to log once (#10798 ) the current master logs way too much when a node is down, instead log once and move on.	2020-10-30 14:55:50 -07:00
Harshavardhana	02cfa774be	allow requests to be proxied when server is booting up (#10790 ) when server is booting up there is a possibility that users might see '503' because object layer when not initialized, then the request is proxied to neighboring peers first one which is online.	2020-10-30 12:20:28 -07:00
Krishna Srinivas	3a2f89b3c0	fix: add support for O_DIRECT reads for erasure backends (#10718 )	2020-10-30 11:04:29 -07:00
Klaus Post	6135f072d2	Fix invalidated metacaches (#10784 ) * Fix caches having EOF marked as a failure. * Simplify cache updates. * Provide context for checkMetacacheState failures. * Log 499 when the client disconnects.	2020-10-30 09:33:16 -07:00
Klaus Post	e63a44b734	rest client: Expect context timeouts for locks (#10782 ) Add option for rest clients to not mark a remote offline for context timeouts. This can be used if context timeouts are expected on the call.	2020-10-29 09:52:11 -07:00
Klaus Post	6b14c4ab1e	Optimize decryptObjectInfo (#10726 ) `decryptObjectInfo` is a significant bottleneck when listing objects. Reduce the allocations for a significant speedup. https://github.com/minio/sio/pull/40 ``` λ benchcmp before.txt after.txt benchmark old ns/op new ns/op delta Benchmark_decryptObjectInfo-32 24260928 808656 -96.67% benchmark old MB/s new MB/s speedup Benchmark_decryptObjectInfo-32 0.04 1.24 31.00x benchmark old allocs new allocs delta Benchmark_decryptObjectInfo-32 75112 48996 -34.77% benchmark old bytes new bytes delta Benchmark_decryptObjectInfo-32 287694772 4228076 -98.53% ```	2020-10-29 09:34:20 -07:00
Harshavardhana	4bf90ca67f	fix: handle a crash when AskDisks is set to -1 (#10777 )	2020-10-29 09:25:43 -07:00
Harshavardhana	e0655e24f2	fix: A possible crash when fi.Erasure.Distribution is empty (#10779 )	2020-10-28 19:24:01 -07:00
Klaus Post	bfc36aed89	Add update retry limit and compare error by string instead (#10776 )	2020-10-28 13:19:53 -07:00
Kaloyan Raev	be7f67268d	fix: Do not cleanup range files in cache SaveMetadata when total hits are false (#10728 )	2020-10-28 09:23:17 -07:00
Klaus Post	a982baff27	ListObjects Metadata Caching (#10648 ) Design: https://gist.github.com/klauspost/025c09b48ed4a1293c917cecfabdf21c Gist of improvements: * Cross-server caching and listing will use the same data across servers and requests. * Lists can be arbitrarily resumed at a constant speed. * Metadata for all files scanned is stored for streaming retrieval. * The existing bloom filters controlled by the crawler is used for validating caches. * Concurrent requests for the same data (or parts of it) will not spawn additional walkers. * Listing a subdirectory of an existing recursive cache will use the cache. * All listing operations are fully streamable so the number of objects in a bucket no longer dictates the amount of memory. * Listings can be handled by any server within the cluster. * Caches are cleaned up when out of date or superseded by a more recent one.	2020-10-28 09:18:35 -07:00
Krishna Srinivas	f53c5a020e	fix: heal object shards with ec.index and ec.distribution mismatches (#10773 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-10-28 00:10:20 -07:00
Harshavardhana	5b30bbda92	fix: add more protection distribution to match EcIndex (#10772 ) allows for more stricter validation in picking up the right set of disks for reconstruction.	2020-10-28 00:09:15 -07:00
Shireesh Anjal	858e2a43df	Remove logging info from OBDInfoHandler (#10727 ) A lot of logging data is counterproductive. A better implementation with precise useful log data can be introduced later.	2020-10-27 17:41:48 -07:00
Kaloyan Raev	df9894e275	avoid caching http ranges in background goroutine (#10724 )	2020-10-26 23:04:48 -07:00
Krishna Srinivas	592f2f23a3	fix: heal rejects objects with disk re-ordering issue (#10766 )	2020-10-26 18:48:47 -07:00
Krishna Srinivas	c49a80db41	fix: use meta.Erasure.Index for GetObject() to reconstruct object (#10764 )	2020-10-26 16:19:42 -07:00
Poorna Krishnamoorthy	46275c6547	cache: rename function declarations (#10763 )	2020-10-26 15:41:24 -07:00
Poorna Krishnamoorthy	0994ed9783	cache: fix call in GetObjectNInfo (#10762 ) Fixes: #10751	2020-10-26 12:30:40 -07:00
Anis Elleuch	eb95353cb1	fix: Get/HeadObject return 404 on non quorum objects (#10753 )	2020-10-26 10:30:46 -07:00
Harshavardhana	029758cb20	fix: retain the previous UUID for newly replaced drives (#10759 ) only newly replaced drives get the new `format.json`, this avoids disks reloading their in-memory reference format, ensures that drives are online without reloading the in-memory reference format. keeping reference format in-tact means UUIDs never change once they are formatted.	2020-10-26 10:29:29 -07:00
Harshavardhana	646d6917ed	turn-off checking for updates completely if MINIO_UPDATE=off (#10752 )	2020-10-24 22:39:44 -07:00
Harshavardhana	d9db7f3308	expire lockers if lockers are offline (#10749 ) lockers currently might leave stale lockers, in unknown ways waiting for downed lockers. locker check interval is high enough to safely cleanup stale locks.	2020-10-24 13:23:16 -07:00
Harshavardhana	6a8c62f9fd	make sure to preserve UUID from reference format (#10748 ) reference format should be source of truth for inconsistent drives which reconnect, add them back to their original position remove automatic fix for existing offline disk uuids	2020-10-24 13:23:08 -07:00
Anis Elleuch	00124c56d9	erasure: Commit data before xl.meta in RenameData() (#10734 ) This will reduce the chance to have updated xl.meta without data.	2020-10-23 21:54:58 -07:00
Anis Elleuch	2c32c2149e	tests: Avoid running TestNSRace in short test mode (#10735 )	2020-10-23 21:23:12 -07:00
Harshavardhana	734f258878	fix: slow down auto healing more aggressively (#10730 ) Bonus fixes - logging improvements to ensure that we don't use `go logger.LogIf` to avoid runtime.Caller missing the function name. log where necessary. - remove unused code at erasure sets	2020-10-22 13:36:24 -07:00
Anis Elleuch	0e0c53bba4	tests: Lower expectation in addr selection in rand cache dialer (#10739 ) Test TestDialContextWithDNSCacheRand was failing sometimes because it depends on a random selection of addresses when testing random DNS resolution from cache. Lower addr selection exception to 10%	2020-10-22 09:35:32 -07:00
Poorna Krishnamoorthy	5cc23ae052	validate if iam store is initialized (#10719 ) Fixes panic - regression from `d6d770c1b1`	2020-10-20 21:28:24 -07:00
Harshavardhana	d6d770c1b1	initialize object layer right after config has loaded	2020-10-19 22:04:59 -07:00
Harshavardhana	b07df5cae1	initialize IAM as soon as object layer is initialized (#10700 ) Allow requests to come in for users as soon as object layer and config are initialized, this allows users to be authenticated sooner and would succeed automatically on servers which are yet to fully initialize.	2020-10-19 09:54:40 -07:00
Harshavardhana	c107728676	fix: s3 gateway DNS cache initialization (#10706 ) fixes #10705	2020-10-19 01:34:23 -07:00
Anis Elleuch	284a2b9021	ilm: Send delete marker creation event when appropriate (#10696 ) Before this commit, the crawler ILM will always send object delete event notification though this is wrong.	2020-10-16 21:22:12 -07:00
Ritesh H Shukla	0b53e30ecb	Clean up monitor on delete bucket (#10698 )	2020-10-16 17:59:31 -07:00
Harshavardhana	bd2131ba34	add DNS cache support to avoid DNS flooding (#10693 ) Go stdlib resolver doesn't support caching DNS resolutions, since we compile with CGO disabled we are more probe to DNS flooding for all network calls to resolve for DNS from the DNS server. Under various containerized environments such as VMWare this becomes a problem because there are no DNS caches available and we may end up overloading the kube-dns resolver under concurrent I/O. To circumvent this issue implement a DNSCache resolver which resolves DNS and caches them for around 10secs with every 3sec invalidation attempted.	2020-10-16 14:49:05 -07:00
ebozduman	1aec168c84	fix: azure gateway should reject bucket names with "." (#10635 )	2020-10-16 09:30:18 -07:00
Klaus Post	21a549a83b	fix: keep MRF channel open to avoid random CI crash (#10686 ) There doesn't seem to be any benefit to closing the channel, so just keep it open and let it die with the server.	2020-10-16 09:08:51 -07:00
Ritesh H Shukla	8a16a1a1a9	fix: misc fixes for bandwidth reporting amd monitoring (#10683 ) * Set peer for fetch bandwidth * Fix the limit for bandwidth that is reported. * Reduce CPU burn from bandwidth management.	2020-10-16 09:07:50 -07:00
Harshavardhana	ad726b49b4	rename zones to serverSets to avoid terminology conflict (#10679 ) we are bringing in availability zones, we should avoid zones as per server expansion concept.	2020-10-15 14:28:50 -07:00
Anis Elleuch	db2241066b	heal: Enable removing dangling delete markers (#10688 )	2020-10-15 13:06:40 -07:00
Harshavardhana	f1cc16e788	fix: background heal rely on getOnlineDisks() (#10687 )	2020-10-15 13:06:23 -07:00
Klaus Post	3820a905e0	in getOnlineDisks wait for disks to be populated (#10685 )	2020-10-15 06:37:10 -07:00
Harshavardhana	2042d4873c	rename crawler config option to heal (#10678 )	2020-10-14 13:51:51 -07:00
Harshavardhana	f9be783f3e	fix: allow crawler to crawl on disks without usage constraints (#10677 ) additionally also change the resolution usage wise return of disks, allows to small byte level differences to be masked.	2020-10-14 12:12:10 -07:00
Harshavardhana	71b97fd3ac	fix: connect disks pre-emptively during startup (#10669 ) connect disks pre-emptively upon startup, to ensure we have enough disks are connected at startup rather than wait for them. we need to do this to avoid long wait times for server to be online when we have servers come up in rolling upgrade fashion	2020-10-13 18:28:42 -07:00
Klaus Post	03991c5d41	crawler: Remove waitForLowActiveIO (#10667 ) Only use dynamic delays for the crawler. Even though the max wait was 1 second the number of waits could severely impact crawler speed. Instead of relying on a global metric, we use the stateless local delays to keep the crawler running at a speed more adjusted to current conditions. The only case we keep it is before bitrot checks when enabled.	2020-10-13 13:45:08 -07:00
飞雪无情	614060764d	fix: use the correct Action type for policy.Args and iampolicy.Args (#10650 )	2020-10-12 15:18:22 -07:00
Harshavardhana	a3ba8188d7	fix: allow locker to be niladic	2020-10-12 14:23:44 -07:00
Harshavardhana	2760fc86af	Bump default idleConnsPerHost to control conns in time_wait (#10653 ) This PR fixes a hang which occurs quite commonly at higher concurrency by allowing following changes - allowing lower connections in time_wait allows faster socket open's - lower idle connection timeout to ensure that we let kernel reclaim the time_wait connections quickly - increase somaxconn to 4096 instead of 2048 to allow larger tcp syn backlogs. fixes #10413	2020-10-12 14:19:46 -07:00
Ritesh H Shukla	8ceb2a93fd	fix: peer replication bandwidth monitoring in distributed setup (#10652 )	2020-10-12 09:04:55 -07:00
Ritesh H Shukla	c2f16ee846	Add basic bandwidth monitoring for replication. (#10501 ) This change tracks bandwidth for a bucket and object - [x] Add Admin API - [x] Add Peer API - [x] Add BW throttling - [x] Admin APIs to set replication limit - [x] Admin APIs for fetch bandwidth	2020-10-09 20:36:00 -07:00
Harshavardhana	6484453fc6	optionally allow strict quorum listing (#10649 ) ``` export MINIO_API_LIST_STRICT_QUORUM=on ``` would enable listing in quorum if necessary	2020-10-09 15:40:46 -07:00
Harshavardhana	a0d0645128	remove safeMode behavior in startup (#10645 ) In almost all scenarios MinIO now is mostly ready for all sub-systems independently, safe-mode is not useful anymore and do not serve its original intended purpose. allow server to be fully functional even with config partially configured, this is to cater for availability of actual I/O v/s manually fixing the server. In k8s like environments it will never make sense to take pod into safe-mode state, because there is no real access to perform any remote operation on them.	2020-10-09 09:59:52 -07:00
Harshavardhana	253194e491	do not hold write locks - if objects don't exist (#10644 )	2020-10-08 17:47:21 -07:00
Harshavardhana	736e58dd68	fix: handle concurrent lockers with multiple optimizations (#10640 ) - select lockers which are non-local and online to have affinity towards remote servers for lock contention - optimize lock retry interval to avoid sending too many messages during lock contention, reduces average CPU usage as well - if bucket is not set, when deleteObject fails make sure setPutObjHeaders() honors lifecycle only if bucket name is set. - fix top locks to list out always the oldest lockers always, avoid getting bogged down into map's unordered nature.	2020-10-08 12:32:32 -07:00
Poorna Krishnamoorthy	907a171edd	Generalize error messages for remote targets (#10638 ) This is to allow remote targets to be generalized for replication/ILM transition Also adding a field in BucketTarget to identify a remote target with a label.	2020-10-08 10:54:11 -07:00
Andreas Auernhammer	ed6d2a100f	logger: avoid writing audit log response header twice (#10642 ) This commit fixes a misuse of the `http.ResponseWriter.WriteHeader`. A caller should either call `WriteHeader` exactly once or write to the response writer and causing an implicit 200 OK. Writing the response headers more than once causes a `http: superfluous response.WriteHeader call` log message. This commit fixes this by preventing a 2nd `WriteHeader` call being forwarded to the underlying `ResponseWriter`. Updates #10587	2020-10-08 09:29:10 -07:00
Harshavardhana	effe131090	fix: allow read unlocks to be defensive about split brains (#10637 )	2020-10-07 09:15:01 -07:00
Harshavardhana	18063bf25c	fix: cleanup old directory handling code (#10633 ) we don't need them anymore, remove legacy code.	2020-10-06 12:03:57 -07:00
Poorna Krishnamoorthy	dbbed6f7f0	update minio-go dependency (#10634 )	2020-10-06 08:37:09 -07:00
Poorna Krishnamoorthy	7fbfdceba3	Fix replication slowness (#10632 ) - Increase channel buffer length - Avoid blocking wait on replicaCh	2020-10-05 14:45:42 -07:00
Shireesh Anjal	f1418a50f0	add NVMe drive info [model num, serial num, drive temp. etc.] (#10613 ) * add NVMe drive info [model num, serial num, drive temp. etc.] * Ignore fuse partitions * Add the nvme logic only for linux * Move smart/nvme structs to a separate file Co-authored-by: wlan0 <sidharthamn@gmail.com>	2020-10-04 10:18:46 -07:00
Krishna Srinivas	045e30f2c1	Set LastModified time from source for bucket replication (#10627 )	2020-10-02 18:32:22 -07:00
Harshavardhana	c6a9a94f94	fix: optimize ServerInfo() handler to avoid reading config (#10626 ) fixes #10620	2020-10-02 16:19:44 -07:00
Harshavardhana	8e7c00f3d4	add missing request-id from DeleteObject events (#10623 ) fixes #10621	2020-10-02 13:36:13 -07:00
Harshavardhana	23e8390997	fix: Allow Walk to honor load balanced drives (#10610 )	2020-10-01 20:24:34 -07:00
Anis Elleuch	71403be912	fix: consider partNumber in GET/HEAD requests (#10618 )	2020-10-01 15:41:12 -07:00
Harshavardhana	f28d02b7f2	fix: simplify obd how we calculate transferred bytes (#10617 )	2020-10-01 14:34:51 -07:00
Harshavardhana	e0cb814f3f	fail if port is not accessible (#10616 ) throw proper error when port is not accessible for the regular user, this is possibly a regression. ``` ERROR Unable to start the server: Insufficient permissions to use specified port > Please ensure MinIO binary has 'cap_net_bind_service=+ep' permissions HINT: Use 'sudo setcap cap_net_bind_service=+ep /path/to/minio' to provide sufficient permissions ```	2020-10-01 13:23:31 -07:00
Harshavardhana	98a08e1644	fix: protect updating latencies/throughput slices in obd (#10611 ) Additionally close the transferChan upon function exit.	2020-10-01 09:50:08 -07:00
Klaus Post	3047121255	dataupdate: Bump to force rescan (#10609 ) After #10594 let's invalidate the bloom filters to force the next cycles to go through all data. There is a small chance that the linked PR could have caused missing bloom filter data. This will invalidate the current bloom filters and make the crawler go through everything.	2020-09-30 16:10:40 -07:00
Ritesh H Shukla	5a7f92481e	fix: client errors for DNS service creation errors (#10584 )	2020-09-30 14:09:41 -07:00
Anis Elleuch	0d45c38782	List v1/versions routes based on source IP if found (#10603 ) Routing using on source IP if found. This should distribute the listing load for V1 and versioning on multiple nodes evenly between different clients. If source IP is not found from the http request header, then falls back to bucket name instead.	2020-09-30 13:38:27 -07:00
Poorna Krishnamoorthy	56d1b227cf	Handle changes to versioning config for replication (#10598 ) Disallow versioning suspension on a bucket with pre-existing replication configuration If versioning is suspended on the target,replication should fail.	2020-09-30 13:36:37 -07:00
Lenin Alevski	bea87a5a20	fix: reading multiple TLS certificates when deployed in K8S (#10601 ) Ignore all regular files, CAs directory and any directory that starts with `..` inside the `.minio/certs` folder	2020-09-30 08:21:30 -07:00
Harshavardhana	2b4eb87d77	pick disks which are common maximally used (#10600 ) further optimization to ensure that good disks are always used for listing, other than healing we only use disks that are maximally used.	2020-09-29 22:54:02 -07:00
Harshavardhana	1f9abbee4d	make sure to release locks upon timeout (#10596 ) fixes #10418	2020-09-29 15:18:34 -07:00
Klaus Post	fdf0ae9167	exit data update tracker only upon context completion (#10594 ) The data update tracker saver would exit if data wasn't updated for between cycles.	2020-09-29 13:23:53 -07:00
Harshavardhana	00eb6f6bc9	cache DiskInfo at storage layer for performance (#10586 ) `mc admin info` on busy setups will not move HDD heads unnecessarily for repeated calls, provides a better responsiveness for the call overall. Bonus change allow listTolerancePerSet be N-1 for good entries, to avoid skipping entries for some reason one of the disk went offline.	2020-09-29 09:54:41 -07:00
Harshavardhana	66174692a2	add '.healing.bin' for tracking currently healing disk (#10573 ) add a hint on the disk to allow for tracking fresh disk being healed, to allow for restartable heals, and also use this as a way to track and remove disks. There are more pending changes where we should move all the disk formatting logic to backend drives, this PR doesn't deal with this refactor instead makes it easier to track healing in the future.	2020-09-28 19:39:32 -07:00
飞雪无情	209680e89f	Remove redundant http.HandlerFunc type conversion. (#10576 )	2020-09-28 13:33:49 -07:00
飞雪无情	27d9bd04e5	Handling unhandled errors in the InfoCannedPolicy method. (#10575 )	2020-09-27 10:24:04 -07:00
Harshavardhana	bebcf4f004	unlock() only if locking was successful	2020-09-25 19:36:47 -07:00
Harshavardhana	eafa775952	fix: add lock ownership to expire locks (#10571 ) - Add owner information for expiry, locking, unlocking a resource - TopLocks returns now locks in quorum by default, provides a way to capture stale locks as well with `?stale=true` - Simplify the quorum handling for locks to avoid from storage class, because there were challenges to make it consistent across all situations. - And other tiny simplifications to reset locks.	2020-09-25 19:21:52 -07:00
Harshavardhana	66b4a862e0	fix: network failure err check should ignore context canceled errors (#10567 ) context canceled errors bubbling up from the network layer has the potential to be misconstrued as network errors, taking prematurely a server offline and triggering a health check routine avoid this potential occurrence.	2020-09-25 14:35:47 -07:00
Anis Elleuch	9603489dd3	federation: Honor range with UploadObjectPart to a different cluster (#10570 ) Use gr & length instead of srcInfo.Reader & srcInfo.Size because they don't honor range header	2020-09-25 12:06:42 -07:00
Anis Elleuch	b302c8a5f4	heal: Fix periodic healing cleanup (#10569 ) isEnded() was incorrectly calculating if the current healing sequence is ended or not. h.currentStatus.Items could be empty if healing is very slow and mc admin heal consumed all items.	2020-09-25 10:29:00 -07:00
Praveen raj Mani	b880796aef	Set the maximum open connections limit in PG and MySQL target configs (#10558 ) As the bulk/recursive delete will require multiple connections to open at an instance, The default open connections limit will be reached which results in the following error ```FATAL: sorry, too many clients already``` By setting the open connections to a reasonable value - `2`, We ensure that the max open connections will not be exhausted and lie under bounds. The queries are simple inserts/updates/deletes which is operational and sufficient with the the maximum open connection limit is 2. Fixes #10553 Allow user configuration for MaxOpenConnections	2020-09-24 22:20:30 -07:00
Harshavardhana	37a5d5d7a0	reduce timeouts between servers for faster disconnects (#10562 )	2020-09-24 20:10:07 -07:00
Harshavardhana	3cac262dd1	report heal drives properly, also from global state (#10561 ) It is possible the heal drives are not reported from the maintenance check because the background heal state simply relied on the `format.json` for capturing unformatted drives. It is possible that drives might be still healing - make sure that applications which rely on cluster health check respond back this detail.	2020-09-24 15:36:47 -07:00
poornas	e6ab4db6b8	Fix minimum replication workers started (#10560 ) This PR also fixes GetReplicationConfiguration permission in web-handlers.go to use bucket as resource	2020-09-24 12:25:41 -07:00
Harshavardhana	ca989eb0b3	avoid ListBuckets returning quorum errors when node is down (#10555 ) Also, revamp the way ListBuckets work make few portions of the healing logic parallel - walk objects for healing disks in parallel - collect the list of buckets in parallel across drives - provide consistent view for listBuckets()	2020-09-24 09:53:38 -07:00
飞雪无情	d778d034e7	Remove redundant mgmtQueryKey type. (#10557 ) Remove redundant type conversion.	2020-09-24 08:40:21 -07:00
Harshavardhana	f7f9517b6a	fix: host extraction without port	2020-09-23 12:10:14 -07:00
Harshavardhana	90cff10e2b	avoid crash if disks are not initialized	2020-09-23 12:00:29 -07:00
Harshavardhana	81caf35926	fix: reduce healthcheck interval for storage rest client (#10544 )	2020-09-23 10:43:42 -07:00
poornas	5726cef3ca	validate bucket exists in ListRemoteTargets api (#10552 )	2020-09-23 10:37:54 -07:00
Harshavardhana	8b74a72b21	fix: rename READY deadline to CLUSTER deadline ENV (#10535 )	2020-09-23 09:14:33 -07:00
Klaus Post	eec69d6796	Fix stale context for bucket retrieval (#10551 ) The provided context gets captured by the closure making all subsequent calls fail.	2020-09-23 08:30:31 -07:00
Harshavardhana	0537a21b79	avoid concurrenct use of rand.NewSource (#10543 )	2020-09-22 15:34:27 -07:00
poornas	4c54ed8748	Close replica channel only once (#10542 ) Also enforce s3:GetReplicationConfiguration permission check as a bucket level resource.	2020-09-22 12:47:24 -07:00
Anis Elleuch	4c81201f95	fix: healing delete marker on versioned buckets (#10530 ) Healing was not working correctly in the distributed mode because errFileVersionNotFound was not properly converted in storage rest client. Besides, fixing the healing delete marker is not working as expected.	2020-09-21 15:16:16 -07:00
Harshavardhana	cd8d511d3d	move versionsOrder struct to xl-storage-utils	2020-09-21 14:24:42 -07:00
Harshavardhana	17e17da00d	add parallel workers to perform replication in parallel (#10525 ) set the concurrency for replication be to runtime.NumCPU()/2	2020-09-21 13:43:29 -07:00
Harshavardhana	a5da9120f3	fix: [fs] an error upon rwPool.Write() just attempt rwPool.Create() (#10533 ) On some NFS clients looks like errno is incorrectly set, which leads to incorrect errors thrown upwards.	2020-09-21 12:54:23 -07:00
poornas	aa12d75d75	fix crawler to detect lifecycle on bucket even if filter nil (#10532 )	2020-09-21 11:41:07 -07:00
Harshavardhana	6fcbdd5607	remove unused putObjectDir code (#10528 )	2020-09-21 09:41:39 -07:00
Harshavardhana	3831cc9e3b	fix: [fs] CompleteMultipart use trie structure for partMatch (#10522 ) performance improves by around 100x or more ``` go test -v -run NONE -bench BenchmarkGetPartFile goos: linux goarch: amd64 pkg: github.com/minio/minio/cmd BenchmarkGetPartFileWithTrie BenchmarkGetPartFileWithTrie-4 1000000000 0.140 ns/op 0 B/op 0 allocs/op PASS ok github.com/minio/minio/cmd 1.737s ``` fixes #10520	2020-09-21 01:18:13 -07:00
Krishna Srinivas	230fc0d186	Support for "directory" objects (#10499 )	2020-09-19 08:39:41 -07:00
Harshavardhana	7f9498f43f	fix: ignore faulty drives and continue (#10511 ) drives might return different types of errors handle them individually, and for some errors just log an error and continue	2020-09-18 12:09:05 -07:00
Harshavardhana	1cf322b7d4	change leader locker only for crawler (#10509 )	2020-09-18 11:15:54 -07:00
Klaus Post	0b1c824618	Fix incorrect request start time (#10516 ) Log request start time BEFORE starting processing the request	2020-09-18 09:30:52 -07:00
Klaus Post	c851e022b7	Tweaks to dynamic locks (#10508 ) * Fix cases where minimum timeout > default timeout. * Add defensive code for too small/negative timeouts. * Never set timeout below the maximum value of a request. * Protect against (unlikely) int64 wraps. * Decrease timeout slower. * Don't re-lock before copying.	2020-09-18 09:18:18 -07:00
Klaus Post	5ad032826a	Add a reasonable if unable to get total RAM (#10506 ) Though unlikely we shouldn't skip initializing the API if we cannot get RAM. Add 16GiB as a default and log the error.	2020-09-18 02:03:02 -07:00
Harshavardhana	84bf4624a4	fix: make sure to preserve metadata during overwrite in FS mode (#10512 ) This bug was introduced in `14f0047295` almost 3yrs ago, as a side affect of removing stale `fs.json` but we in-fact end up removing existing good `fs.json` for an existing object, leading to some form of a data loss. fixes #10496	2020-09-18 00:16:16 -07:00
Harshavardhana	4a36cd7035	fix: improve performance ListObjectParts in FS mode (#10510 ) from 20s for 10000 parts to less than 1sec Without the patch ``` ~ time aws --endpoint-url=http://localhost:9000 --profile minio s3api \ list-parts --bucket testbucket --key test \ --upload-id c1cd1f50-ea9a-4824-881c-63b5de95315a real 0m20.394s user 0m0.589s sys 0m0.174s ``` With the patch ``` ~ time aws --endpoint-url=http://localhost:9000 --profile minio s3api \ list-parts --bucket testbucket --key test \ --upload-id c1cd1f50-ea9a-4824-881c-63b5de95315a real 0m0.891s user 0m0.624s sys 0m0.182s ``` fixes #10503	2020-09-17 18:51:16 -07:00
Klaus Post	03490c811b	Fix obd goroutine leak (#10504 ) The gouroutine collecting transfer stats never exits. Add missing channel close.	2020-09-17 10:10:20 -07:00
Harshavardhana	ed78854cea	fix: list across all drives to avoid stale disks	2020-09-16 21:17:10 -07:00
Harshavardhana	e60834838f	fix: background disk heal, to reload format consistently (#10502 ) It was observed in VMware vsphere environment during a pod replacement, `mc admin info` might report incorrect offline nodes for the replaced drive. This issue eventually goes away but requires quite a lot of time for all servers to be in sync. This PR fixes this behavior properly.	2020-09-16 21:14:35 -07:00
Harshavardhana	d616d8a857	serialize replication and feed it through task model (#10500 ) this allows for eventually controlling the concurrency of replication and overally control of throughput	2020-09-16 16:04:55 -07:00
Anis Elleuch	24cab7f9df	ilm: Remove a 'null' version if not latest (#10494 ) If the ILM document requires removing noncurrent versions, the the server should be able to remove 'null' versions as well. 'null' versions are created when versioning is not enabled or suspended.	2020-09-16 10:21:50 -07:00
Harshavardhana	02c1a08a5b	fix: make sure to lock CopyObject for in-place updates (#10492 )	2020-09-15 20:44:48 -07:00
Ritesh H Shukla	5c47ce456e	Run replication in the background (#10491 )	2020-09-15 18:44:58 -07:00
Anis Elleuch	8ea55f9dba	obd: Add console log to OBD output (#10372 )	2020-09-15 18:02:54 -07:00
poornas	80e3dce631	azure: update content-md5 to metadata after upload (#10482 ) Fixes #10453	2020-09-15 16:31:47 -07:00
Harshavardhana	80fab03b63	fix: S3 gateway doesn't support full passthrough for encryption (#10484 ) The entire encryption layer is dependent on the fact that KMS should be configured for S3 encryption to work properly and we only support passing the headers as is to the backend for encryption only if KMS is configured. Make sure that this predictability is maintained, currently the code was allowing encryption to go through and fail at later to indicate that KMS was not configured. We should simply reply "NotImplemented" if KMS is not configured, this allows clients to simply proceed with their tests.	2020-09-15 13:57:15 -07:00
Harshavardhana	730d2dc7be	fix: allow CopyObject/PutObjecTags on pre-existing content (#10485 ) fixes #10475	2020-09-15 09:18:41 -07:00
Harshavardhana	0ee9678190	fix: add missing delete marker created filter (#10481 )	2020-09-14 21:32:52 -07:00
Klaus Post	34859c6d4b	Preallocate (safe) slices when we know the size (#10459 )	2020-09-14 20:44:18 -07:00
Klaus Post	b1c99e88ac	reduce CPU usage upto 50% in readdir (#10466 )	2020-09-14 17:19:54 -07:00
Harshavardhana	0104af6bcc	delayed locks until we have started reading the body (#10474 ) This is to ensure that Go contexts work properly, after some interesting experiments I found that Go net/http doesn't cancel the context when Body is non-zero and hasn't been read till EOF. The following gist explains this, this can lead to pile up of go-routines on the server which will never be canceled and will die at a really later point in time, which can simply overwhelm the server. https://gist.github.com/harshavardhana/c51dcfd055780eaeb71db54f9c589150 To avoid this refactor the locking such that we take locks after we have started reading from the body and only take locks when needed. Also, remove contextReader as it's not useful, doesn't work as expected context is not canceled until the body reaches EOF so there is no point in wrapping it with context and putting a `select {` on it which can unnecessarily increase the CPU overhead. We will still use the context to cancel the lockers etc. Additional simplification in the locker code to avoid timers as re-using them is a complicated ordeal avoid them in the hot path, since locking is very common this may avoid lots of allocations.	2020-09-14 15:57:13 -07:00
Harshavardhana	34ea1d2167	fix: return correct error code for MetadataTooLarge (#10470 ) fixes #10469	2020-09-13 21:26:35 -07:00
Harshavardhana	9d95937018	update KMS docs indicating deprecation of AUTO_ENCRYPTION env	2020-09-13 16:23:28 -07:00
Klaus Post	fa01e640f5	Continous healing: add optional bitrot check (#10417 )	2020-09-12 00:08:12 -07:00
Harshavardhana	f355374962	add support for configurable remote transport deadline (#10447 ) configurable remote transport timeouts for some special cases where this value needs to be bumped to a higher value when transferring large data between federated instances.	2020-09-11 23:03:08 -07:00
Harshavardhana	bda0fe3150	fix: allow LDAP identity to support form body POST (#10468 ) similar to other STS APIs	2020-09-11 23:02:32 -07:00
Harshavardhana	b70995dd60	Revert "ilm: Remove null version if not latest with proper config (#10467 )" This reverts commit `4b6264da7d`.	2020-09-11 18:15:49 -07:00
Anis Elleuch	4b6264da7d	ilm: Remove null version if not latest with proper config (#10467 )	2020-09-11 14:20:09 -07:00
Harshavardhana	48919de301	fix: for defer'ed deleteObject use internal context (#10463 )	2020-09-11 06:39:19 -07:00
Harshavardhana	eb2934f0c1	simplify webhook DNS further generalize for gateway (#10448 ) continuation of the changes from `eaaf05a7cc` this further simplifies, enables this for gateway deployments as well	2020-09-10 14:19:32 -07:00
Klaus Post	b7438fe4e6	Copy metadata before spawning goroutine + prealloc maps (#10458 ) In `(*cacheObjects).GetObjectNInfo` copy the metadata before spawning a goroutine. Clean up a few map[string]string copies as well, reducing allocs and simplifying the code. Fixes #10426	2020-09-10 11:37:22 -07:00
Anis Elleuch	ce6cef6855	erasure: Call Walk() from all disks (#10445 ) It does not make sense to call Walk() in only N/2 disks and then requires N/2 quorum, just keep it N/2+1 The commit fixes this behavior.	2020-09-10 09:27:52 -07:00
Klaus Post	493c714663	Remove erasureSets and erasureObjects from ObjectLayer (#10442 )	2020-09-10 09:18:19 -07:00
Harshavardhana	e959c5d71c	fix: server panic in FS mode (#10455 ) fixes #10454	2020-09-10 09:16:26 -07:00
Harshavardhana	4a2928eb49	generate missing object delete bucket notifications (#10449 ) fixes #10381	2020-09-09 18:23:08 -07:00
Anis Elleuch	af88772a78	lifecycle: NoncurrentVersionExpiration considers noncurrent version age (#10444 ) From https://docs.aws.amazon.com/AmazonS3/latest/dev/intro-lifecycle-rules.html#intro-lifecycle-rules-actions ``` When specifying the number of days in the NoncurrentVersionTransition and NoncurrentVersionExpiration actions in a Lifecycle configuration, note the following: It is the number of days from when the version of the object becomes noncurrent (that is, when the object is overwritten or deleted), that Amazon S3 will perform the action on the specified object or objects. Amazon S3 calculates the time by adding the number of days specified in the rule to the time when the new successor version of the object is created and rounding the resulting time to the next day midnight UTC. For example, in your bucket, suppose that you have a current version of an object that was created at 1/1/2014 10:30 AM UTC. If the new version of the object that replaces the current version is created at 1/15/2014 10:30 AM UTC, and you specify 3 days in a transition rule, the transition date of the object is calculated as 1/19/2014 00:00 UTC. ```	2020-09-09 18:11:24 -07:00
Harshavardhana	9109148474	add support for new UA values for update an check (#10451 )	2020-09-09 17:21:39 -07:00
Nitish Tiwari	eaaf05a7cc	Add Kubernetes operator webook server as DNS target (#10404 ) This PR adds a DNS target that ensures to update an entry into Kubernetes operator when a bucket is created or deleted. See minio/operator#264 for details. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-09-09 12:20:49 -07:00
Harshavardhana	958661cbb5	skip subdomain from bucket DNS which start with `minio.domain` (#10390 ) extend host matcher to reject the host match	2020-09-09 09:57:37 -07:00
Harshavardhana	6a0372be6c	cleanup tmpDir any older entries automatically just like multipart (#10439 ) also consider multipart uploads, temporary files in `.minio.sys/tmp` as stale beyond 24hrs and clean them up automatically	2020-09-08 15:55:40 -07:00
Harshavardhana	c13afd56e8	Remove MaxConnsPerHost settings to avoid potential hangs (#10438 ) MaxConnsPerHost can potentially hang a call without any way to timeout, we do not need this setting for our proxy and gateway implementations instead IdleConn settings are good enough. Also ensure to use NewRequestWithContext and make sure to take the disks offline only for network errors. Fixes #10304	2020-09-08 14:22:04 -07:00
Harshavardhana	96997d2b21	allow ctrl+c to be consistent at early startup (#10435 ) fixes #10431	2020-09-08 09:10:55 -07:00
Klaus Post	86a3319d41	Ignore config values from unknown subsystems (#10432 )	2020-09-08 08:57:04 -07:00
Harshavardhana	9f60e84ce1	always copy UserDefined metadata map (#10427 ) fixes #10426	2020-09-07 09:25:28 -07:00
Harshavardhana	572b1721b2	set max API requests automatically based on RAM (#10421 )	2020-09-04 19:37:37 -07:00
Harshavardhana	b0e1d4ce78	re-attach offline drive after new drive replacement (#10416 ) inconsistent drive healing when one of the drive is offline while a new drive was replaced, this change is to ensure that we can add the offline drive back into the mix by healing it again.	2020-09-04 17:09:02 -07:00
Harshavardhana	eb19c8af40	Bump response header timeout for proxying list request (#10420 )	2020-09-04 16:07:40 -07:00
Klaus Post	2d58a8d861	Add storage layer contexts (#10321 ) Add context to all (non-trivial) calls to the storage layer. Contexts are propagated through the REST client. - `context.TODO()` is left in place for the places where it needs to be added to the caller. - `endWalkCh` could probably be removed from the walkers, but no changes so far. The "dangerous" part is that now a caller disconnecting will propagate down, so a "delete" operation will now be interrupted. In some cases we might want to disconnect this functionality so the operation completes if it has started, leaving the system in a cleaner state.	2020-09-04 09:45:06 -07:00
poornas	0037951b6e	improve error message when remote target missing (#10412 )	2020-09-04 08:48:38 -07:00
Andreas Auernhammer	fbd1c5f51a	certs: refactor cert manager to support multiple certificates (#10207 ) This commit refactors the certificate management implementation in the `certs` package such that multiple certificates can be specified at the same time. Therefore, the following layout of the `certs/` directory is expected: ``` certs/ │ ├─ public.crt ├─ private.key ├─ CAs/ // CAs directory is ignored │ │ │ ... │ ├─ example.com/ │ │ │ ├─ public.crt │ └─ private.key └─ foobar.org/ │ ├─ public.crt └─ private.key ... ``` However, directory names like `example.com` are just for human readability/organization and don't have any meaning w.r.t whether a particular certificate is served or not. This decision is made based on the SNI sent by the client and the SAN of the certificate. *** The `Manager` will pick a certificate based on the client trying to establish a TLS connection. In particular, it looks at the client hello (i.e. SNI) to determine which host the client tries to access. If the manager can find a certificate that matches the SNI it returns this certificate to the client. However, the client may choose to not send an SNI or tries to access a server directly via IP (`https://<ip>:<port>`). In this case, we cannot use the SNI to determine which certificate to serve. However, we also should not pick "the first" certificate that would be accepted by the client (based on crypto. parameters - like a signature algorithm) because it may be an internal certificate that contains internal hostnames. We would disclose internal infrastructure details doing so. Therefore, the `Manager` returns the "default" certificate when the client does not specify an SNI. The default certificate the top-level `public.crt` - i.e. `certs/public.crt`. This approach has some consequences: - It's the operator's responsibility to ensure that the top-level `public.crt` does not disclose any information (i.e. hostnames) that are not publicly visible. However, this was the case in the past already. - Any other `public.crt` - except for the top-level one - must not contain any IP SAN. The reason for this restriction is that the Manager cannot match a SNI to an IP b/c the SNI is the server host name. The entire purpose of SNI is to indicate which host the client tries to connect to when multiple hosts run on the same IP. So, a client will not set the SNI to an IP. If we would allow IP SANs in a lower-level `public.crt` a user would expect that it is possible to connect to MinIO directly via IP address and that the MinIO server would pick "the right" certificate. However, the MinIO server cannot determine which certificate to serve, and therefore always picks the "default" one. This may lead to all sorts of confusing errors like: "It works if I use `https:instance.minio.local` but not when I use `https://10.0.2.1`. These consequences/limitations should be pointed out / explained in our docs in an appropriate way. However, the support for multiple certificates should not have any impact on how deployment with a single certificate function today. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-09-03 23:33:37 -07:00
Harshavardhana	1c6781757c	add missing ListBucketVersions from policy actions (#10414 )	2020-09-03 18:25:06 -07:00
Harshavardhana	b4e3956e69	update KES docs to talk about 'mc encrypt' command (#10400 ) add a deprecation notice for KMS_AUTO_ENCRYPTION	2020-09-03 12:43:45 -07:00
Harshavardhana	8a291e1dc0	Cluster healthcheck improvements (#10408 ) - do not fail the healthcheck if heal status was not obtained from one of the nodes, if many nodes fail then report this as a catastrophic error. - add "x-minio-write-quorum" value to match the write tolerance supported by server. - admin info now states if a drive is healing where madmin.Disk.Healing is set to true and madmin.Disk.State is "ok"	2020-09-02 22:54:56 -07:00
Klaus Post	650dccfa9e	cache: Only start at high watermark (#10403 ) Currently, cache purges are triggered as soon as the low watermark is exceeded. To reduce IO this should only be done when reaching the high watermark. This simplifies checks and reduces all calls for a GC to go through `dcache.diskSpaceAvailable(size)`. While a comment claims that `dcache.triggerGC <- struct{}{}` was non-blocking I don't see how that was possible. Instead, we add a 1 size to the queue channel and use channel semantics to avoid blocking when a GC has already been requested. `bytesToClear` now takes the high watermark into account to it will not request any bytes to be cleared until that is reached.	2020-09-02 17:48:44 -07:00
Andreas Auernhammer	9a703befe6	crypto: reduce retry delay when retrying KES requests (#10394 ) This commit reduces the retry delay when retrying a request to a KES server by: - reducing the max. jitter delay from 3s to 1.5s - skipping the random delay when there are more KES endpoints available. If there are more KES endpoints we can directly retry to the request by sending it to the next endpoint - as pointed out by @krishnasrinivas	2020-09-02 11:04:10 -07:00
Klaus Post	9a1615768d	Fix flaky TestXLStorageVerifyFile (#10398 ) `TestXLStorageVerifyFile` would fail 1 in 256 if the first random character was 'a'. Instead write 256 bytes which has 1 in 256^256 probability.	2020-09-02 09:42:24 -07:00
Harshavardhana	37da0c647e	fix: delete marker compatibility behavior for suspended bucket (#10395 ) - delete-marker should be created on a suspended bucket as `null` - delete-marker should delete any pre-existing `null` versioned object and create an entry `null`	2020-09-02 00:19:03 -07:00
Harshavardhana	2acb530ccd	update rulesguard with new rules (#10392 ) Co-authored-by: Nitish Tiwari <nitish@minio.io> Co-authored-by: Praveen raj Mani <praveen@minio.io>	2020-09-01 16:58:13 -07:00
Klaus Post	3e1fb17b70	heal: Check for truncated files (#10399 ) When checking parts we already do a stat for each part. Since we have the on disk size check if it is at least what we expect. When checking metadata check if metadata is 0 bytes.	2020-09-01 12:06:45 -07:00
Klaus Post	a89d6b8e3d	Fix common Windows failure (#10397 ) The `getNonLoopBackIP` may grab an IP from an interface that doesn't allow binding (on Windows), so this test consistently fails. We exclude that specific error.	2020-09-01 10:11:15 -07:00
Klaus Post	1c085f7d1a	Fix crash on Windows when crawling (#10385 ) * readDirN: Check if file is directory `syscall.FindNextFile` crashes if the handle is a file. `errFileNotFound` matches 'unix' functionality: `d19b434ffc/cmd/os-readdir_unix.go (L106)` Fixes #10384	2020-09-01 09:33:16 -07:00
Harshavardhana	4b6585d249	support 'ldap:user' variable replacement properly (#10391 ) also update `ldap.go` examples with latest minio-go changes Fixes #10367	2020-09-01 12:26:22 +05:30
Harshavardhana	9ffad7fceb	discard empty endpoint in crypto kes introduced in `18725679c4`	2020-08-31 19:35:43 -07:00
Andreas Auernhammer	18725679c4	crypto: allow multiple KES endpoints (#10383 ) This commit addresses a maintenance / automation problem when MinIO-KES is deployed on bare-metal. In orchestrated env. the orchestrator (K8S) will make sure that `n` KES servers (IPs) are available via the same DNS name. There it is sufficient to provide just one endpoint.	2020-08-31 18:10:52 -07:00
Anis Elleuch	ba8a8ad818	ListObjectsV1 requests unnecessarily fail with offline nodes (#10386 ) ListObjectsV1 requests are actually redirected to a specific node, depending on the bucket name. The purpose of this behavior was to optimize listing. However, the current code sends a Bad Gateway error if the target node is offline, which is a bad behavior because it means that the list request will fail, although this is unnecessary since we can still use the current node to list as well (the default behavior without using proxying optimization) Currently, you can see mint fails when there is one offline node, after this PR, mint will always succeed.	2020-08-31 12:37:31 -07:00
Harshavardhana	102ad60dee	simplify removing temporary files (#10389 )	2020-08-31 12:35:40 -07:00
Gaige B Paulsen	859ef52886	update for smartos build (solaris too) (#10378 )	2020-08-31 10:19:25 -07:00
Harshavardhana	e730da1438	fix: referesh JWKS public keys upon failure (#10368 ) fixes #10359	2020-08-28 08:15:12 -07:00
Anis Elleuch	46ee8659b4	fix write quorum calculation for bucket operations (#10364 ) When the number of disks is odd, the calculation of quorum for bucket operations were not correct, fix it.	2020-08-27 12:55:32 -07:00
Harshavardhana	a359e36e35	tolerate listing with only readQuorum disks (#10357 ) We can reduce this further in the future, but this is a good value to keep around. With the advent of continuous healing, we can be assured that namespace will eventually be consistent so we are okay to avoid the necessity to a list across all drives on all sets. Bonus Pop()'s in parallel seem to have the potential to wait too on large drive setups and cause more slowness instead of gaining any performance remove it for now. Also, implement load balanced reply for local disks, ensuring that local disks have an affinity for - cleanupStaleMultipartUploads()	2020-08-26 19:29:35 -07:00
Jorge Israel Peña	0a2e6d58a5	hdfs gateway handle listing single files (#10362 )	2020-08-26 16:03:53 -07:00
Klaus Post	1b119557c2	getDisksInfo: Attribute failed disks to correct endpoint (#10360 ) If DiskInfo calls failed the information returned was used anyway resulting in no endpoint being set. This would make the drive be attributed to the local system since `disk.Endpoint == disk.DrivePath` in that case. Instead, if the call fails record the endpoint and the error only.	2020-08-26 10:11:26 -07:00
Harshavardhana	7778fef6bb	update continous heal metrics appropriately for scanned items (#10352 ) bonus make sure to ignore objectNotFound, and versionNotFound errors properly at all layers, since HealObjects() returns objectNotFound error if the bucket or prefix is empty.	2020-08-26 08:53:33 -07:00
飞雪无情	ea1803417f	Use constants for gateway names to avoid bugs caused by spelling. (#10355 )	2020-08-26 08:52:46 -07:00
Harshavardhana	d19b434ffc	fix: bring back delayed leaf detection in listing (#10346 )	2020-08-25 12:26:48 -07:00
Klaus Post	17a1eda702	Disregard healing disks in crawling (#10349 ) When crawling never use a disk we know is healing. Most of the change involves keeping track of the original endpoint on xlStorage and this also fixes DiskInfo.Endpoint never being populated. Heal master will print `data-crawl: Disk "http://localhost:9001/data/mindev/data2/xl1" is Healing, skipping` once on a cycle (no more often than every 5m).	2020-08-25 10:55:15 -07:00
Daniel Valdivia	7d1734d033	indicate through HTTP header cluster healing in progress (#10342 )	2020-08-24 15:20:50 -07:00
Harshavardhana	03ec6adfd0	fix: KES http2.0 communication support (#10341 )	2020-08-24 14:37:53 -07:00
Harshavardhana	309b10f201	keep crawler cycle at 5 minutes	2020-08-24 14:05:16 -07:00
Klaus Post	c097ce9c32	continous healing based on crawler (#10103 ) Design: https://gist.github.com/klauspost/792fe25c315caf1dd15c8e79df124914	2020-08-24 13:47:01 -07:00
Harshavardhana	caad314faa	add ruleguard support, fix all the reported issues (#10335 )	2020-08-24 12:11:20 -07:00
Klaus Post	bc2ebe0021	Only enforce quota on success (#10339 ) We should only enforce quotas if no error has been returned. firstErr is safe to access since all goroutines have exited at this point. If `firstErr` hasn't been set by something else return the context error if cancelled.	2020-08-24 10:15:46 -07:00
Harshavardhana	11aa393ba7	Allow region errors to be dynamic (#10323 ) remove other FIXMEs as we are not planning to fix these, instead we will add dynamism case by case basis. fixes #10250	2020-08-23 22:06:22 -07:00
Praveen raj Mani	d0c910a6f3	Support https and basic-auth for elasticsearch notification target (#10332 )	2020-08-23 09:43:48 -07:00
kannappanr	d15a5ad4cc	S3 Gateway: Check for encryption headers properly (#10309 )	2020-08-22 11:41:49 -07:00
Harshavardhana	95411228db	add missing cleanupStaleMultipartUploads (#10325 ) fixes #10319	2020-08-21 21:39:54 -07:00
ebozduman	23774353b7	get_object() returns NoSuchKey error when object is a prefix (#10315 )	2020-08-21 13:08:01 -07:00
poornas	a2a5ec93d3	fix: use global context for filling cache in the background (#10308 )	2020-08-20 14:23:24 -07:00
Harshavardhana	27a774cbe9	fix: FS mode should reject putBucketVersioning (#10307 )	2020-08-20 13:18:06 -07:00
Klaus Post	8e6787a302	Fix TestDataUpdateTracker hanging (#10302 ) Keep dataUpdateTracker while goroutine is starting. This will ensure the object is updated one `start` returns Tested with ``` λ go test -cpu=1,2,4,8 -test.run TestDataUpdateTracker -count=1000 PASS ok github.com/minio/minio/cmd 8.913s ``` Fixes #10295	2020-08-20 13:17:42 -07:00
Harshavardhana	59352d0ac2	load all blocking metadata in background (#10298 ) most of this metadata already has fallbacks and there is no good reason to load them in blocking fashion	2020-08-20 10:38:53 -07:00
Harshavardhana	75d44b3bae	add disk for more context in bitrot errors (#10296 )	2020-08-20 09:41:15 -07:00
Klaus Post	95ae6c4b49	Fix missing unlock in *healSequence.hasEnded() (#10305 ) The background healing sequence would always hang when this function is called.	2020-08-20 08:48:09 -07:00
KevinSmile	0ebb73ee2e	use const instead of literals (#10292 )	2020-08-19 16:43:52 -07:00
Harshavardhana	c8b84a0e9e	Add nancy vulnerability scanner (#10289 )	2020-08-19 14:25:21 -07:00
Ritesh H Shukla	3acb5cff45	Update code comment (#10287 )	2020-08-19 14:24:58 -07:00
Harshavardhana	74116204ce	handle fresh setup with mixed drives (#10273 ) fresh drive setups when one of the drive is a root drive, we should ignore such a root drive and not proceed to format. This PR handles this properly by marking the disks which are root disk and they are taken offline.	2020-08-18 14:37:26 -07:00
Harshavardhana	e4a44f6224	fix: commonPrefixes behavior in ListObjectVersions (#10286 ) ``` $ aws s3api --profile minio --endpoint-url http://localhost:9003 \ list-object-versions --bucket testbucket \ --delimiter / --prefix Veeam/Archive/ { "CommonPrefixes": [ { "Prefix": "Veeam/Archive/003/" } ] } ``` Also add coverage tests similar to ListObjects to catch errors in future, skip these tests in FS mode	2020-08-18 12:19:44 -07:00
poornas	0272973175	Fix regression in web ui for retention (#10285 ) Fixes: #10283 regression from PR #9259	2020-08-18 12:09:42 -07:00
Harshavardhana	d2a3f92452	fix: health handler for lockers (#10280 )	2020-08-18 07:27:41 -07:00
Harshavardhana	ede86845e5	docs: Add policy variables for resource and conditions (#10278 ) Bonus fix adds LDAP policy variable and clarifies the usage of policy variables for temporary credentials. fixes #10197	2020-08-17 17:39:55 -07:00
Harshavardhana	e57c742674	use single dynamic timeout for most locked API/heal ops (#10275 ) newDynamicTimeout should be allocated once, in-case of temporary locks in config and IAM we should have allocated timeout once before the `for loop` This PR doesn't fix any issue as such, but provides enough dynamism for the timeout as per expectation.	2020-08-17 11:29:58 -07:00
Klaus Post	bb5976d727	healbucket: Send object version ID (#10263 ) Based on our previous conversations I assume we should send the version id when healing an object. Maybe we should even list object versions and heal all?	2020-08-17 08:25:44 -07:00
Harshavardhana	f7c1a59de1	add validation logs for configured Logger/Audit HTTP targets (#10274 ) extra logs in-case of misconfiguration of audit/logger targets	2020-08-16 10:25:00 -07:00
Anis Elleuch	51ba1dac49	listing: Fix result when prefix is an object with a slash (#10267 ) In a non recursive mode, issuing a list request where prefix is an existing object with a slash and delimiter is a slash will return entries in the object directory (data dir IDs) ``` $ aws s3api --profile minioadmin --endpoint-url http://localhost:9000 \ list-objects-v2 --bucket testbucket --prefix code_of_conduct.md/ --delimiter '/' { "CommonPrefixes": [ { "Prefix": "code_of_conduct.md/ec750fe0-ea7e-4b87-bbec-1e32407e5e47/" } ] } ``` This commit adds a fast exit track in Walk() in this specific case.	2020-08-14 20:13:24 -07:00
Harshavardhana	a4463dd40f	fix: storageClass shouldn't set the value upon failure (#10271 )	2020-08-14 19:48:04 -07:00
Harshavardhana	83a82d818e	allow lock tolerance to match storage-class drive tolerance (#10270 )	2020-08-14 18:17:14 -07:00
Harshavardhana	1d1c4430b2	decrypt ETags in parallel around 500 at a time (#10261 ) Listing speed-up gained from 10secs for just 400 entries to 2secs for 400 entries	2020-08-14 11:56:35 -07:00
Harshavardhana	43e6d1ce2d	fix: missing proxy request by bucket for ListVersions (#10260 )	2020-08-13 16:31:58 -07:00
Harshavardhana	30da442a85	rootDisk on containers can have different device Id (#10259 ) use `/etc/hosts` instead of `/` to check for common device id, if the device is same for `/etc/hosts` and the --bind mount to detect root disks. Bonus enhance healthcheck logging by adding maintenance tags, for all messages.	2020-08-13 15:21:20 -07:00
Harshavardhana	038d91feaa	fix: add public certs automatically as part of global CAs (#10256 )	2020-08-13 09:46:50 -07:00
Harshavardhana	e7ba78beee	use GlobalContext instead of context.Background when possible (#10254 )	2020-08-13 09:16:01 -07:00
Harshavardhana	b32d0a5b60	use the correct endpoints for offline drives	2020-08-12 19:17:49 -07:00
poornas	79e21601b0	fix: web handlers to enforce replication (#10249 ) This PR also preserves source ETag for replication	2020-08-12 17:32:24 -07:00
Harshavardhana	34253aa595	feat: cache env value in-case network is not reachable (#10251 )	2020-08-12 16:53:15 -07:00
Harshavardhana	79ed7ce451	fs: listObjects shouldn't take FS locks while listing (#10248 )	2020-08-12 15:23:14 +05:30
Harshavardhana	0dd3a08169	move the certPool loader function into pkg/certs (#10239 )	2020-08-11 08:29:50 -07:00
Klaus Post	f8f290e848	security: Remove insecure custom headers (#10244 ) Background: https://github.com/google/security-research/security/advisories/GHSA-76wf-9vgp-pj7w Remove these custom headers from incoming and outgoing requests.	2020-08-11 08:29:29 -07:00
Harshavardhana	1e2ebc9945	feat: time to bring back http2.0 support (#10230 ) Bonus move our CI/CD to go1.14	2020-08-10 09:02:29 -07:00
Harshavardhana	2a9819aff8	fix: refactor background heal for cluster health (#10225 )	2020-08-07 19:43:06 -07:00
Harshavardhana	6c6137b2e7	add cluster maintenance healthcheck drive heal affinity (#10218 )	2020-08-07 13:22:53 -07:00
Anis Elleuch	9138b2b503	Avoid duplicate headers when proxying S3 listing requests (#10220 )	2020-08-07 04:10:16 -07:00
Harshavardhana	77509ce391	Support looking up environment remotely (#10215 ) adds a feature where we can fetch the MinIO command-line remotely, this is primarily meant to add some stateless nature to the MinIO deployment in k8s environments, MinIO operator would run a webhook service endpoint which can be used to fetch any environment value in a generalized approach.	2020-08-06 18:03:16 -07:00
poornas	adcaa6f9de	fix: Change ListBucketTargets handler (#10217 ) to list all targets across a tenant. Also fixing some validations.	2020-08-06 17:10:21 -07:00
poornas	121164db56	fix: relax some replication validations (#10210 ) Also inherit storage class from source object if replication configuration does not have a storage class specified for destination bucket.	2020-08-05 20:01:20 -07:00
Harshavardhana	a20d4568a2	fix: make sure to use uniform drive count calculation (#10208 ) It is possible in situations when server was deployed in asymmetric configuration in the past such as ``` minio server ~/fs{1...4}/disk{1...5} ``` Results in setDriveCount of 10 in older releases but with fairly recent releases we have moved to having server affinity which means that a set drive count ascertained from above config will be now '4' While the object layer make sure that we honor `format.json` the storageClass configuration however was by mistake was using the global value obtained by heuristics. Which leads to prematurely using lower parity without being requested by the an administrator. This PR fixes this behavior.	2020-08-05 13:31:12 -07:00
Harshavardhana	e656beb915	feat: allow service accounts to be generated with OpenID STS (#10184 ) Bonus also fix a bug where we did not purge relevant service accounts generated by rotating credentials appropriately, service accounts should become invalid as soon as its corresponding parent user becomes invalid. Since service account themselves carry parent claim always we would never reach this problem, as the access get rejected at IAM policy layer.	2020-08-05 13:08:40 -07:00
poornas	88daaef76b	Validate object lock when setting replication config. (#10200 ) Check if object lock is enabled on destination bucket while setting replication configuration on a object lock enabled bucket.	2020-08-04 23:02:27 -07:00
Harshavardhana	0b8255529a	fix: proxies set keep-alive timeouts to be system dependent (#10199 ) Split the DialContext's one for internode and another for all other external communications especially proxy forwarders, gateway transport etc.	2020-08-04 14:55:53 -07:00
Harshavardhana	019fe69a57	fix: reduce an extra system call for writes instead fail later (#10187 )	2020-08-04 12:09:41 -07:00
Anis Elleuch	6ae30b21c9	fix ILM should not remove a protected version (#10189 )	2020-08-03 23:04:40 -07:00
Harshavardhana	b16781846e	allow server to start even with corrupted/faulty disks (#10175 )	2020-08-03 18:17:48 -07:00
Harshavardhana	5ce82b45da	add CopyObject optimization when source and destination are same (#10170 ) when source and destination are same and versioning is enabled on the destination bucket - we do not need to re-create the entire object once again to optimize on space utilization. Cases this PR is not supporting - any pre-existing legacy object will not be preserved in this manner, meaning a new dataDir will be created. - key-rotation and storage class changes of course will never re-use the dataDir	2020-08-03 16:21:10 -07:00
Harshavardhana	e99bc177c0	fix: allow FS mode situations when conflicting files exist (#10185 ) conflicting files can exist on FS at `.minio.sys/buckets/testbucket/policy.json/`, this is an expected valid scenario for FS mode allow it to work, i.e ignore and move forward	2020-08-03 13:20:49 -07:00
Harshavardhana	b68bc75dad	fix: quorum calculation mistake with reduced parity (#10186 ) With reduced parity our write quorum should be same as read quorum, but code was still assuming ``` readQuorum+1 ``` In all situations which is not necessary.	2020-08-03 12:15:08 -07:00
Harshavardhana	d61eac080b	fix: connection_string should override other params (#10180 ) closes #9965	2020-08-03 09:16:00 -07:00
poornas	a8dd7b3eda	Refactor replication target management. (#10154 ) Generalize replication target management so that remote targets for a bucket can be managed with ARNs. `mc admin bucket remote` command will be used to manage targets.	2020-07-30 19:55:22 -07:00
Harshavardhana	25a55bae6f	fix: avoid buffering of server sent events by proxies (#10164 )	2020-07-30 19:45:12 -07:00
Harshavardhana	fe157166ca	fix: Pass context all the way down to the network call in lockers (#10161 ) Context timeout might race on each other when timeouts are lower i.e when two lock attempts happened very quickly on the same resource and the servers were yet trying to establish quorum. This situation can lead to locks held which wouldn't be unlocked and subsequent lock attempts would fail. This would require a complete server restart. A potential of this issue happening is when server is booting up and we are trying to hold a 'transaction.lock' in quick bursts of timeout.	2020-07-29 23:15:34 -07:00
Adam Brown	f7259adf83	Update LastUpdate timestamp before save (#10152 )	2020-07-28 13:20:50 -07:00
Harshavardhana	6669560cb9	turn-off bucket usage metrics in gateway mode (#10150 ) closes #10147	2020-07-28 13:04:26 -07:00
poornas	b46ab7e921	Rename replication target handler (#10142 ) Rename replication target handler to a generic bucket target handler	2020-07-28 11:50:47 -07:00
Harshavardhana	27266f8a54	fix: if OPA set do not enforce policy claim (#10149 )	2020-07-28 11:47:57 -07:00
poornas	1b6ba0d062	Add validation in cache for offline drives (#10146 ) closes #10144	2020-07-28 10:06:52 -07:00
Harshavardhana	f200a7fb6a	fix: speed up OBD tests avoid unnecessary memory allocation (#10141 ) replace dummy buffer with nullReader{} instead, to avoid large memory allocations in memory constrainted environments. allows running obd tests in such environments.	2020-07-27 14:51:59 -07:00
Harshavardhana	47e304d03c	fix: add missing content-disposition from CORS handler (#10137 )	2020-07-27 09:03:38 -07:00
Harshavardhana	9108abf204	fix: allow shareable URLs with rotating creds (#10135 ) closes #8935	2020-07-27 09:02:53 -07:00
Harshavardhana	6529dcb3b5	fix: gateway Walk() implementation to list correct contents (#10131 ) closes #10122	2020-07-26 22:56:05 -07:00
Harshavardhana	abbf6ce6cc	simplify JWKS decoding in OpenID and more tests (#10119 ) add tests for non-compliant Azure AD behavior with "nonce" to fail properly and treat it as expected behavior for non-standard JWT tokens.	2020-07-25 08:42:41 -07:00
Harshavardhana	5ffc733eec	fix: enforce bucket quota from browser uploads (#10129 )	2020-07-24 21:16:54 -07:00
Harshavardhana	35212b673e	add unformatted disk as part of the error list (#10128 ) these errors should be ignored for quorum error calculation to ensure that we don't prematurely return unformatted disk error as part of API calls	2020-07-24 13:16:11 -07:00
Harshavardhana	57ff9abca2	Apply quota usage cache invalidation per second (#10127 ) Allow faster lookups for quota check enforcement	2020-07-24 12:24:21 -07:00
Jorge Israel Peña	4752323e1c	Use hdfs.Readdir() to optimize HDFS directory listings (#10121 ) Currently, listing directories on HDFS incurs a per-entry remote Stat() call penalty, the cost of which can really blow up on directories with many entries (+1,000) especially when considered in addition to peripheral calls (such as validation) and the fact that minio is an intermediary to the client (whereas other clients listed below can query HDFS directly). Because listing directories this way is expensive, the Golang HDFS library provides the [`Client.Open()`] function which creates a [`FileReader`] that is able to batch multiple calls together through the [`Readdir()`] function. This is substantially more efficient for very large directories. In one case we were witnessing about +20 seconds to list a directory with 1,500 entries, admittedly large, but the Java hdfs ls utility as well as the HDFS library sample ls utility were much faster. Hadoop HDFS DFS (4.02s): λ ~/code/minio → use-readdir » time hdfs dfs -ls /directory/with/1500/entries/ … hdfs dfs -ls 5.81s user 0.49s system 156% cpu 4.020 total Golang HDFS library (0.47s): λ ~/code/hdfs → master » time ./hdfs ls -lh /directory/with/1500/entries/ … ./hdfs ls -lh 0.13s user 0.14s system 56% cpu 0.478 total mc and minio without optimization (16.96s): λ ~/code/minio → master » time mc ls myhdfs/directory/with/1500/entries/ … ./mc ls 0.22s user 0.29s system 3% cpu 16.968 total mc and minio with optimization (0.40s): λ ~/code/minio → use-readdir » time mc ls myhdfs/directory/with/1500/entries/ … ./mc ls 0.13s user 0.28s system 102% cpu 0.403 total [`Client.Open()`]: https://godoc.org/github.com/colinmarc/hdfs#Client.Open [`FileReader`]: https://godoc.org/github.com/colinmarc/hdfs#FileReader [`Readdir()`]: https://godoc.org/github.com/colinmarc/hdfs#FileReader.Readdir	2020-07-24 11:31:51 -07:00
Klaus Post	11593c6cc4	Usage: Reset merged info when updating (#10126 ) When merging multiple buckets reset between each update. Avoids merging the same usage metrics multiple times resulting in duplicate data entries.	2020-07-24 11:02:10 -07:00
Harshavardhana	10025bda45	fix: add missing response headers to CORS handler (#10124 )	2020-07-24 00:46:51 -07:00
Harshavardhana	3a73f1ead5	refactor server update behavior (#10107 )	2020-07-23 08:03:31 -07:00
poornas	b9be841fd2	Add missing validation for replication API conditions (#10114 )	2020-07-22 17:39:40 -07:00
Anis Elleuch	456b2ef6eb	Avoid healing to be stuck with many concurrent event listeners (#10111 ) If there are many listeners to bucket notifications or to the trace subsystem, healing fails to work properly since it suspends itself when the number of concurrent connections is above a certain threshold. These connections are also continuous and not costly (no disk access), it is okay to just ignore them in waitForLowHTTPReq().	2020-07-22 13:16:55 -07:00
poornas	c43da3005a	Add support for server side bucket replication (#9882 )	2020-07-21 17:49:56 -07:00
Harshavardhana	a880283593	Send the lower level error directly from GetDiskID() (#10095 ) this is to detect situations of corruption disk format etc errors quickly and keep the disk online in such scenarios for requests to fail appropriately.	2020-07-21 13:54:06 -07:00
Harshavardhana	eb6bf454f1	fix: copyObject encryption from unencrypted object (#10102 ) This is a continuation of #10085	2020-07-21 12:25:01 -07:00
Harshavardhana	ec06089eda	fix: re-implement cluster healthcheck (#10101 )	2020-07-20 18:31:22 -07:00
Harshavardhana	0c4be55936	fix: fix lockup in merge-walk pool (#10098 ) Fixes two different types of problems - continuation of the problem seen in FS #9992 as not fixed for erasure coded deployments, reproduced this issue with spark and its fixed now - another issue was leaking walk go-routines which would lead to high memory usage and crash the system this is simply because all the walks which were purged at the top limit had leaking end walkers which would consume memory endlessly. closes #9966 closes #10088	2020-07-20 17:28:26 -07:00
Harshavardhana	11d21d5d1b	fix: pass around the correct drives per set (#10097 ) this is a precursor change before adding parity based SLA across zones instead of same stripe size	2020-07-20 16:38:40 -07:00
Harshavardhana	2955aae8e4	feat: Add notification support for bucketCreates and removal (#10075 )	2020-07-20 12:52:49 -07:00
Harshavardhana	9fd836e51f	add dnsStore interface for upcoming operator webhook (#10077 )	2020-07-20 12:28:48 -07:00
Anis Elleuch	518f44908c	fs: Close object fs.json before deletion (#10092 ) NFS fails when deleting a file while it is already opened. The reason is that the object fs.json meta file is opened but not closed before removal.	2020-07-20 08:52:24 -07:00
Harshavardhana	e2c71717f8	add different TCP timeouts for internal and incoming (#10090 ) closes #10086	2020-07-19 17:16:12 -07:00
Harshavardhana	7764c542f2	allow claims to be optional in STS (#10078 ) not all claims need to be present for the JWT claim, let the policies not exist and only apply which are present when generating the credentials once credentials are generated then those policies should exist, otherwise the request will fail.	2020-07-19 15:34:01 -07:00
Harshavardhana	d53e560ce0	fix: copyObject key rotation issue (#10085 ) - copyObject in-place decryption failed due to incorrect verification of headers - do not decode ETag when object is encrypted with SSE-C, so that pre-conditions don't fail prematurely.	2020-07-18 17:36:32 -07:00
Harshavardhana	17747db93f	fix: support healing older content (#10076 ) This PR adds support for healing older content i.e from 2yrs, 1yr. Also handles other situations where our config was not encrypted yet. This PR also ensures that our Listing is consistent and quorum friendly, such that we don't list partial objects	2020-07-17 17:41:29 -07:00
Harshavardhana	3fe27c8411	fix: In federated setup dial all hosts to figure out online host (#10074 ) In federated NAS gateway setups, multiple hosts in srvRecords was picked at random which could mean that if one of the host was down the request can indeed fail and if client retries it would succeed. Instead allow server to figure out the current online host quickly such that we can exclude the host which is down. At the max the attempt to look for a downed node is to 300 millisecond, if the node is taking longer to respond than this value we simply ignore and move to the node, total attempts are equal to number of srvRecords if no server is online we simply fallback to last dialed host.	2020-07-17 14:25:47 -07:00
Harshavardhana	14b1c9f8e4	fix: return Range errors after If-Matches (#10045 ) closes #7292	2020-07-17 13:01:22 -07:00
Klaus Post	d84fc58cac	fix: CheckParts endpoint call to correct API (#10073 ) CheckParts is calling the wrong endpoint, so instead of checking parts, it is writing metadata.	2020-07-17 10:17:59 -07:00
Harshavardhana	187c3f62df	fix: heal replaced drives properly (#10069 ) healing was not working properly when drives were replaced, due to the error check in root disk calculation this PR fixes this behavior This PR also adds additional fix for missing metadata entries from .minio.sys as part of disk healing as well. Added code to ignore and print more context sensitive errors for better debugging. This PR is continuation of fix in `7b14e9b660`	2020-07-17 10:08:04 -07:00
Harshavardhana	4bfc50411c	fix: return versionId in tagging APIs (#10068 )	2020-07-16 22:38:58 -07:00
Harshavardhana	d3c81a6e93	add missing available space from metrics (#10065 )	2020-07-16 14:43:48 -07:00
Harshavardhana	7342b5355f	fix: obtain correct location string with DNS style buckets (#10060 ) closes #10054	2020-07-16 13:28:29 -07:00
Harshavardhana	7b14e9b660	fix: diskInfo should check diskID only if disk is online (#10058 ) closes #10057	2020-07-16 07:30:05 -07:00
Harshavardhana	cd849bc2ff	update STS docs with new values (#10055 ) Co-authored-by: Poorna <poornas@users.noreply.github.com>	2020-07-15 14:36:14 -07:00
Klaus Post	00d3cc4b69	Enforce quota checks after crawl (#10036 ) Enforce bucket quotas when crawling has finished. This ensures that we will not do quota enforcement on old data. Additionally, delete less if we are closer to quota than we thought.	2020-07-14 18:59:05 -07:00
Harshavardhana	14ff7f5fcf	add hdfs sub-path support (#10046 ) for users who don't have access to HDFS rootPath '/' can optionally specify `minio gateway hdfs hdfs://namenode:8200/path` for which they have access to, allowing all writes to be performed at `/path`. NOTE: once configured in this manner you need to make sure command line is correctly specified, otherwise your data might not be visible closes #10011	2020-07-14 15:49:10 -07:00
Harshavardhana	369a876ebe	fix: handle array policies in JWT claim (#10041 ) PR #10014 was not complete as only handled policy claims partially.	2020-07-14 10:26:47 -07:00
Anis Elleuch	778e9c864f	Move dependency from minio-go v6 to v7 (#10042 )	2020-07-14 09:38:05 -07:00
Harshavardhana	a2616b8227	allow turning off secure ciphers (#10038 ) this PR to allow legacy support for big-data applications which run older Java versions which do not support the secure ciphers currently defaulted in MinIO. This option allows optionally to turn them off such that client and server can negotiate the best ciphers themselves. This env is purposefully not documented, meant as a last resort when client application cannot be changed easily.	2020-07-13 14:20:21 -07:00
Harshavardhana	e7d7d5232c	fix: admin info output and improve overall performance (#10015 ) - admin info node offline check is now quicker - admin info now doesn't duplicate the code across doing the same checks for disks - rely on StorageInfo to return appropriate errors instead of calling locally. - diskID checks now return proper errors when disk not found v/s format.json missing. - add more disk states for more clarity on the underlying disk errors.	2020-07-13 09:51:07 -07:00
Harshavardhana	1d65ef3201	fix: deletes on older format properly (#10029 ) while we handle all situations for writes and reads on older format, what we didn't cater for properly yet was delete where we only ended up deleting just `xl.meta` - instead we should allow all the deletes to go through for older format without versioning enabled buckets.	2020-07-13 09:01:17 -07:00
Harshavardhana	37c14207d6	fix: cors handling again for not just OPTIONS request (#10025 ) CORS is notorious requires specific headers to be handled appropriately in request and response, using cors package as part of handlerFunc() for options method lacks the necessary control this package needs to add headers.	2020-07-12 10:56:57 -07:00
Harshavardhana	3b9fbf80ad	fix: make sure to use new restClient for healthcheck (#10026 ) Without instantiating a new rest client we can have a recursive error which can lead to healthcheck returning always offline, this can prematurely take the servers offline.	2020-07-11 22:19:38 -07:00
Harshavardhana	143f9371c6	fix: loading users regression additionally also move to latest gorilla/mux master to fix the DNS style bucket routing regression resolves #10022 resolves #10023	2020-07-11 14:03:27 -07:00
Harshavardhana	3f1902face	fix: cors should be available on all paths (#10020 )	2020-07-11 13:49:24 -07:00
Harshavardhana	c0adb52213	sync to disk only upon successful legacy metadata rename (#10018 )	2020-07-11 09:37:34 -07:00
Harshavardhana	2d17c16d93	fix: make sure to honor versioning from browser UI deletes (#10016 )	2020-07-10 22:21:04 -07:00
Harshavardhana	36d36fab0b	fix: add virtual host style workaround for gorilla/mux issue (#10010 ) gorilla/mux broke their recent release 1.7.4 which we upgraded to, we need the current workaround to ensure that our regex matches appropriately. An upstream PR is sent, we should remove the workaround once we have a new release.	2020-07-10 15:21:32 -07:00
Harshavardhana	ba756cf366	fix: extract array type for policy claim if present (#10014 )	2020-07-10 14:48:44 -07:00
Benjamin Sodenkamp	c00d410e61	Added bucket name param to ToJSONError call (#9961 ) when called with InvalidBucketName error. The user is shown a more specific error when the param is present.	2020-07-10 12:10:39 -07:00
Klaus Post	968342c732	Remove usage of go-ieproxy for windows (#10009 ) There is a potential for deadlock on Windows 10 refer https://github.com/mattn/go-ieproxy/issues/17 remove this dependency for now.	2020-07-10 12:08:14 -07:00
Harshavardhana	5c15656c55	support bootstrap client to use healthcheck restClient (#10004 ) - reduce locker timeout for early transaction lock for more eagerness to timeout - reduce leader lock timeout to range from 30sec to 1minute - add additional log message during bootstrap phase	2020-07-10 09:26:21 -07:00
kannappanr	efe9fe6124	azure: Return success when deleting non-existent object (#9981 )	2020-07-10 08:30:23 -07:00
Klaus Post	c850905e43	fix: threadwalk lockup under high load (#9992 ) Main issue is that `t.pool[params]` should be `t.pool[oldest]`. We add a bit more safety features for the code. * Make writes to the endTimerCh non-blocking in all cases so multiple releases cannot lock up. * Double check expectations. * Shift down deletes with copy instead of truncating slice. * Actually delete the oldest if we are above total limit. * Actually delete the oldest found and not the current. * Unexport the mutex so nobody from the outside can meddle with it.	2020-07-09 07:02:18 -07:00
Andreas Auernhammer	a317a2531c	admin: new API for creating KMS master keys (#9982 ) This commit adds a new admin API for creating master keys. An admin client can send a POST request to: ``` /minio/admin/v3/kms/key/create?key-id=<keyID> ``` The name / ID of the new key is specified as request query parameter `key-id=<ID>`. Creating new master keys requires KES - it does not work with the native Vault KMS (deprecated) nor with a static master key (deprecated). Further, this commit removes the `UpdateKey` method from the `KMS` interface. This method is not needed and not used anymore.	2020-07-08 18:50:43 -07:00
Harshavardhana	2743d4ca87	fix: Add support for preserving mtime for replication (#9995 ) This PR is needed for bucket replication support	2020-07-08 17:36:56 -07:00
Harshavardhana	6136a963c8	fix: bump the response header timeout for forwarder as well (#9994 ) continuation of #9986, add more place where the lower timeout comes into effect.	2020-07-08 10:55:24 -07:00
Anis Elleuch	fa211f6a10	heal: Fix healing delete markers (#9989 )	2020-07-07 20:54:09 -07:00
Harshavardhana	72e0745e2f	fix: migrate to go.etcd.io import path (#9987 ) with the merge of https://github.com/etcd-io/etcd/pull/11823 etcd v3.5.0 will now have a properly imported versioned path this fixes our pending migration to newer repo	2020-07-07 19:04:29 -07:00
Klaus Post	aa4d1021eb	Remove timeout from putobject and listobjects (#9986 ) Use a separate client for these calls that can take a long time. Add request context to these so they are canceled when the client disconnects instead except for ListObject which doesn't have any equivalent.	2020-07-07 12:19:57 -07:00
Harshavardhana	93e7e4a0e5	fix: cors handling after gorilla mux update (#9980 ) fixes #9979	2020-07-06 20:55:19 -07:00
Anis Elleuch	c2f7cd1104	Consider errFileVersionNotFound during healing assessment (#9977 ) Healing an object which has multiple versions was not working because the healing code forgot to consider errFileVersionNotFound error as a use case that needs healing	2020-07-06 08:09:48 -07:00
Anis Elleuch	4cf80f96ad	fix: lifecycle XML parsing errors with Versioning (#9974 )	2020-07-05 09:08:42 -07:00
Anis Elleuch	d4af132fc4	lifecycle: Expiry should not delete versions (#9972 ) Currently, lifecycle expiry is deleting all object versions which is not correct, unless noncurrent versions field is specified. Also, only delete the delete marker if it is the only version of the given object.	2020-07-04 20:56:02 -07:00
Harshavardhana	c087a05b43	fix: simplify data structure before release (#9968 ) - additionally upgrade to msgp@v1.1.2 - change StatModTime,StatSize fields as simple Size/ModTime - reduce 50000 entries per List batch to 10000 as client needs to wait too long to see the first batch some times which is not desired and it is worth we write the data as soon as we have it.	2020-07-04 12:25:53 -07:00
Harshavardhana	cdb0e6ffed	support proper values for listMultipartUploads/listParts (#9970 ) object KMS is configured with auto-encryption, there were issues when using docker registry - this has been left unnoticed for a while. This PR fixes an issue with compatibility. Additionally also fix the continuation-token implementation infinite loop issue which was missed as part of #9939 Also fix the heal token to be generated as a client facing value instead of what is remembered by the server, this allows for the server to be stateless regarding the token's behavior.	2020-07-03 19:27:13 -07:00
Harshavardhana	03b84091fc	auto enable versioning with object locking (#9967 ) this is to preserve versioning for object-locked buckets from current release code.	2020-07-03 15:30:06 -07:00
Anis Elleuch	2be20588bf	Reroute requests based token heal/listing (#9939 ) When manual healing is triggered, one node in a cluster will become the authority to heal. mc regularly sends new requests to fetch the status of the ongoing healing process, but a load balancer could land the healing request to a node that is not doing the healing request. This PR will redirect a request to the node based on the node index found described as part of the client token. A similar technique is also used to proxy ListObjectsV2 requests by encoding this information in continuation-token	2020-07-03 11:53:03 -07:00
Harshavardhana	e59ee14f40	Tune tcp keep-alives with new kernel timeout options (#9963 ) For more deeper understanding https://blog.cloudflare.com/when-tcp-sockets-refuse-to-die/	2020-07-03 10:03:41 -07:00
Anis Elleuch	21a37e3393	fix: ListObjectVersions should return ordered Version & DeleteMarker (#9959 ) The S3 specification says that versions are ordered in the response of list object versions. mc snapshot needs this to know which version comes first especially when two versions have the same exact last-modified field.	2020-07-03 09:15:44 -07:00
Harshavardhana	810a4f0723	fix: return proper errors Get/HeadObject for deleteMarkers (#9957 )	2020-07-02 16:17:27 -07:00
Krishna Srinivas	4c266df863	fix: proxy ListObjects request to one of the server based on hash(bucket) (#9881 )	2020-07-02 10:56:22 -07:00
Klaus Post	abd999f64a	fix: list object versions in distributed setup (#9958 ) Remove calls to `WalkVersions` was calling the wrong endpoint, so unless quorum could be reached with local disks no results would ever be returned.	2020-07-02 10:29:50 -07:00
Benjamin Sodenkamp	648cb13e02	Added 'close' to results channel in Walk() (#9956 )	2020-07-01 14:29:45 -07:00
Harshavardhana	174f428571	add additional fdatasync before close() on writes (#9947 )	2020-07-01 10:57:23 -07:00
Harshavardhana	5388ae4acb	make sure to delete data-usage cache upon bucket deletes (#9952 )	2020-07-01 10:55:28 -07:00
kannappanr	5089a7167d	Handle empty retention in get/put object retention (#9948 ) Fixes #9943	2020-06-30 16:44:24 -07:00
Harshavardhana	c0ac25bfff	fix: readiness needs to be like liveness (#9941 ) Readiness as no reasoning to be cluster scope because that is not how the k8s networking works for pods, all the pods to a deployment are not sharing the network in a singleton. Instead they are run as local scopes to themselves, with readiness failures the pod is potentially taken out of the network to be resolvable - this affects the distributed setup in myriad of different ways. Instead readiness should behave like liveness with local scope alone, and should be a dummy implementation. This PR all the startup times and overal k8s startup time dramatically improves. Added another handler called as `/minio/health/cluster` to understand the cluster scope health.	2020-06-30 11:28:27 -07:00
Klaus Post	27a1f3ed2b	fs: Check if cache root was added (#9945 ) Fixes #9942	2020-06-30 09:32:36 -07:00
Harshavardhana	91817d0d1a	fix: implement generic Walk for gateway (#9938 ) Walk() functionality was missing on gateway implementations leading to missing functionality for the browser UI such as remove multiple objects, download as zip file etc. This PR brings a generic implementation across all gateway's, it is not required to repeat the same code in all gateway's	2020-06-29 17:07:23 -07:00
poornas	55a3b071ea	Allow optionally to disable range caching. (#9908 ) The default behavior is to cache each range requested to cache drive. Add an environment variable `MINIO_RANGE_CACHE` - when set to off, it disables range caching and instead downloads entire object in the background. Fixes #9870	2020-06-29 13:25:29 -07:00
Harshavardhana	a38ce29137	fix: simplify background heal and trigger heal items early (#9928 ) Bonus fix during versioning merge one of the PR was missing the offline/online disk count fix from #9801 port it correctly over to the master branch from release. Additionally, add versionID support for MRF Fixes #9910 Fixes #9931	2020-06-29 13:07:26 -07:00
Harshavardhana	4bba2cd034	fix: disallow versioning to be suspended with object lock (#9930 )	2020-06-28 08:15:15 -07:00

... 18 19 20 21 22 ...

4618 Commits