minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	157272dc5b	fix: use optimized json.NewEncoder instead for metrics (#15648 )	2022-09-05 08:06:35 -07:00
yudoutingle	f4c56026a2	fix: potential deadLock caused by unlocking a non-existing lock (#15635 )	2022-09-02 14:24:32 -07:00
Harshavardhana	37e3f5de10	do not print object not found errors in MRF healing (#15646 )	2022-09-02 14:22:40 -07:00
Harshavardhana	5ea629beb2	avoid printing io.ErrUnexpectedEOF for .metacache objects (#15642 )	2022-09-02 12:47:17 -07:00
Anis Elleuch	cf52691959	Save resync status in the backend using a last update timestamp (#15638 ) Currently, there is a short time window where the code is allowed to save the status of a replication resync. Currently, the window is `now.Sub(st.EndTime) <= resyncTimeInterval`. Also, any failure to write in the backend disks is not retried. Refactor the code a little bit to rely on the last timestamp of a successful write of the resync status of any given bucket in the backend disks.	2022-09-01 16:53:36 -07:00
Anis Elleuch	10e75116ef	Avoid replicating dirs in listing with replication enabled (#15641 ) When replication is enabled in a particular bucket, the listing will send objects to bucket replication, but it is also sending prefixes for non recursive listing which is useless and shows a lot of error logs. This commit will ignore prefixes.	2022-09-01 15:22:11 -07:00
Harshavardhana	f649968c69	tier: avoid stats infinite loop in forwardTo method (#15640 ) under some sequence of events following code would reach an infinite loop. ``` idx1, idx2 := 0, 1 for ; idx2 != idx1; idx2++ { fmt.Println(idx2) } ``` fixes #15639	2022-09-01 13:51:06 -07:00
Harshavardhana	bcedc2b0d9	fix: add healing metric type for heal tracing (#15631 ) changes the `heal.checkBucket` to `heal.Bucket` instead since the latter is more meaningful.	2022-08-31 12:28:03 -07:00
Klaus Post	8e4a45ec41	fix: encrypt checksums in metadata (#15620 )	2022-08-31 08:13:23 -07:00
Klaus Post	dec942beb6	feat: Add healing trace (#15616 )	2022-08-31 01:56:12 -07:00
Abirdcfly	d4e0f13bb3	chore: remove duplicate word in comments (#15607 ) Signed-off-by: Abirdcfly <fp544037857@gmail.com> Signed-off-by: Abirdcfly <fp544037857@gmail.com>	2022-08-30 08:26:43 -07:00
Anis Elleuch	1f28a3bb80	Avoid messages from go test output (#15601 ) A lot of warning messages are printed in CI/CD failures generated by go test. Avoid that by requiring at least Error level for logging when doing go test.	2022-08-30 08:23:40 -07:00
Krishnan Parthasarathi	3a1d3a7952	audit-log: Add time to get/restore object from remote-tier (#15602 )	2022-08-29 21:33:59 -07:00
Klaus Post	a9f1ad7924	Add extended checksum support (#15433 )	2022-08-29 16:57:16 -07:00
Poorna	929b9e164e	site replication: Avoid returning root svcacct info in sr metadata (#15608 ) Service accounts of root users should not be replicated.	2022-08-29 11:19:51 -07:00
Harshavardhana	97376f6e8f	improve performance for inlined data (#15603 ) inlined data often is bigger than the allowed O_DIRECT alignment, so potentially we can write 'xl.meta' without O_DSYNC instead we can rely on O_DIRECT + fdatasync() instead. This PR allows O_DIRECT on inlined data that would gain the benefits of performing O_DIRECT, eventually performing an fdatasync() at the end. Performance boost can be observed here for small objects < 128KiB. The performance boost is mainly seen on HDD, and marginal on NVMe setups.	2022-08-29 11:19:29 -07:00
Febriananda Wida Pramudita	1f22a16b15	fix: endpoints for single local disks must retain port info (#15585 )	2022-08-26 12:53:15 -07:00
Harshavardhana	433b6fa8fe	upgrade golang-lint to the latest (#15600 )	2022-08-26 12:52:29 -07:00
Krishnan Parthasarathi	99fbfe2421	Add concurrency to healing objects on a fresh disk (#15575 )	2022-08-25 13:07:15 -07:00
Poorna	b1b6264bea	fix: validate deployment id when adding peer clusters (#15591 ) Fixes: #15573	2022-08-25 11:30:52 -07:00
Aditya Manthramurthy	18dffb26e7	Allow querying a single target in config get API (#15587 )	2022-08-25 00:17:05 -07:00
Harshavardhana	edba7c987b	fix: objects matching prefixes should not leave delete markers (#15586 ) This is needed to ensure that we do not leave prefixes where version is suspended, instead we never leave versions on these paths.	2022-08-24 13:46:29 -07:00
Anis Elleuch	b737c83a66	Ensure that only one node performs site replication healing (#15584 ) When a node finds a change in the other replication cluster and applies to itself will already notify other peers. No need for all nodes in a given cluster to do site replication healing, only one node is sufficient.	2022-08-24 13:46:09 -07:00
Anis Elleuch	97a6322de1	Fix regression in notifying peers about new policy mapping (#15583 ) Switch from mux.Vars() to r.Form to avoid the issue of missing arguments passed to LoadPolicyMappingHandler.	2022-08-24 12:34:52 -07:00
Klaus Post	037fe4afdc	Add listing block reuse (#15579 ) When streaming results, pool metadata slices when sent.	2022-08-24 09:11:16 -07:00
Aditya Manthramurthy	afbb63a197	Factor out external event notification funcs (#15574 ) This change moves external event notification functionality into `event-notification.go`. This simplifies notification related code.	2022-08-24 06:42:36 -07:00
Harshavardhana	8902561f3c	use new xxml for XML responses to support rare control characters (#15511 ) use new xxml/XML responses to support rare control characters fixes #15023	2022-08-23 17:04:11 -07:00
Anis Elleuch	b8cdf060c8	Properly replicate policy mapping for virtual users (#15558 ) Currently, replicating policy mapping for STS users does not work. Fix it is by passing user type to PolicyDBSet.	2022-08-23 11:11:45 -07:00
Poorna	4155c5b695	replication: improve MRF healing. (#15556 ) This PR improves the replication failure healing by persisting most recent failures to disk and re-queuing them until the replication is successful. While this does not eliminate the need for healing during a full scan, queuing MRF vastly improves the ETA to keeping replicated buckets in sync as it does not wait for the scanner visit to detect unreplicated object versions.	2022-08-22 16:53:06 -07:00
Poorna	471467d310	fix: ensure metadata update happens after deletemarker replication (#15564 ) Fixes regression caused by #15521	2022-08-22 15:59:06 -07:00
Harshavardhana	ae4ee95d25	change default lock retry interval to 50ms (#15560 ) competing calls on the same object on versioned bucket mutating calls on the same object may unexpected have higher delays. This can be reproduced with a replicated bucket overwriting the same object writes, deletes repeatedly. For longer locks like scanner keep the 1sec interval	2022-08-19 16:21:05 -07:00
Harshavardhana	e9055e9ef7	fix: walk() should cancel itself upon context cancellation (#15553 ) This PR fixes possible leaks that may emanate from not listening on context cancelation or timeouts. ``` goroutine 60957610 [chan send, 16 minutes]: github.com/minio/minio/cmd.(erasureServerPools).Walk.func1.1.1(...) github.com/minio/minio/cmd/erasure-server-pool.go:1724 +0x368 github.com/minio/minio/cmd.listPathRaw({0x4a9a740, 0xc0666dffc0},... github.com/minio/minio/cmd/metacache-set.go:1022 +0xfc4 github.com/minio/minio/cmd.(erasureServerPools).Walk.func1.1() github.com/minio/minio/cmd/erasure-server-pool.go:1764 +0x528 created by github.com/minio/minio/cmd.(*erasureServerPools).Walk.func1 github.com/minio/minio/cmd/erasure-server-pool.go:1697 +0x1b7 ```	2022-08-18 17:49:08 -07:00
Harshavardhana	d350b666ff	feat: add idempotent delete marker support (#15521 ) The bottom line is delete markers are a nuisance, most applications are not version aware and this has simply complicated the version management. AWS S3 gave an unnecessary complication overhead for customers, they need to now manage these markers by applying ILM settings and clean them up on a regular basis. To make matters worse all these delete markers get replicated as well in a replicated setup, requiring two ILM settings on each site. This PR is an attempt to address this inferior implementation by deviating MinIO towards an idempotent delete marker implementation i.e MinIO will never create any more than single consecutive delete markers. This significantly reduces operational overhead by making versioning more useful for real data. This is an S3 spec deviation for pragmatic reasons.	2022-08-18 16:41:59 -07:00
Harshavardhana	895357607a	avoid using errors.As for 'errors.New' use errors.Is (#15549 ) Bonus: ignore coredns CVE, for now, there is no fix yet https://github.com/coredns/coredns/issues/5574	2022-08-18 11:10:49 -07:00
Harshavardhana	bf38c0c0d1	fix: increase concurrency of DeleteObjects() to N/10th (#15546 ) instead of keeping the value 10 and static, make the concurrency a function of incoming number of objects being deleted.	2022-08-18 09:33:56 -07:00
Poorna	21fe14201f	replication: centralize healthcheck for remote targets (#15516 ) This PR moves health check from minio-go client to being managed on the server. Additionally integrating health check into site replication	2022-08-16 17:46:22 -07:00
Harshavardhana	48640b1de2	fix: trim arn:aws:kms from incoming SSE aws-kms-key-id (#15540 )	2022-08-16 11:28:30 -07:00
Anis Elleuch	5682685c80	Introduce disk io stats metrics (#15512 )	2022-08-16 07:13:49 -07:00
Harshavardhana	c7d535c648	init console after IAM init() (#15531 ) fixes #15527	2022-08-13 12:54:41 -07:00
Aditya Manthramurthy	9986e103cf	Fix env var output in config get/export APIs (#15528 ) Fix a bug where env vars are not output when the config for the subsystem is specified solely via env vars.	2022-08-13 10:39:01 -07:00
Krishnan Parthasarathi	91e6af4470	Add trace support for decommissioning (#15502 ) * Add trace support for decommissioning * Add support for tracing errors during decommission	2022-08-10 12:46:45 -07:00
Shireesh Anjal	316c492842	Upgrade madmin-go to latest version (v1.4.15) (#15510 )	2022-08-10 07:36:13 -07:00
Harshavardhana	74418b542a	fix: incorrect context timeout during listPath() (#15509 ) This PR cleans up the listing code for single drive to ensure that we do not add an incorrect context timeout, while resuming the listing. fixes #15508	2022-08-10 07:35:29 -07:00
Poorna	172e63dbb6	fix: site replication group updates to set status correctly (#15507 ) Fixes: #15486	2022-08-09 15:17:43 -07:00
Poorna	21bf5b4db7	replication: heal proactively upon access (#15501 ) Queue failed/pending replication for healing during listing and GET/HEAD API calls. This includes healing of existing objects that were never replicated or those in the middle of a resync operation. This PR also fixes a bug in ListObjectVersions where lifecycle filtering should be done.	2022-08-09 15:00:24 -07:00
Harshavardhana	a406bb0288	restrict number of disks used for scanning buckets upto GOMAXPROCS (#15492 ) control scanner parallelism to avoid higher CPU usage on nodes that have more drives but an old CPU.	2022-08-08 16:16:44 -07:00
Harshavardhana	1823ab6808	LDAP/OpenID must be initialized IAM Init() (#15491 ) This allows for LDAP/OpenID to be non-blocking, allowing for unreachable Identity targets to be initialized in IAM.	2022-08-08 16:16:27 -07:00
Harshavardhana	8eec49304d	use logger.Info instead of logger.LogIf	2022-08-08 16:13:58 -07:00
Harshavardhana	ecdc2f2f5f	fix: maxConcurrent '0' is an invalid value (#15500 ) log and continue with defaults instead of crashing the service.	2022-08-08 15:18:45 -07:00
Harshavardhana	e178c55bc3	remove non-working GetRawData() from FS mode (#15498 )	2022-08-08 11:34:09 -07:00
Poorna	2c137c0d04	fix: handle invalid endpoint errors in site replication(#15499 ) fixes #15497	2022-08-08 11:12:05 -07:00
Harshavardhana	638c57e466	revert changes in FS implementation for umask fixes #15494	2022-08-08 09:48:24 -07:00
Harshavardhana	5e4213b3be	fix: keep writing previous speedtest result (#15484 ) when object speedtest is running keep writing previous speedtest result back to client until we have a new result - this avoids sending back blank entries in between the speedtest when it is running in 'autotune' mode.	2022-08-07 23:04:03 -07:00
Harshavardhana	e0b0a351c6	remove IAM old migration code (#15476 ) ``` commit `7bdaf9bc50` Author: Aditya Manthramurthy <donatello@users.noreply.github.com> Date: Wed Jul 24 17:34:23 2019 -0700 Update on-disk storage format for users system (#7949) ``` Bonus: fixes a bug when etcd keys were being re-encrypted.	2022-08-05 17:53:23 -07:00
Anis Elleuch	1d2ff46a89	Ensure lock/versioning permissions when creating a bucket (#15432 ) Currently, the code doesn't check if the user creating a bucket with locking feature has bucket locking and versioning permissions enabled, adding it in accordance with S3 spec. https://docs.aws.amazon.com/AmazonS3/latest/API/API_CreateBucket.html Object Lock - If ObjectLockEnabledForBucket is set to true in your CreateBucket request, s3:PutBucketObjectLockConfiguration and s3:PutBucketVersioning permissions are required.	2022-08-05 16:27:09 -07:00
Harshavardhana	8f7c739328	feat: add SpeedTest ResponseTimes and TTFB (#15479 ) Capture average, p50, p99, p999 response times and ttfb values. These are needed for latency measurements and overall understanding of our speedtest results.	2022-08-05 09:40:03 -07:00
Poorna	1beea3daba	fix: import bucket metadata import to return a summary (#15462 )	2022-08-05 01:52:50 -07:00
Aditya Manthramurthy	3d94c38ec4	Add env variables to configuration APIs output (#15465 ) Config export and config get APIs now include environment variables set on the server	2022-08-04 22:21:52 -07:00
Harshavardhana	f4af2d3cdc	fix: decodeDirObject() in single drive DeleteObjects() call (#15477 ) Thanks to @bh4t for reproducing this issue.	2022-08-04 18:57:43 -07:00
ebozduman	b57e7321e7	Replaces 'disk'=>'drive' visible to end user (#15464 )	2022-08-04 16:10:08 -07:00
Anis Elleuch	e93867488b	actively cancel listIAMConfigItems to avoid goroutine leak (#15471 ) listConfigItems creates a goroutine but sometimes callers will exit without properly asking listAllIAMConfigItems() to stop sending results, hence a goroutine leak. Create a new context and cancel it for each listAllIAMConfigItems call.	2022-08-04 13:20:43 -07:00
Harshavardhana	3bd9615d0e	fix: log if there is readDir() failure with ListBuckets (#15461 ) This is actionable and must be logged. Bonus: also honor umask by using 0o666 for all Open() syscalls.	2022-08-04 07:23:05 -07:00
Harshavardhana	a6e0ec4e6f	Add support converting non-inlined to inlined (#15444 ) This is a feature to allow for inode compaction on large clusters that use a lot of small files spread across a large heirarchy.	2022-08-02 23:10:22 -07:00
Andreas Auernhammer	d774a3309b	kes: automatically reload KES client certificate (#15450 ) This commit adds support for automatically reloading the MinIO client certificate for authentication to KES. The client certificate will now be reloaded: - when the private key / certificate file changes - when a SIGHUP signal is received - every 15 minutes Fixes #14869 Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2022-08-02 16:58:09 -07:00
Anis Elleuch	b3edb25377	bloom: healObject to mark a path dirty only for dangling objects (#15458 ) The path is marked dirty automatically when healObject() is called, which is wrong. HealObject() is called during self-healing and this will lead to an increase in the false positive result of the bloom filter. Also move NSUpdated() from renameData() and call it directly in CompleteMultipart and PutObject, this is not a functional change but it will make it less prone to errors in the future.	2022-08-02 16:57:39 -07:00
Harshavardhana	53a816b17a	fix: readdir fallback on root of the drive (#15457 ) fixes #15452	2022-08-02 14:57:36 -07:00
Harshavardhana	043aaa792d	fix: intrument os.OpenFile differently for Reads and Writes (#15449 ) allows us to trace latency for READs or WRITEs	2022-08-01 13:22:43 -07:00
Shireesh Anjal	e6eab2091f	fix: Incorrect ServersCount in cluster.info (#15431 ) The `ServersCount` field in cluster.info is expected to contain the number of nodes, and not number of endpoints.	2022-07-29 22:21:40 -07:00
Harshavardhana	3cdb609cca	allow root users to return appropriate policy in AccountInfo (#15437 ) fixes #15436 This fixes a regression caused after the removal of "consoleAdmin" policy usage for 'root users' in PR #15402	2022-07-29 20:58:03 -07:00
Harshavardhana	aa874010e2	fix: regression in resolving the right versions (#15430 ) fix: regression in resolving right versions commit `d480022711` caused a regression in real resolver, by picking up incorrect versionID.	2022-07-29 10:03:53 -07:00
Cesar Celis Hernandez	8ec888d13d	feat: update binary once and push it to other servers (#15407 )	2022-07-29 08:34:30 -07:00
Harshavardhana	916f274c83	choose starting concurrency based on number of local disks (#15428 ) smaller setups may have less drives per server choosing the concurrency based on number of local drives, and let the MinIO server change the overall concurrency as necessary.	2022-07-29 00:00:06 -07:00
Aditya Manthramurthy	7ac53c07af	fix: passing application configuration to console (#15409 ) This is an update to MinIO server after swagger codegen related build fixes added after issues introduced in `39fd7b0b3b`	2022-07-28 18:30:24 -07:00
Harshavardhana	bc72e4226e	do not allow filesystem fallback in server download (#15429 ) It is possible for anyone with admin access to relatively to get any content of any random OS location by simply providing the file with 'mc admin update alias/ /etc/passwd`. Workaround is to disable 'admin:ServiceUpdate' action. Everyone is advised to upgrade to this patch. Thanks to @alevsk for finding this bug.	2022-07-28 17:44:21 -07:00
Poorna	5e0776e96a	replication: Include replica object versions for resync (#15427 )	2022-07-28 13:43:02 -07:00
Anis Elleuch	2f1ef02d35	Do not update directory access time (#15426 ) Most setups will have relatime it only updates the access time following a change in the directory.	2022-07-28 12:40:48 -07:00
Harshavardhana	aff236e20e	fix: cluster healthcheck for single drive setups (#15415 ) single drive setups must return '200 OK' if drive is accessible, current master returns '503'	2022-07-27 16:46:34 -07:00
Harshavardhana	cbd70d26b5	optimize speedtest for smaller setups (#15414 ) this has been observed in multiple environments where the setups are small `speedtest` naturally fails with default '10s' and the concurrency of '32' is big for such clusters. choose a smaller value i.e equal to number of drives in such clusters and let 'autotune' increase the concurrency instead.	2022-07-27 14:41:59 -07:00
Harshavardhana	5e763b71dc	use logger.LogOnce to reduce printing disconnection logs (#15408 ) fixes #15334 - re-use net/url parsed value for http.Request{} - remove gosimple, structcheck and unusued due to https://github.com/golangci/golangci-lint/issues/2649 - unwrapErrs upto leafErr to ensure that we store exactly the correct errors	2022-07-27 09:44:59 -07:00
Aditya Manthramurthy	7e4e7a66af	Remove internal usage of consoleAdmin (#15402 ) "consoleAdmin" was used as the policy for root derived accounts, but this lead to unexpected bugs when an administrator modified the consoleAdmin policy This change avoids evaluating a policy for root derived accounts as by default no policy is mapped to the root user. If a session policy is attached to a root derived account, it will be evaluated as expected.	2022-07-26 19:06:55 -07:00
Shireesh Anjal	906947a285	fix: typo in json key ClusterInfo DeploymentID (#15406 ) deployement_id -> deployment_id	2022-07-26 19:05:33 -07:00
Poorna	426c902b87	site replication: fix healing of bucket deletes. (#15377 ) This PR changes the handling of bucket deletes for site replicated setups to hold on to deleted bucket state until it syncs to all the clusters participating in site replication.	2022-07-25 17:51:32 -07:00
Anis Elleuch	e4b51235f8	upgrade: Split in two steps to ensure a stable retry (#15396 ) Currently, if one server in a distributed setup fails to upgrade due to any reasons, it is not possible to upgrade again unless nodes are restarted. To fix this, split the upgrade process into two steps : - download the new binary on all servers - If successful, overwrite the old binary with the new one	2022-07-25 17:49:47 -07:00
Eng Zer Jun	0a3b1ad4eb	test: use `T.TempDir` to create temporary test directory (#15400 ) This commit replaces `ioutil.TempDir` with `t.TempDir` in tests. The directory created by `t.TempDir` is automatically removed when the test and all its subtests complete. Prior to this commit, temporary directory created using `ioutil.TempDir` needs to be removed manually by calling `os.RemoveAll`, which is omitted in some tests. The error handling boilerplate e.g. defer func() { if err := os.RemoveAll(dir); err != nil { t.Fatal(err) } } is also tedious, but `t.TempDir` handles this for us nicely. Reference: https://pkg.go.dev/testing#T.TempDir Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2022-07-25 12:37:26 -07:00
Anis Elleuch	f23f442d33	Add cluster info to inspect/profiling archive (#15360 ) Add cluster info to inspect and profiling archive. In addition to the existing data generation for both inspect and profiling, cluster.info file is added. This latter contains some info of the cluster. The generation of cluster.info is is done as the last step and it can fail if it exceed 10 seconds.	2022-07-25 09:11:35 -07:00
Klaus Post	3795b2c8ba	Add compression scheme to header (#15395 ) For easier debugging. We still do not return compressed size for security reasons.	2022-07-24 07:15:49 -07:00
Harshavardhana	7725425e05	fix: fork os.MkdirAll to optimize cases where parent exists (#15379 ) a/b/c/d/ where `a/b/c/` exists results in additional syscalls such as an Lstat() call to verify if the `a/b/c/` exists and its a directory. We do not need to do this on MinIO since the parent prefixes if exist, we can simply return success without spending additional syscalls. Also this implementation attempts to simply use Access() calls to avoid os.Stat() calls since the latter does memory allocation for things we do not need to use. Access() is simpler since we have a predictable structure on the backend and we know exactly how our path structures are.	2022-07-24 00:43:11 -07:00
Aditya Manthramurthy	39fd7b0b3b	Pass multiple IDP config to console (#15270 ) This change passes multiple IDP config via a struct rather than env variables.	2022-07-22 15:28:02 -07:00
Harshavardhana	b0d70a0e5e	support additional claim info in Auditing STS calls (#15381 ) Bonus: Adds a missing AuditLog from AssumeRoleWithCertificate API Fixes #9529	2022-07-22 11:12:03 -07:00
Poorna	7d8c8de827	single drive: Remove bucket metadata on DeleteBucket (#15378 ) from disk and in-memory map	2022-07-21 19:51:53 -07:00
jiuker	3faef829c5	expect full quorum for writing 'format.json' everywhere (#15362 )	2022-07-21 18:04:17 -07:00
Poorna	7560fb6f9a	save IAM export assets relative at a folder prefix (#15355 )	2022-07-21 17:51:33 -07:00
Klaus Post	69bf39f42e	fix: make complete multipart uploads faster encrypted/compressed backends (#15375 ) - Only fetch the parts we need and abort as soon as one is missing. - Only fetch the number of parts requested by "ListObjectParts".	2022-07-21 16:47:58 -07:00
Minio Trusted	564a0afae1	Revert "tests: Add context cancelation (#15374 )" This reverts commit `1e332f0eb1`. Reverting this as tests are failing randomly.	2022-07-21 13:58:56 -07:00
Klaus Post	1e332f0eb1	tests: Add context cancelation (#15374 ) A huge number of goroutines would build up from various monitors When creating test filesystems provide a context so they can shut down when no longer needed.	2022-07-21 11:52:18 -07:00
Poorna	cab8d3d568	feat: add API to return list of objects waiting to be replicated (#15091 )	2022-07-21 11:05:44 -07:00
Klaus Post	be8c4cb24a	fix: support multiple validateAdminReq actions (#15372 ) handle multiple validateAdminReq actions and remove duplicate error responses.	2022-07-21 10:26:59 -07:00
Harshavardhana	65166e4ce4	fix: readQuorum calculation when defaultParityCount is 0 (#15363 ) when parity is '0' the readQuorum must be equal to the number of data disks.	2022-07-21 07:25:54 -07:00
Harshavardhana	d3f89fa6e3	remove unnecessary logs in IAM store (#15356 )	2022-07-20 08:19:12 -07:00
Harshavardhana	ce8397f7d9	use partInfo only for intermediate part.x.meta (#15353 )	2022-07-19 18:56:24 -07:00
Klaus Post	cae9aeca00	fix: reused field crash in PartIndices (#15351 ) `PartIndices` may be set if xlMetaV2Version is reused. Clear before unmarshaling and add sanity check when reading.	2022-07-19 16:49:46 -07:00
Klaus Post	f939d1c183	Independent Multipart Uploads (#15346 ) Do completely independent multipart uploads. In distributed mode, a lock was held to merge each multipart upload as it was added. This lock was highly contested and retries are expensive (timewise) in distributed mode. Instead, each part adds its metadata information uniquely. This eliminates the per object lock required for each to merge. The metadata is read back and merged by "CompleteMultipartUpload" without locks when constructing final object. Co-authored-by: Harshavardhana <harsha@minio.io>	2022-07-19 08:35:29 -07:00
Andreas Auernhammer	242d06274a	kms: add `context.Context` to KMS API calls (#15327 ) This commit adds a `context.Context` to the the KMS `{Stat, CreateKey, GenerateKey}` API calls. The context will be used to terminate external calls as soon as the client requests gets canceled. A follow-up PR will add a `context.Context` to the remaining `DecryptKey` API call. Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2022-07-18 18:54:27 -07:00
Poorna	957e3ed729	export IAM: include site replicator svcacct (#15339 )	2022-07-18 17:38:53 -07:00
Harshavardhana	b6eb8dff64	Add decommission compression+encryption enabled tests (#15322 ) update compression environment variables to follow the expected sub-system style, however support fallback mode.	2022-07-17 08:43:14 -07:00
Harshavardhana	7da9e3a6f8	support encrypted/compressed objects properly during decommission (#15320 ) fixes #15314	2022-07-16 19:35:24 -07:00
Anis Elleuch	876970baea	Exclude upload-ids with incomplete part upload in multipart listing (#15318 ) Uploading a part object can leave an inconsistent state inside .minio.sys/multipart where data are uploaded but xl.meta is not committed yet. Do not list upload-ids that have this state in the multipart listing.	2022-07-16 13:25:58 -07:00
LHHDZ	e68e76e143	fix: data race, which caused tests execution to fail (#15313 )	2022-07-16 07:57:55 -07:00
Harshavardhana	e7ac1ea54c	allow decommission to continue when healing (#15312 ) Bonus: - heal buckets in-case during startup the new pools have bucket missing.	2022-07-15 21:03:23 -07:00
Harshavardhana	5ac6d91525	support 'admin update' for hotfix versions (#15308 ) hotfixed versions are rejected as invalid, allow `mc admin update` from hotfix repos.	2022-07-15 16:00:34 -07:00
Harshavardhana	1cd6713e24	copy query values before update to preserve the expected keys (#15310 ) in success_action_redirect we were missing required query params as per S3 spec - updated tests.	2022-07-15 15:04:48 -07:00
Harshavardhana	1b339ea062	allow force delete on decom pool (#15302 ) Bonus: - skip suspended pool from being considered for multipart uploads - add more context for decomErrors()	2022-07-14 20:44:22 -07:00
Harshavardhana	236ef03dbd	fix: skip objects expired via lifecycle rules during decommission (#15300 )	2022-07-14 16:47:09 -07:00
Poorna	7e32a17742	fix: site replication healing of missing buckets (#15298 ) fixes a regression from #15186 - Adding tests to cover healing of buckets. - Also dereference quota in SiteReplicationStatus only when non-nil	2022-07-14 14:27:47 -07:00
Krishnan Parthasarathi	1d42133d44	listing: Expire object versions past expiry (#15287 ) We skip object versions which are past their ILM expiry. This change schedules them for expiry while at it.	2022-07-14 07:21:26 -07:00
Poorna	b4f6901903	resync: Avoid concurrent access/write on map (#15286 ) fixes a crash ``` fatal error: concurrent map iteration and map write minio[19309]: goroutine 18640 [running]: minio[19309]: runtime.throw({0x27a3399?, 0x1785?}) minio[19309]: runtime/panic.go:992 +0x71 fp=0xc0062f1c80 sp=0xc0062f1c50 pc=0x438671 minio[19309]: runtime.mapiternext(0xc0062f1e90?) minio[19309]: runtime/map.go:871 +0x4eb fp=0xc0062f1cf0 sp=0xc0062f1c80 pc=0x41002b minio[19309]: github.com/minio/minio/cmd.(*ReplicationPool).periodicResyncMetaSave(0xc0056c00c0, {0x4d06a48, 0xc0005b2480}, {0x4d22fc0, 0xc0015ea0 ```	2022-07-13 16:29:10 -07:00
Klaus Post	0149382cdc	Add padding to compressed+encrypted files (#15282 ) Add up to 256 bytes of padding for compressed+encrypted files. This will obscure the obvious cases of extremely compressible content and leave a similar output size for a very wide variety of inputs. This does not mean the compression ratio doesn't leak information about the content, but the outcome space is much smaller, so often less information is leaked.	2022-07-13 07:52:15 -07:00
Klaus Post	697c9973a7	Upgrade compression package (#15284 ) Includes mitigation for CVE-2022-30631 (Go should still be updated) Remove functions now available upstream.	2022-07-13 07:48:14 -07:00
Harshavardhana	788fd3df81	preserve incoming query params in success_action_redirect (#15280 ) fixes #15274	2022-07-13 07:46:44 -07:00
Anis Elleuch	996cac5fed	Avoid listing buckets from a suspended pool (#15283 ) Make bucket requests sent after decommissioning is started are not created in a suspended pool. Therefore listing buckets should avoid suspended pools as well.	2022-07-13 07:44:50 -07:00
Harshavardhana	0a8b78cb84	fix: simplify passing auditLog eventType (#15278 ) Rename Trigger -> Event to be a more appropriate name for the audit event. Bonus: fixes a bug in AddMRFWorker() it did not cancel the waitgroup, leading to waitgroup leaks.	2022-07-12 10:43:32 -07:00
Harshavardhana	b4eb74f5ff	allow custom speedtest bucket (#15271 ) this allows for specifying existing buckets with - object replication enabled - object encryption enabled - object versioning enabled - object locking enabled	2022-07-12 10:12:47 -07:00
Anis Elleuch	57d1f31054	Do not log erasure read failure when disk goes offline (#15277 ) Avoid printing the following log ``` API: SYSTEM Time: Fri Jul 08 2022 11:48:40 GMT+0100 Error: Error(disk not found) reading erasure shards at... Backtrace: 0: internal/logger/logger.go:278:logger.LogIf() 1: cmd/bitrot-streaming.go:156:cmd.(streamingBitrotReader).ReadAt() 2: cmd/erasure-decode.go:165:cmd.(parallelReader).Read.func1() ```	2022-07-12 09:56:56 -07:00
Klaus Post	9f02f51b87	Add 4K minimum compressed size (#15273 ) There is no point in compressing very small files. Typically the effective size on disk will be the same due to disk blocks. So don't waste resources on extremely small files. We don't check on multipart. 1) because we don't know and 2) this is very likely a big object anyway.	2022-07-12 07:42:04 -07:00
Klaus Post	911a17b149	Add compressed file index (#15247 )	2022-07-11 17:30:56 -07:00
Poorna	3d969bd2b4	fix: ignore missing targets/replication config during site removal (#15269 )	2022-07-11 14:11:46 -07:00
Andreas Auernhammer	f800cee4fa	metric: add KMS-related metrics (#15258 ) This commit adds a minimal set of KMS-related metrics: ``` # HELP minio_cluster_kms_online Reports whether the KMS is online (1) or offline (0) # TYPE minio_cluster_kms_online gauge minio_cluster_kms_online{server="127.0.0.1:9000"} 1 # HELP minio_cluster_kms_request_error Number of KMS requests that failed with a well-defined error # TYPE minio_cluster_kms_request_error counter minio_cluster_kms_request_error{server="127.0.0.1:9000"} 16790 # HELP minio_cluster_kms_request_success Number of KMS requests that succeeded # TYPE minio_cluster_kms_request_success counter minio_cluster_kms_request_success{server="127.0.0.1:9000"} 348031 ``` Currently, we report whether the KMS is available and how many requests succeeded/failed. However, KES exposes much more metrics that can be exposed if necessary. See: https://pkg.go.dev/github.com/minio/kes#Metric Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2022-07-11 09:17:28 -07:00
Praveen raj Mani	b49fc33cb3	purge objects immediately with `x-minio-force-delete` in DeleteObject and DeleteBucket API (#15148 )	2022-07-11 09:15:54 -07:00
Klaus Post	37a6b2da67	Allow compaction at bucket top level. (#15266 ) If more than 1M folders (objects or prefixes) are found at the top level in a bucket allow it to be compacted. While very suboptimal structure we should limit memory usage at some point.	2022-07-11 07:59:03 -07:00
Harshavardhana	913e977c8d	remove auto-port warning for console-address (#15260 )	2022-07-08 13:36:41 -07:00
Harshavardhana	c2ddcb3b40	do not recreate deprecated delete-journal.bin, only read it (#15185 ) simplify deprecated code, re-enable hot-swap replace disks	2022-07-08 12:17:02 -07:00
Anis Elleuch	ed0cbfb31e	fix: rootdisk detection by not using cached value when GetDiskInfo() errors out (#15249 ) GetDiskInfo() uses timedValue to cache the disk info for one second. timedValue behavior was recently changed to return an old cached value when calculating a new value returns an error. When a mount point is empty, GetDiskInfo() will return errUnformattedDisk, timedValue will return cached disk info with unexpected IsRootDisk value, e.g. false if the mount point belongs to a root disk. Therefore, the mount point will be considered a valid disk and will be formatted as well. This commit will also add more defensive code when marking root disks: always mark a disk offline for any GetDiskInfo() error except errUnformattedDisk. The server will try anyway to reconnect to those disks every 10 seconds.	2022-07-07 17:05:23 -07:00
Harshavardhana	32b2f6117e	fix: do not pass around sync.Map (#15250 ) it is not safe to pass around sync.Map through pointers, as it may be concurrently updated by different callers. this PR simplifies by avoiding sync.Map altogether, we do not need sync.Map to keep object->erasureMap association. This PR fixes a crash when concurrently using this value when audit logs are configured. ``` fatal error: concurrent map iteration and map write goroutine 247651580 [running]: runtime.throw({0x277a6c1?, 0xc002381400?}) runtime/panic.go:992 +0x71 fp=0xc004d29b20 sp=0xc004d29af0 pc=0x438671 runtime.mapiternext(0xc0d6e87f18?) runtime/map.go:871 +0x4eb fp=0xc004d29b90 sp=0xc004d29b20 pc=0x41002b ```	2022-07-07 17:04:25 -07:00
Harshavardhana	ae92521310	remove unnecessary nAgreed value in partial() func (#15242 )	2022-07-07 13:45:34 -07:00
Harshavardhana	5802df4365	retry and resume decom operation upon retriable failures (#15244 ) it is possible in a k8s-like system reading pool.bin might not have quorum during startup, however, add a way to retry after this failure.	2022-07-07 12:31:44 -07:00
Anis Elleuch	8d98282afd	Better reporting of total/free usable capacity of the cluster (#15230 ) The current code uses approximation using a ratio. The approximation can skew if we have multiple pools with different disk capacities. Replace the algorithm with a simpler one which counts data disks and ignore parity disks.	2022-07-06 13:29:49 -07:00
Harshavardhana	3af6073576	no 'replicate status' without replication config (#15233 ) 'replicate status' shouldn't be displaying historic values unless replication config is present on the relevant bucket.	2022-07-06 09:53:33 -07:00
Harshavardhana	2518af5f9e	fix: allow certain mutations on objects during decommissioning (#15231 ) fix: allow certain mutation on objects during decommission currently by mistake deletion of objects was skipped, if the object resided on the pool being decommissioned. delete's are okay to be allowed since decommission is designed to run on a cluster with active I/O.	2022-07-06 09:53:16 -07:00
Harshavardhana	7b793d84c8	fix: calculate scanner metric paths for single drive (#15232 ) Additionally use pathJoin() to avoid double `//` in path names.	2022-07-06 07:48:38 -07:00
Aditya Manthramurthy	af9bc7ea7d	Add external IDP management Admin API for OpenID (#15152 )	2022-07-05 18:18:04 -07:00
Klaus Post	ac055b09e9	Add detailed scanner metrics (#15161 )	2022-07-05 14:45:49 -07:00
haslersn	df42914da6	Fix missing whitespace in error message for IncompleteBody (#15227 )	2022-07-05 12:19:57 -07:00
Klaus Post	2471bdda00	fix: for DiskInfo call cache disk metrics (#15229 ) Small uploads spend a significant amount of time (~5%) fetching disk info metrics. Also maps are allocated for each call. Add a 100ms cache to disk metrics.	2022-07-05 11:02:30 -07:00
Harshavardhana	9d80ff5a05	fix: decommission delete markers for non-current objects (#15225 ) versioned buckets were not creating the delete markers present in the versioned stack of an object, this essentially would stop decommission to succeed. This PR fixes creating such delete markers properly during a decommissioning process, adds tests as well.	2022-07-05 07:37:24 -07:00
Harshavardhana	b311abed31	decom IAM, Bucket metadata properly (#15220 ) Current code incorrectly passed the config asset object name while decommissioning, make sure that we pass the right object name to be hashed on the newer set of pools. This PR fixes situations after a successful decommission, the users and policies might go missing due to wrong hashed set.	2022-07-04 14:02:54 -07:00
Harshavardhana	ce667ddae0	do not print errFileNotFound in entries.resolve() (#15216 )	2022-07-04 06:40:46 -07:00
Harshavardhana	0fee993a4b	return appropriate error under 'decom status' (#15213 ) fixes #15208	2022-07-01 16:21:23 -07:00
Poorna	0ea5c9d8e8	site healing: Skip stale iam asset updates from peer. (#15203 ) Allow healing to apply IAM change only when peer gave the most recent update.	2022-07-01 13:19:13 -07:00
Harshavardhana	63ac260bd5	Simplify Prometheus metrics gather (#15210 )	2022-07-01 13:18:39 -07:00
Harshavardhana	f9a4ad7904	update banner with version+runtime (#15206 )	2022-06-30 13:58:09 -07:00

1 2 3 4 5 ...

4777 Commits