minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	ae4ee95d25	change default lock retry interval to 50ms (#15560 ) competing calls on the same object on versioned bucket mutating calls on the same object may unexpected have higher delays. This can be reproduced with a replicated bucket overwriting the same object writes, deletes repeatedly. For longer locks like scanner keep the 1sec interval	2022-08-19 16:21:05 -07:00
Harshavardhana	e9055e9ef7	fix: walk() should cancel itself upon context cancellation (#15553 ) This PR fixes possible leaks that may emanate from not listening on context cancelation or timeouts. ``` goroutine 60957610 [chan send, 16 minutes]: github.com/minio/minio/cmd.(erasureServerPools).Walk.func1.1.1(...) github.com/minio/minio/cmd/erasure-server-pool.go:1724 +0x368 github.com/minio/minio/cmd.listPathRaw({0x4a9a740, 0xc0666dffc0},... github.com/minio/minio/cmd/metacache-set.go:1022 +0xfc4 github.com/minio/minio/cmd.(erasureServerPools).Walk.func1.1() github.com/minio/minio/cmd/erasure-server-pool.go:1764 +0x528 created by github.com/minio/minio/cmd.(*erasureServerPools).Walk.func1 github.com/minio/minio/cmd/erasure-server-pool.go:1697 +0x1b7 ```	2022-08-18 17:49:08 -07:00
Harshavardhana	d350b666ff	feat: add idempotent delete marker support (#15521 ) The bottom line is delete markers are a nuisance, most applications are not version aware and this has simply complicated the version management. AWS S3 gave an unnecessary complication overhead for customers, they need to now manage these markers by applying ILM settings and clean them up on a regular basis. To make matters worse all these delete markers get replicated as well in a replicated setup, requiring two ILM settings on each site. This PR is an attempt to address this inferior implementation by deviating MinIO towards an idempotent delete marker implementation i.e MinIO will never create any more than single consecutive delete markers. This significantly reduces operational overhead by making versioning more useful for real data. This is an S3 spec deviation for pragmatic reasons.	2022-08-18 16:41:59 -07:00
Harshavardhana	895357607a	avoid using errors.As for 'errors.New' use errors.Is (#15549 ) Bonus: ignore coredns CVE, for now, there is no fix yet https://github.com/coredns/coredns/issues/5574	2022-08-18 11:10:49 -07:00
Harshavardhana	bf38c0c0d1	fix: increase concurrency of DeleteObjects() to N/10th (#15546 ) instead of keeping the value 10 and static, make the concurrency a function of incoming number of objects being deleted.	2022-08-18 09:33:56 -07:00
Poorna	21fe14201f	replication: centralize healthcheck for remote targets (#15516 ) This PR moves health check from minio-go client to being managed on the server. Additionally integrating health check into site replication	2022-08-16 17:46:22 -07:00
Harshavardhana	48640b1de2	fix: trim arn:aws:kms from incoming SSE aws-kms-key-id (#15540 )	2022-08-16 11:28:30 -07:00
Anis Elleuch	5682685c80	Introduce disk io stats metrics (#15512 )	2022-08-16 07:13:49 -07:00
Harshavardhana	c7d535c648	init console after IAM init() (#15531 ) fixes #15527	2022-08-13 12:54:41 -07:00
Aditya Manthramurthy	9986e103cf	Fix env var output in config get/export APIs (#15528 ) Fix a bug where env vars are not output when the config for the subsystem is specified solely via env vars.	2022-08-13 10:39:01 -07:00
Krishnan Parthasarathi	91e6af4470	Add trace support for decommissioning (#15502 ) * Add trace support for decommissioning * Add support for tracing errors during decommission	2022-08-10 12:46:45 -07:00
Shireesh Anjal	316c492842	Upgrade madmin-go to latest version (v1.4.15) (#15510 )	2022-08-10 07:36:13 -07:00
Harshavardhana	74418b542a	fix: incorrect context timeout during listPath() (#15509 ) This PR cleans up the listing code for single drive to ensure that we do not add an incorrect context timeout, while resuming the listing. fixes #15508	2022-08-10 07:35:29 -07:00
Poorna	172e63dbb6	fix: site replication group updates to set status correctly (#15507 ) Fixes: #15486	2022-08-09 15:17:43 -07:00
Poorna	21bf5b4db7	replication: heal proactively upon access (#15501 ) Queue failed/pending replication for healing during listing and GET/HEAD API calls. This includes healing of existing objects that were never replicated or those in the middle of a resync operation. This PR also fixes a bug in ListObjectVersions where lifecycle filtering should be done.	2022-08-09 15:00:24 -07:00
Harshavardhana	a406bb0288	restrict number of disks used for scanning buckets upto GOMAXPROCS (#15492 ) control scanner parallelism to avoid higher CPU usage on nodes that have more drives but an old CPU.	2022-08-08 16:16:44 -07:00
Harshavardhana	1823ab6808	LDAP/OpenID must be initialized IAM Init() (#15491 ) This allows for LDAP/OpenID to be non-blocking, allowing for unreachable Identity targets to be initialized in IAM.	2022-08-08 16:16:27 -07:00
Harshavardhana	8eec49304d	use logger.Info instead of logger.LogIf	2022-08-08 16:13:58 -07:00
Harshavardhana	ecdc2f2f5f	fix: maxConcurrent '0' is an invalid value (#15500 ) log and continue with defaults instead of crashing the service.	2022-08-08 15:18:45 -07:00
Harshavardhana	e178c55bc3	remove non-working GetRawData() from FS mode (#15498 )	2022-08-08 11:34:09 -07:00
Poorna	2c137c0d04	fix: handle invalid endpoint errors in site replication(#15499 ) fixes #15497	2022-08-08 11:12:05 -07:00
Harshavardhana	638c57e466	revert changes in FS implementation for umask fixes #15494	2022-08-08 09:48:24 -07:00
Harshavardhana	5e4213b3be	fix: keep writing previous speedtest result (#15484 ) when object speedtest is running keep writing previous speedtest result back to client until we have a new result - this avoids sending back blank entries in between the speedtest when it is running in 'autotune' mode.	2022-08-07 23:04:03 -07:00
Harshavardhana	e0b0a351c6	remove IAM old migration code (#15476 ) ``` commit `7bdaf9bc50` Author: Aditya Manthramurthy <donatello@users.noreply.github.com> Date: Wed Jul 24 17:34:23 2019 -0700 Update on-disk storage format for users system (#7949) ``` Bonus: fixes a bug when etcd keys were being re-encrypted.	2022-08-05 17:53:23 -07:00
Anis Elleuch	1d2ff46a89	Ensure lock/versioning permissions when creating a bucket (#15432 ) Currently, the code doesn't check if the user creating a bucket with locking feature has bucket locking and versioning permissions enabled, adding it in accordance with S3 spec. https://docs.aws.amazon.com/AmazonS3/latest/API/API_CreateBucket.html Object Lock - If ObjectLockEnabledForBucket is set to true in your CreateBucket request, s3:PutBucketObjectLockConfiguration and s3:PutBucketVersioning permissions are required.	2022-08-05 16:27:09 -07:00
Harshavardhana	8f7c739328	feat: add SpeedTest ResponseTimes and TTFB (#15479 ) Capture average, p50, p99, p999 response times and ttfb values. These are needed for latency measurements and overall understanding of our speedtest results.	2022-08-05 09:40:03 -07:00
Poorna	1beea3daba	fix: import bucket metadata import to return a summary (#15462 )	2022-08-05 01:52:50 -07:00
Aditya Manthramurthy	3d94c38ec4	Add env variables to configuration APIs output (#15465 ) Config export and config get APIs now include environment variables set on the server	2022-08-04 22:21:52 -07:00
Harshavardhana	f4af2d3cdc	fix: decodeDirObject() in single drive DeleteObjects() call (#15477 ) Thanks to @bh4t for reproducing this issue.	2022-08-04 18:57:43 -07:00
ebozduman	b57e7321e7	Replaces 'disk'=>'drive' visible to end user (#15464 )	2022-08-04 16:10:08 -07:00
Anis Elleuch	e93867488b	actively cancel listIAMConfigItems to avoid goroutine leak (#15471 ) listConfigItems creates a goroutine but sometimes callers will exit without properly asking listAllIAMConfigItems() to stop sending results, hence a goroutine leak. Create a new context and cancel it for each listAllIAMConfigItems call.	2022-08-04 13:20:43 -07:00
Harshavardhana	3bd9615d0e	fix: log if there is readDir() failure with ListBuckets (#15461 ) This is actionable and must be logged. Bonus: also honor umask by using 0o666 for all Open() syscalls.	2022-08-04 07:23:05 -07:00
Harshavardhana	a6e0ec4e6f	Add support converting non-inlined to inlined (#15444 ) This is a feature to allow for inode compaction on large clusters that use a lot of small files spread across a large heirarchy.	2022-08-02 23:10:22 -07:00
Andreas Auernhammer	d774a3309b	kes: automatically reload KES client certificate (#15450 ) This commit adds support for automatically reloading the MinIO client certificate for authentication to KES. The client certificate will now be reloaded: - when the private key / certificate file changes - when a SIGHUP signal is received - every 15 minutes Fixes #14869 Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2022-08-02 16:58:09 -07:00
Anis Elleuch	b3edb25377	bloom: healObject to mark a path dirty only for dangling objects (#15458 ) The path is marked dirty automatically when healObject() is called, which is wrong. HealObject() is called during self-healing and this will lead to an increase in the false positive result of the bloom filter. Also move NSUpdated() from renameData() and call it directly in CompleteMultipart and PutObject, this is not a functional change but it will make it less prone to errors in the future.	2022-08-02 16:57:39 -07:00
Harshavardhana	53a816b17a	fix: readdir fallback on root of the drive (#15457 ) fixes #15452	2022-08-02 14:57:36 -07:00
Harshavardhana	043aaa792d	fix: intrument os.OpenFile differently for Reads and Writes (#15449 ) allows us to trace latency for READs or WRITEs	2022-08-01 13:22:43 -07:00
Shireesh Anjal	e6eab2091f	fix: Incorrect ServersCount in cluster.info (#15431 ) The `ServersCount` field in cluster.info is expected to contain the number of nodes, and not number of endpoints.	2022-07-29 22:21:40 -07:00
Harshavardhana	3cdb609cca	allow root users to return appropriate policy in AccountInfo (#15437 ) fixes #15436 This fixes a regression caused after the removal of "consoleAdmin" policy usage for 'root users' in PR #15402	2022-07-29 20:58:03 -07:00
Harshavardhana	aa874010e2	fix: regression in resolving the right versions (#15430 ) fix: regression in resolving right versions commit `d480022711` caused a regression in real resolver, by picking up incorrect versionID.	2022-07-29 10:03:53 -07:00
Cesar Celis Hernandez	8ec888d13d	feat: update binary once and push it to other servers (#15407 )	2022-07-29 08:34:30 -07:00
Harshavardhana	916f274c83	choose starting concurrency based on number of local disks (#15428 ) smaller setups may have less drives per server choosing the concurrency based on number of local drives, and let the MinIO server change the overall concurrency as necessary.	2022-07-29 00:00:06 -07:00
Aditya Manthramurthy	7ac53c07af	fix: passing application configuration to console (#15409 ) This is an update to MinIO server after swagger codegen related build fixes added after issues introduced in `39fd7b0b3b`	2022-07-28 18:30:24 -07:00
Harshavardhana	bc72e4226e	do not allow filesystem fallback in server download (#15429 ) It is possible for anyone with admin access to relatively to get any content of any random OS location by simply providing the file with 'mc admin update alias/ /etc/passwd`. Workaround is to disable 'admin:ServiceUpdate' action. Everyone is advised to upgrade to this patch. Thanks to @alevsk for finding this bug.	2022-07-28 17:44:21 -07:00
Poorna	5e0776e96a	replication: Include replica object versions for resync (#15427 )	2022-07-28 13:43:02 -07:00
Anis Elleuch	2f1ef02d35	Do not update directory access time (#15426 ) Most setups will have relatime it only updates the access time following a change in the directory.	2022-07-28 12:40:48 -07:00
Harshavardhana	aff236e20e	fix: cluster healthcheck for single drive setups (#15415 ) single drive setups must return '200 OK' if drive is accessible, current master returns '503'	2022-07-27 16:46:34 -07:00
Harshavardhana	cbd70d26b5	optimize speedtest for smaller setups (#15414 ) this has been observed in multiple environments where the setups are small `speedtest` naturally fails with default '10s' and the concurrency of '32' is big for such clusters. choose a smaller value i.e equal to number of drives in such clusters and let 'autotune' increase the concurrency instead.	2022-07-27 14:41:59 -07:00
Harshavardhana	5e763b71dc	use logger.LogOnce to reduce printing disconnection logs (#15408 ) fixes #15334 - re-use net/url parsed value for http.Request{} - remove gosimple, structcheck and unusued due to https://github.com/golangci/golangci-lint/issues/2649 - unwrapErrs upto leafErr to ensure that we store exactly the correct errors	2022-07-27 09:44:59 -07:00
Aditya Manthramurthy	7e4e7a66af	Remove internal usage of consoleAdmin (#15402 ) "consoleAdmin" was used as the policy for root derived accounts, but this lead to unexpected bugs when an administrator modified the consoleAdmin policy This change avoids evaluating a policy for root derived accounts as by default no policy is mapped to the root user. If a session policy is attached to a root derived account, it will be evaluated as expected.	2022-07-26 19:06:55 -07:00
Shireesh Anjal	906947a285	fix: typo in json key ClusterInfo DeploymentID (#15406 ) deployement_id -> deployment_id	2022-07-26 19:05:33 -07:00
Poorna	426c902b87	site replication: fix healing of bucket deletes. (#15377 ) This PR changes the handling of bucket deletes for site replicated setups to hold on to deleted bucket state until it syncs to all the clusters participating in site replication.	2022-07-25 17:51:32 -07:00
Anis Elleuch	e4b51235f8	upgrade: Split in two steps to ensure a stable retry (#15396 ) Currently, if one server in a distributed setup fails to upgrade due to any reasons, it is not possible to upgrade again unless nodes are restarted. To fix this, split the upgrade process into two steps : - download the new binary on all servers - If successful, overwrite the old binary with the new one	2022-07-25 17:49:47 -07:00
Eng Zer Jun	0a3b1ad4eb	test: use `T.TempDir` to create temporary test directory (#15400 ) This commit replaces `ioutil.TempDir` with `t.TempDir` in tests. The directory created by `t.TempDir` is automatically removed when the test and all its subtests complete. Prior to this commit, temporary directory created using `ioutil.TempDir` needs to be removed manually by calling `os.RemoveAll`, which is omitted in some tests. The error handling boilerplate e.g. defer func() { if err := os.RemoveAll(dir); err != nil { t.Fatal(err) } } is also tedious, but `t.TempDir` handles this for us nicely. Reference: https://pkg.go.dev/testing#T.TempDir Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2022-07-25 12:37:26 -07:00
Anis Elleuch	f23f442d33	Add cluster info to inspect/profiling archive (#15360 ) Add cluster info to inspect and profiling archive. In addition to the existing data generation for both inspect and profiling, cluster.info file is added. This latter contains some info of the cluster. The generation of cluster.info is is done as the last step and it can fail if it exceed 10 seconds.	2022-07-25 09:11:35 -07:00
Klaus Post	3795b2c8ba	Add compression scheme to header (#15395 ) For easier debugging. We still do not return compressed size for security reasons.	2022-07-24 07:15:49 -07:00
Harshavardhana	7725425e05	fix: fork os.MkdirAll to optimize cases where parent exists (#15379 ) a/b/c/d/ where `a/b/c/` exists results in additional syscalls such as an Lstat() call to verify if the `a/b/c/` exists and its a directory. We do not need to do this on MinIO since the parent prefixes if exist, we can simply return success without spending additional syscalls. Also this implementation attempts to simply use Access() calls to avoid os.Stat() calls since the latter does memory allocation for things we do not need to use. Access() is simpler since we have a predictable structure on the backend and we know exactly how our path structures are.	2022-07-24 00:43:11 -07:00
Aditya Manthramurthy	39fd7b0b3b	Pass multiple IDP config to console (#15270 ) This change passes multiple IDP config via a struct rather than env variables.	2022-07-22 15:28:02 -07:00
Harshavardhana	b0d70a0e5e	support additional claim info in Auditing STS calls (#15381 ) Bonus: Adds a missing AuditLog from AssumeRoleWithCertificate API Fixes #9529	2022-07-22 11:12:03 -07:00
Poorna	7d8c8de827	single drive: Remove bucket metadata on DeleteBucket (#15378 ) from disk and in-memory map	2022-07-21 19:51:53 -07:00
jiuker	3faef829c5	expect full quorum for writing 'format.json' everywhere (#15362 )	2022-07-21 18:04:17 -07:00
Poorna	7560fb6f9a	save IAM export assets relative at a folder prefix (#15355 )	2022-07-21 17:51:33 -07:00
Klaus Post	69bf39f42e	fix: make complete multipart uploads faster encrypted/compressed backends (#15375 ) - Only fetch the parts we need and abort as soon as one is missing. - Only fetch the number of parts requested by "ListObjectParts".	2022-07-21 16:47:58 -07:00
Minio Trusted	564a0afae1	Revert "tests: Add context cancelation (#15374 )" This reverts commit `1e332f0eb1`. Reverting this as tests are failing randomly.	2022-07-21 13:58:56 -07:00
Klaus Post	1e332f0eb1	tests: Add context cancelation (#15374 ) A huge number of goroutines would build up from various monitors When creating test filesystems provide a context so they can shut down when no longer needed.	2022-07-21 11:52:18 -07:00
Poorna	cab8d3d568	feat: add API to return list of objects waiting to be replicated (#15091 )	2022-07-21 11:05:44 -07:00
Klaus Post	be8c4cb24a	fix: support multiple validateAdminReq actions (#15372 ) handle multiple validateAdminReq actions and remove duplicate error responses.	2022-07-21 10:26:59 -07:00
Harshavardhana	65166e4ce4	fix: readQuorum calculation when defaultParityCount is 0 (#15363 ) when parity is '0' the readQuorum must be equal to the number of data disks.	2022-07-21 07:25:54 -07:00
Harshavardhana	d3f89fa6e3	remove unnecessary logs in IAM store (#15356 )	2022-07-20 08:19:12 -07:00
Harshavardhana	ce8397f7d9	use partInfo only for intermediate part.x.meta (#15353 )	2022-07-19 18:56:24 -07:00
Klaus Post	cae9aeca00	fix: reused field crash in PartIndices (#15351 ) `PartIndices` may be set if xlMetaV2Version is reused. Clear before unmarshaling and add sanity check when reading.	2022-07-19 16:49:46 -07:00
Klaus Post	f939d1c183	Independent Multipart Uploads (#15346 ) Do completely independent multipart uploads. In distributed mode, a lock was held to merge each multipart upload as it was added. This lock was highly contested and retries are expensive (timewise) in distributed mode. Instead, each part adds its metadata information uniquely. This eliminates the per object lock required for each to merge. The metadata is read back and merged by "CompleteMultipartUpload" without locks when constructing final object. Co-authored-by: Harshavardhana <harsha@minio.io>	2022-07-19 08:35:29 -07:00
Andreas Auernhammer	242d06274a	kms: add `context.Context` to KMS API calls (#15327 ) This commit adds a `context.Context` to the the KMS `{Stat, CreateKey, GenerateKey}` API calls. The context will be used to terminate external calls as soon as the client requests gets canceled. A follow-up PR will add a `context.Context` to the remaining `DecryptKey` API call. Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2022-07-18 18:54:27 -07:00
Poorna	957e3ed729	export IAM: include site replicator svcacct (#15339 )	2022-07-18 17:38:53 -07:00
Harshavardhana	b6eb8dff64	Add decommission compression+encryption enabled tests (#15322 ) update compression environment variables to follow the expected sub-system style, however support fallback mode.	2022-07-17 08:43:14 -07:00
Harshavardhana	7da9e3a6f8	support encrypted/compressed objects properly during decommission (#15320 ) fixes #15314	2022-07-16 19:35:24 -07:00
Anis Elleuch	876970baea	Exclude upload-ids with incomplete part upload in multipart listing (#15318 ) Uploading a part object can leave an inconsistent state inside .minio.sys/multipart where data are uploaded but xl.meta is not committed yet. Do not list upload-ids that have this state in the multipart listing.	2022-07-16 13:25:58 -07:00
LHHDZ	e68e76e143	fix: data race, which caused tests execution to fail (#15313 )	2022-07-16 07:57:55 -07:00
Harshavardhana	e7ac1ea54c	allow decommission to continue when healing (#15312 ) Bonus: - heal buckets in-case during startup the new pools have bucket missing.	2022-07-15 21:03:23 -07:00
Harshavardhana	5ac6d91525	support 'admin update' for hotfix versions (#15308 ) hotfixed versions are rejected as invalid, allow `mc admin update` from hotfix repos.	2022-07-15 16:00:34 -07:00
Harshavardhana	1cd6713e24	copy query values before update to preserve the expected keys (#15310 ) in success_action_redirect we were missing required query params as per S3 spec - updated tests.	2022-07-15 15:04:48 -07:00
Harshavardhana	1b339ea062	allow force delete on decom pool (#15302 ) Bonus: - skip suspended pool from being considered for multipart uploads - add more context for decomErrors()	2022-07-14 20:44:22 -07:00
Harshavardhana	236ef03dbd	fix: skip objects expired via lifecycle rules during decommission (#15300 )	2022-07-14 16:47:09 -07:00
Poorna	7e32a17742	fix: site replication healing of missing buckets (#15298 ) fixes a regression from #15186 - Adding tests to cover healing of buckets. - Also dereference quota in SiteReplicationStatus only when non-nil	2022-07-14 14:27:47 -07:00
Krishnan Parthasarathi	1d42133d44	listing: Expire object versions past expiry (#15287 ) We skip object versions which are past their ILM expiry. This change schedules them for expiry while at it.	2022-07-14 07:21:26 -07:00
Poorna	b4f6901903	resync: Avoid concurrent access/write on map (#15286 ) fixes a crash ``` fatal error: concurrent map iteration and map write minio[19309]: goroutine 18640 [running]: minio[19309]: runtime.throw({0x27a3399?, 0x1785?}) minio[19309]: runtime/panic.go:992 +0x71 fp=0xc0062f1c80 sp=0xc0062f1c50 pc=0x438671 minio[19309]: runtime.mapiternext(0xc0062f1e90?) minio[19309]: runtime/map.go:871 +0x4eb fp=0xc0062f1cf0 sp=0xc0062f1c80 pc=0x41002b minio[19309]: github.com/minio/minio/cmd.(*ReplicationPool).periodicResyncMetaSave(0xc0056c00c0, {0x4d06a48, 0xc0005b2480}, {0x4d22fc0, 0xc0015ea0 ```	2022-07-13 16:29:10 -07:00
Klaus Post	0149382cdc	Add padding to compressed+encrypted files (#15282 ) Add up to 256 bytes of padding for compressed+encrypted files. This will obscure the obvious cases of extremely compressible content and leave a similar output size for a very wide variety of inputs. This does not mean the compression ratio doesn't leak information about the content, but the outcome space is much smaller, so often less information is leaked.	2022-07-13 07:52:15 -07:00
Klaus Post	697c9973a7	Upgrade compression package (#15284 ) Includes mitigation for CVE-2022-30631 (Go should still be updated) Remove functions now available upstream.	2022-07-13 07:48:14 -07:00
Harshavardhana	788fd3df81	preserve incoming query params in success_action_redirect (#15280 ) fixes #15274	2022-07-13 07:46:44 -07:00
Anis Elleuch	996cac5fed	Avoid listing buckets from a suspended pool (#15283 ) Make bucket requests sent after decommissioning is started are not created in a suspended pool. Therefore listing buckets should avoid suspended pools as well.	2022-07-13 07:44:50 -07:00
Harshavardhana	0a8b78cb84	fix: simplify passing auditLog eventType (#15278 ) Rename Trigger -> Event to be a more appropriate name for the audit event. Bonus: fixes a bug in AddMRFWorker() it did not cancel the waitgroup, leading to waitgroup leaks.	2022-07-12 10:43:32 -07:00
Harshavardhana	b4eb74f5ff	allow custom speedtest bucket (#15271 ) this allows for specifying existing buckets with - object replication enabled - object encryption enabled - object versioning enabled - object locking enabled	2022-07-12 10:12:47 -07:00
Anis Elleuch	57d1f31054	Do not log erasure read failure when disk goes offline (#15277 ) Avoid printing the following log ``` API: SYSTEM Time: Fri Jul 08 2022 11:48:40 GMT+0100 Error: Error(disk not found) reading erasure shards at... Backtrace: 0: internal/logger/logger.go:278:logger.LogIf() 1: cmd/bitrot-streaming.go:156:cmd.(streamingBitrotReader).ReadAt() 2: cmd/erasure-decode.go:165:cmd.(parallelReader).Read.func1() ```	2022-07-12 09:56:56 -07:00
Klaus Post	9f02f51b87	Add 4K minimum compressed size (#15273 ) There is no point in compressing very small files. Typically the effective size on disk will be the same due to disk blocks. So don't waste resources on extremely small files. We don't check on multipart. 1) because we don't know and 2) this is very likely a big object anyway.	2022-07-12 07:42:04 -07:00
Klaus Post	911a17b149	Add compressed file index (#15247 )	2022-07-11 17:30:56 -07:00
Poorna	3d969bd2b4	fix: ignore missing targets/replication config during site removal (#15269 )	2022-07-11 14:11:46 -07:00
Andreas Auernhammer	f800cee4fa	metric: add KMS-related metrics (#15258 ) This commit adds a minimal set of KMS-related metrics: ``` # HELP minio_cluster_kms_online Reports whether the KMS is online (1) or offline (0) # TYPE minio_cluster_kms_online gauge minio_cluster_kms_online{server="127.0.0.1:9000"} 1 # HELP minio_cluster_kms_request_error Number of KMS requests that failed with a well-defined error # TYPE minio_cluster_kms_request_error counter minio_cluster_kms_request_error{server="127.0.0.1:9000"} 16790 # HELP minio_cluster_kms_request_success Number of KMS requests that succeeded # TYPE minio_cluster_kms_request_success counter minio_cluster_kms_request_success{server="127.0.0.1:9000"} 348031 ``` Currently, we report whether the KMS is available and how many requests succeeded/failed. However, KES exposes much more metrics that can be exposed if necessary. See: https://pkg.go.dev/github.com/minio/kes#Metric Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2022-07-11 09:17:28 -07:00
Praveen raj Mani	b49fc33cb3	purge objects immediately with `x-minio-force-delete` in DeleteObject and DeleteBucket API (#15148 )	2022-07-11 09:15:54 -07:00
Klaus Post	37a6b2da67	Allow compaction at bucket top level. (#15266 ) If more than 1M folders (objects or prefixes) are found at the top level in a bucket allow it to be compacted. While very suboptimal structure we should limit memory usage at some point.	2022-07-11 07:59:03 -07:00
Harshavardhana	913e977c8d	remove auto-port warning for console-address (#15260 )	2022-07-08 13:36:41 -07:00
Harshavardhana	c2ddcb3b40	do not recreate deprecated delete-journal.bin, only read it (#15185 ) simplify deprecated code, re-enable hot-swap replace disks	2022-07-08 12:17:02 -07:00
Anis Elleuch	ed0cbfb31e	fix: rootdisk detection by not using cached value when GetDiskInfo() errors out (#15249 ) GetDiskInfo() uses timedValue to cache the disk info for one second. timedValue behavior was recently changed to return an old cached value when calculating a new value returns an error. When a mount point is empty, GetDiskInfo() will return errUnformattedDisk, timedValue will return cached disk info with unexpected IsRootDisk value, e.g. false if the mount point belongs to a root disk. Therefore, the mount point will be considered a valid disk and will be formatted as well. This commit will also add more defensive code when marking root disks: always mark a disk offline for any GetDiskInfo() error except errUnformattedDisk. The server will try anyway to reconnect to those disks every 10 seconds.	2022-07-07 17:05:23 -07:00
Harshavardhana	32b2f6117e	fix: do not pass around sync.Map (#15250 ) it is not safe to pass around sync.Map through pointers, as it may be concurrently updated by different callers. this PR simplifies by avoiding sync.Map altogether, we do not need sync.Map to keep object->erasureMap association. This PR fixes a crash when concurrently using this value when audit logs are configured. ``` fatal error: concurrent map iteration and map write goroutine 247651580 [running]: runtime.throw({0x277a6c1?, 0xc002381400?}) runtime/panic.go:992 +0x71 fp=0xc004d29b20 sp=0xc004d29af0 pc=0x438671 runtime.mapiternext(0xc0d6e87f18?) runtime/map.go:871 +0x4eb fp=0xc004d29b90 sp=0xc004d29b20 pc=0x41002b ```	2022-07-07 17:04:25 -07:00
Harshavardhana	ae92521310	remove unnecessary nAgreed value in partial() func (#15242 )	2022-07-07 13:45:34 -07:00
Harshavardhana	5802df4365	retry and resume decom operation upon retriable failures (#15244 ) it is possible in a k8s-like system reading pool.bin might not have quorum during startup, however, add a way to retry after this failure.	2022-07-07 12:31:44 -07:00
Anis Elleuch	8d98282afd	Better reporting of total/free usable capacity of the cluster (#15230 ) The current code uses approximation using a ratio. The approximation can skew if we have multiple pools with different disk capacities. Replace the algorithm with a simpler one which counts data disks and ignore parity disks.	2022-07-06 13:29:49 -07:00
Harshavardhana	3af6073576	no 'replicate status' without replication config (#15233 ) 'replicate status' shouldn't be displaying historic values unless replication config is present on the relevant bucket.	2022-07-06 09:53:33 -07:00
Harshavardhana	2518af5f9e	fix: allow certain mutations on objects during decommissioning (#15231 ) fix: allow certain mutation on objects during decommission currently by mistake deletion of objects was skipped, if the object resided on the pool being decommissioned. delete's are okay to be allowed since decommission is designed to run on a cluster with active I/O.	2022-07-06 09:53:16 -07:00
Harshavardhana	7b793d84c8	fix: calculate scanner metric paths for single drive (#15232 ) Additionally use pathJoin() to avoid double `//` in path names.	2022-07-06 07:48:38 -07:00
Aditya Manthramurthy	af9bc7ea7d	Add external IDP management Admin API for OpenID (#15152 )	2022-07-05 18:18:04 -07:00
Klaus Post	ac055b09e9	Add detailed scanner metrics (#15161 )	2022-07-05 14:45:49 -07:00
haslersn	df42914da6	Fix missing whitespace in error message for IncompleteBody (#15227 )	2022-07-05 12:19:57 -07:00
Klaus Post	2471bdda00	fix: for DiskInfo call cache disk metrics (#15229 ) Small uploads spend a significant amount of time (~5%) fetching disk info metrics. Also maps are allocated for each call. Add a 100ms cache to disk metrics.	2022-07-05 11:02:30 -07:00
Harshavardhana	9d80ff5a05	fix: decommission delete markers for non-current objects (#15225 ) versioned buckets were not creating the delete markers present in the versioned stack of an object, this essentially would stop decommission to succeed. This PR fixes creating such delete markers properly during a decommissioning process, adds tests as well.	2022-07-05 07:37:24 -07:00
Harshavardhana	b311abed31	decom IAM, Bucket metadata properly (#15220 ) Current code incorrectly passed the config asset object name while decommissioning, make sure that we pass the right object name to be hashed on the newer set of pools. This PR fixes situations after a successful decommission, the users and policies might go missing due to wrong hashed set.	2022-07-04 14:02:54 -07:00
Harshavardhana	ce667ddae0	do not print errFileNotFound in entries.resolve() (#15216 )	2022-07-04 06:40:46 -07:00
Harshavardhana	0fee993a4b	return appropriate error under 'decom status' (#15213 ) fixes #15208	2022-07-01 16:21:23 -07:00
Poorna	0ea5c9d8e8	site healing: Skip stale iam asset updates from peer. (#15203 ) Allow healing to apply IAM change only when peer gave the most recent update.	2022-07-01 13:19:13 -07:00
Harshavardhana	63ac260bd5	Simplify Prometheus metrics gather (#15210 )	2022-07-01 13:18:39 -07:00
Harshavardhana	f9a4ad7904	update banner with version+runtime (#15206 )	2022-06-30 13:58:09 -07:00
Minio Trusted	e60b67d246	Revert "Tighten enforcement of object retention (#14993 )" This reverts commit `5e3010d455`. This commit causes regression on object locked buckets causine delete-markers to be not created.	2022-06-30 13:06:32 -07:00
Klaus Post	9004d69c6f	Make ReqInfo concurrency safe (#15204 ) Some read/writes of ReqInfo did not get appropriate locks, leading to races. Make sure reading and writing holds appropriate locks.	2022-06-30 10:48:50 -07:00
Harshavardhana	8856a2d77b	finalize startup-banner and remove unnecessary logs (#15202 )	2022-06-29 16:32:04 -07:00
Anis Elleuch	54a061bdda	Save minio version information centrally (#15181 )	2022-06-29 14:45:49 -07:00
Poorna	7cc9286e0f	site healing: Skip stale bucket metadata updates from peer (#15186 ) Allow healing to apply bucket metadata change only when peer gave the most recent update.	2022-06-28 18:09:20 -07:00
Harshavardhana	2f25639ea0	update banner to reflect the final agreed UI (#15192 )	2022-06-28 16:37:40 -07:00
Harshavardhana	2070c215a2	handle missing funcNames for handlers (#15188 ) also use designated names for internal calls - storageREST calls are storageR - lockREST calls are lockR - peerREST calls are just peer Named in this fashion to facilitate wildcard matches by having prefixes of the same name. Additionally, also enable funcNames for generic handlers that return errors, currently we disable '<unknown>'	2022-06-28 05:04:10 -07:00
Harshavardhana	9c605ad153	allow support for parity '0', '1' enabling support for 2,3 drive setups (#15171 ) allows for further granular setups - 2 drives (1 parity, 1 data) - 3 drives (1 parity, 2 data) Bonus: allows '0' parity as well.	2022-06-27 20:22:18 -07:00
Anis Elleuch	b7c7e59dac	Revert proxying requests with precondition errors (#15180 ) In a replicated setup, when an object is updated in one cluster but still waiting to be replicated to the other cluster, GET requests with if-match, and range headers will likely fail. It is better to proxy requests instead. Also, this commit avoids printing verbose logs about precondition & range errors.	2022-06-27 14:03:44 -07:00
Harshavardhana	699cf6ff45	perform object sweep after equeue the latest CopyObject() (#15183 ) keep it similar to PutObject/CompleteMultipart	2022-06-27 12:11:33 -07:00
Anis Elleuch	9201870f6c	Remove unnecessary code in WalkDir() (#15168 ) Recalculating forward is useless. It is never used and it will be computed again when calling scanDir() again.	2022-06-27 10:26:56 -07:00
Harshavardhana	6722f58668	save MinIO version with each version (8-bytes extra) (#15170 ) store MinIO version along with each version in 'xl.meta' for future purposes, can be used as ways to add specific code for bug fixes if any.	2022-06-27 03:59:41 -07:00
Harshavardhana	7b9b7cef11	add license banner for GNU AGPLv3 (#15178 ) Bonus: rewrite subnet re-use of Transport	2022-06-27 03:58:25 -07:00
Harshavardhana	bd099f5e71	fix: change timedValue to return the previously cached value (#15169 ) fix: change timedvalue to return previous cached value caller can interpret the underlying error and decide accordingly, places where we do not interpret the errors upon timedValue.Get() - we should simply use the previously cached value instead of returning "empty". Bonus: remove some unused code	2022-06-25 08:50:16 -07:00
Klaus Post	baf257adcb	fix: health client leak when calling UpdateAllTargets (#15167 ) When `LoadBucketMetadataHandler` is called and `UpdateAllTargets` gets called. Since targets are rebuilt we cancel all.	2022-06-24 11:12:52 -07:00
Anis Elleuch	4fd1986885	Trace all http requests (#15064 ) Add a generic handler that adds a new tracing context to the request if tracing is enabled. Other handlers are free to modify the tracing context to update information on the fly, such as, func name, enable body logging etc.. With this commit, requests like this ``` curl -H "Host: ::1:3000" http://localhost:9000/ ``` will be traced as well.	2022-06-23 23:19:24 -07:00
Harshavardhana	e1afac9439	reduce sha256 CPU usage by turning it off for speedtests (#15154 ) continuation of the PR #15151, keeping signature v4 for the headers however avoiding sha256 for the body.	2022-06-23 11:26:53 -07:00
Poorna	580d9db85e	Add APIs to import/export IAM data (#15014 )	2022-06-23 09:25:15 -07:00
Anis Elleuch	42e2fd35d8	heal: Include dir markers when healing a fresh disk (#15158 ) Directories markers are not healed when healing a new fresh disk. A a proper fix would be moving object names encoding/decoding to erasure object level but it is too late now since the object to set distribution is calculated at a higher level.	2022-06-23 06:47:33 -07:00
Harshavardhana	1a40c7c27c	use signature-v2 for 'object perf' tests to avoid CPU using sha256 (#15151 ) It is observed in a local 8 drive system the CPU seems to be bottlenecked at ``` (pprof) top Showing nodes accounting for 1385.31s, 88.47% of 1565.88s total Dropped 1304 nodes (cum <= 7.83s) Showing top 10 nodes out of 159 flat flat% sum% cum cum% 724s 46.24% 46.24% 724s 46.24% crypto/sha256.block 219.04s 13.99% 60.22% 226.63s 14.47% syscall.Syscall 158.04s 10.09% 70.32% 158.04s 10.09% runtime.memmove 127.58s 8.15% 78.46% 127.58s 8.15% crypto/md5.block 58.67s 3.75% 82.21% 58.67s 3.75% github.com/minio/highwayhash.updateAVX2 40.07s 2.56% 84.77% 40.07s 2.56% runtime.epollwait 33.76s 2.16% 86.93% 33.76s 2.16% github.com/klauspost/reedsolomon._galMulAVX512Parallel84 8.88s 0.57% 87.49% 11.56s 0.74% runtime.step 7.84s 0.5% 87.99% 7.84s 0.5% runtime.memclrNoHeapPointers 7.43s 0.47% 88.47% 22.18s 1.42% runtime.pcvalue ``` Bonus changes: - re-use transport for bucket replication clients, also site replication clients. - use 32KiB buffer for all read and writes at transport layer seems to help TLS read connections. - Do not have 'MaxConnsPerHost' this is problematic to be used with net/http connection pooling 'MaxIdleConnsPerHost' is enough.	2022-06-22 16:28:25 -07:00
Poorna	cb097e6b0a	CopyObject: fix read/write err on closed pipe (#15135 ) Fixes: #15128 Regression from PR#14971	2022-06-21 19:20:11 -07:00
Poorna	1cfb03fb74	replication: Avoid proxying when precondition failed (#15134 ) Proxying is not required when content is on this cluster and does not meet pre-conditions specified in the request. Fixes #15124	2022-06-21 14:11:35 -07:00
Harshavardhana	f293df647c	s3/zip: extract metadata properly for Zipped objects (#15123 ) s3/zip: extra metadata properly for Zipped objects fixes #15121	2022-06-21 14:11:12 -07:00
sota	e2e5bd6f19	fix: cant parse comment without '=' in environment file (#15130 )	2022-06-21 10:37:15 -07:00
Andreas Auernhammer	cd7a0a9757	fips: simplify TLS configuration (#15127 ) This commit simplifies the TLS configuration. It inlines the FIPS / non-FIPS code. Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2022-06-21 07:54:48 -07:00
Anis Elleuch	b3eda248a3	Parallelize new disks healing of different erasure sets (#15112 ) - Always reformat all disks when a new disk is detected, this will ensure new uploads to be written in new fresh disks - Always heal all buckets first when an erasure set started to be healed - Use a lock to prevent two disks belonging to different nodes but in the same erasure set to be healed in parallel - Heal different sets in parallel Bonus: - Avoid logging errUnformattedDisk when a new fresh disk is inserted but not detected by healing mechanism yet (10 seconds lag)	2022-06-21 07:53:55 -07:00
Harshavardhana	486888f595	remove gateway banner and some other TODO loggers (#15125 )	2022-06-21 05:25:40 -07:00
Poorna	b3ebc69034	improve error message for bucket metadata export/import API (#15120 )	2022-06-20 16:13:45 -07:00
Harshavardhana	761dde2f1b	fix: add 'mc support inspect' support for single drive deployment (#15122 )	2022-06-20 16:11:19 -07:00
Harshavardhana	2bb6a3f4d0	cleanup site replication error handling (#15113 ) site replication errors were printed at various random locations, repeatedly - this PR attempts to remove double logging and capture all of them at a common place. This PR also enhances the code to show partial success and errors as well.	2022-06-20 10:48:11 -07:00

1 2 3 4 5 ...

4747 Commits