minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	2518af5f9e	fix: allow certain mutations on objects during decommissioning (#15231 ) fix: allow certain mutation on objects during decommission currently by mistake deletion of objects was skipped, if the object resided on the pool being decommissioned. delete's are okay to be allowed since decommission is designed to run on a cluster with active I/O.	2022-07-06 09:53:16 -07:00
Harshavardhana	7b793d84c8	fix: calculate scanner metric paths for single drive (#15232 ) Additionally use pathJoin() to avoid double `//` in path names.	2022-07-06 07:48:38 -07:00
Aditya Manthramurthy	af9bc7ea7d	Add external IDP management Admin API for OpenID (#15152 )	2022-07-05 18:18:04 -07:00
Klaus Post	ac055b09e9	Add detailed scanner metrics (#15161 )	2022-07-05 14:45:49 -07:00
haslersn	df42914da6	Fix missing whitespace in error message for IncompleteBody (#15227 )	2022-07-05 12:19:57 -07:00
Klaus Post	2471bdda00	fix: for DiskInfo call cache disk metrics (#15229 ) Small uploads spend a significant amount of time (~5%) fetching disk info metrics. Also maps are allocated for each call. Add a 100ms cache to disk metrics.	2022-07-05 11:02:30 -07:00
Harshavardhana	9d80ff5a05	fix: decommission delete markers for non-current objects (#15225 ) versioned buckets were not creating the delete markers present in the versioned stack of an object, this essentially would stop decommission to succeed. This PR fixes creating such delete markers properly during a decommissioning process, adds tests as well.	2022-07-05 07:37:24 -07:00
Harshavardhana	b311abed31	decom IAM, Bucket metadata properly (#15220 ) Current code incorrectly passed the config asset object name while decommissioning, make sure that we pass the right object name to be hashed on the newer set of pools. This PR fixes situations after a successful decommission, the users and policies might go missing due to wrong hashed set.	2022-07-04 14:02:54 -07:00
Harshavardhana	ce667ddae0	do not print errFileNotFound in entries.resolve() (#15216 )	2022-07-04 06:40:46 -07:00
Harshavardhana	0fee993a4b	return appropriate error under 'decom status' (#15213 ) fixes #15208	2022-07-01 16:21:23 -07:00
Poorna	0ea5c9d8e8	site healing: Skip stale iam asset updates from peer. (#15203 ) Allow healing to apply IAM change only when peer gave the most recent update.	2022-07-01 13:19:13 -07:00
Harshavardhana	63ac260bd5	Simplify Prometheus metrics gather (#15210 )	2022-07-01 13:18:39 -07:00
Harshavardhana	f9a4ad7904	update banner with version+runtime (#15206 )	2022-06-30 13:58:09 -07:00
Minio Trusted	e60b67d246	Revert "Tighten enforcement of object retention (#14993 )" This reverts commit `5e3010d455`. This commit causes regression on object locked buckets causine delete-markers to be not created.	2022-06-30 13:06:32 -07:00
Klaus Post	9004d69c6f	Make ReqInfo concurrency safe (#15204 ) Some read/writes of ReqInfo did not get appropriate locks, leading to races. Make sure reading and writing holds appropriate locks.	2022-06-30 10:48:50 -07:00
Harshavardhana	8856a2d77b	finalize startup-banner and remove unnecessary logs (#15202 )	2022-06-29 16:32:04 -07:00
Anis Elleuch	54a061bdda	Save minio version information centrally (#15181 )	2022-06-29 14:45:49 -07:00
Poorna	7cc9286e0f	site healing: Skip stale bucket metadata updates from peer (#15186 ) Allow healing to apply bucket metadata change only when peer gave the most recent update.	2022-06-28 18:09:20 -07:00
Harshavardhana	2f25639ea0	update banner to reflect the final agreed UI (#15192 )	2022-06-28 16:37:40 -07:00
Harshavardhana	2070c215a2	handle missing funcNames for handlers (#15188 ) also use designated names for internal calls - storageREST calls are storageR - lockREST calls are lockR - peerREST calls are just peer Named in this fashion to facilitate wildcard matches by having prefixes of the same name. Additionally, also enable funcNames for generic handlers that return errors, currently we disable '<unknown>'	2022-06-28 05:04:10 -07:00
Harshavardhana	9c605ad153	allow support for parity '0', '1' enabling support for 2,3 drive setups (#15171 ) allows for further granular setups - 2 drives (1 parity, 1 data) - 3 drives (1 parity, 2 data) Bonus: allows '0' parity as well.	2022-06-27 20:22:18 -07:00
Anis Elleuch	b7c7e59dac	Revert proxying requests with precondition errors (#15180 ) In a replicated setup, when an object is updated in one cluster but still waiting to be replicated to the other cluster, GET requests with if-match, and range headers will likely fail. It is better to proxy requests instead. Also, this commit avoids printing verbose logs about precondition & range errors.	2022-06-27 14:03:44 -07:00
Harshavardhana	699cf6ff45	perform object sweep after equeue the latest CopyObject() (#15183 ) keep it similar to PutObject/CompleteMultipart	2022-06-27 12:11:33 -07:00
Anis Elleuch	9201870f6c	Remove unnecessary code in WalkDir() (#15168 ) Recalculating forward is useless. It is never used and it will be computed again when calling scanDir() again.	2022-06-27 10:26:56 -07:00
Harshavardhana	6722f58668	save MinIO version with each version (8-bytes extra) (#15170 ) store MinIO version along with each version in 'xl.meta' for future purposes, can be used as ways to add specific code for bug fixes if any.	2022-06-27 03:59:41 -07:00
Harshavardhana	7b9b7cef11	add license banner for GNU AGPLv3 (#15178 ) Bonus: rewrite subnet re-use of Transport	2022-06-27 03:58:25 -07:00
Harshavardhana	bd099f5e71	fix: change timedValue to return the previously cached value (#15169 ) fix: change timedvalue to return previous cached value caller can interpret the underlying error and decide accordingly, places where we do not interpret the errors upon timedValue.Get() - we should simply use the previously cached value instead of returning "empty". Bonus: remove some unused code	2022-06-25 08:50:16 -07:00
Klaus Post	baf257adcb	fix: health client leak when calling UpdateAllTargets (#15167 ) When `LoadBucketMetadataHandler` is called and `UpdateAllTargets` gets called. Since targets are rebuilt we cancel all.	2022-06-24 11:12:52 -07:00
Anis Elleuch	4fd1986885	Trace all http requests (#15064 ) Add a generic handler that adds a new tracing context to the request if tracing is enabled. Other handlers are free to modify the tracing context to update information on the fly, such as, func name, enable body logging etc.. With this commit, requests like this ``` curl -H "Host: ::1:3000" http://localhost:9000/ ``` will be traced as well.	2022-06-23 23:19:24 -07:00
Harshavardhana	e1afac9439	reduce sha256 CPU usage by turning it off for speedtests (#15154 ) continuation of the PR #15151, keeping signature v4 for the headers however avoiding sha256 for the body.	2022-06-23 11:26:53 -07:00
Poorna	580d9db85e	Add APIs to import/export IAM data (#15014 )	2022-06-23 09:25:15 -07:00
Anis Elleuch	42e2fd35d8	heal: Include dir markers when healing a fresh disk (#15158 ) Directories markers are not healed when healing a new fresh disk. A a proper fix would be moving object names encoding/decoding to erasure object level but it is too late now since the object to set distribution is calculated at a higher level.	2022-06-23 06:47:33 -07:00
Harshavardhana	1a40c7c27c	use signature-v2 for 'object perf' tests to avoid CPU using sha256 (#15151 ) It is observed in a local 8 drive system the CPU seems to be bottlenecked at ``` (pprof) top Showing nodes accounting for 1385.31s, 88.47% of 1565.88s total Dropped 1304 nodes (cum <= 7.83s) Showing top 10 nodes out of 159 flat flat% sum% cum cum% 724s 46.24% 46.24% 724s 46.24% crypto/sha256.block 219.04s 13.99% 60.22% 226.63s 14.47% syscall.Syscall 158.04s 10.09% 70.32% 158.04s 10.09% runtime.memmove 127.58s 8.15% 78.46% 127.58s 8.15% crypto/md5.block 58.67s 3.75% 82.21% 58.67s 3.75% github.com/minio/highwayhash.updateAVX2 40.07s 2.56% 84.77% 40.07s 2.56% runtime.epollwait 33.76s 2.16% 86.93% 33.76s 2.16% github.com/klauspost/reedsolomon._galMulAVX512Parallel84 8.88s 0.57% 87.49% 11.56s 0.74% runtime.step 7.84s 0.5% 87.99% 7.84s 0.5% runtime.memclrNoHeapPointers 7.43s 0.47% 88.47% 22.18s 1.42% runtime.pcvalue ``` Bonus changes: - re-use transport for bucket replication clients, also site replication clients. - use 32KiB buffer for all read and writes at transport layer seems to help TLS read connections. - Do not have 'MaxConnsPerHost' this is problematic to be used with net/http connection pooling 'MaxIdleConnsPerHost' is enough.	2022-06-22 16:28:25 -07:00
Poorna	cb097e6b0a	CopyObject: fix read/write err on closed pipe (#15135 ) Fixes: #15128 Regression from PR#14971	2022-06-21 19:20:11 -07:00
Poorna	1cfb03fb74	replication: Avoid proxying when precondition failed (#15134 ) Proxying is not required when content is on this cluster and does not meet pre-conditions specified in the request. Fixes #15124	2022-06-21 14:11:35 -07:00
Harshavardhana	f293df647c	s3/zip: extract metadata properly for Zipped objects (#15123 ) s3/zip: extra metadata properly for Zipped objects fixes #15121	2022-06-21 14:11:12 -07:00
sota	e2e5bd6f19	fix: cant parse comment without '=' in environment file (#15130 )	2022-06-21 10:37:15 -07:00
Andreas Auernhammer	cd7a0a9757	fips: simplify TLS configuration (#15127 ) This commit simplifies the TLS configuration. It inlines the FIPS / non-FIPS code. Signed-off-by: Andreas Auernhammer <hi@aead.dev>	2022-06-21 07:54:48 -07:00
Anis Elleuch	b3eda248a3	Parallelize new disks healing of different erasure sets (#15112 ) - Always reformat all disks when a new disk is detected, this will ensure new uploads to be written in new fresh disks - Always heal all buckets first when an erasure set started to be healed - Use a lock to prevent two disks belonging to different nodes but in the same erasure set to be healed in parallel - Heal different sets in parallel Bonus: - Avoid logging errUnformattedDisk when a new fresh disk is inserted but not detected by healing mechanism yet (10 seconds lag)	2022-06-21 07:53:55 -07:00
Harshavardhana	486888f595	remove gateway banner and some other TODO loggers (#15125 )	2022-06-21 05:25:40 -07:00
Poorna	b3ebc69034	improve error message for bucket metadata export/import API (#15120 )	2022-06-20 16:13:45 -07:00
Harshavardhana	761dde2f1b	fix: add 'mc support inspect' support for single drive deployment (#15122 )	2022-06-20 16:11:19 -07:00
Harshavardhana	2bb6a3f4d0	cleanup site replication error handling (#15113 ) site replication errors were printed at various random locations, repeatedly - this PR attempts to remove double logging and capture all of them at a common place. This PR also enhances the code to show partial success and errors as well.	2022-06-20 10:48:11 -07:00
Anis Elleuch	73733a8fb9	heal: Report correctly in multip-pools setup (#15117 ) `mc admin heal -r <alias>` in a multi setup pools returns incorrectly grey objects. The reason is that erasure-server-pools.HealObject() runs HealObject in all pools and returns the result of the first nil error. However, in the lower erasureObject level, HealObject() returns nil if an object does not exist + missing error in each disk of the object in that pool, therefore confusing mc. Make erasureObject.HealObject() to return not found error in the lower level, so at least erasureServerPools will know what pools to ignore.	2022-06-20 08:07:45 -07:00
Poorna	2fa1d8ac48	Add import/export APIs to migrate bucket metadata (#14929 )	2022-06-18 06:55:39 -07:00
Poorna	8b9a19eef1	fix: typo in site replication version healing (#15103 )	2022-06-17 16:43:24 -07:00
Aditya Manthramurthy	7f629df4d5	Add generic function to retrieve config value with metadata (#15083 ) `config.ResolveConfigParam` returns the value of a configuration for any subsystem based on checking env, config store, and default value. Also returns info about which config source returned the value. This is useful to return info about config params overridden via env in the user APIs. Currently implemented only for OpenID subsystem, but will be extended for others subsequently.	2022-06-17 11:39:21 -07:00
Anis Elleuch	98ddc3596c	Avoid CompleteMultipart freeze with unexpected network issue (#15102 ) If sending a white space during a long S3 handler call fails, the whitespace goroutine forgets to return a result to the caller. Therefore, the complete multipart handler will be blocked. Remember to send the header written result to the caller or/and close the channel.	2022-06-17 10:41:25 -07:00
Harshavardhana	5d23be6242	fix: ignore printing io.EOF during WalkDir() on concurrently modified objects (#15100 ) fix: ignore print io.EOF during WalkDir() on concurrently modified objects	2022-06-17 08:23:47 -07:00
Poorna	55ee94bed0	initialize site replication subsys after loading metadata (#15099 )	2022-06-16 19:00:35 -07:00
Harshavardhana	d228d29944	update '-v' flag behavior to include copyRight and license (#15097 ) ``` ~ minio -v minio version DEVELOPMENT.2022-06-16T20-40-14Z (commit-id=e083228e2a06bfdcd006fee28d449cd2b47c542a) Runtime: go1.18.3 linux/amd64 Copyright (c) 2015-2022 MinIO, Inc. Licence AGPLv3 <https://www.gnu.org/licenses/agpl-3.0.html> ```	2022-06-16 16:10:48 -07:00
Harshavardhana	013cc66d8e	add dataErrs for healing debug log (#15092 )	2022-06-16 09:42:45 -07:00
Harshavardhana	c7ed6eee5e	fix: background local test also via channel (#15086 ) current implementation for `standalone` setups was blocking the `perf drive`. Bonus: remove all old unused complicated code.	2022-06-15 14:51:42 -07:00
Harshavardhana	8082d1fed6	add bucket level S3 received/sent bytes (#15084 ) adds bucket level metrics for bytes received and sent bytes on all S3 API calls.	2022-06-14 15:14:24 -07:00
Harshavardhana	d2a10dbe69	fix: simplify healthcheck code to freeze calls only once (#15082 ) - currently subnet health check was freezing and calling locks at multiple locations, avoid them. - throw errors if first attempt itself fails with no results	2022-06-14 11:22:07 -07:00
Anis Elleuch	14645142db	erasure-sd: Evaluate versioning Prefix in multi-delete objects (#15081 ) Erasure SD DeleteObjects() is only inheriting bucket versioning status from the handler layer. Add the missing versioning prefix evaluation for each object that will deleted.	2022-06-14 10:05:12 -07:00
Anis Elleuch	0d00f3a55b	kms: initialize after cli parsing (#15076 ) KMS depends on the --certs-dir flag. Ensure KMS is initialized after loading the flag.	2022-06-13 13:06:13 -07:00
Anis Elleuch	dd53b287f2	sts: Avoid printing all STS errors (#15065 ) Limit printing STS errors to - STS internal error - STS not initialized - STS upstream error	2022-06-11 12:55:32 -07:00
Harshavardhana	7413045f0e	fix: add missing minio_s3_requests_total (#15070 ) PR #15052 caused a regression, add the missing metrics back. Bonus: - internode information should be only for distributed setups - update the dashboard to include 4xx and 5xx error panels.	2022-06-11 00:50:31 -07:00
Harshavardhana	af1944f28d	support reading systemctl config automatically on baremetal setups (#15066 ) this allows for customers to use `mc admin service restart` directly even when performing RPM, DEB upgrades. Upon such 'restart' after upgrade MinIO will re-read the /etc/default/minio for any newer environment variables. As long as `MINIO_CONFIG_ENV_FILE=/etc/default/minio` is set, this is honored.	2022-06-10 09:59:15 -07:00
Harshavardhana	214ea14f29	fix: for frozen calls return if client disconnects (#15062 )	2022-06-09 05:06:47 -07:00
Anis Elleuch	5fb420c703	prometheus: Add S3 4xx and 5xx S3 monitoring (#15052 ) Currently minio_s3_requests_errors_total covers 4xx and 5xx S3 responses which can be confusing when s3 applications sent a lot of HEAD requests with obvious 404 responses or when the replication is enabled. Add - minio_s3_requests_4xx_errors_total - minio_s3_requests_5xx_errors_total to help users monitor 4xx and 5xx HTTP status codes separately.	2022-06-08 11:22:34 -07:00
Harshavardhana	2420f6c000	fix: make metrics endpoint responsive by reducing the chatter (#15055 ) peerOnlineCounter was making NxN calls to many peers, this can be really long and tedious if there are random servers that are going down. Instead we should calculate online peers from the point of view of "self" and return those online and offline appropriately by performing a healthcheck.	2022-06-08 02:43:13 -07:00
Harshavardhana	b0d7332a0c	healthcheck cluster endpoint should honor write/readQuorum per pool (#15053 )	2022-06-07 19:08:21 -07:00
Harshavardhana	d55efc791f	relax O_DIRECT in single drive mode if unsupported (#15045 )	2022-06-07 06:44:01 -07:00
Minio Trusted	e2d4d097e7	do not print errors upon 'nil' err	2022-06-06 17:33:41 -07:00
Shireesh Anjal	4ce81fd07f	Add periodic callhome functionality (#14918 ) * Add periodic callhome functionality Periodically (every 24hrs by default), fetch callhome information and upload it to SUBNET. New config keys under the `callhome` subsystem: enable - Set to `on` for enabling callhome. Default `off` frequency - Interval between callhome cycles. Default `24h` * Improvements based on review comments - Update `enableCallhome` safely - Rename pctx to ctx - Block during execution of callhome - Store parsed proxy URL in global subnet config - Store callhome URL(s) in constants - Use existing global transport - Pass auth token to subnetPostReq - Use `config.EnableOn` instead of `"on"` * Use atomic package instead of lock * Use uber atomic package * Use `Cancel` instead of `cancel` Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Aditya Manthramurthy <donatello@users.noreply.github.com>	2022-06-06 16:14:52 -07:00
Harshavardhana	df9eeb7f8f	fix: do not log concurrently when multiple disks return errors (#15044 ) since the values inside 'context' are mutated internally by logger, make sure to log serially upon errors not concurrently.	2022-06-06 15:15:11 -07:00
Harshavardhana	31c4fdbf79	fix: resyncing 'null' version on pre-existing content (#15043 ) PR #15041 fixed replicating 'null' version however due to a regression from #14994 caused the target versions for these 'null' versioned objects to have different 'versions', this may cause confusion with bi-directional replication and cause double replication. This PR fixes this properly by making sure we replicate the correct versions on the objects.	2022-06-06 15:14:56 -07:00
Harshavardhana	48e367ff7d	reject resync start on misconfigured replication rules (#15041 ) we expect resync to start on buckets with replication rule ExistingObjects enabled, if not we reject such calls.	2022-06-06 02:54:39 -07:00
Anis Elleuch	fd02492cb7	avoid limits on the number of parallel trace/bucket notifications listeners (#14799 ) Simplifies overall limits on the incoming listeners for notifications. Fixes #14566	2022-06-05 14:29:12 -07:00
Harshavardhana	5afdc56796	allow single drive mode to run on root disk (#15037 ) for practical reasons, allow root disk based installs for single drive mode.	2022-06-03 12:53:42 -07:00
Harshavardhana	c3e1da8e04	honor canceled context and do not leak on mergeChannels (#15034 ) mergeEntryChannels has the potential to perpetually wait on the results channel, context might be closed and we did not honor the caller context canceling.	2022-06-03 05:59:02 -07:00
Anis Elleuch	20a753e2e5	Fix a possible service freeze after perf object (#15036 ) The S3 service can be frozen indefinitely if a client or mc asks for object perf API but quits early or has some networking issues. The reason is that partialWrite() can block indefinitely. This commit makes partialWrite() listens to context cancellation as well. It also renames deadlinedCtx to healthCtx since it covers handler context cancellation and not only not only the speedtest deadline.	2022-06-03 05:58:45 -07:00
Aditya Manthramurthy	61a7434379	Update --version option behavior (#15032 ) - Add git commit ID - Add go version	2022-06-02 18:40:53 -07:00
Poorna	29edb4ccfe	fix: site replication bucket heal to not panic if replication config is missing (#15025 )	2022-06-02 12:34:03 -07:00
Anis Elleuch	d4e565e595	Add defensive check for one stream message size (#15029 ) In a streaming response, the client knows the size of a streamed message but never checks the message size. Add the check to error out if the response message is truncated.	2022-06-02 09:16:26 -07:00
Klaus Post	f7cecf0945	Make isIndexedMetaV2 return errors (#15012 ) Indexed streams would be decoded by the legacy loader if there was an error loading it. Return an error when the stream is indexed and it cannot be loaded. Fixes "unknown minor metadata version" on corrupted xl.meta files and returns an actual error.	2022-05-31 19:06:57 -07:00
Harshavardhana	52221db7ef	fix: for unexpected errors in reading versioning config panic (#14994 ) We need to make sure if we cannot read bucket metadata for some reason, and bucket metadata is not missing and returning corrupted information we should panic such handlers to disallow I/O to protect the overall state on the system. In-case of such corruption we have a mechanism now to force recreate the metadata on the bucket, using `x-minio-force-create` header with `PUT /bucket` API call. Additionally fix the versioning config updated state to be set properly for the site replication healing to trigger correctly.	2022-05-31 02:57:57 -07:00
Anis Elleuch	56a61bab56	test: Add GetObjectNInfo test with some outdated disks (#15004 ) Add a test reading an object which has some old data in some outdated disks, in a versioned and non-versioned bucket.	2022-05-30 17:52:59 -07:00
Harshavardhana	d480022711	fix: invalidate outdated disks appropriately during readAllXL (#15002 ) readAllXL would return inlined data for outdated disks causing "read" to return incorrect content to the client, this PR fixes this behavior by making sure we skip such outdated disks appropriately based on the latest ModTime on the disk.	2022-05-30 12:43:54 -07:00
Harshavardhana	f1abb92f0c	feat: Single drive XL implementation (#14970 ) Main motivation is move towards a common backend format for all different types of modes in MinIO, allowing for a simpler code and predictable behavior across all features. This PR also brings features such as versioning, replication, transitioning to single drive setups.	2022-05-30 10:58:37 -07:00
Harshavardhana	5792be71fa	fix: add timeouts to avoid goroutine leaks in net/http (#14995 ) Following code can reproduce an unending go-routine buildup, while keeping connections established due to lack of client not closing the connections. https://gist.github.com/harshavardhana/2d00e6f909054d2d2524c71485ad02e1 Without this PR all MinIO deployments can be put into denial of service attacks, causing entire service to be unavailable. We bring in two timeouts at this stage to control such go-routine build ups, new change - IdleTimeout (to kill off idle connections) - ReadHeaderTimeout (to kill off connections that are too slow) This new change also brings two hidden options to make any additional relevant changes if desired in some setups.	2022-05-30 06:24:51 -07:00
Poorna	5e3010d455	Tighten enforcement of object retention (#14993 ) Ref issue#14991 - in the rare case that object in bucket under retention has null version, make sure to enforce retention rules.	2022-05-28 02:21:19 -07:00
Anis Elleuch	ccbf65c8e8	site-repl: Fix deadlock after an IAM loading error (#14990 ) Fix forgotten IAM cache lock releases when reading some data from disk/etcd Co-authored-by: Anis Elleuch <anis@min.io>	2022-05-27 10:26:38 -07:00
Harshavardhana	9d07cde385	use crypto/sha256 only for FIPS 140-2 compliance (#14983 ) It would seem like the PR #11623 had chewed more than it wanted to, non-fips build shouldn't really be forced to use slower crypto/sha256 even for presumed "non-performance" codepaths. In MinIO there are really no "non-performance" codepaths. This assumption seems to have had an adverse effect in certain areas of CPU usage. This PR ensures that we stick to sha256-simd on all non-FIPS builds, our most common build to ensure we get the best out of the CPU at any given point in time.	2022-05-27 06:00:19 -07:00
Aditya Manthramurthy	464b9d7c80	Add support for Identity Management Plugin (#14913 ) - Adds an STS API `AssumeRoleWithCustomToken` that can be used to authenticate via the Id. Mgmt. Plugin. - Adds a sample identity manager plugin implementation - Add doc for plugin and STS API - Add an example program using go SDK for AssumeRoleWithCustomToken	2022-05-26 17:58:09 -07:00
Poorna	5c81d0d89a	site replication: heal missing/invalid replication config (#14979 ) Validate remote target ARNs and heal any stale rules in the replication config	2022-05-26 17:57:23 -07:00
Klaus Post	c0bf02b8b2	Ignore disks with 0 total space (#14981 ) Ignore disks with 0 total Mainly defensive to ensure no `/0` in percent calculation.	2022-05-26 06:01:50 -07:00
Harshavardhana	fd46a1c3b3	fix: some races when accessing ldap/openid config globally (#14978 )	2022-05-25 18:32:53 -07:00
Aditya Manthramurthy	5aae7178ad	Fix listing of service and sts accounts (#14977 ) Now returns user does not exist error if the user is not known to the system	2022-05-25 15:28:54 -07:00
Harshavardhana	dea8220eee	do not heal outdated disks > parityBlocks (#14976 ) this PR also fixes a situation where incorrect partsMetadata slice was used where fi.Data was re-used from a single drive causing duplication of the shards across all drives. This happens for situations where shouldHeal() returns true for all drives > parityBlocks. To avoid this we should never attempt to heal on all drives > parityBlocks, unless we are doing metadata migration from xl.json -> xl.meta	2022-05-25 15:17:10 -07:00
Klaus Post	a4be0b88f6	Add server pool reserved space (#14974 ) If one or more pools reach 85% usage in a set, we will only use pools that have more free space. In case all pools are above 85% we allow all of them to be used with the regular distribution.	2022-05-25 13:20:20 -07:00
Poorna	d8101573be	Disallow deletion of ARN when under active replication (#14972 ) fixes a regression from #12880	2022-05-24 19:40:45 -07:00
Klaus Post	41cdb357bb	Compensate for different server pool sizes (#14968 ) When a server pool with a different number of sets is added they are not compensated when choosing a destination pool for new objects. This leads to the unbalanced placement of objects with smaller pools getting a bigger number of objects since we only compare the destination sets directly. This change will compensate for differences in set sizes when choosing the destination pool. Different set sizes are already compensated by fewer disks.	2022-05-24 18:57:14 -07:00
Harshavardhana	38caddffe7	fix: copyObject on versioned bucket when updating metadata (#14971 ) updating metadata with CopyObject on a versioned bucket causes the latest version to be not readable, this PR fixes this properly by handling the inline data bug fix introduced in PR #14780. This bug affects only inlined data.	2022-05-24 17:27:45 -07:00
Poorna	0e26f983d6	site replication: Allow replication rule edit (#14969 ) Revert commit `b42cfcea60` as too restrictive	2022-05-24 13:27:33 -07:00
Anis Elleuch	77dc99e71d	Do not use inline data size in xl.meta quorum calculation (#14831 ) * Do not use inline data size in xl.meta quorum calculation Data shards of one object can different inline/not-inline decision in multiple disks. This happens with outdated disks when inline decision changes. For example, enabling bucket versioning configuration will change the small file threshold. When the parity of an object becomes low, GET object can return 503 because it is not unable to calculate the xl.meta quorum, just because some xl.meta has inline data and other are not. So this commit will be disable taking the size of the inline data into consideration when calculating the xl.meta quorum. * Add tests for simulatenous inline/notinline object Co-authored-by: Anis Elleuch <anis@min.io>	2022-05-24 06:26:38 -07:00
Anis Elleuch	5041bfcb5c	replication healing: Fix typo when healing bucket quota info (#14966 ) A typo is found in the replication healing code where an empty quota configuration is sent to peer sites instead of the correct one. .io>	2022-05-24 06:26:13 -07:00
Harshavardhana	f8650a3493	fetch bucket replication stats across peers in single call (#14956 ) current implementation relied on recursively calling one bucket at a time across all peers, this would be very slow and chatty when there are 100's of buckets which would mean 100*peerCount amount of network operations. This PR attempts to reduce this entire call into `peerCount` amount of network calls only. This functionality addresses also a concern where the Prometheus metrics would significantly slow down when one of the peers is offline.	2022-05-23 09:15:30 -07:00

1 2 3 4 5 ...

4590 Commits