minio

mirror of https://github.com/minio/minio.git synced 2025-05-02 07:23:59 -04:00

Author	SHA1	Message	Date
Harshavardhana	0559f46bbb	fix: make healObject() make non-blocking (#13071 ) healObject() should be non-blocking to ensure that scanner is not blocked for a long time, this adversely affects performance of the scanner and also affects the way usage is updated subsequently. This PR allows for a non-blocking behavior for healing, dropping operations that cannot be queued anymore.	2021-08-25 17:46:20 -07:00
Klaus Post	e1b0582859	fsOpenFile: Close on error (#13064 ) Close files on error.	2021-08-25 09:43:01 -07:00
Klaus Post	88d719689c	Synchronize bucket cycle numbers (#13058 ) Synchronize bucket cycles so it is much more likely that the same prefixes will be picked up for scanning. Use the global bloom filter cycle for that. Bump bloom filter versions to clear those.	2021-08-25 08:25:26 -07:00
Harshavardhana	200eb8dc0e	fix: remove any internal metadata keys from notification (#13062 )	2021-08-24 21:13:37 -07:00
Shireesh Anjal	ce05e67a0c	Add admin api to return sys config info (#12988 ) The intention is to list values of sys config that can potentially impact the performance of minio. At present, it will return max value configured for rlimit Signed-off-by: Shireesh Anjal <shireesh@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2021-08-24 17:09:37 -07:00
Poorna Krishnamoorthy	6a7e22386e	Use part sizes correctly in multipart replication (#13061 ) fixes #13057	2021-08-24 14:41:05 -07:00
Harshavardhana	85dfb4351c	fix: allow an entire set to be dropped (#13060 ) proceed to heal the cluster when all the drives in a set have failed, this is extremely rare occurrence but even if it happens we allow the cluster to be functional.	2021-08-24 12:43:57 -07:00
Harshavardhana	bbf3576f70	remove unecessary metadata structs in applyTransitionAction() (#13059 )	2021-08-24 12:24:00 -07:00
Harshavardhana	293d261cf9	use available memory to restrict API calls (#13047 ) also choose 90% of the available memory to calculate maximum API calls.	2021-08-24 09:14:46 -07:00
Anis Elleuch	f1cab828ee	fix: New disks healing should pick unformatted disks as well (#13054 ) A recent regression caused new disks not being re-formatted. In the old code, a disk needed be 'online' to be chosen to be formatted but the disk has to be already formatted for XL storage IsOnline() function to return true. It is enough to check if XL storage is nil or not if we want to avoid formatting root disks. Co-authored-by: Anis Elleuch <anis@min.io>	2021-08-24 07:40:56 -07:00
MoonJustry	6a8d0fb955	fix(Router): typo: completemutipartupload to completemultipartupload (#13051 )	2021-08-24 07:14:34 -07:00
Klaus Post	c8ca055935	Fix concurrent map read/write (#13052 ) Clones were not independent. Fixes race: ``` WARNING: DATA RACE Read at 0x00c002040cc0 by goroutine 50: runtime.mapiterinit() c:/go/src/runtime/map.go:802 +0x0 github.com/minio/minio/cmd.(dataUsageCache).flatten() d:/minio/minio/cmd/data-usage-cache.go:551 +0xad github.com/minio/minio/cmd.(dataUsageCache).dui() d:/minio/minio/cmd/data-usage-cache.go:352 +0x144 github.com/minio/minio/cmd.(erasureServerPools).NSScanner.func3.1() d:/minio/minio/cmd/erasure-server-pool.go:542 +0x2a4 github.com/minio/minio/cmd.(erasureServerPools).NSScanner.func3() d:/minio/minio/cmd/erasure-server-pool.go:561 +0x24b Previous write at 0x00c002040cc0 by goroutine 1391: runtime.mapassign_faststr() c:/go/src/runtime/map_faststr.go:202 +0x0 github.com/minio/minio/cmd.(dataUsageEntry).addChild() d:/minio/minio/cmd/data-usage-cache.go:231 +0x313 github.com/minio/minio/cmd.(dataUsageCache).replace() d:/minio/minio/cmd/data-usage-cache.go:383 +0x293 github.com/minio/minio/cmd.erasureObjects.nsScanner.func1() d:/minio/minio/cmd/erasure.go:428 +0x3a6 ```	2021-08-24 07:11:38 -07:00
Poorna Krishnamoorthy	674c6f7a7b	fix: resync of replication of delete markers (#12932 ) Fixes #12919	2021-08-23 14:48:22 -07:00
Krishnan Parthasarathi	db35bcf2ce	heal: Remove transitioned objects' parts from outdated disks (#13018 ) Bonus: check equality for replication and other metadata	2021-08-23 13:14:55 -07:00
Anis Elleuch	901d1314af	Fix formatting disks in a test environment (#13043 ) markRootDisksAsDown() relies on disk info even if the disk is unformatted. Therefore, we should always return DiskInfo data even when DiskInfo storage API returns errUnformattedDisk	2021-08-23 12:53:54 -07:00
Klaus Post	1080609c86	Reuse buffers when writing metadata (#13040 ) Simplify returning buffers. Tested using `warp mixed --duration=1m --obj.size=100K`: ``` Operation: DELETE Operations: 7148 -> 7642 * Average: +6.77% (+8.1) obj/s ------------------- Operation: GET Operations: 32200 -> 34403 * Average: +6.74% (+3.5 MiB/s) throughput, +6.74% (+36.2) obj/s * First Byte: Average: -105.403µs (-3%), Median: -309µs (-11%), Best: -2.7µs (-0%), Worst: +3.5637ms (+3%) ------------------- Operation: PUT Operations: 10741 -> 11475 * Average: +6.78% (+1.2 MiB/s) throughput, +6.78% (+12.1) obj/s ------------------- Operation: STAT Operations: 21465 -> 22927 * Average: +6.71% (+24.0) obj/s ```	2021-08-23 11:17:27 -07:00
Anis Elleuch	7fb9301c03	heal: Return parity for storage classes in heal info API (#13038 ) `mc admin heal` command will show servers/disks tolerance, for that purpose, you need to know the number of parity disks for each storage class. Parity is always the same in all pools.	2021-08-23 08:50:35 -07:00
Klaus Post	63f3e5c3fc	replication: Lock object while replicating (#13014 ) Introduce a replication lock that will ensure that only one replication operation will run for any given object at any time. Fixes #13013	2021-08-23 08:16:18 -07:00
Klaus Post	47de1d2e0e	Fix diskinfo race (#12857 ) Fixes share info struct. ``` WARNING: DATA RACE Read at 0x00c011780618 by goroutine 419: github.com/minio/minio/cmd.(DiskMetrics).DecodeMsg() c:/gopath/src/github.com/minio/minio/cmd/storage-datatypes_gen.go:331 +0x247 github.com/minio/minio/cmd.(DiskInfo).DecodeMsg() c:/gopath/src/github.com/minio/minio/cmd/storage-datatypes_gen.go:76 +0x5ec github.com/tinylib/msgp/msgp.Decode() c:/gopath/pkg/mod/github.com/tinylib/msgp@v1.1.6-0.20210521143832-0becd170c402/msgp/read.go:105 +0x70 github.com/minio/minio/cmd.(storageRESTClient).DiskInfo.func1.1() c:/gopath/src/github.com/minio/minio/cmd/storage-rest-client.go:288 +0x235 github.com/minio/minio/cmd.(timedValue).Get() c:/gopath/src/github.com/minio/minio/cmd/utils.go:886 +0x77 github.com/minio/minio/cmd.(storageRESTClient).DiskInfo() c:/gopath/src/github.com/minio/minio/cmd/storage-rest-client.go:297 +0xf9 github.com/minio/minio/cmd.getDiskInfos() c:/gopath/src/github.com/minio/minio/cmd/object-api-utils.go:962 +0x1a8 github.com/minio/minio/cmd.(erasureServerPools).getServerPoolsAvailableSpace.func1() c:/gopath/src/github.com/minio/minio/cmd/erasure-server-pool.go:241 +0x27c github.com/minio/minio/internal/sync/errgroup.(Group).Go.func1() c:/gopath/src/github.com/minio/minio/internal/sync/errgroup/errgroup.go:123 +0xd7 Previous write at 0x00c011780618 by goroutine 423: github.com/minio/minio/cmd.(DiskMetrics).DecodeMsg() c:/gopath/src/github.com/minio/minio/cmd/storage-datatypes_gen.go:332 +0x6e4 github.com/minio/minio/cmd.(DiskInfo).DecodeMsg() c:/gopath/src/github.com/minio/minio/cmd/storage-datatypes_gen.go:76 +0x5ec github.com/tinylib/msgp/msgp.Decode() c:/gopath/pkg/mod/github.com/tinylib/msgp@v1.1.6-0.20210521143832-0becd170c402/msgp/read.go:105 +0x70 github.com/minio/minio/cmd.(storageRESTClient).DiskInfo.func1.1() c:/gopath/src/github.com/minio/minio/cmd/storage-rest-client.go:288 +0x235 github.com/minio/minio/cmd.(timedValue).Get() c:/gopath/src/github.com/minio/minio/cmd/utils.go:886 +0x77 github.com/minio/minio/cmd.(storageRESTClient).DiskInfo() c:/gopath/src/github.com/minio/minio/cmd/storage-rest-client.go:297 +0xf9 github.com/minio/minio/cmd.getDiskInfos() c:/gopath/src/github.com/minio/minio/cmd/object-api-utils.go:962 +0x1a8 github.com/minio/minio/cmd.(erasureServerPools).getServerPoolsAvailableSpace.func1() c:/gopath/src/github.com/minio/minio/cmd/erasure-server-pool.go:241 +0x27c github.com/minio/minio/internal/sync/errgroup.(Group).Go.func1() c:/gopath/src/github.com/minio/minio/internal/sync/errgroup/errgroup.go:123 +0xd7 ```	2021-08-23 01:13:47 -07:00
Harshavardhana	14fe8ecb58	fix: decodeDirObject in prefix usage function (#13026 ) prefixes at top level create such as ``` ~ mc mb alias/bucket/prefix ``` The prefix/ incorrect appears as prefix__XL_DIR__/ in the accountInfo output, make sure to trim '__XL_DIR__'	2021-08-22 16:46:45 -07:00
Harshavardhana	0f01e7ef0f	fix: check for xl.meta as directory fallback (#13023 ) Objects uploaded in this format for example ``` mc cp /etc/hosts alias/bucket/foo/bar/xl.meta mc ls -r alias/bucket/foo/bar ``` Won't list the object, handle this scenario.	2021-08-21 00:12:29 -07:00
Harshavardhana	6d04c9c585	populate additional claims for prometheus endpoint (#13011 ) service accounts and STS provide additional claims for policy authorization which needs to be verified along with Prometheus issuer claim.	2021-08-20 11:32:01 -07:00
Krishnan Parthasarathi	e210cb3670	fix: use transition/replication fields in FileInfo quorum calculation (#13010 )	2021-08-19 14:55:42 -07:00
Klaus Post	47b577fcc0	Lock while creating buckets (#12999 ) Ensure that one call will succeed and others will serialize Example failure without code in place: ``` bucket-policy-handlers_test.go:120: unexpected error: cmd.InsufficientWriteQuorum: Storage resources are insufficient for the write operation doz2wjqaovp5kvlrv11fyacowgcvoziszmkmzzz9nk9au946qwhci4zkane5-1/ bucket-policy-handlers_test.go:120: unexpected error: cmd.InsufficientWriteQuorum: Storage resources are insufficient for the write operation doz2wjqaovp5kvlrv11fyacowgcvoziszmkmzzz9nk9au946qwhci4zkane5-1/ bucket-policy-handlers_test.go:135: want 1 ok, got 0 ```	2021-08-19 13:21:02 -07:00
Harshavardhana	e9d970154d	use renameAll instead of deleteAll for metacache-manager (#13005 ) renameAll is cheaper, rely on background deletes instead.	2021-08-19 09:16:14 -07:00
Harshavardhana	202d0b64eb	fix: enable go1.17 github ci/cd (#12997 )	2021-08-18 18:35:22 -07:00
Klaus Post	c25816eabc	xl walk: Limit walk concurrent IO (#12885 ) We are observing heavy system loads, potentially locking the system up for periods when concurrent listing operations are performed. We place a per-disk lock on walk IO operations. This will minimize the impact of concurrent listing operations on the entire system and de-prioritize them compared to other operations. Single list operations should remain largely unaffected.	2021-08-18 18:10:36 -07:00
Harshavardhana	ee028a4693	listObjects optimized to handle max-keys=1 when prefix is object (#13000 ) Some applications albeit poorly written rather than using headObject rely on listObjects to check for existence of object, this unusual request always has prefix=(to actual object) and max-keys=1 handle this situation specially such that we can avoid readdir() on the top level parent to avoid sorting and skipping, ensuring that such type of listObjects() always behaves similar to a headObject() call.	2021-08-18 18:05:05 -07:00
Harshavardhana	9c65168312	fix: all levels deep flat key match (#12996 ) this addresses a regression from #12984 which only addresses flat key from single level deep at bucket level. added extra tests as well to cover all these scenarios.	2021-08-18 07:40:53 -07:00
Harshavardhana	a690772cc5	add support to set subnet license for embedded console (#12993 )	2021-08-17 11:56:01 -07:00
Krishnan Parthasarathi	cf8abd8888	Add prometheus metrics for ILM tasks (#12933 )	2021-08-17 10:21:19 -07:00
Krishnan Parthasarathi	b7e3651d3c	Set free-version id in case of version/version-suspended buckets (#12982 ) This free-version id may be used to track tiered object contents of the object (version) being deleted.	2021-08-17 08:59:48 -07:00
Harshavardhana	ef4d023c85	fix: various performance improvements to tiering (#12965 ) - deletes should always Sweep() for tiering at the end and does not need an extra getObjectInfo() call - puts, copy and multipart writes should conditionally do getObjectInfo() when tiering targets are configured - introduce 'TransitionedObject' struct for ease of usage and understanding. - multiple-pools optimization deletes don't need to hold read locks verifying objects across namespace and pools.	2021-08-17 07:50:00 -07:00
Harshavardhana	654a6e9871	always set the filter to skip navigating baseDir (#12984 ) baseDir is empty if the top level prefix does not end with `/` this causes large recursive listings without any filtering, to fix this filtering make sure to set the filter prefix appropriately. also do not navigate folders at top level that do not match the filter prefix, entries don't need to match prefix since they are never prefixed with the prefix anyways.	2021-08-17 07:43:24 -07:00
Klaus Post	ad928f0078	Return list request when canceled (#12977 ) * Return list request when canceled * Cancel list if abandoned	2021-08-16 11:59:16 -07:00
Klaus Post	92bb2928e4	Compress better on amd64 (#12974 ) Since S2 has amd64 assembly, it now operates at a reasonable speed to use by default. Here are some examples of stream compression speed, 16 cores: ``` nyc-taxi-data-10M.csv s2 1 3325605752 -> 1095998837 312ms 10139.07MB/s 67.04% reduction nyc-taxi-data-10M.csv s2 2 3325605752 -> 917905514 428ms 7393.74MB/s 72.40% github-june-2days-2019.json s2 1 6273951764 -> 1043196283 391ms 15301.99 MB/s 83.37% github-june-2days-2019.json s2 2 6273951764 -> 955924506 519ms 11510.81MB/s 84.76% github-ranks-backup.bin s2 1 1862623243 -> 623911363 146ms 12133MB/s 66.50% github-ranks-backup.bin s2 2 1862623243 -> 563752759 230ms 7705.26MB/s 69.73% ``` We keep non-assembly platforms on the faster, but less efficient mode.	2021-08-16 11:55:07 -07:00
Anis Elleuch	47dfc1b1b0	ldap: Reevalute filter when searching for non eligible users (#12953 ) The previous code removes SVC/STS accounts for ldap users that do not exist anymore in LDAP server. This commit will actually re-evaluate filter as well if it is changed and remove all local SVC/STS accounts beloning to the ldap user if the latter is not eligible for the search filter anymore. For example: the filter selects enabled users among other criteras in the LDAP database, if one ldap user changes his status to disabled later, then associated SVC/STS accounts will be removed because that user does not meet the filter search anymore.	2021-08-13 11:40:04 -07:00
Klaus Post	7d8413a589	Reuse more metadata buffers (#12955 ) Reuse metadata buffers when no longer referenced. Takes care of most of the happy paths.	2021-08-13 11:39:27 -07:00
Klaus Post	24722ddd02	Remove inline data hack (#12946 ) move the code down to the storage layer, this logic decouples the inline data from the size parameter making it flexible and future proof.	2021-08-13 08:25:54 -07:00
Klaus Post	f31a00de01	fix: http stats race in traffic metering (#12956 ) Traffic metering was not protected against concurrent updates. ``` WARNING: DATA RACE Read at 0x00c02b0dace8 by goroutine 235: github.com/minio/minio/cmd.setHTTPStatsHandler.func1() d:/minio/minio/cmd/generic-handlers.go:360 +0x27d net/http.HandlerFunc.ServeHTTP() ... Previous write at 0x00c02b0dace8 by goroutine 994: github.com/minio/minio/internal/http/stats.(*IncomingTrafficMeter).Read() d:/minio/minio/internal/http/stats/http-traffic-recorder.go:34 +0xd2 ```	2021-08-13 07:30:03 -07:00
Shireesh Anjal	d44e4399e6	Add admin api to return sys services info (#12939 ) The intention is to provide status of any sys services that can potentially impact the performance of minio. At present, it will return information about the `selinux` service (not-installed/disabled/permissive/enforcing) Signed-off-by: Shireesh Anjal <shireesh@minio.io>	2021-08-12 18:58:40 -07:00
Harshavardhana	f9ae71fd17	fix: deleteMultiObjects performance regression (#12951 ) fixes performance regression found in deleteObjects(), putObject(), copyObject and completeMultipart calls.	2021-08-12 18:57:37 -07:00
Harshavardhana	ce28e904c9	pass the current credentials for claims	2021-08-12 18:24:04 -07:00
Harshavardhana	8f2a3efa85	disallow sub-credentials based on root credentials to gain priviledges (#12947 ) This happens because of a change added where any sub-credential with parentUser == rootCredential i.e (MINIO_ROOT_USER) will always be an owner, you cannot generate credentials with lower session policy to restrict their access. This doesn't affect user service accounts created with regular users, LDAP or OpenID	2021-08-12 18:07:08 -07:00
Klaus Post	89febdb3d6	Reuse small buffers (#12948 ) When reading metadata allow reuse of buffers in certain cases. Take the low-hanging fruit. Reduce GC overhead when listing.	2021-08-12 14:27:22 -07:00
Klaus Post	3eac02f676	Use metadata reader in ReadVersion (#12942 ) Use `readMetadata` when reading version information without data requested. Reduces IO on inlined data. Bonus: Inline compressed data as well when compression is enabled.	2021-08-12 10:05:24 -07:00
Krishnan Parthasarathi	65b6f4aa31	Add dynamic reconfiguration of number of transition workers (#12926 )	2021-08-11 22:23:56 -07:00
Harshavardhana	9e88941515	fix: skip disks that are offline when healing the drives (#12931 )	2021-08-11 12:57:18 -07:00
Harshavardhana	40a2fa8e81	fix: add more optimizations to putMetacacheObject() (#12916 ) - avoid extra lookup for 'xl.meta' since we are definitely sure that it doesn't exist. - use this in newMultipartUpload() as well - also additionally do not write with O_DSYNC to avoid loading the drives, instead create 'xl.meta' for listing operations without O_DSYNC since these are ephemeral objects. - do the same with newMultipartUpload() since it gets synced when the PutObjectPart() is attempted, we do not need to tax newMultipartUpload() instead.	2021-08-10 11:12:22 -07:00
Aditya Manthramurthy	59bb54ed6a	Use common function for authenticating admin requests (#12915 )	2021-08-09 18:14:38 -07:00

... 15 16 17 18 19 ...

4678 Commits