minio

mirror of https://github.com/minio/minio.git synced 2025-12-03 14:31:23 -05:00

Author	SHA1	Message	Date
Klaus Post	4a6c97463f	Fix all racy use of NewDeadlineWorker (#18861 ) AlmosAll uses of NewDeadlineWorker, which relied on secondary values, were used in a racy fashion, which could lead to inconsistent errors/data being returned. It also propagates the deadline downstream. Rewrite all these to use a generic WithDeadline caller that can return an error alongside a value. Remove the stateful aspect of DeadlineWorker - it was racy if used - but it wasn't AFAICT. Fixes races like: ``` WARNING: DATA RACE Read at 0x00c130b29d10 by goroutine 470237: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).ReadVersion() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:702 +0x611 github.com/minio/minio/cmd.readFileInfo() github.com/minio/minio/cmd/erasure-metadata-utils.go:160 +0x122 github.com/minio/minio/cmd.erasureObjects.getObjectFileInfo.func1.1() github.com/minio/minio/cmd/erasure-object.go:809 +0x27a github.com/minio/minio/cmd.erasureObjects.getObjectFileInfo.func1.2() github.com/minio/minio/cmd/erasure-object.go:828 +0x61 Previous write at 0x00c130b29d10 by goroutine 470298: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).ReadVersion.func1() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:698 +0x244 github.com/minio/minio/internal/ioutil.(DeadlineWorker).Run.func1() github.com/minio/minio/internal/ioutil/ioutil.go:141 +0x33 WARNING: DATA RACE Write at 0x00c0ba6e6c00 by goroutine 94507: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).StatVol.func1() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:419 +0x104 github.com/minio/minio/internal/ioutil.(DeadlineWorker).Run.func1() github.com/minio/minio/internal/ioutil/ioutil.go:141 +0x33 Previous read at 0x00c0ba6e6c00 by goroutine 94463: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).StatVol() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:422 +0x47e github.com/minio/minio/cmd.getBucketInfoLocal.func1() github.com/minio/minio/cmd/peer-s3-server.go:275 +0x122 github.com/minio/pkg/v2/sync/errgroup.(*Group).Go.func1() ``` Probably back from #17701	2024-01-24 10:08:31 -08:00
Frank Wessels	6c912ac960	Fix startup message when using single path (#18856 )	2024-01-24 10:02:56 -08:00
Harshavardhana	708cebe7f0	add necessary protection err, fileInfo slice reads and writes (#18854 ) protection was in place. However, it covered only some areas, so we re-arranged the code to ensure we could hold locks properly. Along with this, remove the DataShardFix code altogether, in deployments with many drive replacements, this can affect and lead to quorum loss.	2024-01-24 01:08:23 -08:00
Harshavardhana	f78d677ab6	pre-allocate EC memory by default at startup (#18846 )	2024-01-23 20:41:11 -08:00
Poorna	e39e2306d6	site replication: remove extraneous log for missing group (#18785 )	2024-01-23 18:28:11 -08:00
Harshavardhana	52229a21cb	avoid reload of 'format.json' over the network under normal conditions (#18842 )	2024-01-23 14:11:46 -08:00
Harshavardhana	961f7dea82	compress binary while sending it to all the nodes (#18837 ) Also limit the amount of concurrency when sending binary updates to peers, avoid high network over TX that can cause disconnection events for the node sending updates.	2024-01-22 12:16:36 -08:00
Shubhendu	65c4d550cb	Distribution bucket metrics with site replication (#18841 ) If site replication is enabled, we should still show the size and version distribution histogram metrics at bucket level. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-01-22 08:45:36 -08:00
Harshavardhana	f9b4a8d6e8	improve server update behavior by re-using memory properly (#18831 )	2024-01-19 18:27:58 -08:00
Harshavardhana	e11d851aee	add new drive I/O waiting/tokens metric (#18836 ) Bonus: add virtual memory used as well part of the system resource metrics.	2024-01-19 14:51:36 -08:00
Harshavardhana	ac81f0248c	introduce new ServiceV2 API to handle guided restarts (#18826 ) New API now verifies any hung disks before restart/stop, provides a 'per node' break down of the restart/stop results. Provides also how many blocked syscalls are present on the drives and what users must do about them. Adds options to do pre-flight checks to provide information to the user regarding any hung disks. Provides 'force' option to forcibly attempt a restart() even with waiting syscalls on the drives.	2024-01-19 14:22:36 -08:00
Aditya Manthramurthy	cc960adbee	fix: remove policy mapping file when empty (#18828 ) On a policy detach operation, if there are no policies remaining attached to the user/group, remove the policy mapping file, instead of leaving a file containing an empty list of policies.	2024-01-19 10:31:40 -08:00
Shubhendu	19387cafab	Use +Inf label additionally for Histogram metrics (#18807 )	2024-01-18 14:51:28 -08:00
Harshavardhana	7c0673279b	capture I/O in waiting and total tokens in diskMetrics (#18819 ) This is needed for the subsequent changes in ServerUpdate(), ServerRestart() etc.	2024-01-18 11:17:43 -08:00
Anis Eleuch	7ce0d71a96	Do not log volume not empty when healing dangling buckets (#18822 ) Healing dangling buckets is conservative, and it is a typical use case to fail to remove a dangling bucket because it contains some data because healing danging bucket code is not allowed to remove data: only healing the dangling object is allowed to do so.	2024-01-18 10:39:27 -08:00
Harshavardhana	dd2542e96c	add codespell action (#18818 ) Original work here, #18474, refixed and updated.	2024-01-17 23:03:17 -08:00
Harshavardhana	21d60eab7c	remove all older unused APIs (#18769 )	2024-01-17 20:41:23 -08:00
Harshavardhana	a4a74e9844	re-init the worker group to ensure errs[] slice is fresh	2024-01-17 20:33:25 -08:00
Harshavardhana	9588978028	fix: HealBucket regression for empty buckets, simplify it (#18815 )	2024-01-17 15:19:09 -08:00
chienguo	8cd967803c	fix: a typo in storeDataUsageInBackend() comment (#18778 )	2024-01-16 15:48:54 -08:00
Harshavardhana	a0e1163fb6	reject reference format from a different deployment (#18800 ) reference format is constant for any lifetime of a minio cluster, we do not have to ever replace it during HealFormat() as it will never change. additionally we should simply reject reference formats that we do not understand early on.	2024-01-16 15:13:14 -08:00
Sveinn	30bd5e2669	adding a missing return case to fix GetObjectTagging (#18793 )	2024-01-15 16:11:06 -08:00
Harshavardhana	38637897ba	fix: listing SSE encrypted multipart objects (#18786 ) GetActualSize() was heavily relying on o.Parts() to be non-empty to figure out if the object is multipart or not, However, we have many indicators of whether an object is multipart or not. Blindly assuming that o.Parts == nil is not a multipart, is an incorrect expectation instead, multipart must be obtained via - Stored metadata value indicating this is a multipart encrypted object. - Rely on <meta>-actual-size metadata to get the object's actual size. This value is preserved for additional reasons such as these. - ETag != 32 length	2024-01-15 00:57:49 -08:00
Harshavardhana	993d96feef	treat all localhost endpoints as local setup with same port (#18784 ) fixes #18783 and avoids user mistakes	2024-01-12 23:53:03 -08:00
Poorna	b2b26d9c95	support proxying of tagging requests in replication (#18649 ) support proxying of tagging requests in active-active replication Note: even if proxying is successful, PutObjectTagging/DeleteObjectTagging will continue to report a 404 since the object is not present locally.	2024-01-12 23:51:33 -08:00
Krishnan Parthasarathi	cba3dd276b	Add more size intervals to obj size histogram (#18772 ) New intervals: [1024B, 64KiB) [64KiB, 256KiB) [256KiB, 512KiB) [512KiB, 1MiB) The new intervals helps us see object size distribution with higher resolution for the interval [1024B, 1MiB).	2024-01-12 23:51:08 -08:00
Anis Eleuch	a47fc75c26	xl: Remove wrong wording for errCorruptedFormat (#18775 ) Also add errCorruptedBackend to make it easier to differentiate between corrupted content or something else wrong in the backend drive	2024-01-12 14:48:44 -08:00
Harshavardhana	e5c8794b8b	avoid disk monitoring leaks under various conditions (#18777 ) - HealFormat() was leaking healthcheck goroutines for disks, we are only interested in enabling healthcheck for the newly formatted disk, not for existing disks. - When disk is a root-disk a random disk monitor was leaking while we ignored the drive. - When loading the disk for each erasure set, we were leaking goroutines for the prepare-storage.go disks which were replaced via the globalLocalDrives slice - avoid disk monitoring utilizing health tokens that would cause exhaustion in the tokens, prematurely which were meant for incoming I/O. This is ensured by avoiding writing O_DIRECT aligned buffer instead write 2048 worth of content only as O_DSYNC, which is sufficient.	2024-01-12 01:48:36 -08:00
Taran Pelkey	ac90a873eb	Verify that remote target bucket is on MinIO server for bucket replication (#18656 )	2024-01-11 14:56:16 -08:00
jiuker	c1a78224cf	fix: prevent queries from starting before initialization (#18766 )	2024-01-10 15:21:52 -08:00
Harshavardhana	39f9350697	optimize readdir() open calls to be dealt with directly via 'fd' (#18762 )	2024-01-10 08:48:50 -08:00
Shubhendu	e31081d79d	Heal buckets at node level (#18612 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-01-09 20:34:04 -08:00
Harshavardhana	f02d282754	avoid frivolous logs for expired credentials (#18767 )	2024-01-09 12:25:18 -08:00
Krishnan Parthasarathi	3a90af0bcd	Add line, col to types used in batch-expire (#18747 )	2024-01-08 15:22:28 -08:00
jiuker	53ceb0791f	fix: prevent queries from starting before initialization (#18756 ) Prevent queries from starting before initialization	2024-01-08 12:40:27 -08:00
jiuker	2cd98a0d21	remove outdated notes (#18755 )	2024-01-08 08:04:19 -08:00
Anis Eleuch	04135fa6cd	audit: Add the drives where the dangling object is removed (#18737 )	2024-01-05 14:17:24 -08:00
Harshavardhana	42dc6329e6	simplify success response for GetObjectAttributes() (#18746 )	2024-01-05 12:50:07 -08:00
Sveinn	9b8ba97f9f	feat: add support for GetObjectAttributes API (#18732 )	2024-01-05 10:43:06 -08:00
Anis Eleuch	7705605b5a	scanner: Add a config to disable short sleep between objects scan (#18734 ) Add a hidden configuration under the scanner sub section to configure if the scanner should sleep between two objects scan. The configuration has only effect when there is no drive activity related to s3 requests or healing. By default, the code will keep the current behavior which is doing sleep between objects. To forcefully enable the full scan speed in idle mode, you can do this: `mc admin config set myminio scanner idle_speed=full`	2024-01-04 15:07:17 -08:00
Anis Eleuch	414bcb0c73	prom: Add read quorum per erasure set metric (#18736 )	2024-01-04 15:05:13 -08:00
Harshavardhana	f4710948c4	fix: an odd crash when deleting `null` DEL markers (#18727 ) fixes #18724 A regression was introduced in #18547, that attempted to file adding a missing `null` marker however we should not skip returning based on versionID instead it must be based on if we are being asked to create a DEL marker or not. The PR also has a side-affect for replicating `null` marker permanent delete, as it may end up adding a `null` marker while removing one. This PR should address both scenarios.	2024-01-02 15:08:18 -08:00
Anis Eleuch	3f4488c589	scanner: Allow full throttle if there is no parallel disk ops (#18109 )	2024-01-02 13:51:24 -08:00
Pedro Juarez	8f13c8c3bf	Support to store browser config settings (#18631 ) * csp_policy * hsts_seconds * hsts_include_subdomains * hsts_preload * referrer_policy	2024-01-01 08:36:33 -08:00
Zhou Ting	31d16f6cc2	allow sha256 payload to be configurable for object perf test (#18712 ) Signed-off-by: Zhou Ting <ting.z.zhou@intel.com>	2023-12-29 23:56:50 -08:00
Harshavardhana	a50ea92c64	feat: introduce list_quorum="auto" to prefer quorum drives (#18084 ) NOTE: This feature is not retro-active; it will not cater to previous transactions on existing setups. To enable this feature, please set ` _MINIO_DRIVE_QUORUM=on` environment variable as part of systemd service or k8s configmap. Once this has been enabled, you need to also set `list_quorum`. ``` ~ mc admin config set alias/ api list_quorum=auto` ``` A new debugging tool is available to check for any missing counters.	2023-12-29 15:52:41 -08:00
Harshavardhana	5b2ced0119	re-use globalLocalDrives properly (#18721 )	2023-12-29 09:30:10 -08:00
Anis Eleuch	8a0ba093dd	audit: Fix merrs and derrs object dangling message (#18714 ) merrs and derrs are empty when a dangling object is deleted. Fix the bug and adds invalid-meta data for data blocks	2023-12-27 22:27:04 -08:00
Daniel Valdivia	5fc7da345d	Upgrade Console to v0.44.0 (#18717 ) Signed-off-by: Daniel Valdivia <18384552+dvaldivia@users.noreply.github.com>	2023-12-27 11:19:13 -08:00
Anis Eleuch	8bd4f6568b	server-info: Avoid initializing audit/log http/kafka targets (#18703 ) This can cause unnecessary ServerInfo() call delay.	2023-12-22 10:25:08 -08:00

1 2 3 4 5 ...

5796 Commits