minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	f225ca3312	Add more advanced cases for dangling (#18968 )	2024-02-04 14:36:13 -08:00
Frank Wessels	8b68e0bfdc	Fix typo in api-router.go (#18955 )	2024-02-03 14:03:51 -08:00
Anis Eleuch	6ae97aedc9	xl: Disable rename2 in decommissioning/rebalance (#18964 ) Always disable rename2 optimization in decom/rebalance	2024-02-03 14:03:30 -08:00
Harshavardhana	960d604013	disconnected returns, an unexpected error to List() returning 500s (#18959 ) provide the error string appropriately so that the matching of error types works. Also add a string based fallback for the said error.	2024-02-03 01:04:33 -08:00
Harshavardhana	ff80cfd83d	move Make,Delete,Head,Heal bucket calls to websockets (#18951 )	2024-02-02 14:54:54 -08:00
Harshavardhana	99fde2ba85	deprecate disk tokens, instead rely on deadlines and active monitoring (#18947 ) disk tokens usage is not necessary anymore with the implementation of deadlines for storage calls and active monitoring of the drive for I/O timeouts. Functionality kicking off a bad drive is still supported, it's just that we do not have to serialize I/O in the manner tokens would do.	2024-02-02 10:10:54 -08:00
Frank Wessels	31743789dc	Fix some leftover issues from PR 18936 (#18946 )	2024-02-01 19:42:56 -08:00
Anis Eleuch	6fd63e920a	log: Use error log type instead of Application/MinIO type (#18930 ) * log: Use error log type instead of Application/MinIO type Also bump github.com/shirou/gopsutil version to address cross compilation issues. * Apply suggestions from code review Co-authored-by: Aditya Manthramurthy <donatello@users.noreply.github.com> --------- Co-authored-by: Anis Eleuch <anis@min.io> Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Aditya Manthramurthy <donatello@users.noreply.github.com>	2024-02-01 16:13:57 -08:00
Aditya Manthramurthy	59cc3e93d6	fix: `null` inline policy handling for access keys (#18945 ) Interpret `null` inline policy for access keys as inheriting parent policy. Since MinIO Console currently sends this value, we need to honor it for now. A larger fix in Console and in the server are required. Fixes #18939.	2024-02-01 14:45:03 -08:00
Anis Eleuch	61a4bb38cd	batch: Fix a typo while validating smallerThan field (#18942 )	2024-02-01 13:53:26 -08:00
Klaus Post	b192bc348c	Improve object reuse for grid messages (#18940 ) Allow internal types to support a `Recycler` interface, which will allow for sharing of common types across handlers. This means that all `grid.MSS` (and similar) objects are shared across in a common pool instead of a per-handler pool. Add internal request reuse of internal types. Add for safe (pointerless) types explicitly. Only log params for internal types. Doing Sprint(obj) is just a bit too messy.	2024-02-01 12:41:20 -08:00
Harshavardhana	6440d0fbf3	move a collection of peer APIs to websockets (#18936 )	2024-02-01 10:47:20 -08:00
Anis Eleuch	24ecc44bac	Keep ServiceV1 admin stop/restart API and mark as deprecated (#18932 )	2024-01-31 12:20:33 -08:00
Aditya Manthramurthy	0ae4915a93	fix: permission checks for editing access keys (#18928 ) With this change, only a user with `UpdateServiceAccountAdminAction` permission is able to edit access keys. We would like to let a user edit their own access keys, however the feature needs to be re-designed for better security and integration with external systems like AD/LDAP and OpenID. This change prevents privilege escalation via service accounts.	2024-01-31 10:56:45 -08:00
Harshavardhana	caac9d216e	remove all the frivolous logs, that may or may not be actionable (#18922 ) for actionable, inspections we have `mc support inspect` we do not need double logging, healing will report relevant errors if any, in terms of quorum lost etc.	2024-01-30 18:11:45 -08:00
Harshavardhana	057192913c	add total usable capacity, free and used to DataUsageInfo() (#18921 )	2024-01-30 17:49:37 -08:00
Harshavardhana	f25cbdf43c	use all the available nr_requests for NVMe (#18920 )	2024-01-30 14:10:06 -08:00
Klaus Post	6da4a9c7bb	Improve tracing & notification scalability (#18903 ) * Perform JSON encoding on remote machines and only forward byte slices. * Migrate tracing & notification to WebSockets.	2024-01-30 12:49:02 -08:00
Harshavardhana	80ca120088	remove checkBucketExist check entirely to avoid fan-out calls (#18917 ) Each Put, List, Multipart operations heavily rely on making GetBucketInfo() call to verify if bucket exists or not on a regular basis. This has a large performance cost when there are tons of servers involved. We did optimize this part by vectorizing the bucket calls, however its not enough, beyond 100 nodes and this becomes fairly visible in terms of performance.	2024-01-30 12:43:25 -08:00
Anis Eleuch	a669946357	Add cgroup v2 support for memory limit (#18905 )	2024-01-30 11:13:27 -08:00
Poorna	7ffc162ea8	exclude veeam virtual objects from replication (#18918 ) Fixes: #18916	2024-01-30 10:43:58 -08:00
Poorna	bcfd7fbbcf	reuse transports for callhome and remote tgt validation (#18912 )	2024-01-29 23:05:39 -08:00
Harshavardhana	486e2e48ea	enable xattr capture by default (#18911 ) - healing must not set the write xattr because that is the job of active healing to update. what we need to preserve is permanent deletes. - remove older env for drive monitoring and enable it accordingly, as a global value.	2024-01-29 23:03:58 -08:00
Harshavardhana	2ddf2ca934	allow configuring maximum idle connections per host (#18908 )	2024-01-29 16:50:37 -08:00
Poorna	29b1a29044	fix metrics panic in node metrics endpoint (#18894 )	2024-01-29 12:32:44 -08:00
jiuker	b4ab8e095a	fix: preserve bucket metric of data usage for replication info (#18895 )	2024-01-29 08:54:20 -08:00
Harshavardhana	cff8235068	remove getReplicationNodeMetrics() from peer metrics groups	2024-01-28 18:45:20 -08:00
Harshavardhana	944f3c1477	remove local disk metrics from cluster metrics (#18886 ) local disk metrics were polluting cluster metrics Please remove them instead of adding relevant ones. - batch job metrics were incorrectly kept at bucket metrics endpoint, move it to cluster metrics. - add tier metrics to cluster peer metrics from the node. - fix missing set level cluster health metrics	2024-01-28 12:53:59 -08:00
Harshavardhana	1d3bd02089	avoid close 'nil' panics if any (#18890 ) brings a generic implementation that prints a stack trace for 'nil' channel closes(), if not safely closes it.	2024-01-28 10:04:17 -08:00
Harshavardhana	6347fb6636	add missing proper error return in WalkDir() (#18884 ) without this the caller might end up returning incorrect errors and not ignoring the drive properly.	2024-01-27 16:13:41 -08:00
Harshavardhana	32e668eb94	update() stale rebalance stats() object during pool expansion (#18882 ) it is entirely possible that a rebalance process which was running when it was asked to "stop" it failed to write its last statistics to the disk. After this a pool expansion can cause disruption and all S3 API calls would fail at IsPoolRebalancing() function. This PRs makes sure that we update rebalance.bin under such conditions to avoid any runtime crashes.	2024-01-27 10:14:03 -08:00
Harshavardhana	c88308cf0e	avoid 'panic' on mc admin update for single drive setup (#18876 )	2024-01-26 12:07:03 -08:00
Harshavardhana	88837fb753	add new update v2 that updates per node, allows idempotent behavior (#18859 ) add new update v2 that updates per node, allows idempotent behavior new API ensures that - binary is correct and can be downloaded checksummed verified - committed to actual path - restart returns back the relevant waiting drives	2024-01-26 08:40:13 -08:00
Harshavardhana	d0283ff354	remove unnecessary logs in HealBucket() (#18875 )	2024-01-26 08:39:57 -08:00
Harshavardhana	f449a7ae2c	allow bucket import to be idempotent (#18873 ) do not need to be defensive in our approach, we should simply override anything everything in import process, do not care about what currently exists on the disk - backup is the source of truth.	2024-01-25 17:20:54 -08:00
Klaus Post	a113b2c394	Fix inspect format.json exclusion (#18871 ) Right now the format.json is excluded if anything within `.minio.sys` is requested. I assume the check was meant to exclude only if it was actually requesting it.	2024-01-25 15:59:00 -08:00
Harshavardhana	74851834c0	further bootstrap/startup optimization for reading 'format.json' (#18868 ) - Move RenameFile to websockets - Move ReadAll that is primarily is used for reading 'format.json' to to websockets - Optimize DiskInfo calls, and provide a way to make a NoOp DiskInfo call.	2024-01-25 12:45:46 -08:00
Harshavardhana	e377bb949a	migrate bootstrap logic directly to websockets (#18855 ) improve performance for startup sequences by 2x for 300+ nodes.	2024-01-24 13:36:44 -08:00
Poorna	b6e9d235fe	fix replication error logs to include target endpoint (#18863 )	2024-01-24 13:05:43 -08:00
Klaus Post	4a6c97463f	Fix all racy use of NewDeadlineWorker (#18861 ) AlmosAll uses of NewDeadlineWorker, which relied on secondary values, were used in a racy fashion, which could lead to inconsistent errors/data being returned. It also propagates the deadline downstream. Rewrite all these to use a generic WithDeadline caller that can return an error alongside a value. Remove the stateful aspect of DeadlineWorker - it was racy if used - but it wasn't AFAICT. Fixes races like: ``` WARNING: DATA RACE Read at 0x00c130b29d10 by goroutine 470237: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).ReadVersion() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:702 +0x611 github.com/minio/minio/cmd.readFileInfo() github.com/minio/minio/cmd/erasure-metadata-utils.go:160 +0x122 github.com/minio/minio/cmd.erasureObjects.getObjectFileInfo.func1.1() github.com/minio/minio/cmd/erasure-object.go:809 +0x27a github.com/minio/minio/cmd.erasureObjects.getObjectFileInfo.func1.2() github.com/minio/minio/cmd/erasure-object.go:828 +0x61 Previous write at 0x00c130b29d10 by goroutine 470298: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).ReadVersion.func1() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:698 +0x244 github.com/minio/minio/internal/ioutil.(DeadlineWorker).Run.func1() github.com/minio/minio/internal/ioutil/ioutil.go:141 +0x33 WARNING: DATA RACE Write at 0x00c0ba6e6c00 by goroutine 94507: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).StatVol.func1() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:419 +0x104 github.com/minio/minio/internal/ioutil.(DeadlineWorker).Run.func1() github.com/minio/minio/internal/ioutil/ioutil.go:141 +0x33 Previous read at 0x00c0ba6e6c00 by goroutine 94463: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).StatVol() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:422 +0x47e github.com/minio/minio/cmd.getBucketInfoLocal.func1() github.com/minio/minio/cmd/peer-s3-server.go:275 +0x122 github.com/minio/pkg/v2/sync/errgroup.(*Group).Go.func1() ``` Probably back from #17701	2024-01-24 10:08:31 -08:00
Frank Wessels	6c912ac960	Fix startup message when using single path (#18856 )	2024-01-24 10:02:56 -08:00
Harshavardhana	708cebe7f0	add necessary protection err, fileInfo slice reads and writes (#18854 ) protection was in place. However, it covered only some areas, so we re-arranged the code to ensure we could hold locks properly. Along with this, remove the DataShardFix code altogether, in deployments with many drive replacements, this can affect and lead to quorum loss.	2024-01-24 01:08:23 -08:00
Harshavardhana	f78d677ab6	pre-allocate EC memory by default at startup (#18846 )	2024-01-23 20:41:11 -08:00
Poorna	e39e2306d6	site replication: remove extraneous log for missing group (#18785 )	2024-01-23 18:28:11 -08:00
Harshavardhana	52229a21cb	avoid reload of 'format.json' over the network under normal conditions (#18842 )	2024-01-23 14:11:46 -08:00
Harshavardhana	961f7dea82	compress binary while sending it to all the nodes (#18837 ) Also limit the amount of concurrency when sending binary updates to peers, avoid high network over TX that can cause disconnection events for the node sending updates.	2024-01-22 12:16:36 -08:00
Shubhendu	65c4d550cb	Distribution bucket metrics with site replication (#18841 ) If site replication is enabled, we should still show the size and version distribution histogram metrics at bucket level. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-01-22 08:45:36 -08:00
Harshavardhana	f9b4a8d6e8	improve server update behavior by re-using memory properly (#18831 )	2024-01-19 18:27:58 -08:00
Harshavardhana	e11d851aee	add new drive I/O waiting/tokens metric (#18836 ) Bonus: add virtual memory used as well part of the system resource metrics.	2024-01-19 14:51:36 -08:00
Harshavardhana	ac81f0248c	introduce new ServiceV2 API to handle guided restarts (#18826 ) New API now verifies any hung disks before restart/stop, provides a 'per node' break down of the restart/stop results. Provides also how many blocked syscalls are present on the drives and what users must do about them. Adds options to do pre-flight checks to provide information to the user regarding any hung disks. Provides 'force' option to forcibly attempt a restart() even with waiting syscalls on the drives.	2024-01-19 14:22:36 -08:00
Aditya Manthramurthy	cc960adbee	fix: remove policy mapping file when empty (#18828 ) On a policy detach operation, if there are no policies remaining attached to the user/group, remove the policy mapping file, instead of leaving a file containing an empty list of policies.	2024-01-19 10:31:40 -08:00
Shubhendu	19387cafab	Use +Inf label additionally for Histogram metrics (#18807 )	2024-01-18 14:51:28 -08:00
Harshavardhana	7c0673279b	capture I/O in waiting and total tokens in diskMetrics (#18819 ) This is needed for the subsequent changes in ServerUpdate(), ServerRestart() etc.	2024-01-18 11:17:43 -08:00
Anis Eleuch	7ce0d71a96	Do not log volume not empty when healing dangling buckets (#18822 ) Healing dangling buckets is conservative, and it is a typical use case to fail to remove a dangling bucket because it contains some data because healing danging bucket code is not allowed to remove data: only healing the dangling object is allowed to do so.	2024-01-18 10:39:27 -08:00
Harshavardhana	dd2542e96c	add codespell action (#18818 ) Original work here, #18474, refixed and updated.	2024-01-17 23:03:17 -08:00
Harshavardhana	21d60eab7c	remove all older unused APIs (#18769 )	2024-01-17 20:41:23 -08:00
Harshavardhana	a4a74e9844	re-init the worker group to ensure errs[] slice is fresh	2024-01-17 20:33:25 -08:00
Harshavardhana	9588978028	fix: HealBucket regression for empty buckets, simplify it (#18815 )	2024-01-17 15:19:09 -08:00
chienguo	8cd967803c	fix: a typo in storeDataUsageInBackend() comment (#18778 )	2024-01-16 15:48:54 -08:00
Harshavardhana	a0e1163fb6	reject reference format from a different deployment (#18800 ) reference format is constant for any lifetime of a minio cluster, we do not have to ever replace it during HealFormat() as it will never change. additionally we should simply reject reference formats that we do not understand early on.	2024-01-16 15:13:14 -08:00
Sveinn	30bd5e2669	adding a missing return case to fix GetObjectTagging (#18793 )	2024-01-15 16:11:06 -08:00
Harshavardhana	38637897ba	fix: listing SSE encrypted multipart objects (#18786 ) GetActualSize() was heavily relying on o.Parts() to be non-empty to figure out if the object is multipart or not, However, we have many indicators of whether an object is multipart or not. Blindly assuming that o.Parts == nil is not a multipart, is an incorrect expectation instead, multipart must be obtained via - Stored metadata value indicating this is a multipart encrypted object. - Rely on <meta>-actual-size metadata to get the object's actual size. This value is preserved for additional reasons such as these. - ETag != 32 length	2024-01-15 00:57:49 -08:00
Harshavardhana	993d96feef	treat all localhost endpoints as local setup with same port (#18784 ) fixes #18783 and avoids user mistakes	2024-01-12 23:53:03 -08:00
Poorna	b2b26d9c95	support proxying of tagging requests in replication (#18649 ) support proxying of tagging requests in active-active replication Note: even if proxying is successful, PutObjectTagging/DeleteObjectTagging will continue to report a 404 since the object is not present locally.	2024-01-12 23:51:33 -08:00
Krishnan Parthasarathi	cba3dd276b	Add more size intervals to obj size histogram (#18772 ) New intervals: [1024B, 64KiB) [64KiB, 256KiB) [256KiB, 512KiB) [512KiB, 1MiB) The new intervals helps us see object size distribution with higher resolution for the interval [1024B, 1MiB).	2024-01-12 23:51:08 -08:00
Anis Eleuch	a47fc75c26	xl: Remove wrong wording for errCorruptedFormat (#18775 ) Also add errCorruptedBackend to make it easier to differentiate between corrupted content or something else wrong in the backend drive	2024-01-12 14:48:44 -08:00
Harshavardhana	e5c8794b8b	avoid disk monitoring leaks under various conditions (#18777 ) - HealFormat() was leaking healthcheck goroutines for disks, we are only interested in enabling healthcheck for the newly formatted disk, not for existing disks. - When disk is a root-disk a random disk monitor was leaking while we ignored the drive. - When loading the disk for each erasure set, we were leaking goroutines for the prepare-storage.go disks which were replaced via the globalLocalDrives slice - avoid disk monitoring utilizing health tokens that would cause exhaustion in the tokens, prematurely which were meant for incoming I/O. This is ensured by avoiding writing O_DIRECT aligned buffer instead write 2048 worth of content only as O_DSYNC, which is sufficient.	2024-01-12 01:48:36 -08:00
Taran Pelkey	ac90a873eb	Verify that remote target bucket is on MinIO server for bucket replication (#18656 )	2024-01-11 14:56:16 -08:00
jiuker	c1a78224cf	fix: prevent queries from starting before initialization (#18766 )	2024-01-10 15:21:52 -08:00
Harshavardhana	39f9350697	optimize readdir() open calls to be dealt with directly via 'fd' (#18762 )	2024-01-10 08:48:50 -08:00
Shubhendu	e31081d79d	Heal buckets at node level (#18612 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-01-09 20:34:04 -08:00
Harshavardhana	f02d282754	avoid frivolous logs for expired credentials (#18767 )	2024-01-09 12:25:18 -08:00
Krishnan Parthasarathi	3a90af0bcd	Add line, col to types used in batch-expire (#18747 )	2024-01-08 15:22:28 -08:00
jiuker	53ceb0791f	fix: prevent queries from starting before initialization (#18756 ) Prevent queries from starting before initialization	2024-01-08 12:40:27 -08:00
jiuker	2cd98a0d21	remove outdated notes (#18755 )	2024-01-08 08:04:19 -08:00
Anis Eleuch	04135fa6cd	audit: Add the drives where the dangling object is removed (#18737 )	2024-01-05 14:17:24 -08:00
Harshavardhana	42dc6329e6	simplify success response for GetObjectAttributes() (#18746 )	2024-01-05 12:50:07 -08:00
Sveinn	9b8ba97f9f	feat: add support for GetObjectAttributes API (#18732 )	2024-01-05 10:43:06 -08:00
Anis Eleuch	7705605b5a	scanner: Add a config to disable short sleep between objects scan (#18734 ) Add a hidden configuration under the scanner sub section to configure if the scanner should sleep between two objects scan. The configuration has only effect when there is no drive activity related to s3 requests or healing. By default, the code will keep the current behavior which is doing sleep between objects. To forcefully enable the full scan speed in idle mode, you can do this: `mc admin config set myminio scanner idle_speed=full`	2024-01-04 15:07:17 -08:00
Anis Eleuch	414bcb0c73	prom: Add read quorum per erasure set metric (#18736 )	2024-01-04 15:05:13 -08:00
Harshavardhana	f4710948c4	fix: an odd crash when deleting `null` DEL markers (#18727 ) fixes #18724 A regression was introduced in #18547, that attempted to file adding a missing `null` marker however we should not skip returning based on versionID instead it must be based on if we are being asked to create a DEL marker or not. The PR also has a side-affect for replicating `null` marker permanent delete, as it may end up adding a `null` marker while removing one. This PR should address both scenarios.	2024-01-02 15:08:18 -08:00
Anis Eleuch	3f4488c589	scanner: Allow full throttle if there is no parallel disk ops (#18109 )	2024-01-02 13:51:24 -08:00
Pedro Juarez	8f13c8c3bf	Support to store browser config settings (#18631 ) * csp_policy * hsts_seconds * hsts_include_subdomains * hsts_preload * referrer_policy	2024-01-01 08:36:33 -08:00
Zhou Ting	31d16f6cc2	allow sha256 payload to be configurable for object perf test (#18712 ) Signed-off-by: Zhou Ting <ting.z.zhou@intel.com>	2023-12-29 23:56:50 -08:00
Harshavardhana	a50ea92c64	feat: introduce list_quorum="auto" to prefer quorum drives (#18084 ) NOTE: This feature is not retro-active; it will not cater to previous transactions on existing setups. To enable this feature, please set ` _MINIO_DRIVE_QUORUM=on` environment variable as part of systemd service or k8s configmap. Once this has been enabled, you need to also set `list_quorum`. ``` ~ mc admin config set alias/ api list_quorum=auto` ``` A new debugging tool is available to check for any missing counters.	2023-12-29 15:52:41 -08:00
Harshavardhana	5b2ced0119	re-use globalLocalDrives properly (#18721 )	2023-12-29 09:30:10 -08:00
Anis Eleuch	8a0ba093dd	audit: Fix merrs and derrs object dangling message (#18714 ) merrs and derrs are empty when a dangling object is deleted. Fix the bug and adds invalid-meta data for data blocks	2023-12-27 22:27:04 -08:00
Daniel Valdivia	5fc7da345d	Upgrade Console to v0.44.0 (#18717 ) Signed-off-by: Daniel Valdivia <18384552+dvaldivia@users.noreply.github.com>	2023-12-27 11:19:13 -08:00
Anis Eleuch	8bd4f6568b	server-info: Avoid initializing audit/log http/kafka targets (#18703 ) This can cause unnecessary ServerInfo() call delay.	2023-12-22 10:25:08 -08:00
Harshavardhana	da55499db0	fix: reject clients that do not send proper payload (#18701 )	2023-12-22 01:26:17 -08:00
Anis Eleuch	22f8e39b58	tier: Allow edit of the new Azure and AWS auth params (#18690 ) Allow editing for the service principal credentials from Azure and the web identity token for AWS; Also, more validation of input parameters.	2023-12-21 16:58:10 -08:00
Harshavardhana	eba23bbac4	rename object_size -> block_size for cache subsystem (#18694 )	2023-12-21 16:57:13 -08:00
Harshavardhana	4550535cbb	send proper IPv6 names avoid bracketing notation (#18699 ) Following policies if present ``` "Condition": { "IpAddress": { "aws:SourceIp": [ "54.240.143.0/24", "2001:DB8:1234:5678::/64" ] } } ``` And client is making a request to MinIO via IPv6 can potentially crash the server. Workarounds are turn-off IPv6 and use only IPv4	2023-12-21 16:56:55 -08:00
Anis Eleuch	8432fd5ac2	prom: Add online and healing drives metrics per erasure set (#18700 )	2023-12-21 16:56:43 -08:00
Harshavardhana	7c948adf88	allow pre-allocating buffers to reduce frequent GCs during growth (#18686 ) This PR also increases per node bpool memory from 1024 entries to 2048 entries; along with that, it also moves the byte pool centrally instead of being per pool.	2023-12-21 08:59:38 -08:00
Krishnan Parthasarathi	56b7045c20	Export tier metrics (#18678 ) minio_node_tier_ttlb_seconds - Distribution of time to last byte for streaming objects from warm tier minio_node_tier_requests_success - Number of requests to download object from warm tier that were successful minio_node_tier_requests_failure - Number of requests to download object from warm tier that failed	2023-12-20 20:13:40 -08:00
Poorna	d55b6b9909	Fix quota config replication for SR (#18684 ) Fixing regression introduced by PR #17988	2023-12-19 13:22:47 -08:00
Shireesh Anjal	7680e5f81d	Read new key license_v2 from SUBNET response (#18669 ) SUBNET now has a v2 of license that is returned in the new key `license_v2`. mc will start reading and storing the same. (The old key `license` is deprecated but is still available in SUBNET response to ensure that the current released version of minio doesn't break)	2023-12-18 08:21:44 -08:00
Taran Pelkey	ad8a34858f	Add APIs to create and list access keys for LDAP (#18402 )	2023-12-15 13:00:43 -08:00
Krishnan Parthasarathi	162eced7d2	Fix incorrect metric desc for bucketRequestsDuration (#18657 )	2023-12-14 19:02:11 -08:00
Krishnan Parthasarathi	bec1f7c26a	metrics: Refactor handling of histogram vectors (#18632 )	2023-12-14 14:02:52 -08:00
Anis Eleuch	8771617199	tier: Add support of AWS S3 tiering with web identity token file (#18648 )	2023-12-14 14:01:49 -08:00
Klaus Post	6c89a81af4	Fix CreateFile shared buffer corruption. (#18652 ) `(xlStorageDiskIDCheck).CreateFile` wraps the incoming reader in `xioutil.NewDeadlineReader`. The wrapped reader is handed to `(xlStorage).CreateFile`. This performs a Read call via `writeAllDirect`, which reads into an `ODirectPool` buffer. `(*DeadlineReader).Read` spawns an async read into the buffer. If a timeout is hit while reading, the read operation returns to `writeAllDirect`. The operation returns an error and the buffer is reused. However, if the async `Read` call unblocks, it will write to the now recycled buffer. Fix: Remove the `DeadlineReader` - it is inherently unsafe. Instead, rely on the network timeouts. This is not a disk timeout, anyway. Regression in https://github.com/minio/minio/pull/17745	2023-12-14 10:51:57 -08:00
Praveen raj Mani	10ca0a6936	Label the notification target metrics by their target IDs (#18633 ) This patch adds the targetID to the existing notification target metrics and deprecates the current target metrics which points to the overall event notification subsystem	2023-12-14 09:09:26 -08:00
Harshavardhana	b3314e97a6	re-use the same local drive used by remote-peer (#18645 ) historically, we have always kept storage-rest-server and a local storage API separate without much trouble, since they both can independently operate due to no special state() between them. however, over some time, we have added state() such as - drive monitoring threads now there will be "2" of them per drive instead of just 1. - concurrent tokens available per drive are now twice instead of just single shared, allowing unexpectedly high amount of I/O to go through. - applying serialization by using walkMutexes can now be adequately honored for both remote callers and local callers.	2023-12-13 19:27:55 -08:00
Poorna	3781a0f9ad	replication: Pass metadata timestamps in CopyObject call (#18647 ) Regression from #18285. CopyObject options were inheriting source MTime for metadata timestamps if unspecified, removing this prevented metadata updates from being applied on target.	2023-12-13 15:28:55 -08:00
Poorna	e79b289325	fix datadir missing check on HeadObject (#18646 ) versions pending purge in replication were seeing a errFileCorrupt that prevents permanent deletion after replication. Regression from PR#18477	2023-12-13 14:54:01 -08:00
Harshavardhana	3f72c7fcc7	healthcheck requests with user-agent mozilla do not need redirects (#18642 ) apparently, windows powershell curl has this abhorrent behavior	2023-12-12 16:16:26 -08:00
Harshavardhana	d521c84d55	reduce logging during permission denied errors (#18641 ) log them if any only once	2023-12-12 16:11:17 -08:00
Anis Eleuch	4a21dce2b5	tier: Add support of SP credentials with Azure (#18630 ) Co-authored-by: Anis Elleuch <anis@min.io>	2023-12-11 21:51:53 -08:00
Harshavardhana	65f34cd823	fix: remove ODirectReader entirely since we do not need it anymore (#18619 )	2023-12-09 10:17:51 -08:00
Harshavardhana	196e7e072b	allow bitrot files to be healed in MRF (#18618 ) bitrot scanMode was ignored in MRF, allow it to heal relevant content if needed when seen as an error.	2023-12-08 12:26:01 -08:00
Anis Eleuch	6f97663174	yml-config: Add support of rootUser and rootPassword (#18615 ) Users can define the root user and password in the yaml configuration file; Root credentials defined in the environment variable still take precedence	2023-12-08 12:04:54 -08:00
Anis Eleuch	aed7a1818a	info: Populate pool/set/disk indexes for offline disks (#18613 ) This can be calculated from the disk layout and some external applications would like to know the location of the offline disks.	2023-12-08 08:13:04 -08:00
Poorna	6b06da76cb	add configuration to limit replication workers (#18601 )	2023-12-07 16:22:00 -08:00
jiuker	6ca6788bb7	feat: add events_errors_total metric (#18610 )	2023-12-07 16:21:17 -08:00
Anis Eleuch	2e23e61a45	Add support of conf file to pass arguments and options (#18592 )	2023-12-07 01:33:56 -08:00
Harshavardhana	53ce92b9ca	fix: use the right channel to feed the data in (#18605 ) this PR fixes a regression in batch replication where we weren't sending any data from the Walk() results due to incorrect channels being used.	2023-12-06 18:17:03 -08:00
Shireesh Anjal	7350a29fec	Capture percentage of cpu load and memory used (#18596 ) By default the cpu load is the cumulative of all cores. Capture the percentage load (load * 100 / cpu-count) Also capture the percentage memory used (used * 100 / total)	2023-12-06 13:19:59 -08:00
jiuker	5cc2c62c66	fix: GetFreePort() will get the same port (#18604 )	2023-12-06 10:36:42 -08:00
Harshavardhana	4bc5ed6c76	support LDAP service accounts via SFTP, FTP logins (#18599 )	2023-12-06 04:31:35 -08:00
Harshavardhana	73dde66dbe	stick to go1.19 go.mod (#18600 )	2023-12-06 01:09:22 -08:00
Harshavardhana	e30c0e7ca3	Revert "Heal buckets at node level (#18504 )" This reverts commit `708296ae1b`.	2023-12-05 22:34:46 -08:00
Shubhendu	708296ae1b	Heal buckets at node level (#18504 )	2023-12-05 02:17:35 -08:00
Harshavardhana	fbb5e75e01	avoid run-away goroutine build-up in notification send, use channels (#18533 ) use memory for async events when necessary and dequeue them as needed, for all synchronous events customers must enable ``` MINIO_API_SYNC_EVENTS=on ``` Async events can be lost but is upto to the admin to decide what they want, we will not create run-away number of goroutines per event instead we will queue them properly. Currently the max async workers is set to runtime.GOMAXPROCS(0) which is more than sufficient in general, but it can be made configurable in future but may not be needed.	2023-12-05 02:16:33 -08:00
Harshavardhana	f327b21557	handle crashes with ILM expiry changes (#18590 )	2023-12-05 01:14:36 -08:00
Harshavardhana	45b7253f39	parallelize renameData() cleanup upon error (#18591 )	2023-12-04 14:54:34 -08:00
Harshavardhana	05bb655efc	avoid caching metrics for timeout errors per drive (#18584 ) Bonus: combine the loop for drive/REST registration.	2023-12-04 11:54:13 -08:00
Harshavardhana	8fdfcfb562	upon RenameData() quorum error delete any partial success (#18586 ) there is potential for danglingWrites when quorum failed, where only some drives took a successful write, generally this is left to the healing routine to pick it up. However it is better that we delete it right away to avoid potential for quorum issues on version signature when there are many versions of an object.	2023-12-04 11:33:39 -08:00
Harshavardhana	e7c144eeac	avoid double MRF heal when there is versions disparity (#18585 )	2023-12-04 11:13:50 -08:00
Harshavardhana	e98172d72d	avoid hot-tier SLA to be tied to warm-tier SLA (#18581 ) it is okay if the warm-tier cannot keep up, we should continue to take I/O at hot-tier, only fail hot-tier or block it when we are disk full. Bonus: add metrics counter for these missed tasks, we will know for sure if one of the node is lagging behind or is losing too many tasks during transitioning.	2023-12-02 13:02:12 -08:00
Krishnan Parthasarathi	a50f26b7f5	Implement batch-expiration for objects (#17946 ) Based on an initial PR from - https://github.com/minio/minio/pull/17792 But fully completes it with newer finalized YAML spec.	2023-12-02 02:51:33 -08:00
Klaus Post	69294cf98a	Disable DMA optimization on windows (#18575 ) It appears that Windows can lock up when errors occur. Use regular copy here.	2023-12-01 16:13:19 -08:00
Krishnan Parthasarathi	c397fb6c7a	Minor fixes to bucket replication (#18578 )	2023-12-01 16:13:08 -08:00
Klaus Post	961b0b524e	Do not require restart when a disk is unreachable during node boot (#18576 ) A disk that is not able to initialize when an instance is started will never have a handler registered, which means a user will need to restart the node after fixing the disk; This will also prevent showing the wrong 'upgrade is needed.' error message in that case. When the disk is still failing, print an error every 30 minutes; Disk reconnection will be retried every 30 seconds. Co-authored-by: Anis Elleuch <anis@min.io>	2023-12-01 12:01:14 -08:00
Harshavardhana	109a9e3f35	skip ILM expired objects from healing (#18569 )	2023-12-01 07:56:24 -08:00
Klaus Post	5f971fea6e	Fix Mux Connect Error (#18567 ) `OpMuxConnectError` was not handled correctly. Remove local checks for single request handlers so they can run before being registered locally. Bonus: Only log IAM bootstrap on startup.	2023-12-01 00:18:04 -08:00
Klaus Post	94fbcd8ebe	Add TLS cert checksum (#18557 ) It allows validation of whether all certs match across clusters.	2023-11-30 12:13:50 -08:00
Harshavardhana	879d5dd236	site replication must heal policy mappings with correct userType (#18563 )	2023-11-30 10:34:18 -08:00
Harshavardhana	0ee722f8c3	cleanup handling of STS isAllowed and simplifies the PolicyDBGet() (#18554 )	2023-11-29 16:07:35 -08:00
Anis Eleuch	b7d11141e1	rename Force to Immediate for clarity (#18540 )	2023-11-28 22:35:16 -08:00
Klaus Post	bea0b050cd	Improve env var config error reporting (#18549 ) Improve env var config error Env vars that were set on current server but not on remotes were not reported in errors. Add these.	2023-11-28 10:39:02 -08:00
Shubhendu	ce62980d4e	Fixed transition rules getting overwritten while healing (#18542 ) While healing the latest changes of expiry rules across sites if target had pre existing transition rules, they were getting overwritten as cloned latest expiry rules from remote site were getting written as is. Fixed the same and added test cases as well. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-11-28 10:38:35 -08:00
Klaus Post	dc88865908	fix: shadowed error in getObjectFileInfo() (#18548 ) This will result in `done <- err == nil` always returning true for this path, which seems unintentional.	2023-11-28 09:47:41 -08:00
Krishnan Parthasarathi	9fbd931058	Skip versions expired by DeleteAllVersionsAction (#18537 ) Object versions expired by DeleteAllVersionsAction must not be included toward data-usage accounting.	2023-11-28 08:39:21 -08:00
jiuker	b0264bdb90	preserve null version delete marker on suspended bucket version (#18547 )	2023-11-28 08:31:33 -08:00
bestgopher	95d6f43cc8	fix(cmd/notification.go): no error when retry successful (#18530 )	2023-11-27 22:41:03 -08:00
Anis Eleuch	9cb94eb4a9	cleaning up will delete instead of rename to trash with full disk err (#18534 ) moveToTrash() function moves a folder to .trash, for example, when doing some object deletions: a data dir that has many parts will be renamed to the trash folder; However, ENOSPC is a valid error from rename(), and it can cripple a user trying to free some space in an entire disk situation. Therefore, this commit will try to do a recursive delete in that case.	2023-11-27 17:36:02 -08:00
Harshavardhana	bd0819330d	avoid Walk() API listing objects without quorum (#18535 ) This allows batch replication to basically do not attempt to copy objects that do not have read quorum. This PR also allows walk() to provide custom values for quorum under batch replication, and key rotation.	2023-11-27 17:20:04 -08:00
Harshavardhana	8d9e83fd99	support passing signatureAge conditional (#18529 ) this PR allows following policy ``` { "Version": "2012-10-17", "Statement": [ { "Sid": "Deny a presigned URL request if the signature is more than 10 min old", "Effect": "Deny", "Action": "s3:", "Resource": "arn:aws:s3:::DOC-EXAMPLE-BUCKET1/", "Condition": { "NumericGreaterThan": { "s3:signatureAge": 600000 } } } ] } ``` This is to basically disable all pre-signed URLs that are older than 10 minutes.	2023-11-27 11:30:19 -08:00

1 2 3 4 5 ...

5885 Commits