minio

mirror of https://github.com/minio/minio.git synced 2025-11-20 18:06:10 -05:00

Author	SHA1	Message	Date
Harshavardhana	9693c382a8	make renameData() more defensive during overwrites (#19548 ) instead upon any error in renameData(), we still preserve the existing dataDir in some form for recoverability in strange situations such as out of disk space type errors. Bonus: avoid running list and heal() instead allow versions disparity to return the actual versions, uuid to heal. Currently limit this to 100 versions and lesser disparate objects. an undo now reverts back the xl.meta from xl.meta.bkp during overwrites on such flaky setups. Bonus: Save N depth syscalls via skipping the parents upon overwrites and versioned updates. Flaky setup examples are stretch clusters with regular packet drops etc, we need to add some defensive code around to avoid dangling objects.	2024-04-23 10:15:52 -07:00
jiuker	ee1047bd52	fix: can't get total disksize for `decom status` (#19585 )	2024-04-23 04:33:28 -07:00
Seiya	5ea5ab162b	Remove leading zero strings in return value of (*xlMetaV2)getDataDirs() (#19567 ) remove leading zero strings in return value of getDataDirs()	2024-04-22 22:07:37 -07:00
Klaus Post	b5a09ff96b	Fix RenameData data race (#19579 ) RenameData could start operating on inline data after timing out and the call returned due to WithDeadline. This could cause a buffer to write to the inline data being written. Since no writes are in `RenameData` and the call is canceled, this doesn't present a corruption issue. But a race is a race and should be fixed. Copy inline data to a fresh buffer.	2024-04-22 22:07:19 -07:00
Harshavardhana	95c65f4e8f	do not panic on rebalance during server restarts (#19563 ) This PR makes a feasible approach to handle all the scenarios that we must face to avoid returning "panic." Instead, we must return "errServerNotInitialized" when a bucketMetadataSys.Get() is called, allowing the caller to retry their operation and wait. Bonus fix the way data-usage-cache stores the object. Instead of storing usage-cache.bin with the bucket as `.minio.sys/buckets`, the `buckets` must be relative to the bucket `.minio.sys` as part of the object name. Otherwise, there is no way to decommission entries at `.minio.sys/buckets` and their final erasure set positions. A bucket must never have a `/` in it. Adds code to read() from existing data-usage.bin upon upgrade.	2024-04-22 10:49:30 -07:00
Harshavardhana	6bfff7532e	re-use transport and set stronger backwards compatible Ciphers (#19565 ) This PR fixes a few things - FIPS support for missing for remote transports, causing MinIO could end up using non-FIPS Ciphers in FIPS mode - Avoids too many transports, they all do the same thing to make connection pooling work properly re-use them. - globalTCPOptions must be set before setting transport to make sure the client conn deadlines are honored properly. - GCS warm tier must re-use our transport - Re-enable trailing headers support.	2024-04-21 04:43:18 -07:00
Harshavardhana	1aa8896ad6	Revert "cleanup: Simplify usage of MinIOSourceProxyRequest (#19553 )" This reverts commit `928c0181bf`. This change was not correct, reverting. We track 3 states with the ProxyRequest header - if replication process wants to know if object is already replicated with a HEAD, it shouldn't proxy back - Poorna	2024-04-20 02:05:54 -07:00
Krishnan Parthasarathi	3e32ceb39f	Disable trailing header support for MinIO tiers (#19561 ) AWS S3 trailing header support was recently enabled on the warm tier client connection to MinIO type remote tiers. With this enabled, we are seeing the following error message at http transport layer. > Unsolicited response received on idle HTTP channel starting with "HTTP/1.1 400 Bad Request\r\nContent-Type: text/plain; charset=utf-8\r\nConnection: close\r\n\r\n400 Bad Request"; err=<nil> This is an interim fix until we identify the root cause for this behaviour in the minio-go client package.	2024-04-19 19:32:25 -07:00
jiuker	9205434ed3	fix: ignore signaturev2 for policy header check (#19551 )	2024-04-19 09:45:54 -07:00
Harshavardhana	cd50e9b4bc	make LRU cache global for internode tokens (#19555 )	2024-04-19 09:45:14 -07:00
Klaus Post	ec816f3840	Reduce parallelReader allocs (#19558 )	2024-04-19 09:44:59 -07:00
Klaus Post	5f774951b1	Store object EC in metadata header (#19534 ) Keep the EC in header, so it can be retrieved easily for dynamic quorum calculations. To not force a full metadata decode on every read the value will be 0/0 for data written in previous versions. Size is expected to increase by 2 bytes per version, since all valid values can be represented with 1 byte each. Example: ``` λ xl-meta xl.meta { "Versions": [ { "Header": { "EcM": 4, "EcN": 8, "Flags": 6, "ModTime": "2024-04-17T11:46:25.325613+02:00", "Signature": "0a409875", "Type": 1, "VersionID": "8e03504e11234957b2727bc53eda0d55" }, ... ``` Not used for operations yet.	2024-04-19 09:43:43 -07:00
Harshavardhana	72f5cb577e	optimize ftp/sftp upload() implementations to avoid CPU load (#19552 )	2024-04-19 05:23:42 -07:00
Robert Lützner	928c0181bf	cleanup: Simplify usage of MinIOSourceProxyRequest (#19553 ) This replaces a convoluted condition that ultimately evaluated to "is this HTTP header present in the request or not?"	2024-04-19 05:23:31 -07:00
Harshavardhana	03767d26da	fix: get rid of large buffers (#19549 ) these lead to run-away usage of memory beyond which the Go's GC can handle, we have to re-visit this differently, remove this for now.	2024-04-19 04:26:59 -07:00
Sveinn	108e6f92d4	updating tests to use new mc --enc flags (#19508 )	2024-04-19 01:43:09 -07:00
Harshavardhana	d653a59fc0	fix: flaky getHostIP test	2024-04-18 19:09:56 -07:00
Aditya Manthramurthy	98f7821eb3	fix: ldap: avoid unnecessary import errors (#19547 ) Follow up for #19528 If there are multiple existing DN mappings for the same normalized DN, if they all have the same policy mapping value, we pick one of them of them instead of returning an import error.	2024-04-18 12:09:19 -07:00
Aditya Manthramurthy	ae46ce9937	ldap: Normalize DNs when importing (#19528 ) This is a change to IAM export/import functionality. For LDAP enabled setups, it performs additional validations: - for policy mappings on LDAP users and groups, it ensures that the corresponding user or group DN exists and if so uses a normalized form of these DNs for storage - for access keys (service accounts), it updates (i.e. validates existence and normalizes) the internally stored parent user DN and group DNs. This allows for a migration path for setups in which LDAP mappings have been stored in previous versions of the server, where the name of the mapping file stored on drives is not in a normalized form. An administrator needs to execute: `mc admin iam export ALIAS` followed by `mc admin iam import ALIAS /path/to/export/file` The validations are more strict and returns errors when multiple mappings are found for the same user/group DN. This is to ensure the mappings stored by the server are unambiguous and to reduce the potential for confusion. Bonus bug fix: IAM export of access keys (service accounts) did not export key name, description and expiration. This is fixed in this change too.	2024-04-18 08:15:02 -07:00
Anis Eleuch	dfc112c06b	list: Fix rare listing continuation freeze (#19524 ) Reading the list metacache is not protected by a lock; the code retries when it fails to read the metacache object, however, it forgot to re-read the metacache object from the drives, which is necessary, especially if the metacache object is inlined. This commit will ensure that we always re-read the metacache object from the drives when it is retrying.	2024-04-17 21:42:11 -07:00
Shireesh Anjal	ca5fab8656	Add cluster audit metrics in metrics-v3 (#19514 ) endpoint: /minio/metrics/v3/cluster/audit metrics: - failed_messages (counter) - total_messages (counter) - target_queue_length (gauge)	2024-04-17 02:18:02 -07:00
Shireesh Anjal	6df76ca73c	Add system memory metrics in v3 (#19486 ) Following memory metrics will be added under /system/memory - available - buffers - cache - free - shared - total - used - used_perc	2024-04-16 22:10:25 -07:00
Harshavardhana	f65dd3e5a2	reload from drive tier-config when in-memory cache is not found (#19527 ) avoid probing tier target while reloading() tier config	2024-04-16 22:09:58 -07:00
Harshavardhana	a8d601b64a	allow detaching any non-normalized DN (#19525 )	2024-04-16 17:36:43 -07:00
Klaus Post	e2709ea129	ftp: Return current time for prefixes/directories (#19519 )	2024-04-16 17:35:55 -07:00
Allan Roger Reid	740ec80819	At server init, use the correct context when creating the KMS Master Key (#19526 )	2024-04-16 17:34:45 -07:00
Allan Roger Reid	7c1f9667d1	Use GetDuration() helper for MINIO_KMS_KEY_CACHE_INTERVAL as time.Duration (#19512 ) Bonus: Use default duration of 10 seconds if invalid input < time.Second is specified	2024-04-16 08:43:39 -07:00
Klaus Post	9246990496	fix: ListObjectVersions returning duplicates when resuming with null version id (#19518 ) When resuming a versioned listing where `version-id-marker=null`, the `null` object would always be returned, causing duplicate entries to be returned. Add check against empty version	2024-04-16 08:41:27 -07:00
Harshavardhana	cb06aee5ac	convert multipart-cleanup from a blocking unlink() to a rename to trash (#19495 ) unlinking() at two different locations on a disk when there are lots to purge, this can lead to huge IOwaits, instead rely on rename() to .trash to avoid running multiple unlinks() in parallel.	2024-04-15 03:02:39 -07:00
Shubhendu	1c70e9ed1b	ILM expiry replication status only if enabled (#19503 ) Report ILM expiry replication status only if atleast one site has the feature enabled. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-04-15 02:40:39 -07:00
jiuker	f3d6a2dd37	code clean for dynamicSleeper (#19499 )	2024-04-15 02:40:19 -07:00
Harshavardhana	d1c58fc2eb	remove older deploymentID fix behavior to speed up startup (#19497 ) since mid 2018 we do not have any deployments without deployment-id, it is time to put this code to rest, this PR removes this old code as its no longer valuable. on setups with 1000's of drives these are all quite expensive operations.	2024-04-15 01:25:46 -07:00
Allan Roger Reid	b8f05b1471	Keep an up-to-date copy of the KMS master key (#19492 )	2024-04-15 00:42:50 -07:00
Klaus Post	e7baf78ee8	fix: list operations resuming when hitting different node (#19494 ) The rest of the peer clients were not consistent across nodes. So, meta cache requests would not go to the same server if a continuation happens on a different node.	2024-04-12 11:13:36 -07:00
Harshavardhana	7e3166475d	simplify common functions in replication (#19480 )	2024-04-11 17:27:32 -07:00
Klaus Post	5206c0e883	Inspect: Add error if no results (#19476 ) When no results match or another error occurs, add an error to the stream. Keep the "inspect-input.txt" as the only thing in the zip for reference. Example: ``` λ mc support inspect --airgap myminio/testbucket/fjghfjh/** mc: Using public key from C:\Users\klaus\mc\support_public.pem File data successfully downloaded as inspect-data.enc λ inspect inspect-data.enc Using private key from support_private.pem output written to inspect-data.zip 2024/04/11 14:10:51 next stream: GetRawData: No files matched the given pattern λ unzip -l inspect-data.zip Archive: inspect-data.zip Length Date Time Name --------- ---------- ----- ---- 222 2024-04-11 14:10 inspect-input.txt --------- ------- 222 1 file λ ``` Modifies inspect to read until end of stream to report the error. Bonus: Add legacy commandline params	2024-04-11 14:22:47 -07:00
Harshavardhana	41ec038523	remove permission denied error for being drive error (#19478 )	2024-04-11 14:22:15 -07:00
Shireesh Anjal	08d3d06a06	Add drive metrics in metrics-v3 (#19452 ) Add following metrics: - used_inodes - total_inodes - healing - online - reads_per_sec - reads_kb_per_sec - reads_await - writes_per_sec - writes_kb_per_sec - writes_await - perc_util To be able to calculate the `per_sec` values, we capture the IOStats-related data in the beginning (along with the time at which they were captured), and compare them against the current values subsequently. This is because dividing by "time since server uptime." doesn't work in k8s environments.	2024-04-11 10:46:34 -07:00
Harshavardhana	074febd9e1	remove SetDiskLoc() rely on the endpoint values instead (#19475 ) the disk location never changes in the lifetime of a MinIO cluster, even if it did validate this close to the disk instead at the higher layer. Return appropriate errors indicating an invalid drive, so that the drive is not recognized as part of a valid drive.	2024-04-11 10:45:28 -07:00
Poorna	ffa91f9794	fix CopyObject with replace overwriting inline status (#19468 ) Fixes #19450 - internal inline-data header can get overwritten during copy with replace before this fix.	2024-04-10 23:42:51 -07:00
Harshavardhana	0c31e61343	allow protection from invalid config values (#19460 ) we have had numerous reports on some config values not having default values, causing features misbehaving and not having default values set properly. This PR tries to address all these concerns once and for all. Each new sub-system that gets added - must check for invalid keys - must have default values set - must not "return err" when being saved into a global state() instead collate as part of other subsystem errors allow other sub-systems to independently initialize.	2024-04-10 18:10:30 -07:00
Harshavardhana	9b926f7dbe	avoid busy loops in bad path component (#19466 ) use it in places where we are looking for such bad path components.	2024-04-10 18:08:52 -07:00
Harshavardhana	35d8728990	handle missing LDAP normalization in SetPolicy() API (#19465 )	2024-04-10 15:37:42 -07:00
Allan Roger Reid	f7ed9a75ba	Allow specifying the local server with env variable _MINIO_SERVER_LOCAL (#19453 ) * Allow specifying the local server, with env variable _MINIO_SERVER_LOCAL, in systems where the hostname cannot be resolved to local IP * Limit scope of the _MINIO_SERVER_LOCAL solution to only containerized implementations	2024-04-10 09:34:59 -07:00
jiuker	ed64e91f06	fix: noHost for collectLocalMetric (#19457 )	2024-04-10 09:28:08 -07:00
jiuker	a481825ae1	fix: unknow contentType for ArchiveFileHandler (#19451 )	2024-04-09 03:41:25 -07:00
Harshavardhana	7bb0f32332	make if-none-match PUT/POST RFC compliant (#19448 ) fixes #19442	2024-04-09 01:17:49 -07:00
Anis Eleuch	c6f8dc431e	Add a warning when the total size of an object versions exceeds 1 TiB (#19435 )	2024-04-08 10:45:03 -07:00
Anis Eleuch	787c44c39d	batch-repl: Do not allow both source/target to be remote (#19434 ) Return an error when the user specifies endpoints for both source and target. This can generate many type of errors as the code considers a deployment remote if its endpoint is specified.	2024-04-08 07:11:38 -07:00
Anis Eleuch	f06fee0364	heal: Add more per disk healing result in the audit (#19427 ) HealObject() does not return an error in some cases, for example, when an object is successfully reconstructed in one disk but fails with other disks, another case is when a disk does not have the object is temporarily disconnected Add the After heal drives result in the audit output for better analysis.	2024-04-08 02:26:14 -07:00

1 2 3 4 5 ...

6056 Commits