minio

mirror of https://github.com/minio/minio.git synced 2025-11-24 19:46:16 -05:00

Author	SHA1	Message	Date
Harshavardhana	b0f0e53bba	fix: make sure to correctly initialize health checks (#17765 ) health checks were missing for drives replaced since - HealFormat() would replace the drives without a health check - disconnected drives when they reconnect via connectEndpoint() the loop also loses health checks for local disks and merges these into a single code. - other than this separate cleanUp, health check variables to avoid overloading them with similar requirements. - also ensure that we compete via context selector for disk monitoring such that the canceled disks don't linger around longer waiting for the ticker to trigger. - allow disabling active monitoring.	2023-08-01 10:54:26 -07:00
Klaus Post	004f1e2f66	Fix trailing header signature mismatch (#17774 ) Seems like clients may omit a newline at the end of the trailer chunk. Each header should end with a newline. Add that if missing. Fixes #17662	2023-08-01 08:45:57 -07:00
Harshavardhana	2fa561f22e	do not crash on invalid metric values (#17764 ) ``` minio[1032735]: panic: label value "\xc0.\xc0." is not valid UTF-8 minio[1032735]: goroutine 1781101 [running]: minio[1032735]: github.com/prometheus/client_golang/prometheus.MustNewConstMetric(...) ``` log such errors for investigation	2023-08-01 00:55:39 -07:00
Harshavardhana	81be718674	fix: optimize DiskInfo() call avoid metrics when not needed (#17763 )	2023-07-31 15:20:48 -07:00
Sho Ce	49a1e2f98e	update-notifier.go: misleading version age message (#17750 )	2023-07-31 08:36:19 -07:00
Klaus Post	684c46369c	Send events for extracted objects (#17760 ) Fixes #17759	2023-07-31 08:33:51 -07:00
Harshavardhana	73edd5b8fd	introduce 'mc admin config set alias/ api odirect=on' (#17753 ) change disable_odirect=off -> odirect=on to make it easier to understand, instead of making it double negative.	2023-07-31 00:12:53 -07:00
Harshavardhana	5e5bdf5432	capture total errors data availability and any timeout errors (#17748 )	2023-07-29 23:26:26 -07:00
Harshavardhana	f13cfcb83e	allow disabling O_DIRECT for write ops (#17751 ) on really slow systems, O_DIRECT simply kills the drives allow for a way to disable them.	2023-07-29 15:17:56 -07:00
Harshavardhana	731e03fe5a	add ReadFileStream deadline for disk call (#17745 ) timeout the reader side if hung via disk max timeout	2023-07-28 15:37:53 -07:00
Anis Eleuch	7057d00a28	s3: Return invalid bucket name the first thing in all S3 calls (#17742 )	2023-07-28 10:49:20 -07:00
Harshavardhana	114fab4c70	export cluster health as prometheus metrics (#17741 )	2023-07-28 01:16:53 -07:00
ruspaul013	a92cb66468	Get the signed headers in the order they were signed (#17690 ) use pSignValues to get signed headers in order	2023-07-27 11:45:30 -07:00
ruspaul013	535f97ba61	check if metadata headers/url values are equal with signed headers (#17737 )	2023-07-27 11:44:56 -07:00
drivebyer	14ebd82dbd	fix: missing disk metrics when query metric api from peer (#17738 )	2023-07-27 11:44:13 -07:00
Harshavardhana	47dcfcbdd4	introduce deadlines on READ operations (#17724 )	2023-07-27 07:33:05 -07:00
Krishnan Parthasarathi	bf3901342c	Include SuccessorModTime for FileInfo quorum (#17732 )	2023-07-26 17:04:16 -07:00
Harshavardhana	b28bcad11b	avoid Access() calls on known bucket paths (#17719 )	2023-07-26 11:31:40 -07:00
Harshavardhana	a7c71e4c6b	protect disk monitoring to avoid busy loop configuration (#17723 )	2023-07-25 20:02:22 -07:00
Poorna	1a42693d68	replication: limit larger uploads to a subset of workers (#17687 ) Limit large uploads (> 128MiB) to a max of 10 workers, intent is to avoid larger uploads from using all replication bandwidth, giving room for smaller uploads to sync faster.	2023-07-25 20:02:02 -07:00
Harshavardhana	e7b60c4d65	Add slow drive timeouts to match with active disk monitoring (#17701 ) allow active disk-monitoring to be configurable, and use these add deadlines in various call layers for various syscalls.	2023-07-25 16:58:31 -07:00
Poorna	f95129894d	Use decrypted object size while computing object size summary (#17717 ) Corrects an issue with encrypted versioned objects being reported under `unversioned` bin in the object version histogram	2023-07-24 17:13:25 -07:00
Harshavardhana	c32c71c836	allow DNS cache TTL to be configurable (#17709 ) this is added for now as a hidden variable	2023-07-24 15:13:35 -07:00
Harshavardhana	14e1ace552	remove serializing WalkDir() across all buckets/prefixes on SSDs (#17707 ) slower drives get knocked off because they are too slow via active monitoring, we do not need to block calls arbitrarily. Serializing adds latencies for already slow calls, remove it for SSDs/NVMEs Also, add a selection with context when writing to `out <-` channel, to avoid any potential blocks.	2023-07-24 09:30:19 -07:00
drivebyer	a7fb3a3853	fix: Create metrics slice when necessary in getCacheMetrics() (#17711 )	2023-07-24 08:40:21 -07:00
Klaus Post	2da4bd5f1a	Revert "don't error when asked for 0-based range on empty objects (#17708 ) (#17713 ) Revert "don't error when asked for 0-based range on empty objects (#17708)" This reverts commit `7e76d66184`. There is no valid way to specify offsets in a 0-byte file. Blame it on the [RFC](https://datatracker.ietf.org/doc/html/rfc7233#section-4.4) > The 416 (Range Not Satisfiable) status code indicates that none of the ranges in the > request's Range header field (Section 3.1) overlap the current extent of the selected resource... A request for "bytes=0-" is a request for the first byte of a resource. If the resource is 0-length, the range [0,0] does not overlap the resource content and the server responds with an error.	2023-07-24 07:56:28 -07:00
flisk	7e76d66184	don't error when asked for 0-based range on empty objects (#17708 ) In a reverse proxying setup, a proxy in front of MinIO may attempt to request objects in slices for enhanced cache efficiency. Since such a a proxy cannot have prior knowledge of how large a requested resource is, it usually sends a header of the form: Range: 0-$slice_size ... and, depending on the size of the resource, expects either: - an empty response, if $resource_size == 0 - a full response, if $resource_size <= $slice_size - a partial response, if $resource_size > $slice_size Prior to this change, MinIO would respond 416 Range Not Satisfiable if a client tried to request a range on an empty resource. This behavior is technically consistent with RFC9110[1] – However, it renders sliced reverse proxying, such as implemented in Nginx, broken in the case of empty files. Nginx itself seems to break this convention to enable "useful" responses in these cases, and MinIO should probably do that too. [1]: https://www.rfc-editor.org/rfc/rfc9110#byte.ranges	2023-07-23 00:10:03 -07:00
Harshavardhana	7764f4a8e3	return tags as part of Head/Get calls (#17635 ) AWS S3 only returns the number of tag counts, along with that we must return the tags as well to avoid another metadata call to the server.	2023-07-22 07:19:43 -07:00
Kaan Kabalak	6624f970c0	Fix spelling of 'already' across repository (#17703 )	2023-07-21 08:45:08 -07:00
Harshavardhana	331bdc2245	fix: remove CompleteMultipartUpload() 200 OK response for blocking calls (#17699 ) sending whitespace character with CompleteMultipartUpload() with 200 OK was an AWS S3 compatible implementation detail, and it was expected that the client SDK must look for both successful XML as well as error XML for 200 OK. But this is not useful anymore on MinIO, since we do not have any large delayed coalescing of parts anymore.	2023-07-20 22:14:38 -07:00
Harshavardhana	e12ab486a2	avoid using os.Getenv for internal code, use env.Get() instead (#17688 )	2023-07-20 07:52:49 -07:00
Krishnan Parthasarathi	9eeee92d36	Add deletemarker_total metric (#17689 )	2023-07-20 07:52:32 -07:00
Anis Eleuch	756d6aa729	fix: report correct pool/set/disk indexes for offline disks (#17695 )	2023-07-20 07:48:21 -07:00
Harshavardhana	bddd53d6d2	fix: retry listing in decommissioning if it fails perpetually (#17682 )	2023-07-19 13:09:37 -07:00
jiuker	a99cd825ab	fix: byHost realTime metrics API (#17681 )	2023-07-18 23:50:30 -07:00
Harshavardhana	6426b74770	move bucket centric metrics to /minio/v2/metrics/bucket handlers (#17663 ) users/customers do not have a reasonable number of buckets anymore, this is why we must avoid overpopulating cluster endpoints, instead move the bucket monitoring to a separate endpoint. some of it's a breaking change here for a couple of metrics, but it is imperative that we do it to improve the responsiveness of our Prometheus cluster endpoint. Bonus: Added new cluster metrics for usage, objects and histograms	2023-07-18 22:25:12 -07:00
Harshavardhana	4f257bf1e6	pick internode interface properly via globalLocalNodeName (#17680 ) current code will not pick the right interface name if --address or --interface is not provided.	2023-07-18 19:18:11 -07:00
Krishnan Parthasarathi	0120ff93bc	admin-info: add DeleteMarkers count (#17659 )	2023-07-18 10:49:40 -07:00
Anis Eleuch	49638fa533	s3: Delete Bucket should not recreate bucket if it does not exist (#17676 ) Also return Bucket Not Found error in the same use case.	2023-07-18 09:32:19 -07:00
Shubhendu	7a3a7b19e5	Added a start script to inspect command output (#17591 ) Using this script, post decrypt we should be able to bring up the MinIO instance with same configuration. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-07-17 14:15:28 -07:00
Harshavardhana	24e86d0c59	avoid passing around poolIdx, setIdx instead pass the relevant disks (#17660 )	2023-07-17 09:52:05 -07:00
jiuker	d118031ed6	fix: when Origin: null is set return back '*' for allow origins (#17651 )	2023-07-15 12:15:06 -07:00
Anis Eleuch	341a89c00d	return a descriptive error when loading any IAM item fails (#17654 ) Sometimes IAM fails to load certain items, which could be a user, a service account or a policy but with not enough information for us to debug. This commit will create a more descriptive error to make it easier to debug in such situations.	2023-07-14 20:17:14 -07:00
Anis Eleuch	df29d25e6b	return different status code for internode communication (#17655 ) mc admin trace -a will be able to quickly show 401 Unauthorized header to pinpoint trivial issues between nodes, such as wrong root credentials and skewed time.	2023-07-14 18:34:55 -07:00
Harshavardhana	3e196fa7b3	fix: ILM newer noncurrent version limit must return correct versions (#17652 ) objects/versions that are not expired via NewerNoncurrentVersions must be properly returned to be applied under further ILM actions. this would cause legitimately expired objects to be missed from expiration.	2023-07-14 16:42:35 -07:00
drivebyer	04c792476f	fix: provide a possible slice cap for heal failed metrics items (#17647 ) Signed-off-by: Wu <yang.wu@daocloud.io>	2023-07-14 11:02:45 -07:00
Harshavardhana	005a4a275a	add more bootstrap messages to provide latency (#17650 ) - simplify refreshing bucket metadata, wait() to depend on how fast the bucket metadata can load. - simplify resync to start resync in single pass.	2023-07-14 04:00:29 -07:00
Harshavardhana	bdddf597f6	shuffle buckets randomly before being scanned (#17644 ) this randomness is needed to avoid scanning the same buckets across different erasure sets, in the same order. allow random buckets to be scanned instead allowing a wider spread of ILM, replication checks. Additionally do not loop over twice to fill the channel, fill the channel regardless of having bucket new or old.	2023-07-14 02:25:40 -07:00
Aditya Manthramurthy	bb6921bf9c	Send AuditLog via new middleware fn for admin APIs (#17632 ) A new middleware function is added for admin handlers, including options for modifying certain behaviors. This admin middleware: - sets the handler context via reflection in the request and sends AuditLog - checks for object API availability (skipping it if a flag is passed) - enables gzip compression (skipping it if a flag is passed) - enables header tracing (adding body tracing if a flag is passed) While the new function is a middleware, due to the flags used for conditional behavior modification, which is used in each route registration call. To try to ensure that no regressions are introduced, the following changes were done mechanically mostly with `sed` and regexp: - Remove defer logger.AuditLog in admin handlers - Replace newContext() calls with r.Context() - Update admin routes registration calls Bonus: remove unused NetSpeedtestHandler Since the new adminMiddleware function checks for object layer presence by default, we need to pass the `noObjLayerFlag` explicitly to admin handlers that should work even when it is not available. The following admin handlers do not require it: - ServerInfoHandler - StartProfilingHandler - DownloadProfilingHandler - ProfileHandler - SiteReplicationDevNull - SiteReplicationNetPerf - TraceHandler For these handlers adminMiddleware does not check for the object layer presence (disabled by passing the `noObjLayerFlag`), and for all other handlers, the pre-check ensures that the handler is not called when the object layer is not available - the client would get a ErrServerNotInitialized and can retry later. This `noObjLayerFlag` is added based on existing behavior for these handlers only.	2023-07-13 14:52:21 -07:00
Klaus Post	4f89e5bba9	Add active disk health checks (#17539 ) Add check every 2 minutes to see if a write+read operation can complete. If disk is unresponsive for 2 minutes or returns errFaultyDisk, take it offline.	2023-07-13 11:41:55 -07:00

1 2 3 4 5 ...

5389 Commits