minio

mirror of https://github.com/minio/minio.git synced 2025-12-06 15:54:09 -05:00

Author	SHA1	Message	Date
Shireesh Anjal	a591e06ae5	Add cluster scanner metrics in metrics-v3 (#19517 ) endpoint: /minio/metrics/v3/cluster/scanner metrics: - bucket_scans_finished (counter) - bucket_scans_started (counter) - directories_scanned (counter) - last_activity_nano_seconds (gauge) - objects_scanned (counter) - versions_scanned (counter)	2024-05-24 12:29:25 -07:00
Harshavardhana	443c93c634	compute time spent in ILM properly (#19806 )	2024-05-24 12:28:51 -07:00
Shireesh Anjal	5659cddc84	Add cluster config metrics in metrics-v3 (#19507 ) endpoint: /minio/metrics/v3/cluster/config metrics: - write_quorum - rrs_parity - standard_parity	2024-05-24 05:50:46 -07:00
Shireesh Anjal	2a03a34bde	Upgrade madmin-go to v3.0.52 (#19798 ) This will ensure that content of /proc/cmdline from each server is captured in the health report.	2024-05-24 05:34:57 -07:00
Shubhendu	1654a9b7e6	Use point in time values for `gauge` metrics in graphs (#19690 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-05-24 04:11:51 -07:00
Shireesh Anjal	673a521711	Change endpoint of v3 notification metrics (#19804 ) from /cluster/notification to /notification	2024-05-24 04:10:24 -07:00
Harshavardhana	2e23076688	move windows runners to in-house (#19800 ) GitHub CI runners for windows have gotten very slow, moving them to our own hosted runners	2024-05-23 15:29:33 -07:00
Klaus Post	b92ac55250	Add multipart combination to xl-meta (#19780 ) Add combination of multiple parts. Parts will be reconstructed and saved separately and can manually be combined to the complete object. Parts will be named `(version_id)-(filename).(partnum).(in)complete`.	2024-05-23 09:37:31 -07:00
Shireesh Anjal	7981509cc8	Add cluster and bucket replication metrics in metrics-v3 (#19546 ) endpoint: /minio/metrics/v3/cluster/replication metrics: - average_active_workers - average_queued_bytes - average_queued_count - average_transfer_rate - current_active_workers - current_transfer_rate - last_minute_queued_bytes - last_minute_queued_count - max_active_workers - max_queued_bytes - max_queued_count - max_transfer_rate - recent_backlog_count endpoint: /minio/metrics/v3/api/bucket/replication metrics: - last_hour_failed_bytes - last_hour_failed_count - last_minute_failed_bytes - last_minute_failed_count - latency_ms - proxied_delete_tagging_requests_total - proxied_get_requests_failures - proxied_get_requests_total - proxied_get_tagging_requests_failures - proxied_get_tagging_requests_total - proxied_head_requests_failures - proxied_head_requests_total - proxied_put_tagging_requests_failures - proxied_put_tagging_requests_total - sent_bytes - sent_count - total_failed_bytes - total_failed_count - proxied_delete_tagging_requests_failures	2024-05-23 00:41:18 -07:00
Krishnan Parthasarathi	6d5bc045bc	Disallow ExpiredObjectAllVersions with object lock (#19792 ) Relaxes restrictions on Expiration and NoncurrentVersionExpiration placed by https://github.com/minio/minio/pull/19785. ref: https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-lock-managing.html#object-lock-managing-lifecycle > Object lifecycle management configurations continue functioning normally on protected objects, including placing delete markers. However, a locked version of an object cannot be deleted by a S3 Lifecycle expiration policy. Object Lock is maintained regardless of the object's storage class and throughout S3 Lifecycle transitions between storage classes.	2024-05-22 18:12:48 -07:00
Harshavardhana	d38e020b29	remove errant logs for disconnected remote (#19793 ) Signed-off-by: Harshavardhana <harsha@minio.io>	2024-05-22 18:12:23 -07:00
Poorna	7d29030292	fix list results returned for spark max-keys=2 listing (#19791 ) This PR continues fix #19725 for some unhandled cases	2024-05-22 16:16:34 -07:00
Shubhendu	7c7650b7c3	Add sufficient deadlines and countermeasures to handle hung node scenario (#19688 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io> Signed-off-by: Harshavardhana <harsha@minio.io>	2024-05-22 16:07:14 -07:00
Harshavardhana	ca80eced24	usage of deadline conn at Accept() breaks websocket (#19789 ) fortunately not wired up to use, however if anyone enables deadlines for conn then sporadically MinIO startups fail.	2024-05-22 10:49:27 -07:00
Anis Eleuch	d0e0b81d8e	Fix race get/set system/audit targest to avoid race errors (#19790 )	2024-05-22 09:23:03 -07:00
jiuker	391baa1c9a	test: add reject ilm rule test case (#19788 )	2024-05-22 04:26:59 -07:00
Harshavardhana	ae14681c3e	Revert "Fix two-way stream cancelation and pings (#19763 )" This reverts commit `4d698841f4`.	2024-05-22 03:00:00 -07:00
Klaus Post	4d698841f4	Fix two-way stream cancelation and pings (#19763 ) Do not log errors on oneway streams when sending ping fails. Instead, cancel the stream. This also makes sure pings are sent when blocked on sending responses.	2024-05-22 01:25:25 -07:00
jiuker	9906b3ade9	fix: reject ilm rule when bucket LockEnabled (#19785 )	2024-05-21 23:50:03 -07:00
Anis Eleuch	bf1769d3e0	xl: Avoid marking a drive offline after one part read failure (#19779 ) This commit will fix one rare case of a multipart object that can be read in theory but GetObject API returned an error. It turned out that a six years old code was marking a drive offline when the bitrot streaming fails to read a part in a disk with any error. This can affect reading a subsequent part, though having enough shards, but unable to construct because one drive was marked offline earlier. This commit will remove the drive marking offline code. It will also close the bitrotstreaming reader before marking it as nil.	2024-05-21 07:36:21 -07:00
Harshavardhana	63e1ad9f29	fix: the user-agent for Veeam	2024-05-20 11:54:52 -07:00
Klaus Post	2c7bcee53f	Add cross-version remapped merges to xl-meta (#19765 ) Adds `-xver` which can be used with `-export` and `-combine` to attempt to combine files across versions if data is suspected to be the same. Overlapping data is compared. Bonus: Make `inspect` accept wildcards.	2024-05-19 08:31:54 -07:00
Harshavardhana	1fd90c93ff	re-use StorageAPI while loading drive formats (#19770 ) Bonus: safe settings for deployment ID to avoid races	2024-05-19 01:06:49 -07:00
Poorna	e947a844c9	Fix test scripts to use mc ready (#19768 )	2024-05-18 11:19:01 -07:00
Poorna	4e2d39293a	Fix build script to wait for server to come up (#19767 )	2024-05-17 14:43:59 -07:00
Krishnan Parthasarathi	1228d6bf1a	Return NumVersions in quorum when available (#19766 ) Similar to https://github.com/minio/minio/pull/17925	2024-05-17 13:57:37 -07:00
Shireesh Anjal	fc4561c64c	Start callhome immediately after enabling (#19764 ) Currently, on enabling callhome (or restarting the server), the callhome job gets scheduled. This means that one has to wait for 24hrs (the default frequency duration) to see it in action and to figure out if it is working as expected. It will be a better user experience to perform the first callhome execution immediately after enabling it (or on server start if already enabled). Also, generate audit event on callhome execution, setting the error field in case the execution has failed.	2024-05-17 09:53:34 -07:00
Klaus Post	3b7747b42b	Tweak multipart uploads (#19756 ) * Store ModTime in the upload ID; return it when listing instead of the current time. * Use this ModTime to expire and skip reading the file info. * Consistent upload sorting in listing (since it now has the ModTime). * Exclude healing disks to avoid returning an empty list.	2024-05-17 09:40:09 -07:00
Harshavardhana	e432e79324	avoid calling 'admin info' for disk, cpu, net metrics collection (#19762 ) resource metrics collection was incorrectly making fan-out liveness peer calls where it's not needed.	2024-05-17 08:15:13 -07:00
Harshavardhana	08d74819b6	handle racy updates to globalSite config (#19750 ) ``` ================== WARNING: DATA RACE Read at 0x0000082be990 by goroutine 205: github.com/minio/minio/cmd.setCommonHeaders() Previous write at 0x0000082be990 by main goroutine: github.com/minio/minio/cmd.lookupConfigs() ```	2024-05-16 16:13:47 -07:00
Poorna	aa3fde1784	Add ListObjectsV2 unit test (#19753 ) for PR: #19725	2024-05-15 20:40:51 -07:00
Harshavardhana	0b3eb7f218	add more deadlines and pass around context under most situations (#19752 )	2024-05-15 15:19:00 -07:00
Anis Eleuch	69c9496c71	Upgrade github.com/minio/pkg/v2 and other deps (#19747 )	2024-05-15 11:04:40 -07:00
Klaus Post	b792b36495	Add Veeam storage class override (#19748 ) Recent Veeam is very picky about storage class names. Add `_MINIO_VEEAM_FORCE_SC` env var. It will override the storage class returned by the storage backend if it is non-standard and we detect a Veeam client by checking the User Agent. Applies to HeadObject/GetObject/ListObject*	2024-05-15 11:04:16 -07:00
Harshavardhana	d3db7d31a3	fix: add deadlines for all synchronous REST callers (#19741 ) add deadlines that can be dynamically changed via the drive max timeout values. Bonus: optimize "file not found" case and hung drives/network - circuit break the check and return right away instead of waiting.	2024-05-15 09:52:29 -07:00
Shireesh Anjal	c05ca63158	Fix crash on /minio/metrics/v3?list (#19745 ) An unchecked map access was causing panic.	2024-05-15 09:06:35 -07:00
Klaus Post	6d3e0c7db6	Tweak one way stream ping (#19743 ) Do not log errors on oneway streams when sending ping fails. Instead cancel the stream. This also makes sure pings are sent when blocked on sending responses. I will do a separate PR that includes this and adds pings to two-way streams as well as tests for pings.	2024-05-15 08:39:21 -07:00
Shireesh Anjal	0e59e50b39	Capture ttfb api metrics only for GetObject (#19733 ) as that is the only API where the TTFB metric is beneficial, and capturing this for all APIs exponentially increases the response size in large clusters.	2024-05-14 23:25:13 -07:00
Klaus Post	d4b391de1b	Add PutObject Ring Buffer (#19605 ) Replace the `io.Pipe` from streamingBitrotWriter -> CreateFile with a fixed size ring buffer. This will add an output buffer for encoded shards to be written to disk - potentially via RPC. This will remove blocking when `(*streamingBitrotWriter).Write` is called, and it writes hashes and data. With current settings, the write looks like this: ``` Outbound ┌───────────────────┐ ┌────────────────┐ ┌───────────────┐ ┌────────────────┐ │ │ Parr. │ │ (http body) │ │ │ │ │ Bitrot Hash │ Write │ Pipe │ Read │ HTTP buffer │ Write (syscall) │ TCP Buffer │ │ Erasure Shard │ ──────────► │ (unbuffered) │ ────────────► │ (64K Max) │ ───────────────────► │ (4MB) │ │ │ │ │ │ (io.Copy) │ │ │ └───────────────────┘ └────────────────┘ └───────────────┘ └────────────────┘ ``` We write a Hash (32 bytes). Since the pipe is unbuffered, it will block until the 32 bytes have been delivered to the TCP buffer, and the next Read hits the Pipe. Then we write the shard data. This will typically be bigger than 64KB, so it will block until two blocks have been read from the pipe. When we insert a ring buffer: ``` Outbound ┌───────────────────┐ ┌────────────────┐ ┌───────────────┐ ┌────────────────┐ │ │ │ │ (http body) │ │ │ │ │ Bitrot Hash │ Write │ Ring Buffer │ Read │ HTTP buffer │ Write (syscall) │ TCP Buffer │ │ Erasure Shard │ ──────────► │ (2MB) │ ────────────► │ (64K Max) │ ───────────────────► │ (4MB) │ │ │ │ │ │ (io.Copy) │ │ │ └───────────────────┘ └────────────────┘ └───────────────┘ └────────────────┘ ``` The hash+shard will fit within the ring buffer, so writes will not block - but will complete after a memcopy. Reads can fill the 64KB buffer if there is data for it. If the network is congested, the ring buffer will become filled, and all syscalls will be on full buffers. Only when the ring buffer is filled will erasure coding start blocking. Since there is always "space" to write output data, we remove the parallel writing since we are always writing to memory now, and the goroutine synchronization overhead probably not worth taking. If the output were blocked in the existing, we would still wait for it to unblock in parallel write, so it would make no difference there - except now the ring buffer smoothes out the load. There are some micro-optimizations we could look at later. The biggest is that, in most cases, we could encode directly to the ring buffer - if we are not at a boundary. Also, "force filling" the Read requests (i.e., blocking until a full read can be completed) could be investigated and maybe allow concurrent memory on read and write.	2024-05-14 17:11:04 -07:00
Shubhendu	de4d3dac00	Added tests for IAM policies for bucket operations (#19734 ) * Added tests for bucket access policies Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io> * move to correct category of tests Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io> --------- Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-05-14 08:43:07 -07:00
Olli Janatuinen	534e7161df	SFTP: Correctly inform client about unsupported commands (#19735 )	2024-05-14 03:29:30 -07:00
Harshavardhana	9b219cd646	fix: return quorum based error, temporary failures must be ignored (#19732 )	2024-05-14 03:29:17 -07:00
Shireesh Anjal	3bab4822f3	Add logger webhook metrics in metrics-v3 (#19515 ) endpoint: /minio/metrics/v3/cluster/webhook metrics: - failed_messages (counter) - online (gauge) - queue_length (gauge) - total_messages (counter)	2024-05-14 00:27:33 -07:00
coderwander	3c5f2d8916	fix some typo in struct name comments (#19513 ) Signed-off-by: coderwander <770732124@qq.com>	2024-05-14 00:26:50 -07:00
Shireesh Anjal	5808190398	Add more metrics to v3/cluster/erasure-set (#19714 ) Metrics being added: - read_tolerance: No of drive failures that can be tolerated without disrupting read operations - write_tolerance: No of drive failures that can be tolerated without disrupting write operations - read_health: Health of the erasure set in a pool for read operations (1=healthy, 0=unhealthy) - write_health: Health of the erasure set in a pool for write operations (1=healthy, 0=unhealthy)	2024-05-14 00:25:56 -07:00
Shireesh Anjal	b2a82248b1	Move /system/go to /debug/go (#19707 )	2024-05-14 00:25:37 -07:00
dependabot[bot]	4e5fcca8b9	build(deps): bump golang.org/x/net (#23 ) Bumps the go_modules group with 1 update in the /docs/debugging/s3-verify directory: [golang.org/x/net](https://github.com/golang/net). Updates `golang.org/x/net` from 0.24.0 to 0.25.0 - [Commits](https://github.com/golang/net/compare/v0.24.0...v0.25.0) --- updated-dependencies: - dependency-name: golang.org/x/net dependency-type: indirect dependency-group: go_modules ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-13 10:59:52 -07:00
Klaus Post	c36eaedb93	Re-add "Fix incorrect merging of slash-suffixed objects (#19729 ) Adds regression test for #19699 Failures are a bit luck based, since it requires objects to be placed on different sets. However this generates a failure prior to #19699 * Revert "Revert "Fix incorrect merging of slash-suffixed objects (#19699)"" This reverts commit `f30417d9a8`. * Don't override when suffix doesn't match. Instead rely on quorum for each.	2024-05-13 09:30:24 -07:00
Poorna	7752b03add	optimize max-keys=2 listing for spark workloads (#19725 ) to return results appropriately for versioned buckets, especially when underlying prefixes have been deleted	2024-05-13 07:57:42 -07:00
jiuker	01bfc78535	Optimization: reuse hashedSecret when LookupConfig (#19724 )	2024-05-12 22:52:27 -07:00

1 2 3 4 5 ...

11943 Commits