minio

mirror of https://github.com/minio/minio.git synced 2025-11-21 18:26:04 -05:00

Author	SHA1	Message	Date
Shireesh Anjal	6d20ec3bea	Add support for resource metrics (#18057 ) Add a new endpoint for "resource" metrics `/v2/metrics/resource` This should return system metrics related to drives, network, CPU and memory. Except for drives, other metrics should have corresponding "avg" and "max" values also. Reuse the real-time feature to capture the required data, introducing CPU and memory metrics in it. Collect the data every minute and keep updating the average and max values accordingly, returning the latest values when the API is called.	2023-09-30 13:40:20 -07:00
Harshavardhana	822cbd4b43	add couple of missing things from #18027	2023-09-13 23:26:48 -07:00
Ravind Kumar	3c19a9308d	DOCS-987: Reorganizing list.md for better RST compatibility (#18027 )	2023-09-13 23:23:37 -07:00
Shubhendu	e47e625f73	Added replication graphs for site replication metrics (#17951 ) This dashboard graphs the metrics when site replication is enabled across MinIO instances. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-08-31 08:31:16 -07:00
Shubhendu	0ce9e00ffa	Added node scanner and node drives graphs (#17949 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-08-30 14:01:51 -07:00
Shubhendu	c778c381b5	Added new bucket replication graphs (#17947 ) This PR adds new bucket replication graphs for better and granular monitoring of bucket replication. Also arranged all replication graphs together. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-08-30 11:57:41 -07:00
Poorna	b48bbe08b2	Add additional info for replication metrics API (#17293 ) to track the replication transfer rate across different nodes, number of active workers in use and in-queue stats to get an idea of the current workload. This PR also adds replication metrics to the site replication status API. For site replication, prometheus metrics are no longer at the bucket level - but at the cluster level. Add prometheus metric to track credential errors since uptime	2023-08-30 01:00:59 -07:00
Shubhendu	c3c8441a1d	Corrected the count of buckets and objects graphs (#17883 ) In distributed setup with a load balancer, randmoly any server would report the metrics `minio_cluster_bucket_total` and `minio_cluster_usage_object_total` and while graphing it, we should take max of reported values. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-08-21 09:04:38 -07:00
Harshavardhana	8a9b886011	update grafana dashboard with disk -> drive rename (#17857 )	2023-08-15 16:04:20 -07:00
Harshavardhana	c4ca0a5a57	add two more drive metrics when metrics is available (#17854 )	2023-08-15 10:55:47 -07:00
Shubhendu	b6b6d6e8d8	Removed replication dashboard (#17815 ) As all replication metrics are moved at bucket level, all replication graphs as well are added under minio-bucket.json. Removing the independent replication dashboard. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-08-08 08:13:45 -07:00
Harshavardhana	114fab4c70	export cluster health as prometheus metrics (#17741 )	2023-07-28 01:16:53 -07:00
Shubhendu	e1731d9403	Added bucket specific grafana dashboard (#17727 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2023-07-26 15:10:11 -07:00
Harshavardhana	e1094dde08	update MinIO replication dashboard with latest metrics	2023-07-21 17:30:04 -07:00
Doom And Love	d004c45386	grafana-dashboard: Update scrape_jobs variable to be single select (#17696 ) Set `includeAll` and `mult` to be false since this dashboard only works with a single value being selected	2023-07-21 14:12:44 -07:00
Krishnan Parthasarathi	9eeee92d36	Add deletemarker_total metric (#17689 )	2023-07-20 07:52:32 -07:00
Harshavardhana	c0a5bdaed9	update grafana dashboard JSON with the new metrics (#17683 )	2023-07-19 08:16:04 -07:00
Harshavardhana	6426b74770	move bucket centric metrics to /minio/v2/metrics/bucket handlers (#17663 ) users/customers do not have a reasonable number of buckets anymore, this is why we must avoid overpopulating cluster endpoints, instead move the bucket monitoring to a separate endpoint. some of it's a breaking change here for a couple of metrics, but it is imperative that we do it to improve the responsiveness of our Prometheus cluster endpoint. Bonus: Added new cluster metrics for usage, objects and histograms	2023-07-18 22:25:12 -07:00
Harshavardhana	7605d07bb2	add support for bucket level request count per API (#17468 ) New metrics added to calculate API request count per bucket, per API. Captures errors, including 4xx, 5xx HTTP status codes separately.	2023-06-21 09:41:59 -07:00
Anis Eleuch	46d45a6923	grafana: Add TCP dial errors panel (#17101 )	2023-04-28 11:11:17 -07:00
Anis Eleuch	2448a9e047	grafana: Remove minio_s3_requests_errors_total metric (#17094 )	2023-04-27 10:55:30 -07:00
Jiffs Maverick	61101d82d9	Rename inodes metric in grafana dashboards (#17030 )	2023-04-21 11:07:30 -07:00
Harshavardhana	e47a31f9fc	fix: object size distribution in metrics for all objects (#16539 )	2023-02-04 21:10:10 -08:00
Anis Elleuch	e73894fa50	grafana: Show one metric for the total data growth (#16449 )	2023-01-20 09:39:28 -08:00
Anis Elleuch	b8943fdf19	doc: Update prometheus metrics list (#16329 )	2022-12-29 15:08:22 -08:00
Daryl White	d44f3526dc	Update links to documentation site (#15750 )	2022-09-28 21:28:45 -07:00
ebozduman	b57e7321e7	Replaces 'disk'=>'drive' visible to end user (#15464 )	2022-08-04 16:10:08 -07:00
MohammadReza	f4d5c861f3	update grafana dashboard (#15357 )	2022-07-21 15:17:44 -07:00
Harshavardhana	8082d1fed6	add bucket level S3 received/sent bytes (#15084 ) adds bucket level metrics for bytes received and sent bytes on all S3 API calls.	2022-06-14 15:14:24 -07:00
Minio Trusted	f34b2ef90b	update dashboard Data Usage Growth as time series	2022-06-13 22:05:36 -07:00
Harshavardhana	7413045f0e	fix: add missing minio_s3_requests_total (#15070 ) PR #15052 caused a regression, add the missing metrics back. Bonus: - internode information should be only for distributed setups - update the dashboard to include 4xx and 5xx error panels.	2022-06-11 00:50:31 -07:00
Anis Elleuch	5fb420c703	prometheus: Add S3 4xx and 5xx S3 monitoring (#15052 ) Currently minio_s3_requests_errors_total covers 4xx and 5xx S3 responses which can be confusing when s3 applications sent a lot of HEAD requests with obvious 404 responses or when the replication is enabled. Add - minio_s3_requests_4xx_errors_total - minio_s3_requests_5xx_errors_total to help users monitor 4xx and 5xx HTTP status codes separately.	2022-06-08 11:22:34 -07:00
Minio Trusted	f63645546d	update minimum goroutine threshold on dashboard	2022-06-06 22:13:54 -07:00
Harshavardhana	c2630bb3a3	add total usage pie chart based on total/free bytes	2022-05-28 09:53:53 -07:00
Eco	81d2b54dfd	doc: typo fix for ttfb entry in table (#14647 )	2022-03-29 09:42:02 -07:00
Harshavardhana	e3e0532613	cleanup markdown docs across multiple files (#14296 ) enable markdown-linter	2022-02-11 16:51:25 -08:00
Krishnan Parthasarathi	0ee2933234	Export tier metrics via Prometheus (#13413 ) e.g ``` minio_cluster_ilm_transitioned_bytes{server="minio3:9000",tier="S3TIER-1"} 1.36317772e+08 minio_cluster_ilm_transitioned_bytes{server="minio3:9000",tier="S3TIER-2"} 2892 minio_cluster_ilm_transitioned_bytes{server="minio3:9000",tier="STANDARD"} 1.3631488e+08 minio_cluster_ilm_transitioned_objects{server="minio3:9000",tier="S3TIER-1"} 1 minio_cluster_ilm_transitioned_objects{server="minio3:9000",tier="S3TIER-2"} 0 minio_cluster_ilm_transitioned_objects{server="minio3:9000",tier="STANDARD"} 1 minio_cluster_ilm_transitioned_versions{server="minio3:9000",tier="S3TIER-1"} 3 minio_cluster_ilm_transitioned_versions{server="minio3:9000",tier="S3TIER-2"} 2 minio_cluster_ilm_transitioned_versions{server="minio3:9000",tier="STANDARD"} 1 ```	2022-02-08 12:45:28 -08:00
Harshavardhana	74faed166a	Add quota usage as part of prometheus metrics (#14222 ) Bonus: pass caller context when needed to all bucket metadata handling calls.	2022-01-31 17:27:43 -08:00
chrisbecke	ef0b8367b5	Update minio-overview.json data source panel (#13730 ) Add missing datasource in `Healing` panel.	2021-11-23 09:01:07 -08:00
Mani	7b82411e6f	change the unit of measurement from TB to TiB (#13686 )	2021-11-18 20:06:37 -08:00
Ashish Kumar Sinha	3d2bc15e9a	Add grafana json file for replication metrics (#13678 )	2021-11-17 14:49:46 -08:00
jandres - moscardo	1aa08f594d	Update README.md prometheus (#13514 ) Modify the doc to warn users about Prometheus sending `domain:port`	2021-11-02 12:27:30 -07:00
Harshavardhana	90e505e58f	calculate API requests/error as increase() intervals not as rate()	2021-09-12 11:28:28 -07:00
Nitish Tiwari	60394ddf83	Add support for changing job name in Grafana dashboard (#13050 )	2021-08-24 09:51:09 -07:00
Krishnan Parthasarathi	30b77f59b1	doc: Add ilm prometheus metrics information (#12994 )	2021-08-17 12:19:36 -07:00
Nitish Tiwari	32017454ee	fix typo in Grafana dashboard json (#12471 )	2021-06-09 08:04:12 -07:00
Nitish Tiwari	00c5d7e1b3	Add healing related metrics in official dashboard (#12456 )	2021-06-07 12:46:54 -07:00
Poorna Krishnamoorthy	3690de0c6b	Drop Pending size and count from replication metrics (#12378 ) Real-time metrics calculated in-memory rely on the initial replication metrics saved with data usage. However, this can lag behind the actual state of the cluster at the time of server restart leading to inaccurate Pending size/counts reported to Prometheus. Dropping the Pending metrics as this can be more reliably monitored by applications with replication notifications. Signed-off-by: Poorna Krishnamoorthy <poorna@minio.io>	2021-05-31 20:26:52 -07:00
Nitish Tiwari	a592d3be19	fix the dashboard to use $rate_interval (#12277 ) refer https://grafana.com/blog/2020/09/28/new-in-grafana-7.2-__rate_interval-for-prometheus-rate-queries-that-just-work/ for further information	2021-05-12 08:06:47 -07:00
Harshavardhana	2fd9c13b50	rename minio-cluster to minio-job as per prometheus config	2021-05-06 12:39:58 -07:00

1 2

82 Commits