minio

mirror of https://github.com/minio/minio.git synced 2024-12-27 15:45:55 -05:00

Author	SHA1	Message	Date
Klaus Post	ac055b09e9	Add detailed scanner metrics (#15161 )	2022-07-05 14:45:49 -07:00
Harshavardhana	c7ed6eee5e	fix: background local test also via channel (#15086 ) current implementation for `standalone` setups was blocking the `perf drive`. Bonus: remove all old unused complicated code.	2022-06-15 14:51:42 -07:00
Harshavardhana	f8650a3493	fetch bucket replication stats across peers in single call (#14956 ) current implementation relied on recursively calling one bucket at a time across all peers, this would be very slow and chatty when there are 100's of buckets which would mean 100*peerCount amount of network operations. This PR attempts to reduce this entire call into `peerCount` amount of network calls only. This functionality addresses also a concern where the Prometheus metrics would significantly slow down when one of the peers is offline.	2022-05-23 09:15:30 -07:00
Harshavardhana	040ac5cad8	fix: when logger queue is full exit quickly upon doneCh (#14928 ) Additionally only reload requested sub-system not everything	2022-05-16 16:10:51 -07:00
Harshavardhana	35dea24ffd	fix: console log peer API from its broken implementation (#14873 ) console logging peer API was broken as it would timeout after 15minutes, this never really worked beyond this value and basically failed to provide the streaming "log" functionality that was expected from this implementation. also fix convoluted channel handling by keeping things simple, this is rewritten.	2022-05-06 12:39:58 -07:00
Harshavardhana	2a6a40e93b	enable go1.18.x builds (#14746 )	2022-04-13 14:21:55 -07:00
Harshavardhana	eda34423d7	update gofumpt -w - new changes	2022-04-13 12:00:11 -07:00
Krishna Srinivas	4d0715d226	Implement netperf for "mc support perf net" (#14397 ) Co-authored-by: Klaus Post <klauspost@gmail.com>	2022-03-08 09:54:38 -08:00
Sidhartha Mani	d7df6bc738	add support for speedtest drive (#14182 )	2022-02-01 22:38:05 -08:00
Krishnan Parthasarathi	d2e5f01542	feat: maintain in-memory tier stats for the last 24hrs (#13782 )	2022-01-26 14:33:10 -08:00
Harshavardhana	76b21de0c6	feat: decommission feature for pools (#14012 ) ``` λ mc admin decommission start alias/ http://minio{1...2}/data{1...4} ``` ``` λ mc admin decommission status alias/ ┌─────┬─────────────────────────────────┬──────────────────────────────────┬────────┐ │ ID │ Pools │ Capacity │ Status │ │ 1st │ http://minio{1...2}/data{1...4} │ 439 GiB (used) / 561 GiB (total) │ Active │ │ 2nd │ http://minio{3...4}/data{1...4} │ 329 GiB (used) / 421 GiB (total) │ Active │ └─────┴─────────────────────────────────┴──────────────────────────────────┴────────┘ ``` ``` λ mc admin decommission status alias/ http://minio{1...2}/data{1...4} Progress: ===================> [1GiB/sec] [15%] [4TiB/50TiB] Time Remaining: 4 hours (started 3 hours ago) ``` ``` λ mc admin decommission status alias/ http://minio{1...2}/data{1...4} ERROR: This pool is not scheduled for decommissioning currently. ``` ``` λ mc admin decommission cancel alias/ ┌─────┬─────────────────────────────────┬──────────────────────────────────┬──────────┐ │ ID │ Pools │ Capacity │ Status │ │ 1st │ http://minio{1...2}/data{1...4} │ 439 GiB (used) / 561 GiB (total) │ Draining │ └─────┴─────────────────────────────────┴──────────────────────────────────┴──────────┘ ``` > NOTE: Canceled decommission will not make the pool active again, since we might have > Potentially partial duplicate content on the other pools, to avoid this scenario be > very sure to start decommissioning as a planned activity. ``` λ mc admin decommission cancel alias/ http://minio{1...2}/data{1...4} ┌─────┬─────────────────────────────────┬──────────────────────────────────┬────────────────────┐ │ ID │ Pools │ Capacity │ Status │ │ 1st │ http://minio{1...2}/data{1...4} │ 439 GiB (used) / 561 GiB (total) │ Draining(Canceled) │ └─────┴─────────────────────────────────┴──────────────────────────────────┴────────────────────┘ ```	2022-01-10 09:07:49 -08:00
Shireesh Anjal	c27110e37d	Add timeinfo to health data (#14013 ) Capture RoundtripDuration to figure out NTP issues in subnet health analyzer.	2022-01-06 01:51:10 -08:00
Harshavardhana	f527c708f2	run gofumpt cleanup across code-base (#14015 )	2022-01-02 09:15:06 -08:00
Klaus Post	aca6dfbd60	Check for nil RPC in listing (#13917 ) Fixes #13915	2021-12-15 09:19:11 -08:00
Harshavardhana	99d87c5ca2	fix: totalDrives reported in speedTest for multiple-pools (#13770 ) totalDrives reported in speedTest result were wrong for multiple pools, this PR fixes this. Bonus: add support for configurable storage-class, this allows us to test REDUCED_REDUNDANCY to see further maximum throughputs across the cluster.	2021-11-29 09:05:46 -08:00
Klaus Post	142c6b11b3	Reduce JWT overhead for internode tokens (#13738 ) Since JWT tokens remain valid for up to 15 minutes, we don't have to regenerate tokens for every call. Cache tokens for matching access+secret+audience for up to 15 seconds. ``` BenchmarkAuthenticateNode/uncached-32 270567 4179 ns/op 2961 B/op 33 allocs/op BenchmarkAuthenticateNode/cached-32 7684824 157.5 ns/op 48 B/op 1 allocs/op ``` Reduces internode call allocations a great deal.	2021-11-23 09:51:53 -08:00
Sidhartha Mani	c57ff2640e	recognize slow networks to step down faster during netperf (#13473 )	2021-10-20 03:22:07 -07:00
Aditya Manthramurthy	3a7c79e2c7	Add new site replication feature (#13311 ) This change allows a set of MinIO sites (clusters) to be configured for mutual replication of all buckets (including bucket policies, tags, object-lock configuration and bucket encryption), IAM policies, LDAP service accounts and LDAP STS accounts.	2021-10-06 16:36:31 -07:00
Harshavardhana	0f7a51f461	fix: speedTest between peers keep the connection alive (#13120 ) for longer durations keep the speedTest alive instead of timing them out based on ResponseHeaderTimeout.	2021-08-31 14:08:23 -07:00
Shireesh Anjal	ce05e67a0c	Add admin api to return sys config info (#12988 ) The intention is to list values of sys config that can potentially impact the performance of minio. At present, it will return max value configured for rlimit Signed-off-by: Shireesh Anjal <shireesh@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2021-08-24 17:09:37 -07:00
Shireesh Anjal	d44e4399e6	Add admin api to return sys services info (#12939 ) The intention is to provide status of any sys services that can potentially impact the performance of minio. At present, it will return information about the `selinux` service (not-installed/disabled/permissive/enforcing) Signed-off-by: Shireesh Anjal <shireesh@minio.io>	2021-08-12 18:58:40 -07:00
Shireesh Anjal	56097dfca8	Include system errors in health report (#12776 ) Some examples of system errors: - audit is enabled - updatedb is installed	2021-07-29 23:05:34 -07:00
Krishna Srinivas	aa0c28809b	Server side speedtest implementation (#12750 )	2021-07-27 12:55:56 -07:00
Harshavardhana	da74e2f167	move internal/net to pkg/net package (#12505 )	2021-06-14 14:54:37 -07:00
Anis Elleuch	6c8be64cdb	rest: healthcheck should not update failure metrics (#12458 ) Otherwise, we can see high numbers of networking issues when a node is down.	2021-06-08 14:09:26 -07:00
Harshavardhana	1f262daf6f	rename all remaining packages to internal/ (#12418 ) This is to ensure that there are no projects that try to import `minio/minio/pkg` into their own repo. Any such common packages should go to `https://github.com/minio/pkg`	2021-06-01 14:59:40 -07:00
Bala FA	120951d9e9	Refactor health data structure (#11914 ) This feature comes with simplified data structures and versioning support. Signed-off-by: Bala.FA <bala.gluster@gmail.com>	2021-06-01 08:55:49 -07:00
Harshavardhana	1aa5858543	move madmin to github.com/minio/madmin-go (#12239 )	2021-05-06 08:52:02 -07:00
Krishnan Parthasarathi	c829e3a13b	Support for remote tier management (#12090 ) With this change, MinIO's ILM supports transitioning objects to a remote tier. This change includes support for Azure Blob Storage, AWS S3 compatible object storage incl. MinIO and Google Cloud Storage as remote tier storage backends. Some new additions include: - Admin APIs remote tier configuration management - Simple journal to track remote objects to be 'collected' This is used by object API handlers which 'mutate' object versions by overwriting/replacing content (Put/CopyObject) or removing the version itself (e.g DeleteObjectVersion). - Rework of previous ILM transition to fit the new model In the new model, a storage class (a.k.a remote tier) is defined by the 'remote' object storage type (one of s3, azure, GCS), bucket name and a prefix. * Fixed bugs, review comments, and more unit-tests - Leverage inline small object feature - Migrate legacy objects to the latest object format before transitioning - Fix restore to particular version if specified - Extend SharedDataDirCount to handle transitioned and restored objects - Restore-object should accept version-id for version-suspended bucket (#12091) - Check if remote tier creds have sufficient permissions - Bonus minor fixes to existing error messages Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io> Co-authored-by: Krishna Srinivas <krishna@minio.io> Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Harshavardhana	069432566f	update license change for MinIO Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Harshavardhana	641e564b65	fips build tag uses relevant binary link for updates (#12014 ) This code is necessary for `mc admin update` command to work with fips compiled binaries, with fips tags the releaseInfo will automatically point to fips specific binaries.	2021-04-08 09:51:11 -07:00
Harshavardhana	09ee303244	add cluster support for realtime bucket stats (#11963 ) implementation in #11949 only catered from single node, but we need cluster metrics by capturing from all peers. introduce bucket stats API that will be used for capturing in-line bucket usage as well eventually	2021-04-04 15:34:33 -07:00
Anis Elleuch	d8b5adfd10	trace: Add storage & OS tracing (#11889 )	2021-03-26 23:24:07 -07:00
Klaus Post	2680772d4b	Don't mark remotes online when shutting down (#11368 ) Shutting down will mark remotes online when the shutdown has started since the context is canceled. For example: ``` API: SYSTEM() Time: 16:21:31 CET 01/28/2021 DeploymentID: 313b0065-c5a1-4aa3-9233-07223e77a730 Error: Storage resources are insufficient for the write operation .minio.sys/tmp/ced455c4-3d27-4bdd-95fc-b4707a179b8a/fd934ef3-8fc8-4330-abc1-f039fbbb9700/part.1 (cmd.InsufficientWriteQuorum) 1: d:\minio\minio\cmd\data-usage.go:56:cmd.storeDataUsageInBackend() Exiting on signal: INTERRUPT Client http://127.0.0.1:9002/minio/lock/v5 online Client http://127.0.0.1:9002/minio/storage/data/distxl/s2/d3/v24 online Client http://127.0.0.1:9002/minio/storage/data/distxl/s2/d2/v24 online Client http://127.0.0.1:9002/minio/storage/data/distxl/s2/d1/v24 online Client http://127.0.0.1:9002/minio/peer/v12 online Client http://127.0.0.1:9002/minio/storage/data/distxl/s2/d4/v24 online ``` Use a fresh context for health checks.	2021-01-28 13:38:12 -08:00
Ritesh H Shukla	b4add82bb6	Updated Prometheus metrics (#11141 ) * Add metrics for nodes online and offline * Add cluster capacity metrics * Introduce v2 metrics	2021-01-18 20:35:38 -08:00
Harshavardhana	274bbad5cb	fix: select always online peers for remote listing (#11153 ) always find the right set of online peers for remote listing, this may have an effect on listing if the server is down - we should do this to avoid always performing transient operations on bucket->peerClient that is permanently or down for a long period.	2020-12-22 09:16:07 -08:00
Harshavardhana	5c451d1690	update x/net/http2 to address few bugs (#11144 ) additionally also configure http2 healthcheck values to quickly detect unstable connections and let them timeout. also use single transport for proxying requests	2020-12-21 21:42:38 -08:00
Harshavardhana	4550ac6fff	fix: refactor locks to apply them uniquely per node (#11052 ) This refactor is done for few reasons below - to avoid deadlocks in scenarios when number of nodes are smaller < actual erasure stripe count where in N participating local lockers can lead to deadlocks across systems. - avoids expiry routines to run 1000 of separate network operations and routes per disk where as each of them are still accessing one single local entity. - it is ideal to have since globalLockServer per instance. - In a 32node deployment however, each server group is still concentrated towards the same set of lockers that partipicate during the write/read phase, unlike previous minio/dsync implementation - this potentially avoids send 32 requests instead we will still send at max requests of unique nodes participating in a write/read phase. - reduces overall chattiness on smaller setups.	2020-12-10 07:28:37 -08:00
Harshavardhana	4ec45753e6	rename server sets to server pools	2020-12-01 13:50:33 -08:00
Shireesh Anjal	7bc47a14cc	Rename OBD to Health (#10842 ) Also, Remove thread stats and openfds from the health report as we already have process stats and numfds	2020-11-20 12:52:53 -08:00
Harshavardhana	f794fe79e3	fix: network shutdown was not handle properly (#10927 ) fixes a regression introduced in #10859, due to the error returned by rest.Client being typed i.e *rest.NetworkError - IsNetworkHostDown function didn't work as expected to detect network issues. This in-turn aggravated the situations when nodes are disconnected leading to performance loss.	2020-11-19 13:53:49 -08:00
Klaus Post	06899210a7	Reduce health check output (#10859 ) This will make the health check clients 'silent'. Use `IsNetworkOrHostDown` determine if network is ok so it mimics the functionality in the actual client.	2020-11-10 09:28:23 -08:00
Harshavardhana	4c773f7068	re-use remote transports in Peer,Storage,Locker clients (#10788 ) use one transport for internode communication	2020-11-02 07:43:11 -08:00
Harshavardhana	b686bb9c83	fix: replaced drive properly by healing the entire drive (#10799 ) Bonus fixes, we do not need reload format anymore as the replaced drive is healed locally we only need to ensure that drive heal reloads the drive properly. We preserve the UUID of the original order, this means that the replacement in `format.json` doesn't mean that the drive needs to be reloaded into memory anymore. fixes #10791	2020-10-31 01:34:48 -07:00
Harshavardhana	4bf90ca67f	fix: handle a crash when AskDisks is set to -1 (#10777 )	2020-10-29 09:25:43 -07:00
Klaus Post	a982baff27	ListObjects Metadata Caching (#10648 ) Design: https://gist.github.com/klauspost/025c09b48ed4a1293c917cecfabdf21c Gist of improvements: * Cross-server caching and listing will use the same data across servers and requests. * Lists can be arbitrarily resumed at a constant speed. * Metadata for all files scanned is stored for streaming retrieval. * The existing bloom filters controlled by the crawler is used for validating caches. * Concurrent requests for the same data (or parts of it) will not spawn additional walkers. * Listing a subdirectory of an existing recursive cache will use the cache. * All listing operations are fully streamable so the number of objects in a bucket no longer dictates the amount of memory. * Listings can be handled by any server within the cluster. * Caches are cleaned up when out of date or superseded by a more recent one.	2020-10-28 09:18:35 -07:00
Shireesh Anjal	858e2a43df	Remove logging info from OBDInfoHandler (#10727 ) A lot of logging data is counterproductive. A better implementation with precise useful log data can be introduced later.	2020-10-27 17:41:48 -07:00
Harshavardhana	ad726b49b4	rename zones to serverSets to avoid terminology conflict (#10679 ) we are bringing in availability zones, we should avoid zones as per server expansion concept.	2020-10-15 14:28:50 -07:00
Ritesh H Shukla	8ceb2a93fd	fix: peer replication bandwidth monitoring in distributed setup (#10652 )	2020-10-12 09:04:55 -07:00
Ritesh H Shukla	c2f16ee846	Add basic bandwidth monitoring for replication. (#10501 ) This change tracks bandwidth for a bucket and object - [x] Add Admin API - [x] Add Peer API - [x] Add BW throttling - [x] Admin APIs to set replication limit - [x] Admin APIs for fetch bandwidth	2020-10-09 20:36:00 -07:00
Harshavardhana	f28d02b7f2	fix: simplify obd how we calculate transferred bytes (#10617 )	2020-10-01 14:34:51 -07:00
Harshavardhana	98a08e1644	fix: protect updating latencies/throughput slices in obd (#10611 ) Additionally close the transferChan upon function exit.	2020-10-01 09:50:08 -07:00
Harshavardhana	1f9abbee4d	make sure to release locks upon timeout (#10596 ) fixes #10418	2020-09-29 15:18:34 -07:00
Harshavardhana	37a5d5d7a0	reduce timeouts between servers for faster disconnects (#10562 )	2020-09-24 20:10:07 -07:00
Klaus Post	03490c811b	Fix obd goroutine leak (#10504 ) The gouroutine collecting transfer stats never exits. Add missing channel close.	2020-09-17 10:10:20 -07:00
Harshavardhana	e60834838f	fix: background disk heal, to reload format consistently (#10502 ) It was observed in VMware vsphere environment during a pod replacement, `mc admin info` might report incorrect offline nodes for the replaced drive. This issue eventually goes away but requires quite a lot of time for all servers to be in sync. This PR fixes this behavior properly.	2020-09-16 21:14:35 -07:00
Anis Elleuch	8ea55f9dba	obd: Add console log to OBD output (#10372 )	2020-09-15 18:02:54 -07:00
Klaus Post	34859c6d4b	Preallocate (safe) slices when we know the size (#10459 )	2020-09-14 20:44:18 -07:00
Klaus Post	2d58a8d861	Add storage layer contexts (#10321 ) Add context to all (non-trivial) calls to the storage layer. Contexts are propagated through the REST client. - `context.TODO()` is left in place for the places where it needs to be added to the caller. - `endWalkCh` could probably be removed from the walkers, but no changes so far. The "dangerous" part is that now a caller disconnecting will propagate down, so a "delete" operation will now be interrupted. In some cases we might want to disconnect this functionality so the operation completes if it has started, leaving the system in a cleaner state.	2020-09-04 09:45:06 -07:00
Harshavardhana	caad314faa	add ruleguard support, fix all the reported issues (#10335 )	2020-08-24 12:11:20 -07:00
Harshavardhana	0b8255529a	fix: proxies set keep-alive timeouts to be system dependent (#10199 ) Split the DialContext's one for internode and another for all other external communications especially proxy forwarders, gateway transport etc.	2020-08-04 14:55:53 -07:00
Harshavardhana	f200a7fb6a	fix: speed up OBD tests avoid unnecessary memory allocation (#10141 ) replace dummy buffer with nullReader{} instead, to avoid large memory allocations in memory constrainted environments. allows running obd tests in such environments.	2020-07-27 14:51:59 -07:00
Harshavardhana	3a73f1ead5	refactor server update behavior (#10107 )	2020-07-23 08:03:31 -07:00
Harshavardhana	3b9fbf80ad	fix: make sure to use new restClient for healthcheck (#10026 ) Without instantiating a new rest client we can have a recursive error which can lead to healthcheck returning always offline, this can prematurely take the servers offline.	2020-07-11 22:19:38 -07:00
Harshavardhana	7ed1077879	Add a custom healthcheck function for online status (#9858 ) - Add changes to ensure remote disks are not incorrectly taken online if their order has changed or are incorrect disks. - Bring changes to peer to detect disconnection with separate Health handler, to avoid a rather expensive call GetLocakDiskIDs() - Follow up on the same changes for Lockers as well	2020-06-17 14:49:26 -07:00
Klaus Post	3ba4804d6c	Move online status to REST client (#9808 )	2020-06-16 18:59:32 -07:00
Harshavardhana	4790868878	allow background IAM load to speed up startup (#9796 ) Also fix healthcheck handler to run success only if object layer has initialized fully for S3 API access call.	2020-06-09 19:19:03 -07:00
Harshavardhana	5e529a1c96	simplify context timeout for readiness (#9772 ) additionally also add CORS support to restrict for specific origin, adds a new config and updated the documentation as well	2020-06-04 14:58:34 -07:00
Krishna Srinivas	7d19ab9f62	readiness returns error quickly if any of the set is down (#9662 ) This PR adds a new configuration parameter which allows readiness check to respond within 10secs, this can be reduced to a lower value if necessary using ``` mc admin config set api ready_deadline=5s ``` or ``` export MINIO_API_READY_DEADLINE=5s ```	2020-05-23 17:38:39 -07:00
Harshavardhana	bd032d13ff	migrate all bucket metadata into a single file (#9586 ) this is a major overhaul by migrating off all bucket metadata related configs into a single object '.metadata.bin' this allows us for faster bootups across 1000's of buckets and as well as keeps the code simple enough for future work and additions. Additionally also fixes #9396, #9394	2020-05-19 13:53:54 -07:00
Harshavardhana	1bc32215b9	enable full linter across the codebase (#9620 ) enable linter using golangci-lint across codebase to run a bunch of linters together, we shall enable new linters as we fix more things the codebase. This PR fixes the first stage of this cleanup.	2020-05-18 09:59:45 -07:00
Harshavardhana	9b3b04ecec	allow retries for bucket encryption/policy quorum reloads (#9513 ) We should allow quorum errors to be send upwards such that caller can retry while reading bucket encryption/policy configs when server is starting up, this allows distributed setups to load the configuration properly. Current code didn't facilitate this and would have never loaded the actual configs during rolling, server restarts.	2020-05-04 09:42:58 -07:00
poornas	9a547dcbfb	Add API's for managing bucket quota (#9379 ) This PR allows setting a "hard" or "fifo" quota restriction at the bucket level. Buckets that have reached the FIFO quota configured, will automatically be cleaned up in FIFO manner until bucket usage drops to configured quota. If a bucket is configured with a "hard" quota ceiling, all further writes are disallowed.	2020-04-30 15:55:54 -07:00
Klaus Post	073aac3d92	add data update tracking using bloom filter (#9208 ) By monitoring PUT/DELETE and heal operations it is possible to track changed paths and keep a bloom filter for this data. This can help prioritize paths to scan. The bloom filter can identify paths that have not changed, and the few collisions will only result in a marginal extra workload. This can be implemented on either a bucket+(1 prefix level) with reasonable performance. The bloom filter is set to have a false positive rate at 1% at 1M entries. A bloom table of this size is about ~2500 bytes when serialized. To not force a full scan of all paths that have changed cycle bloom filters would need to be kept, so we guarantee that dirty paths have been scanned within cycle runs. Until cycle bloom filters have been collected all paths are considered dirty.	2020-04-27 10:06:21 -07:00
Harshavardhana	f14bf25cb9	optimize Listen bucket notification implementation (#9444 ) this commit avoids lots of tiny allocations, repeated channel creates which are performed when filtering the incoming events, unescaping a key just for matching. also remove deprecated code which is not needed anymore, avoids unexpected data structure transformations from the map to slice.	2020-04-27 06:25:05 -07:00
Anis Elleuch	20766069a8	add list/delete API service accounts admin API (#9402 )	2020-04-24 12:10:09 -07:00
Sidhartha Mani	3e78ea8acc	improve obd tests and optimize network (#9378 ) - keep long running obd network tests alive - fix error - wrong number of parents in process OBD info - ensure that osinfo does not error out when inside containers - remove limit on max number of connections per client transport The generic client transport uses a default limit of 64 conns per transport. This could end up limiting and throttling usage, and artificially slowing down the performance of MinIO even on hardware capable of doing better.	2020-04-18 11:06:11 -07:00
Klaus Post	c4464e36c8	fix: limit HTTP transport tuables to affordable values (#9383 ) Close connections pro-actively in transient calls	2020-04-17 11:20:56 -07:00
Harshavardhana	69fb68ef0b	fix simplify code to start using context (#9350 )	2020-04-16 10:56:18 -07:00
Harshavardhana	4314ee1670	fix: remove unusued PerfInfoHandler code (#9328 ) - Removes PerfInfo admin API as its not OBDInfo - Keep the drive path without the metaBucket in OBD global latency map. - Remove all the unused code related to PerfInfo API - Do not redefined global mib,gib constants use humanize.MiByte and humanize.GiByte instead always	2020-04-12 19:37:09 -07:00
Harshavardhana	f44cfb2863	use GlobalContext whenever possible (#9280 ) This change is throughout the codebase to ensure that all codepaths honor GlobalContext	2020-04-09 09:30:02 -07:00
Harshavardhana	ac07df2985	start watcher after all creds have been loaded (#9301 ) start watcher after all creds have been loaded to avoid any conflicting locks that might get deadlocked. Deprecate unused peer calls for LoadUsers()	2020-04-08 19:00:39 -07:00
Sidhartha Mani	0c80bf45d0	Implement oboard diagnostics admin API (#9024 ) - Implement a graph algorithm to test network bandwidth from every node to every other node - Saturate any network bandwidth adaptively, accounting for slow and fast network capacity - Implement parallel drive OBD tests - Implement a paging mechanism for OBD test to provide periodic updates to client - Implement Sys, Process, Host, Mem OBD Infos	2020-03-26 21:07:39 -07:00
Harshavardhana	6f6a2214fc	Add rate limiter for S3 API layer (#9196 ) - total number of S3 API calls per server - maximum wait duration for any S3 API call This implementation is primarily meant for situations where HDDs are not capable enough to handle the incoming workload and there is no way to throttle the client. This feature allows MinIO server to throttle itself such that we do not overwhelm the HDDs.	2020-03-24 12:43:40 -07:00
Harshavardhana	cfc9cfd84a	fix: various optimizations, idiomatic changes (#9179 ) - acquire since leader lock for all background operations - healing, crawling and applying lifecycle policies. - simplify lifecyle to avoid network calls, which was a bug in implementation - we should hold a leader and do everything from there, we have access to entire name space. - make listing, walking not interfere by slowing itself down like the crawler. - effectively use global context everywhere to ensure proper shutdown, in cache, lifecycle, healing - don't read `format.json` for prometheus metrics in StorageInfo() call.	2020-03-22 12:16:36 -07:00
Krishnan Parthasarathi	026265f8f7	Add support for bucket encryption feature (#8890 ) - pkg/bucket/encryption provides support for handling bucket encryption configuration - changes under cmd/ provide support for AES256 algorithm only Co-Authored-By: Poorna <poornas@users.noreply.github.com> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-05 15:12:34 +05:30
Harshavardhana	0cbebf0f57	Rename pkg/{tagging,lifecycle} to pkg/bucket sub-directory (#8892 ) Rename to allow for more such features to come in a more proper hierarchical manner.	2020-01-27 14:12:34 -08:00
poornas	60e60f68dd	Add support for object locking with legal hold. (#8634 )	2020-01-16 15:41:56 -08:00
Klaus Post	2bf6cf0e15	Enable multiple concurrent profile types (#8792 )	2020-01-10 17:19:58 -08:00
Harshavardhana	c8d82588c2	Fix crash in console logger and also handle bucket DNS updates (#8654 ) Also fix listenBucketNotification bugs seen by minio-js listen bucket notification API.	2019-12-16 20:30:57 -08:00
Harshavardhana	cc02bf0442	Remove old ListenBucketNotification API (#8645 )	2019-12-13 11:33:11 -08:00
Harshavardhana	f5abe4e1f1	Support ListenBucketNotificationV2 streaming (#8622 )	2019-12-12 10:01:23 -08:00
Ashish Kumar Sinha	24fb1bf258	New Admin Info (#8497 )	2019-12-11 14:27:03 -08:00
Krishna Srinivas	3b67f629a4	Retry peer notification of events (#8621 )	2019-12-09 05:29:37 -08:00
Harshavardhana	fb43d64dc3	Fix healing on multiple zones (#8555 ) It is expected in zone healing underlying callers should return appropriate errors	2019-11-21 13:18:32 -08:00
Harshavardhana	347b29d059	Implement bucket expansion (#8509 )	2019-11-19 17:42:27 -08:00
Harshavardhana	e9b2bf00ad	Support MinIO to be deployed on more than 32 nodes (#8492 ) This PR implements locking from a global entity into a more localized set level entity, allowing for locks to be held only on the resources which are writing to a collection of disks rather than a global level. In this process this PR also removes the top-level limit of 32 nodes to an unlimited number of nodes. This is a precursor change before bring in bucket expansion.	2019-11-13 12:17:45 -08:00
Bala FA	fb48ca5020	Add Get/Put Bucket Lock Configuration API support (#8120 ) This feature implements [PUT Bucket object lock configuration][1] and [GET Bucket object lock configuration][2]. After object lock configuration is set, existing and new objects are set to WORM for specified duration. Currently Governance mode works exactly like Compliance mode. Fixes #8101 [1] https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketPUTObjectLockConfiguration.html [2] https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketGETObjectLockConfiguration.html	2019-11-12 14:50:18 -08:00
Harshavardhana	822eb5ddc7	Bring in safe mode support (#8478 ) This PR refactors object layer handling such that upon failure in sub-system initialization server reaches a stage of safe-mode operation wherein only certain API operations are enabled and available. This allows for fixing many scenarios such as - incorrect configuration in vault, etcd, notification targets - missing files, incomplete config migrations unable to read encrypted content etc - any other issues related to notification, policies, lifecycle etc	2019-11-09 09:27:23 -08:00
Ashish Kumar Sinha	18cb15559d	Add network hardware info (#8358 ) peerRESTVersion changed to v6	2019-10-17 04:09:49 -07:00

1 2 3 4

173 Commits