minio

Commit Graph

Author	SHA1	Message	Date
Poorna Krishnamoorthy	690434514d	Avoid notification event for replicas (#11683 ) Creating notification events for replica creation is not particularly useful to send as the notification event generated at source already includes replication completion events. For applications using replica cluster as failover, avoiding duplicate notifications for replica event will allow seamless failover.	2021-03-03 11:13:31 -08:00
Harshavardhana	c6a120df0e	fix: Prometheus metrics to re-use storage disks (#11647 ) also re-use storage disks for all `mc admin server info` calls as well, implement a new LocalStorageInfo() API call at ObjectLayer to lookup local disks storageInfo also fixes bugs where there were double calls to StorageInfo()	2021-03-02 17:28:04 -08:00
Shireesh Anjal	289b22d911	fix: pool number not added for one server (#11670 ) The previous code was iterating over replies from peers and assigning pool numbers to them, thus missing to add it for the local server. Fixed by iterating over the server properties of all the servers including the local one.	2021-03-01 08:09:43 -08:00
Bala FA	23f7ab40b3	Add PoolNumber field to madmin.ServerProperties (#11327 )	2021-02-28 21:26:28 -08:00
Ritesh H Shukla	c8489a8f0c	fix: log notification errors only once (#11350 )	2021-01-28 13:40:31 -08:00
Klaus Post	4e6d717f39	Compress profiling data (#11313 ) Trace data can be rather large and compresses fine. Compress profile data in zip files: ``` 277.895.314 before.profiles.zip 152.800.318 after.profiles.zip ```	2021-01-20 15:49:53 -08:00
Poorna Krishnamoorthy	845e251fa9	fix: crash in notificationsys when peers online is 0 (#11307 ) Check if the number of peers online > 0 before using peerClient	2021-01-20 13:13:05 -08:00
Ritesh H Shukla	b4add82bb6	Updated Prometheus metrics (#11141 ) * Add metrics for nodes online and offline * Add cluster capacity metrics * Introduce v2 metrics	2021-01-18 20:35:38 -08:00
Anis Elleuch	153d4be032	tracing: NumSubscribers() to use atomic instead of mutex (#11219 ) globalSubscribers.NumSubscribers() is heavily used in S3 requests and it uses mutex, use atomic.Load instead since it is faster Co-authored-by: Anis Elleuch <anis@min.io>	2021-01-04 09:40:30 -08:00
Harshavardhana	c19e6ce773	avoid a crash in crawler when lifecycle is not initialized (#11170 ) Bonus for static buffers use bytes.NewReader instead of bytes.NewBuffer, to use a more reader friendly implementation	2020-12-26 22:58:06 -08:00
Harshavardhana	274bbad5cb	fix: select always online peers for remote listing (#11153 ) always find the right set of online peers for remote listing, this may have an effect on listing if the server is down - we should do this to avoid always performing transient operations on bucket->peerClient that is permanently or down for a long period.	2020-12-22 09:16:07 -08:00
Harshavardhana	4550ac6fff	fix: refactor locks to apply them uniquely per node (#11052 ) This refactor is done for few reasons below - to avoid deadlocks in scenarios when number of nodes are smaller < actual erasure stripe count where in N participating local lockers can lead to deadlocks across systems. - avoids expiry routines to run 1000 of separate network operations and routes per disk where as each of them are still accessing one single local entity. - it is ideal to have since globalLockServer per instance. - In a 32node deployment however, each server group is still concentrated towards the same set of lockers that partipicate during the write/read phase, unlike previous minio/dsync implementation - this potentially avoids send 32 requests instead we will still send at max requests of unique nodes participating in a write/read phase. - reduces overall chattiness on smaller setups.	2020-12-10 07:28:37 -08:00
Harshavardhana	4a564336fe	Revert "Add metrics for nodes online and offline (#11050 )" This reverts commit `f60bbdf86b`.	2020-12-08 09:23:35 -08:00
Ritesh H Shukla	f60bbdf86b	Add metrics for nodes online and offline (#11050 )	2020-12-08 01:06:27 -08:00
Harshavardhana	4ec45753e6	rename server sets to server pools	2020-12-01 13:50:33 -08:00
Shireesh Anjal	7bc47a14cc	Rename OBD to Health (#10842 ) Also, Remove thread stats and openfds from the health report as we already have process stats and numfds	2020-11-20 12:52:53 -08:00
Klaus Post	422898d9b3	Clean up metadata cache when deleting bucket (#10802 ) Metadata caches were left behind when deleting a bucket.	2020-10-31 09:46:18 -07:00
Harshavardhana	b686bb9c83	fix: replaced drive properly by healing the entire drive (#10799 ) Bonus fixes, we do not need reload format anymore as the replaced drive is healed locally we only need to ensure that drive heal reloads the drive properly. We preserve the UUID of the original order, this means that the replacement in `format.json` doesn't mean that the drive needs to be reloaded into memory anymore. fixes #10791	2020-10-31 01:34:48 -07:00
Klaus Post	a982baff27	ListObjects Metadata Caching (#10648 ) Design: https://gist.github.com/klauspost/025c09b48ed4a1293c917cecfabdf21c Gist of improvements: * Cross-server caching and listing will use the same data across servers and requests. * Lists can be arbitrarily resumed at a constant speed. * Metadata for all files scanned is stored for streaming retrieval. * The existing bloom filters controlled by the crawler is used for validating caches. * Concurrent requests for the same data (or parts of it) will not spawn additional walkers. * Listing a subdirectory of an existing recursive cache will use the cache. * All listing operations are fully streamable so the number of objects in a bucket no longer dictates the amount of memory. * Listings can be handled by any server within the cluster. * Caches are cleaned up when out of date or superseded by a more recent one.	2020-10-28 09:18:35 -07:00
Shireesh Anjal	858e2a43df	Remove logging info from OBDInfoHandler (#10727 ) A lot of logging data is counterproductive. A better implementation with precise useful log data can be introduced later.	2020-10-27 17:41:48 -07:00
Harshavardhana	734f258878	fix: slow down auto healing more aggressively (#10730 ) Bonus fixes - logging improvements to ensure that we don't use `go logger.LogIf` to avoid runtime.Caller missing the function name. log where necessary. - remove unused code at erasure sets	2020-10-22 13:36:24 -07:00
Harshavardhana	d6d770c1b1	initialize object layer right after config has loaded	2020-10-19 22:04:59 -07:00
Ritesh H Shukla	8a16a1a1a9	fix: misc fixes for bandwidth reporting amd monitoring (#10683 ) * Set peer for fetch bandwidth * Fix the limit for bandwidth that is reported. * Reduce CPU burn from bandwidth management.	2020-10-16 09:07:50 -07:00
Harshavardhana	ad726b49b4	rename zones to serverSets to avoid terminology conflict (#10679 ) we are bringing in availability zones, we should avoid zones as per server expansion concept.	2020-10-15 14:28:50 -07:00
Ritesh H Shukla	8ceb2a93fd	fix: peer replication bandwidth monitoring in distributed setup (#10652 )	2020-10-12 09:04:55 -07:00
Harshavardhana	a0d0645128	remove safeMode behavior in startup (#10645 ) In almost all scenarios MinIO now is mostly ready for all sub-systems independently, safe-mode is not useful anymore and do not serve its original intended purpose. allow server to be fully functional even with config partially configured, this is to cater for availability of actual I/O v/s manually fixing the server. In k8s like environments it will never make sense to take pod into safe-mode state, because there is no real access to perform any remote operation on them.	2020-10-09 09:59:52 -07:00
Harshavardhana	736e58dd68	fix: handle concurrent lockers with multiple optimizations (#10640 ) - select lockers which are non-local and online to have affinity towards remote servers for lock contention - optimize lock retry interval to avoid sending too many messages during lock contention, reduces average CPU usage as well - if bucket is not set, when deleteObject fails make sure setPutObjHeaders() honors lifecycle only if bucket name is set. - fix top locks to list out always the oldest lockers always, avoid getting bogged down into map's unordered nature.	2020-10-08 12:32:32 -07:00
Harshavardhana	eafa775952	fix: add lock ownership to expire locks (#10571 ) - Add owner information for expiry, locking, unlocking a resource - TopLocks returns now locks in quorum by default, provides a way to capture stale locks as well with `?stale=true` - Simplify the quorum handling for locks to avoid from storage class, because there were challenges to make it consistent across all situations. - And other tiny simplifications to reset locks.	2020-09-25 19:21:52 -07:00
Anis Elleuch	8ea55f9dba	obd: Add console log to OBD output (#10372 )	2020-09-15 18:02:54 -07:00
Harshavardhana	0ee9678190	fix: add missing delete marker created filter (#10481 )	2020-09-14 21:32:52 -07:00
Harshavardhana	59352d0ac2	load all blocking metadata in background (#10298 ) most of this metadata already has fallbacks and there is no good reason to load them in blocking fashion	2020-08-20 10:38:53 -07:00
Ritesh H Shukla	3acb5cff45	Update code comment (#10287 )	2020-08-19 14:24:58 -07:00
Harshavardhana	2a9819aff8	fix: refactor background heal for cluster health (#10225 )	2020-08-07 19:43:06 -07:00
Harshavardhana	6c6137b2e7	add cluster maintenance healthcheck drive heal affinity (#10218 )	2020-08-07 13:22:53 -07:00
Harshavardhana	3a73f1ead5	refactor server update behavior (#10107 )	2020-07-23 08:03:31 -07:00
Harshavardhana	ec06089eda	fix: re-implement cluster healthcheck (#10101 )	2020-07-20 18:31:22 -07:00
Harshavardhana	2955aae8e4	feat: Add notification support for bucketCreates and removal (#10075 )	2020-07-20 12:52:49 -07:00
Anis Elleuch	778e9c864f	Move dependency from minio-go v6 to v7 (#10042 )	2020-07-14 09:38:05 -07:00
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	2020-06-12 20:04:01 -07:00
Harshavardhana	5e529a1c96	simplify context timeout for readiness (#9772 ) additionally also add CORS support to restrict for specific origin, adds a new config and updated the documentation as well	2020-06-04 14:58:34 -07:00
Harshavardhana	0c71ce3398	fix size accounting for encrypted/compressed objects (#9690 ) size calculation in crawler was using the real size of the object instead of its actual size i.e either a decrypted or uncompressed size. this is needed to make sure all other accounting such as bucket quota and mcs UI to display the correct values.	2020-05-24 11:19:17 -07:00
Krishna Srinivas	7d19ab9f62	readiness returns error quickly if any of the set is down (#9662 ) This PR adds a new configuration parameter which allows readiness check to respond within 10secs, this can be reduced to a lower value if necessary using ``` mc admin config set api ready_deadline=5s ``` or ``` export MINIO_API_READY_DEADLINE=5s ```	2020-05-23 17:38:39 -07:00
Sidhartha Mani	c121d27f31	progressively report obd results (#9639 )	2020-05-22 17:56:45 -07:00
Harshavardhana	a546047c95	keep bucket metadata fields to be consistent (#9660 ) added bonus reload bucket metadata always after a successful MakeBucket, current we were only doing it with object locking enabled.	2020-05-21 11:03:59 -07:00
Harshavardhana	6656fa3066	simplify further bucket configuration properly (#9650 ) This PR is a continuation from #9586, now the entire parsing logic is fully merged into bucket metadata sub-system, simplify the quota API further by reducing the remove quota handler implementation.	2020-05-20 10:18:15 -07:00
Harshavardhana	bd032d13ff	migrate all bucket metadata into a single file (#9586 ) this is a major overhaul by migrating off all bucket metadata related configs into a single object '.metadata.bin' this allows us for faster bootups across 1000's of buckets and as well as keeps the code simple enough for future work and additions. Additionally also fixes #9396, #9394	2020-05-19 13:53:54 -07:00
Harshavardhana	a1de9cec58	cleanup object-lock/bucket tagging for gateways (#9548 ) This PR is to ensure that we call the relevant object layer APIs for necessary S3 API level functionalities allowing gateway implementations to return proper errors as NotImplemented{} This allows for all our tests in mint to behave appropriately and can be handled appropriately as well.	2020-05-08 13:44:44 -07:00
Harshavardhana	9b3b04ecec	allow retries for bucket encryption/policy quorum reloads (#9513 ) We should allow quorum errors to be send upwards such that caller can retry while reading bucket encryption/policy configs when server is starting up, this allows distributed setups to load the configuration properly. Current code didn't facilitate this and would have never loaded the actual configs during rolling, server restarts.	2020-05-04 09:42:58 -07:00
poornas	9a547dcbfb	Add API's for managing bucket quota (#9379 ) This PR allows setting a "hard" or "fifo" quota restriction at the bucket level. Buckets that have reached the FIFO quota configured, will automatically be cleaned up in FIFO manner until bucket usage drops to configured quota. If a bucket is configured with a "hard" quota ceiling, all further writes are disallowed.	2020-04-30 15:55:54 -07:00
Klaus Post	073aac3d92	add data update tracking using bloom filter (#9208 ) By monitoring PUT/DELETE and heal operations it is possible to track changed paths and keep a bloom filter for this data. This can help prioritize paths to scan. The bloom filter can identify paths that have not changed, and the few collisions will only result in a marginal extra workload. This can be implemented on either a bucket+(1 prefix level) with reasonable performance. The bloom filter is set to have a false positive rate at 1% at 1M entries. A bloom table of this size is about ~2500 bytes when serialized. To not force a full scan of all paths that have changed cycle bloom filters would need to be kept, so we guarantee that dirty paths have been scanned within cycle runs. Until cycle bloom filters have been collected all paths are considered dirty.	2020-04-27 10:06:21 -07:00
Harshavardhana	f14bf25cb9	optimize Listen bucket notification implementation (#9444 ) this commit avoids lots of tiny allocations, repeated channel creates which are performed when filtering the incoming events, unescaping a key just for matching. also remove deprecated code which is not needed anymore, avoids unexpected data structure transformations from the map to slice.	2020-04-27 06:25:05 -07:00
Anis Elleuch	20766069a8	add list/delete API service accounts admin API (#9402 )	2020-04-24 12:10:09 -07:00
Praveen raj Mani	322385f1b6	fix: only show active/available ARNs in server startup banner (#9392 )	2020-04-21 09:38:32 -07:00
Klaus Post	c4464e36c8	fix: limit HTTP transport tuables to affordable values (#9383 ) Close connections pro-actively in transient calls	2020-04-17 11:20:56 -07:00
Harshavardhana	4314ee1670	fix: remove unusued PerfInfoHandler code (#9328 ) - Removes PerfInfo admin API as its not OBDInfo - Keep the drive path without the metaBucket in OBD global latency map. - Remove all the unused code related to PerfInfo API - Do not redefined global mib,gib constants use humanize.MiByte and humanize.GiByte instead always	2020-04-12 19:37:09 -07:00
Harshavardhana	f44cfb2863	use GlobalContext whenever possible (#9280 ) This change is throughout the codebase to ensure that all codepaths honor GlobalContext	2020-04-09 09:30:02 -07:00
Harshavardhana	ac07df2985	start watcher after all creds have been loaded (#9301 ) start watcher after all creds have been loaded to avoid any conflicting locks that might get deadlocked. Deprecate unused peer calls for LoadUsers()	2020-04-08 19:00:39 -07:00
Sidhartha Mani	c8243706b4	Add Parallel NetOBD tests to saturate all nodes at once (#9241 )	2020-03-31 17:08:28 -07:00
Sidhartha Mani	7b732b566f	[Bugfix] Fix Net tests being omitted (#9234 )	2020-03-31 01:15:21 -07:00
Sidhartha Mani	0c80bf45d0	Implement oboard diagnostics admin API (#9024 ) - Implement a graph algorithm to test network bandwidth from every node to every other node - Saturate any network bandwidth adaptively, accounting for slow and fast network capacity - Implement parallel drive OBD tests - Implement a paging mechanism for OBD test to provide periodic updates to client - Implement Sys, Process, Host, Mem OBD Infos	2020-03-26 21:07:39 -07:00
Harshavardhana	cfc9cfd84a	fix: various optimizations, idiomatic changes (#9179 ) - acquire since leader lock for all background operations - healing, crawling and applying lifecycle policies. - simplify lifecyle to avoid network calls, which was a bug in implementation - we should hold a leader and do everything from there, we have access to entire name space. - make listing, walking not interfere by slowing itself down like the crawler. - effectively use global context everywhere to ensure proper shutdown, in cache, lifecycle, healing - don't read `format.json` for prometheus metrics in StorageInfo() call.	2020-03-22 12:16:36 -07:00
Harshavardhana	3d3beb6a9d	Add response header timeouts (#9170 ) - Add conservative timeouts upto 3 minutes for internode communication - Add aggressive timeouts of 30 seconds for gateway communication Fixes #9105 Fixes #8732 Fixes #8881 Fixes #8376 Fixes #9028	2020-03-21 22:10:13 -07:00
Klaus Post	eeb5942b6b	fix: remote profile names and extension (#9145 ) Remote profiles are not formatted correctly: ``` profile-172.31.91.126_9000-cpu.pprof profile-172.31.91.126_9000-goroutines-before.txt profile-172.31.91.126_9000-goroutines.txt profiling-172.31.80.49_9000-cpu.pprof.pprof profiling-172.31.80.49_9000-goroutines-before.txt.pprof profiling-172.31.80.49_9000-goroutines.txt.pprof profiling-172.31.86.101_9000-cpu.pprof.pprof profiling-172.31.86.101_9000-goroutines-before.txt.pprof profiling-172.31.86.101_9000-goroutines.txt.pprof profiling-172.31.91.191_9000-cpu.pprof.pprof profiling-172.31.91.191_9000-goroutines-before.txt.pprof profiling-172.31.91.191_9000-goroutines.txt.pprof ``` `profiling` -> `profile`, remove extra extension.	2020-03-16 11:39:53 -07:00
poornas	10fd53d6bb	Fix: admin config set API for notifications (#9085 ) Filter out targets set via env when validating incoming config change against configured notification targets Fixes #9066	2020-03-14 00:01:15 -07:00
Klaus Post	f1b2462193	Add goroutine profiles (#9078 ) Allow downloading goroutine dump to help detect leaks or overuse of goroutines. Extensions are now type dependent. Change `profiling` -> `profile` prefix, since that is what they are not the abstract concept.	2020-03-04 06:58:12 -08:00
Harshavardhana	dcd63b4146	fix: avoid double ListBuckets() loading object lock (#9031 )	2020-02-24 06:39:11 +05:30
Krishnan Parthasarathi	026265f8f7	Add support for bucket encryption feature (#8890 ) - pkg/bucket/encryption provides support for handling bucket encryption configuration - changes under cmd/ provide support for AES256 algorithm only Co-Authored-By: Poorna <poornas@users.noreply.github.com> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-05 15:12:34 +05:30
poornas	1ea2449269	NAS gateway: fix notification initialization (#8920 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-02 15:22:07 +05:30
Harshavardhana	d76160c245	Initialize only one retry timer for all sub-systems (#8913 ) Also make sure that we create buckets on all zones successfully, do not run quick heal buckets if not running with expansion.	2020-02-02 06:37:43 +05:30
Harshavardhana	0cbebf0f57	Rename pkg/{tagging,lifecycle} to pkg/bucket sub-directory (#8892 ) Rename to allow for more such features to come in a more proper hierarchical manner.	2020-01-27 14:12:34 -08:00
poornas	60e60f68dd	Add support for object locking with legal hold. (#8634 )	2020-01-16 15:41:56 -08:00
Klaus Post	2bf6cf0e15	Enable multiple concurrent profile types (#8792 )	2020-01-10 17:19:58 -08:00
Harshavardhana	99ad445260	Avoid double for loops in notification init (#8691 )	2019-12-24 13:49:48 -08:00
Harshavardhana	725172e13b	fix: Do not need safe-mode for unreachable targets upon restart (#8686 )	2019-12-21 22:35:50 -08:00
Harshavardhana	d140074773	fix: replica set deployment for multi tenants (#8673 ) Changes in IP underneath are dynamic in replica sets with multiple tenants, so deploying in that fashion will not work until we wait for atleast one participatory server to be local. This PR also ensures that multi-tenant zone expansion also works in replica set k8s deployments. Introduces a new ENV `KUBERNETES_REPLICA_SET` check to call appropriate code paths.	2019-12-19 13:45:56 -08:00
Harshavardhana	471a3a650a	fix: Don't allow to set unconfigured notification ARNs (#8643 ) Fixes #8642	2019-12-13 12:36:45 -08:00
Harshavardhana	cc02bf0442	Remove old ListenBucketNotification API (#8645 )	2019-12-13 11:33:11 -08:00
Harshavardhana	f5abe4e1f1	Support ListenBucketNotificationV2 streaming (#8622 )	2019-12-12 10:01:23 -08:00
Harshavardhana	c364f0af6c	Start using custom HTTP transport for webhook endpoints (#8630 ) Use a more performant http transport for webhook endpoints with proper connection pooling, appropriate timeouts etc.	2019-12-12 06:53:50 -08:00
Ashish Kumar Sinha	24fb1bf258	New Admin Info (#8497 )	2019-12-11 14:27:03 -08:00
Harshavardhana	5d65428b29	Handle localhost distributed setups properly (#8577 ) Fixes an issue reported by @klauspost and @vadmeste This PR also allows users to expand their clusters from single node XL deployment to distributed mode.	2019-11-26 11:42:10 -08:00
Harshavardhana	f96e902f63	Do not rely on quorum for StorageInfo() (#8557 ) StorageInfo() call is supposed to give each server/disk information independently, rely on this appropriately so that `mc admin info server` gets correct information all the time.	2019-11-21 22:08:41 -08:00
Harshavardhana	fb43d64dc3	Fix healing on multiple zones (#8555 ) It is expected in zone healing underlying callers should return appropriate errors	2019-11-21 13:18:32 -08:00
poornas	ca96560d56	Add object retention at the per object (#8528 ) level - this PR builds on #8120 which added PutBucketObjectLockConfiguration and GetBucketObjectLockConfiguration APIS This PR implements PutObjectRetention, GetObjectRetention API and enhances PUT and GET API operations to display governance metadata if permissions allow.	2019-11-20 13:18:09 -08:00
Harshavardhana	347b29d059	Implement bucket expansion (#8509 )	2019-11-19 17:42:27 -08:00
Harshavardhana	e9b2bf00ad	Support MinIO to be deployed on more than 32 nodes (#8492 ) This PR implements locking from a global entity into a more localized set level entity, allowing for locks to be held only on the resources which are writing to a collection of disks rather than a global level. In this process this PR also removes the top-level limit of 32 nodes to an unlimited number of nodes. This is a precursor change before bring in bucket expansion.	2019-11-13 12:17:45 -08:00
Bala FA	fb48ca5020	Add Get/Put Bucket Lock Configuration API support (#8120 ) This feature implements [PUT Bucket object lock configuration][1] and [GET Bucket object lock configuration][2]. After object lock configuration is set, existing and new objects are set to WORM for specified duration. Currently Governance mode works exactly like Compliance mode. Fixes #8101 [1] https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketPUTObjectLockConfiguration.html [2] https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketGETObjectLockConfiguration.html	2019-11-12 14:50:18 -08:00
Harshavardhana	822eb5ddc7	Bring in safe mode support (#8478 ) This PR refactors object layer handling such that upon failure in sub-system initialization server reaches a stage of safe-mode operation wherein only certain API operations are enabled and available. This allows for fixing many scenarios such as - incorrect configuration in vault, etcd, notification targets - missing files, incomplete config migrations unable to read encrypted content etc - any other issues related to notification, policies, lifecycle etc	2019-11-09 09:27:23 -08:00
Harshavardhana	9e7a3e6adc	Extend further validation of config values (#8469 ) - This PR allows config KVS to be validated properly without being affected by ENV overrides, rejects invalid values during set operation - Expands unit tests and refactors the error handling for notification targets, returns error instead of ignoring targets for invalid KVS - Does all the prep-work for implementing safe-mode style operation for MinIO server, introduces a new global variable to toggle safe mode based operations NOTE: this PR itself doesn't provide safe mode operations	2019-10-30 23:39:09 -07:00
Harshavardhana	ee4a6a823d	Migrate config to KV data format (#8392 ) - adding oauth support to MinIO browser (#8400) by @kanagaraj - supports multi-line get/set/del for all config fields - add support for comments, allow toggle - add extensive validation of config before saving - support MinIO browser to support proper claims, using STS tokens - env support for all config parameters, legacy envs are also supported with all documentation now pointing to latest ENVs - preserve accessKey/secretKey from FS mode setups - add history support implements three APIs - ClearHistory - RestoreHistory - ListHistory - add help command support for each config parameters - all the bug fixes after migration to KV, and other bug fixes encountered during testing.	2019-10-22 22:59:13 -07:00
poornas	1b74ce3924	Ensure actual object size is sent in notification (#8418 ) Fixes: #8407	2019-10-20 23:48:19 -07:00
Ashish Kumar Sinha	18cb15559d	Add network hardware info (#8358 ) peerRESTVersion changed to v6	2019-10-17 04:09:49 -07:00
Harshavardhana	d48fd6fde9	Remove unusued params and functions (#8399 )	2019-10-15 18:35:41 -07:00
Harshavardhana	68a519a468	Use errgroups instead of sync.WaitGroup as needed (#8354 )	2019-10-14 09:44:51 -07:00
Ashish Kumar Sinha	74008446fe	CPU hardware info (#8187 )	2019-10-03 20:18:38 +05:30
Harshavardhana	8b80eca184	List buckets only once per sub-system initialization (#8333 ) Current master repeatedly calls ListBuckets() during initialization of multiple sub-systems Use single ListBuckets() call for each sub-system as follows - LifeCycle - Policy - Notification	2019-10-02 05:35:02 +05:30
Klaus Post	ff726969aa	Switch to Snappy -> S2 compression (#8189 )	2019-09-25 23:08:24 -07:00
Krishnan Parthasarathi	6ba323b009	Add ability to test drive speeds on a MinIO setup (#7664 ) - Extends existing Admin API to measure disk performance	2019-09-13 03:22:30 +05:30
Harshavardhana	b52a3e523c	Avoid using fastjson parser pool, move back to jsoniter (#8190 ) It looks like from implementation point of view fastjson parser pool doesn't behave the same way as expected when dealing many `xl.json` from multiple disks. The fastjson parser pool usage ends up returning incorrect xl.json entries for checksums, with references pointing to older entries. This led to the subtle bug where checksum info is duplicated from a previous xl.json read of a different file from different disk.	2019-09-06 04:21:27 +05:30
Harshavardhana	83d4c5763c	Decouple ServiceUpdate to ServerUpdate to be more native (#8138 ) The change now is to ensure that we take custom URL as well for updating the deployment, this is required for hotfix deliveries for certain deployments - other than the community release. This commit changes the previous work `d65a2c6725` with newer set of requirements. Also deprecates PeerUptime()	2019-08-28 15:04:43 -07:00
Bala FA	60f52f461f	add network read performance collection support. (#8038 ) ReST API on /minio/admin/v1/performance?perfType=net[?size=N] returns ``` { "PEER-1": [ { "addr": ADDR, "readPerf": DURATION, "error": ERROR, }, ... ], ... ... "PEER-N": [ { "addr": ADDR, "readPerf": DURATION, "error": ERROR, }, ... ] } ```	2019-08-19 08:26:32 +05:30
Aditya Manthramurthy	bf9b619d86	Set the policy mapping for a user or group (#8036 ) Add API to set policy mapping for a user or group Contains a breaking Admin APIs change. - Also enforce all applicable policies - Removes the previous /set-user-policy API Bump up peerRESTVersion Add get user info API to show groups of a user	2019-08-13 13:41:06 -07:00
Anis Elleuch	1ce8d2c476	Add bucket lifecycle expiry feature (#7834 )	2019-08-09 10:02:41 -07:00
Aditya Manthramurthy	414a7eca83	Add IAM groups support (#7981 ) This change adds admin APIs and IAM subsystem APIs to: - add or remove members to a group (group addition and deletion is implicit on add and remove) - enable/disable a group - list and fetch group info	2019-08-02 14:25:00 -07:00
Krishnan Parthasarathi	559a59220e	Add initial support for bucket lifecycle (#7563 ) This PR is based off @sinhaashish's PR for object lifecycle management, which includes support only for, - Expiration of object - Filter using object prefix (_not_ object tags) N B the code for actual expiration of objects will be included in a subsequent PR.	2019-07-19 21:20:33 +01:00
Krishna Srinivas	338e9a9be9	Put object client disconnect (#7824 ) Fail putObject and postpolicy in case client prematurely disconnects Use request's context to cancel lock requests on client disconnects	2019-06-28 22:09:17 -07:00
Anis Elleuch	48f2c98052	admin: Add Background heal status info API (#7774 ) This API returns the information related to the self healing routine. For the moment, it returns: - The total number of objects that are scanned - The last time when an item was scanned	2019-06-25 16:42:24 -07:00
Harshavardhana	b30c436715	[notify] Make sure to return when quorum is missing (#7799 ) Fixes a regression introduced in `510ec153b9`	2019-06-18 09:23:33 -07:00
Praveen raj Mani	510ec153b9	Refreshing notification system should not erase the rules-map of other buckets (#7758 ) Fixes #7707	2019-06-15 03:14:27 -07:00
Harshavardhana	6d89435356	Reload a specific user or policy on peers (#7705 ) Fixes #7587	2019-06-06 17:46:22 -07:00
Praveen raj Mani	a73da7755e	Remove senstive encryption entries from event data (#7719 ) Fixes #7716	2019-05-29 22:29:37 -07:00
kannappanr	d2f42d830f	Lock: Use REST API instead of RPC (#7469 ) In distributed mode, use REST API to acquire and manage locks instead of RPC. RPC has been completely removed from MinIO source. Since we are moving from RPC to REST, we cannot use rolling upgrades as the nodes that have not yet been upgraded cannot talk to the ones that have been upgraded. We expect all minio processes on all nodes to be stopped and then the upgrade process to be completed. Also force http1.1 for inter-node communication	2019-04-17 23:16:27 -07:00
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	2019-04-09 11:39:42 -07:00
Harshavardhana	e0a87e96de	Populate host value from GetSourceIP directly (#7417 )	2019-03-25 11:45:42 -07:00
kannappanr	87cf51d5ab	unused code: Remove LoadCredentials function (#7369 ) It is required to set the environment variable in the case of distributed minio. LoadCredentials is used to notify peers of the change and will not work if environment variable is set. so, this function will never be called.	2019-03-20 18:09:57 -07:00
kannappanr	eb69c4f946	Use REST api for inter node communication (#7205 )	2019-03-14 16:27:31 -07:00
Harshavardhana	df35d7db9d	Introduce staticcheck for stricter builds (#7035 )	2019-02-13 18:29:36 +05:30
kannappanr	ce870466ff	Top Locks command implementation (#7052 ) API to list locks used in distributed XL mode	2019-01-24 07:22:14 -08:00
Harshavardhana	8757c963ba	Migrate all Peer communication to common Notification subsystem (#7031 ) Deprecate the use of Admin Peers concept and migrate all peer communication to Notification subsystem. This finally allows for a common subsystem for all peer notification in case of distributed server deployments.	2019-01-14 12:14:20 +05:30
Sidhartha Mani	f3f47d8cd3	Add ServerCPULoadInfo() and ServerMemUsageInfo() admin API (#7038 )	2019-01-09 19:04:19 -08:00
Nitish Tiwari	fcb56d864c	Add ServerDrivesPerfInfo() admin API (#6969 ) This is part of implementation for mc admin health command. The ServerDrivesPerfInfo() admin API returns read and write speed information for all the drives (local and remote) in a given Minio server deployment. Part of minio/mc#2606	2018-12-31 09:46:44 -08:00
Harshavardhana	4f31a9a33b	Reload users upon AddUser on peers (#6975 ) Also migrate ReloadFormat to notification subsystem, remove GetConfig() we do not use this API anymore	2018-12-18 14:39:21 -08:00
Harshavardhana	3be616de3f	Send deployment ID in notification event response elements (#6991 )	2018-12-18 10:05:26 -08:00
Kale Blankenship	79b9a9ce46	Provide actual size in events instead of compressed size. (#6950 ) Previous behavior did not check if the object was compressed and incorrectly reported the stored size rather than the actual object size.	2018-12-11 17:30:15 -08:00
Pontus Leitzler	f9779b24ad	Enable default vet flags (#6810 ) Enable default vet flags except experimental	2018-11-14 10:23:44 -08:00
Harshavardhana	bef0318c36	Support audit logs with additional fields (#6738 ) This PR adds support - Request query params - Request headers - Response headers AuditLogEntry is exported and versioned as well starting with this PR.	2018-11-02 18:40:08 -07:00
Harshavardhana	b0c9ae7490	Add audit logging for S3 and Web handlers (#6571 ) This PR brings an additional logger implementation called AuditLog which logs to http targets The intention is to use AuditLog to log all incoming requests, this is used as a mechanism by external log collection entities for processing Minio requests.	2018-10-12 12:25:59 -07:00
Harshavardhana	54ae364def	Introduce STS client grants API and OPA policy integration (#6168 ) This PR introduces two new features - AWS STS compatible STS API named AssumeRoleWithClientGrants ``` POST /?Action=AssumeRoleWithClientGrants&Token=<jwt> ``` This API endpoint returns temporary access credentials, access tokens signature types supported by this API - RSA keys - ECDSA keys Fetches the required public key from the JWKS endpoints, provides them as rsa or ecdsa public keys. - External policy engine support, in this case OPA policy engine - Credentials are stored on disks	2018-10-09 14:00:01 -07:00
Anis Elleuch	cbc5d78a09	Handle read/quorum errors when initializing all subsystems (#6585 ) - Only require len(disks)/2 to initialize the cluster - Fix checking of read/write quorm in subsystems init - Add retry mechanism in policy and notification to avoid aborting in case of read/write quorums errors	2018-10-08 15:47:13 -07:00
Harshavardhana	2211a5f1b8	Avoid ListenBucket targets to be listed in ServerInfo (#6340 ) In current master when you do `mc watch` you can see a dynamic ARN being listed which exposes the remote IP as well ``` mc watch play/airlines ``` On another terminal ``` mc admin info play ● play.minio.io:9000 Uptime : online since 11 hours ago Version : 2018-08-22T07:50:45Z Region : SQS ARNs : arn:minio:sqs::httpclient+51c39c3f-131d-42d9-b212-c5eb1450b9ee+73.222.245.195:33408 Stats : Incoming 30GiB, Outgoing 7.6GiB Storage : Used 7.7GiB ``` SQS ARNs listed as part of ServerInfo should be only external targets, since listing an ARN here is not useful and it cannot be re-purposed in any manner. This PR fixes this issue by filtering out httpclient from the ARN list. This is a regression introduced in #5294 `0e4431725c`	2018-08-23 23:31:14 -07:00
kannappanr	add57a6938	Add content-length as part of event notification structure (#6341 ) Fixes #6321	2018-08-23 14:40:54 -07:00
Praveen raj Mani	65e05a06fb	Remove notifications Fix (#6082 ) Remove all the notifications for an empty rulesMap Fixes #6053	2018-08-23 22:53:18 +05:30
Harshavardhana	0e02328c98	Migrate config.json from config-dir to backend (#6195 ) This PR is the first set of changes to move the config to the backend, the changes use the existing `config.json` allows it to be migrated such that we can save it in on backend disks. In future releases, we will slowly migrate out of the current architecture. Fixes #6182	2018-08-15 10:11:47 +05:30
kannappanr	264cc4020f	Return 503 instead of 404 if more than half of disks are not found (#6207 ) Fixes #6163	2018-07-31 00:23:29 -07:00
Anis Elleuch	be1700f595	Avoid startup abort when a notify target is down (#6126 ) Minio server was preventing itself to start when any notification target is down and not running. The PR changes the behavior by avoiding startup abort in that case, so the user will still be able to access Minio server using mc admin commands after a restart or set config commands.	2018-07-10 07:20:31 +05:30
Krishna Srinivas	e40a5e05e1	Do notification in background to not block S3 client REST calls (#6005 )	2018-07-03 11:09:36 -07:00
Bala FA	6a53dd1701	Implement HTTP POST based RPC (#5840 ) Added support for new RPC support using HTTP POST. RPC's arguments and reply are Gob encoded and sent as HTTP request/response body. This patch also removes Go RPC based implementation.	2018-06-06 14:21:56 +05:30
Krishna Srinivas	cc8178cdc4	Log errors only once for event notification errors (#5905 )	2018-05-09 15:59:45 -07:00
Harshavardhana	4886bfbc72	fix: Avoid more crashes due to concurrent map usage (#5912 ) This PR fixes another situation where a crash occurs thanks to @krishnasrinivas for reproducing this Fixes #5897	2018-05-09 15:11:51 -07:00
Harshavardhana	98f81ced86	fix: Avoid concurrent map writes in go-routines (#5898 ) Fixes #5897	2018-05-09 11:25:38 -07:00
Bala FA	0d52126023	Enhance policy handling to support SSE and WORM (#5790 ) - remove old bucket policy handling - add new policy handling - add new policy handling unit tests This patch brings support to bucket policy to have more control not limiting to anonymous. Bucket owner controls to allow/deny any rest API. For example server side encryption can be controlled by allowing PUT/GET objects with encryptions including bucket owner.	2018-04-24 15:53:30 -07:00
ebozduman	f16bfda2f2	Remove panic() and handle it appropriately (#5807 ) This is an effort to remove panic from the source. Add a new call called CriticialIf, that calls LogIf and exits. Replace panics with one of CriticalIf, FatalIf and a return of error.	2018-04-19 17:24:43 -07:00
kannappanr	cef992a395	Remove error package and cause functions (#5784 )	2018-04-10 09:36:37 -07:00
kannappanr	f8a3fd0c2a	Create logger package and rename errorIf to LogIf (#5678 ) Removing message from error logging Replace errors.Trace with LogIf	2018-04-05 15:04:40 -07:00
Harshavardhana	ef61b36c5a	Fix PUT bucket notification deadlocks (#5734 ) This PR fixes two different variant of deadlocks in notification. - holding write lock on the bucket competing with read lock - holding competing locks on read/save notification config	2018-03-29 12:00:20 -07:00
Krishna Srinivas	9ede179a21	Use context.Background() instead of nil Rename Context[Get\|Set] -> [Get\|Set]Context	2018-03-15 16:28:25 -07:00
Bala FA	0e4431725c	make notification as separate package (#5294 ) * Remove old notification files * Add net package * Add event package * Modify minio to take new notification system	2018-03-15 13:03:41 -07:00

1 2 3 4 5

247 Commits