minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	cfc9cfd84a	fix: various optimizations, idiomatic changes (#9179 ) - acquire since leader lock for all background operations - healing, crawling and applying lifecycle policies. - simplify lifecyle to avoid network calls, which was a bug in implementation - we should hold a leader and do everything from there, we have access to entire name space. - make listing, walking not interfere by slowing itself down like the crawler. - effectively use global context everywhere to ensure proper shutdown, in cache, lifecycle, healing - don't read `format.json` for prometheus metrics in StorageInfo() call.	2020-03-22 12:16:36 -07:00
Klaus Post	8d98662633	re-implement data usage crawler to be more efficient (#9075 ) Implementation overview: https://gist.github.com/klauspost/1801c858d5e0df391114436fdad6987b	2020-03-18 16:19:29 -07:00
poornas	c93157019f	Allow gc to run in parallel on cache drives (#9051 )	2020-03-03 06:42:26 +03:00
poornas	978bd4e2c4	check cacheControl not nil before access (#9055 ) Fixes: #9053	2020-02-27 10:57:00 -08:00
poornas	224b4f13b8	Add cache eviction low and high watermarks (#8958 ) To allow better control the cache eviction process. Introduce MINIO_CACHE_WATERMARK_LOW and MINIO_CACHE_WATERMARK_HIGH env. variables to specify when to stop/start cache eviction process. Deprecate MINIO_CACHE_EXPIRY environment variable. Cache gc sweeps at 30 minute intervals whenever high watermark is reached to clear least recently accessed entries in the cache until sufficient space is cleared to reach the low watermark. Garbage collection uses an adaptive file scoring approach based on last access time, with greater weights assigned to larger objects and those with more hits to find the candidates for eviction. Thanks to @klauspost for this file scoring algorithm Co-authored-by: Klaus Post <klauspost@minio.io>	2020-02-23 19:03:39 +05:30
poornas	013773065c	Save metadata correctly in cache.json on PUT (#8985 ) fixes #8979	2020-02-13 08:49:32 +05:30
poornas	9b4d46a6ed	evict cached entry for server side copy (#8947 ) Fixes #8942	2020-02-07 14:36:46 -08:00
poornas	278a165674	Allow caching based on a configurable number of hits. (#8891 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-04 09:10:01 +05:30
Harshavardhana	0cbebf0f57	Rename pkg/{tagging,lifecycle} to pkg/bucket sub-directory (#8892 ) Rename to allow for more such features to come in a more proper hierarchical manner.	2020-01-27 14:12:34 -08:00
poornas	a78e5d4763	Add missing error check in cache GetObjectNInfo (#8889 )	2020-01-24 15:49:16 -08:00
poornas	60e60f68dd	Add support for object locking with legal hold. (#8634 )	2020-01-16 15:41:56 -08:00
poornas	9199033db7	Set X-Cache and X-Cache-Lookup headers for cache (#8794 ) X-Cache sets cache status of HIT if object is served from the disk cache, or MISS otherwise. X-Cache-Lookup is set to HIT if object was found in the cache even if not served (for e.g. if cache entry was invalidated by ETag verification)	2020-01-10 20:21:13 -08:00
poornas	80558e839d	Clear cache if reverting to backend (#8637 ) Clear cached entry before reverting to backend for encrypted objects or those under retention to avoid stale objects remaining in cache.	2019-12-12 15:11:27 -08:00
poornas	3c30e4503d	Cache only the range requested for range GETs (#8599 )	2019-12-08 13:58:04 -08:00
poornas	8390bc26db	Fix cache hit metrics. (#8617 )	2019-12-07 23:14:33 +05:30
poornas	be0c8b1ec0	Add support for missing Cache-Control directives (#8619 ) no-cache, only-if-cached and no-store directives are being enforced in this PR.	2019-12-07 07:49:36 +05:30
Nitish Tiwari	3df7285c3c	Add Support for Cache and S3 related metrics in Prometheus endpoint (#8591 ) This PR adds support below metrics - Cache Hit Count - Cache Miss Count - Data served from Cache (in Bytes) - Bytes received from AWS S3 - Bytes sent to AWS S3 - Number of requests sent to AWS S3 Fixes #8549	2019-12-05 23:16:06 -08:00
poornas	ca96560d56	Add object retention at the per object (#8528 ) level - this PR builds on #8120 which added PutBucketObjectLockConfiguration and GetBucketObjectLockConfiguration APIS This PR implements PutObjectRetention, GetObjectRetention API and enhances PUT and GET API operations to display governance metadata if permissions allow.	2019-11-20 13:18:09 -08:00
Harshavardhana	a8e156d6a5	Fix cache locking to use local namespace locking (#8529 )	2019-11-16 13:44:28 -08:00
Harshavardhana	e9b2bf00ad	Support MinIO to be deployed on more than 32 nodes (#8492 ) This PR implements locking from a global entity into a more localized set level entity, allowing for locks to be held only on the resources which are writing to a collection of disks rather than a global level. In this process this PR also removes the top-level limit of 32 nodes to an unlimited number of nodes. This is a precursor change before bring in bucket expansion.	2019-11-13 12:17:45 -08:00
Harshavardhana	822eb5ddc7	Bring in safe mode support (#8478 ) This PR refactors object layer handling such that upon failure in sub-system initialization server reaches a stage of safe-mode operation wherein only certain API operations are enabled and available. This allows for fixing many scenarios such as - incorrect configuration in vault, etcd, notification targets - missing files, incomplete config migrations unable to read encrypted content etc - any other issues related to notification, policies, lifecycle etc	2019-11-09 09:27:23 -08:00
Harshavardhana	9e7a3e6adc	Extend further validation of config values (#8469 ) - This PR allows config KVS to be validated properly without being affected by ENV overrides, rejects invalid values during set operation - Expands unit tests and refactors the error handling for notification targets, returns error instead of ignoring targets for invalid KVS - Does all the prep-work for implementing safe-mode style operation for MinIO server, introduces a new global variable to toggle safe mode based operations NOTE: this PR itself doesn't provide safe mode operations	2019-10-30 23:39:09 -07:00
Harshavardhana	ee4a6a823d	Migrate config to KV data format (#8392 ) - adding oauth support to MinIO browser (#8400) by @kanagaraj - supports multi-line get/set/del for all config fields - add support for comments, allow toggle - add extensive validation of config before saving - support MinIO browser to support proper claims, using STS tokens - env support for all config parameters, legacy envs are also supported with all documentation now pointing to latest ENVs - preserve accessKey/secretKey from FS mode setups - add history support implements three APIs - ClearHistory - RestoreHistory - ListHistory - add help command support for each config parameters - all the bug fixes after migration to KV, and other bug fixes encountered during testing.	2019-10-22 22:59:13 -07:00
Harshavardhana	d48fd6fde9	Remove unusued params and functions (#8399 )	2019-10-15 18:35:41 -07:00
Harshavardhana	68a519a468	Use errgroups instead of sync.WaitGroup as needed (#8354 )	2019-10-14 09:44:51 -07:00
Harshavardhana	589e32a4ed	Refactor config and split them in packages (#8351 ) This change is related to larger config migration PR change, this is a first stage change to move our configs to `cmd/config/` - divided into its subsystems	2019-10-04 23:05:33 +05:30
poornas	4925bc3e80	log server startup messages to admin console api (#8264 )	2019-09-22 13:54:32 +05:30
poornas	76df027264	Allow caching only in gateway mode. (#8232 ) This PR changes cache on PUT behavior to background fill the cache after PutObject completes. This will avoid concurrency issues as in #8219. Added cleanup of partially filled cache to prevent cache corruption - Fixes #8208	2019-09-17 02:54:04 +05:30
Harshavardhana	53e4887e02	Simplify and cleanup metadata r/w functions (#8146 )	2019-09-11 22:52:12 +05:30
poornas	259a5d825b	cache - fix corruption when client prematurely terminates request (#8155 )	2019-09-05 23:33:32 +05:30
poornas	29f64355ce	Allow caching on single PutObject (#8100 )	2019-09-05 19:50:16 +05:30
poornas	48bc3f1d53	Allow cached content to be encrypted (#8001 ) If MINIO_CACHE_ENCRYPTION_MASTER_KEY is set, automatically encrypt all cached content on disk.	2019-08-23 10:13:22 -07:00
poornas	3385bf3da8	Rewrite cache implementation to cache only on GET (#7694 ) Fixes #7458 Fixes #7573 Fixes #7938 Fixes #6934 Fixes #6265 Fixes #6630 This will allow the cache to consistently work for server and gateways. Range GET requests will be cached in the background after the request is served from the backend. - All cached content is automatically bitrot protected. - Avoid ETag verification if a cache-control header is set and the cached content is still valid. - This PR changes the cache backend format, and all existing content will be migrated to the new format. Until the data is migrated completely, all content will be served from the backend.	2019-08-09 17:09:08 -07:00
Harshavardhana	e6d8e272ce	Use const slashSeparator instead of "/" everywhere (#8028 )	2019-08-06 12:08:58 -07:00
poornas	20a15567b8	Fix atime support check for disk cache (#7891 ) - add a sleep between Stat operations to accurately detect atime	2019-07-10 23:41:11 +01:00
Anis Elleuch	7abadfccc2	Add self-healing feature (#7604 ) - Background Heal routine receives heal requests from a channel, either to heal format, buckets or objects - Daily sweeper lists all objects in all buckets, these objects don't necessarly have read quorum so they can be removed if these objects are unhealable - Heal daily ops receives objects from the daily sweeper and send them to the heal routine.	2019-06-08 22:14:07 -07:00
Dee Koder	e252114f06	Revert "cache: Rewrite to cache only on download (#7575 )" (#7684 ) This reverts commit `a13b58f630`.	2019-05-22 14:54:15 -07:00
poornas	a13b58f630	cache: Rewrite to cache only on download (#7575 ) This will allow cache to consistently work for server and gateways. Range GET requests will be cached in the background after the request is served from the backend. Fixes: #7458, #7573, #6265, #6630	2019-05-22 08:30:27 +05:30
Anis Elleuch	9c90a28546	Implement bulk delete (#7607 ) Bulk delete at storage level in Multiple Delete Objects API In order to accelerate bulk delete in Multiple Delete objects API, a new bulk delete is introduced in storage layer, which will accept a list of objects to delete rather than only one. Consequently, a new API is also need to be added to Object API.	2019-05-13 12:25:49 -07:00
Harshavardhana	72929ec05b	Turn off md5sum optionally if content-md5 is not set (#7609 ) This PR also brings --compat option to run MinIO in strict S3 compatibility mode, MinIO by default will now try to run high performance mode.	2019-05-08 18:35:40 -07:00
Harshavardhana	64998fc4ab	Remove delayIsLeaf requirement simplify ListObjects further (#7593 )	2019-05-02 10:36:57 +05:30
Harshavardhana	f767a2538a	Optimize listing with leaf check offloaded to posix (#7541 ) Other listing optimizations include - remove double sorting while filtering object entries - improve error message when upload-id is not in quorum - use jsoniter for full unmarshal json, instead of gjson - remove unused code	2019-04-23 14:54:28 -07:00
poornas	2c096c569f	do not try to delete non-existent object in cache (#7560 ) handle cache cleanup correctly when backend object was deleted. Fixes: #7558	2019-04-18 13:53:22 -07:00
Harshavardhana	620e462413	Implement S3-HDFS gateway (#7440 ) - [x] Support bucket and regular object operations - [x] Supports Select API on HDFS - [x] Implement multipart API support - [x] Completion of ListObjects support	2019-04-17 09:52:08 -07:00
poornas	1d49295943	Close CacheReader before clearing cache entry if object is deleted (#7555 ) Fixes: #7549	2019-04-17 11:24:50 +05:30
Aditya Manthramurthy	ddb0d646aa	Use passed lock-type in GetObjectNInfo cache implementation (#7505 )	2019-04-09 14:49:45 -07:00
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	2019-04-09 11:39:42 -07:00
Harshavardhana	7079abc931	Implement HealObjects API to simplify healing (#7351 )	2019-03-13 17:35:09 -07:00
Anis Elleuch	b05825ffe8	s3: Fix precondition failed in CopyObjectPart when src is encrypted (#7276 ) CopyObject precondition checks into GetObjectReader in order to perform SSE-C pre-condition checks using the last 32 bytes of encrypted ETag rather than the decrypted ETag This also necessitates moving precondition checks for gateways to gateway layer rather than object handler check	2019-03-06 12:38:41 -08:00
poornas	40b8d11209	Move metadata into ObjectOptions for NewMultipart and PutObject (#7060 )	2019-02-09 11:01:06 +05:30
poornas	d732b1ff9d	Fix to cache objects on downloads (#6828 ) fixes #6817	2018-11-19 11:00:46 -08:00
poornas	5f6d717b7a	Fix: Preserve MD5Sum for SSE encrypted objects (#6680 ) To conform with AWS S3 Spec on ETag for SSE-S3 encrypted objects, encrypt client sent MD5Sum and store it on backend as ETag.Extend this behavior to SSE-C encrypted objects.	2018-11-14 17:36:41 -08:00
Pontus Leitzler	c998d1ac8c	Add missing error check (#6632 )	2018-10-17 10:57:12 -07:00
poornas	557f382477	cache: remove cache space constraint (#6635 ) relax cache constraint of requiring 100 times size of object being cached for better cache utilization.	2018-10-16 11:06:42 +05:30
poornas	83d8e01c81	fix: Close cacheReader if cache entry has expired (#6610 ) prevent locking issues Fixes #6602	2018-10-10 23:01:24 -07:00
Pontus Leitzler	307765591d	Use GetObjectInfo instead of GetObjectNInfo before cache decision (#6553 )	2018-10-03 11:02:32 -07:00
Harshavardhana	8c29f69b00	Fix racy error communication inside go-routine (#6539 ) Use CloseWithError to communicate errors in pipe, this PR also fixes potential shadowing of error	2018-09-28 13:14:59 +05:30
Praveen raj Mani	ce9d36d954	Add object compression support (#6292 ) Add support for streaming (golang/LZ77/snappy) compression.	2018-09-28 09:06:17 +05:30
poornas	ed703c065d	Add ObjectOptions to GetObjectNInfo (#6533 )	2018-09-27 15:36:45 +05:30
Anis Elleuch	aa4e2b1542	Use GetObjectNInfo in CopyObject and CopyObjectPart (#6489 )	2018-09-25 12:39:46 -07:00
Aditya Manthramurthy	3c8fabd116	Fix cleanup of pipe in GetObjectNInfo handlers (#6509 )	2018-09-21 11:42:06 -07:00
Aditya Manthramurthy	36e51d0cee	Add GetObjectNInfo to object layer (#6449 ) The new call combines GetObjectInfo and GetObject, and returns an object with a ReadCloser interface. Also adds a number of end-to-end encryption tests at the handler level.	2018-09-20 19:22:09 -07:00
poornas	5c0b98abf0	Add ObjectOptions to ObjectLayer calls (#6382 )	2018-09-10 09:42:43 -07:00
Harshavardhana	4487f70f08	Revert all GetObjectNInfo related PRs (#6398 ) * Revert "Encrypted reader wrapped in NewGetObjectReader should be closed (#6383)" This reverts commit `53a0bbeb5b`. * Revert "Change SelectAPI to use new GetObjectNInfo API (#6373)" This reverts commit `5b05df215a`. * Revert "Implement GetObjectNInfo object layer call (#6290)" This reverts commit `e6d740ce09`.	2018-08-31 13:10:12 -07:00
Aditya Manthramurthy	e6d740ce09	Implement GetObjectNInfo object layer call (#6290 ) This combines calling GetObjectInfo and GetObject while returning a io.ReadCloser for the object's body. This allows the two operations to be under a single lock, fixing a race between getting object info and reading the object body.	2018-08-27 15:28:23 +05:30
Harshavardhana	ad86454580	Make sure to handle FaultyDisks in listing ops (#6204 ) Continuing from PR `157ed65c35` Our posix.go implementation did not handle I/O errors properly on the disks, this led to situations where top-level callers such as ListObjects might return early without even verifying all the available disks. This commit tries to address this in Kubernetes, drbd/nbd based persistent volumes which can disconnect under load and result in the situations with disks return I/O errors. This commit also simplifies listing operation, listing never returns any error. We can avoid this since we pretty much ignore most of the errors anyways. When objects are accessed directly we return proper errors.	2018-07-27 15:32:19 -07:00
Harshavardhana	e5e522fc61	docs: fix all Chinese doc links for the new docs site (#6097 ) Additionally fix typos, default to US locale words	2018-06-28 16:02:02 -07:00
Annanay Agarwal	78abe5234e	Add functionality to make cache limit configurable (#5991 )	2018-06-25 10:24:12 -07:00
Harshavardhana	5282639f3c	Add prometheus endpoint to support total Used storageInfo (#5988 ) Since we deprecated Total/Free we don't need to update prometheus with those metrics. This PR also adds support for caching implementation.	2018-05-30 11:30:14 -07:00
Harshavardhana	e6ec645035	Implement support for calculating disk usage per tenant (#5969 ) Fixes #5961	2018-05-23 15:41:29 +05:30
Anis Elleuch	6d5f2a4391	Better support of empty directories (#5890 ) Better support of HEAD and listing of zero sized objects with trailing slash (a.k.a empty directory). For that, isLeafDir function is added to indicate if the specified object is an empty directory or not. Each backend (xl, fs) has the responsibility to store that information. Currently, in both of XL & FS, an empty directory is represented by an empty directory in the backend. isLeafDir() checks if the given path is an empty directory or not, since dir listing is costly if the latter contains too many objects, readDirN() is added in this PR to list only N number of entries. In isLeadDir(), we will only list one entry to check if a directory is empty or not.	2018-05-09 01:38:21 -07:00
poornas	0dc3d7ac18	fix error checks when cache is offline/missing. (#5850 )	2018-04-26 10:39:05 +05:30
kannappanr	cef992a395	Remove error package and cause functions (#5784 )	2018-04-10 09:36:37 -07:00
kannappanr	f8a3fd0c2a	Create logger package and rename errorIf to LogIf (#5678 ) Removing message from error logging Replace errors.Trace with LogIf	2018-04-05 15:04:40 -07:00
Krishna Srinivas	804a4f9c15	Fix backend format for disk-cache - not to use FS format.json (#5732 )	2018-03-29 14:38:26 -07:00
poornas	a3e806ed61	Add disk based edge caching support. (#5182 ) This PR adds disk based edge caching support for minio server. Cache settings can be configured in config.json to take list of disk drives, cache expiry in days and file patterns to exclude from cache or via environment variables MINIO_CACHE_DRIVES, MINIO_CACHE_EXCLUDE and MINIO_CACHE_EXPIRY Design assumes that Atime support is enabled and the list of cache drives is fixed. - Objects are cached on both GET and PUT/POST operations. - Expiry is used as hint to evict older entries from cache, or if 80% of cache capacity is filled. - When object storage backend is down, GET, LIST and HEAD operations fetch object seamlessly from cache. Current Limitations - Bucket policies are not cached, so anonymous operations are not supported in offline mode. - Objects are distributed using deterministic hashing among list of cache drives specified.If one or more drives go offline, or cache drive configuration is altered - performance could degrade to linear lookup. Fixes #4026	2018-03-28 14:14:06 -07:00

1 2 3

126 Commits