minio

mirror of https://github.com/minio/minio.git synced 2024-12-26 23:25:54 -05:00

Author	SHA1	Message	Date
Harshavardhana	e84f533c6c	add missing wait groups for certain io.Pipe() usage (#12264 ) wait groups are necessary with io.Pipes() to avoid races when a blocking function may not be expected and a Write() -> Close() before Read() races on each other. We should avoid such situations.. Co-authored-by: Klaus Post <klauspost@gmail.com>	2021-05-11 09:18:37 -07:00
Harshavardhana	0d3ddf7286	fix: improve NewObjectReader implementation for careful cleanup usage (#12199 ) cleanup functions should never be cleaned before the reader is instantiated, this type of design leads to situations where order of lockers and places for them to use becomes confusing. Allow WithCleanupFuncs() if the caller wishes to add cleanupFns to be run upon close() or an error during initialization of the reader. Also make sure streams are closed before we unlock the resources, this allows for ordered cleanup of resources.	2021-04-30 18:37:58 -07:00
Harshavardhana	64f6020854	fix: cleanup locking, cancel context upon lock timeout (#12183 ) upon errors to acquire lock context would still leak, since the cancel would never be called. since the lock is never acquired - proactively clear it before returning.	2021-04-29 20:55:21 -07:00
Anis Elleuch	9e797532dc	lock: Always cancel the returned Get(R)Lock context (#12162 ) * lock: Always cancel the returned Get(R)Lock context There is a leak with cancel created inside the locking mechanism. The cancel purpose was to cancel operations such erasure get/put that are holding non-refreshable locks. This PR will ensure the created context.Cancel is passed to the unlock API so it will cleanup and avoid leaks. * locks: Avoid returning nil cancel in local lockers Since there is no Refresh mechanism in the local locking mechanism, we do not generate a new context or cancel. Currently, a nil cancel function is returned but this can cause a crash. Return a dummy function instead.	2021-04-27 16:12:50 -07:00
Harshavardhana	069432566f	update license change for MinIO Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Andreas Auernhammer	885c170a64	introduce new package pkg/kms (#12019 ) This commit introduces a new package `pkg/kms`. It contains basic types and functions to interact with various KMS implementations. This commit also moves KMS-related code from `cmd/crypto` to `pkg/kms`. Now, it is possible to implement a KMS-based config data encryption in the `pkg/config` package.	2021-04-15 08:47:33 -07:00
Andreas Auernhammer	97aa831352	add new pkg/fips for FIPS 140-2 (#12051 ) This commit introduces a new package `pkg/fips` that bundles functionality to handle and configure cryptographic protocols in case of FIPS 140. If it is compiled with `--tags=fips` it assumes that a FIPS 140-2 cryptographic module is used to implement all FIPS compliant cryptographic primitives - like AES, SHA-256, ... In "FIPS mode" it excludes all non-FIPS compliant cryptographic primitives from the protocol parameters.	2021-04-14 08:29:56 -07:00
Anis Elleuch	7be7109471	locking: Add Refresh for better locking cleanup (#11535 ) Co-authored-by: Anis Elleuch <anis@min.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2021-03-03 18:36:43 -08:00
Harshavardhana	289e1d8b2a	fix: reduce crawler memory usage by orders of magnitude (#11556 ) currently crawler waits for an entire readdir call to return until it processes usage, lifecycle, replication and healing - instead we should pass the applicator all the way down to avoid building any special stack for all the contents in a single directory. This allows for - no need to remember the entire list of entries per directory before applying the required functions - no need to wait for entire readdir() call to finish before applying the required functions	2021-02-17 15:34:42 -08:00
Poorna Krishnamoorthy	93fd248b52	fix: save ModTime properly in disk cache (#11522 ) fix #11414	2021-02-11 19:25:47 -08:00
Andreas Auernhammer	8cdf2106b0	refactor cmd/crypto code for SSE handling and parsing (#11045 ) This commit refactors the code in `cmd/crypto` and separates SSE-S3, SSE-C and SSE-KMS. This commit should not cause any behavior change except for: - `IsRequested(http.Header)` which now returns the requested type {SSE-C, SSE-S3, SSE-KMS} and does not consider SSE-C copy headers. However, SSE-C copy headers alone are anyway not valid.	2020-12-22 09:19:32 -08:00
Harshavardhana	df93102235	fix: unwrapping issues with os.Is* functions (#10949 ) reduces 3 stat calls, reducing the overall startup time significantly.	2020-11-23 08:36:49 -08:00
Poorna Krishnamoorthy	d295ce5708	Fix disk cache usage percent for prometheus (#10898 ) Fixes: #10895 Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2020-11-14 19:18:00 -08:00
Klaus Post	2294e53a0b	Don't retain context in locker (#10515 ) Use the context for internal timeouts, but disconnect it from outgoing calls so we always receive the results and cancel it remotely.	2020-11-04 08:25:42 -08:00
Poorna Krishnamoorthy	03fdbc3ec2	Add async caching commit option in diskcache (#10742 ) Add store and a forward option for a single part uploads when an async mode is enabled with env MINIO_CACHE_COMMIT=writeback It defaults to `writethrough` if unspecified.	2020-11-02 10:00:45 -08:00
Kaloyan Raev	be7f67268d	fix: Do not cleanup range files in cache SaveMetadata when total hits are false (#10728 )	2020-10-28 09:23:17 -07:00
Harshavardhana	2b4eb87d77	pick disks which are common maximally used (#10600 ) further optimization to ensure that good disks are always used for listing, other than healing we only use disks that are maximally used.	2020-09-29 22:54:02 -07:00
Klaus Post	b7438fe4e6	Copy metadata before spawning goroutine + prealloc maps (#10458 ) In `(*cacheObjects).GetObjectNInfo` copy the metadata before spawning a goroutine. Clean up a few map[string]string copies as well, reducing allocs and simplifying the code. Fixes #10426	2020-09-10 11:37:22 -07:00
Harshavardhana	c13afd56e8	Remove MaxConnsPerHost settings to avoid potential hangs (#10438 ) MaxConnsPerHost can potentially hang a call without any way to timeout, we do not need this setting for our proxy and gateway implementations instead IdleConn settings are good enough. Also ensure to use NewRequestWithContext and make sure to take the disks offline only for network errors. Fixes #10304	2020-09-08 14:22:04 -07:00
Klaus Post	650dccfa9e	cache: Only start at high watermark (#10403 ) Currently, cache purges are triggered as soon as the low watermark is exceeded. To reduce IO this should only be done when reaching the high watermark. This simplifies checks and reduces all calls for a GC to go through `dcache.diskSpaceAvailable(size)`. While a comment claims that `dcache.triggerGC <- struct{}{}` was non-blocking I don't see how that was possible. Instead, we add a 1 size to the queue channel and use channel semantics to avoid blocking when a GC has already been requested. `bytesToClear` now takes the high watermark into account to it will not request any bytes to be cleared until that is reached.	2020-09-02 17:48:44 -07:00
Harshavardhana	caad314faa	add ruleguard support, fix all the reported issues (#10335 )	2020-08-24 12:11:20 -07:00
Harshavardhana	e57c742674	use single dynamic timeout for most locked API/heal ops (#10275 ) newDynamicTimeout should be allocated once, in-case of temporary locks in config and IAM we should have allocated timeout once before the `for loop` This PR doesn't fix any issue as such, but provides enough dynamism for the timeout as per expectation.	2020-08-17 11:29:58 -07:00
Harshavardhana	14b1c9f8e4	fix: return Range errors after If-Matches (#10045 ) closes #7292	2020-07-17 13:01:22 -07:00
Andreas Auernhammer	a317a2531c	admin: new API for creating KMS master keys (#9982 ) This commit adds a new admin API for creating master keys. An admin client can send a POST request to: ``` /minio/admin/v3/kms/key/create?key-id=<keyID> ``` The name / ID of the new key is specified as request query parameter `key-id=<ID>`. Creating new master keys requires KES - it does not work with the native Vault KMS (deprecated) nor with a static master key (deprecated). Further, this commit removes the `UpdateKey` method from the `KMS` interface. This method is not needed and not used anymore.	2020-07-08 18:50:43 -07:00
poornas	55a3b071ea	Allow optionally to disable range caching. (#9908 ) The default behavior is to cache each range requested to cache drive. Add an environment variable `MINIO_RANGE_CACHE` - when set to off, it disables range caching and instead downloads entire object in the background. Fixes #9870	2020-06-29 13:25:29 -07:00
Harshavardhana	f9aa239973	fix: export prometheus metrics for cache GC triggers (#9815 ) Bonus change to use channel to serialize triggers, instead of using atomic variables. More efficient mechanism for synchronization. Co-authored-by: Nitish Tiwari <nitish@minio.io>	2020-06-15 09:05:35 -07:00
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	2020-06-12 20:04:01 -07:00
Erkki Eilonen	301de169e9	in cache build ranges metadata as needed (#9698 )	2020-05-25 00:17:03 -07:00
Harshavardhana	27d716c663	simplify usage of mutexes and atomic constants (#9501 )	2020-05-03 22:35:40 -07:00
poornas	9a547dcbfb	Add API's for managing bucket quota (#9379 ) This PR allows setting a "hard" or "fifo" quota restriction at the bucket level. Buckets that have reached the FIFO quota configured, will automatically be cleaned up in FIFO manner until bucket usage drops to configured quota. If a bucket is configured with a "hard" quota ceiling, all further writes are disallowed.	2020-04-30 15:55:54 -07:00
Harshavardhana	957ecb1b64	use optimal memory while purging cache (#9426 ) re-implement the cache purging routine to avoid using ioutil.ReadDir which can lead to high allocations when there are cache directories with lots of content, or when cache is installed in memory constrainted environments. Instead rely on a callback function where we are not using memory no-more than 8KiB per cycle. Precursor for this change refer #9425, original issue pointed by Caleb Case <caleb@storj.io>	2020-04-23 12:26:13 -07:00
Harshavardhana	282c9f790a	fix: validate partNumber in queryParam as part of preConditions (#9386 )	2020-04-20 22:01:59 -07:00
Harshavardhana	f44cfb2863	use GlobalContext whenever possible (#9280 ) This change is throughout the codebase to ensure that all codepaths honor GlobalContext	2020-04-09 09:30:02 -07:00
Harshavardhana	cfc9cfd84a	fix: various optimizations, idiomatic changes (#9179 ) - acquire since leader lock for all background operations - healing, crawling and applying lifecycle policies. - simplify lifecyle to avoid network calls, which was a bug in implementation - we should hold a leader and do everything from there, we have access to entire name space. - make listing, walking not interfere by slowing itself down like the crawler. - effectively use global context everywhere to ensure proper shutdown, in cache, lifecycle, healing - don't read `format.json` for prometheus metrics in StorageInfo() call.	2020-03-22 12:16:36 -07:00
Kody A Kantor	06e30b5aa1	Skip building directio on platforms that don't support Direct IO (#9059 )	2020-03-12 18:57:41 -07:00
poornas	224b4f13b8	Add cache eviction low and high watermarks (#8958 ) To allow better control the cache eviction process. Introduce MINIO_CACHE_WATERMARK_LOW and MINIO_CACHE_WATERMARK_HIGH env. variables to specify when to stop/start cache eviction process. Deprecate MINIO_CACHE_EXPIRY environment variable. Cache gc sweeps at 30 minute intervals whenever high watermark is reached to clear least recently accessed entries in the cache until sufficient space is cleared to reach the low watermark. Garbage collection uses an adaptive file scoring approach based on last access time, with greater weights assigned to larger objects and those with more hits to find the candidates for eviction. Thanks to @klauspost for this file scoring algorithm Co-authored-by: Klaus Post <klauspost@minio.io>	2020-02-23 19:03:39 +05:30
poornas	013773065c	Save metadata correctly in cache.json on PUT (#8985 ) fixes #8979	2020-02-13 08:49:32 +05:30
poornas	278a165674	Allow caching based on a configurable number of hits. (#8891 ) Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-04 09:10:01 +05:30
poornas	9199033db7	Set X-Cache and X-Cache-Lookup headers for cache (#8794 ) X-Cache sets cache status of HIT if object is served from the disk cache, or MISS otherwise. X-Cache-Lookup is set to HIT if object was found in the cache even if not served (for e.g. if cache entry was invalidated by ETag verification)	2020-01-10 20:21:13 -08:00
poornas	04de3ea4bd	Change cache purge routine granularity to hours (#8660 ) With this PR,cache eviction will continue until no LRU entries older than an hour can be cache evicted or sufficient percentage of disk space has been reclaimed.	2019-12-18 13:49:10 -08:00
poornas	3c30e4503d	Cache only the range requested for range GETs (#8599 )	2019-12-08 13:58:04 -08:00
Nitish Tiwari	3df7285c3c	Add Support for Cache and S3 related metrics in Prometheus endpoint (#8591 ) This PR adds support below metrics - Cache Hit Count - Cache Miss Count - Data served from Cache (in Bytes) - Bytes received from AWS S3 - Bytes sent to AWS S3 - Number of requests sent to AWS S3 Fixes #8549	2019-12-05 23:16:06 -08:00
Harshavardhana	5d3d57c12a	Start using error wrapping with fmt.Errorf (#8588 ) Use fatih/errwrap to fix all the code to use error wrapping with fmt.Errorf()	2019-12-02 09:28:01 -08:00
Harshavardhana	ee4a6a823d	Migrate config to KV data format (#8392 ) - adding oauth support to MinIO browser (#8400) by @kanagaraj - supports multi-line get/set/del for all config fields - add support for comments, allow toggle - add extensive validation of config before saving - support MinIO browser to support proper claims, using STS tokens - env support for all config parameters, legacy envs are also supported with all documentation now pointing to latest ENVs - preserve accessKey/secretKey from FS mode setups - add history support implements three APIs - ClearHistory - RestoreHistory - ListHistory - add help command support for each config parameters - all the bug fixes after migration to KV, and other bug fixes encountered during testing.	2019-10-22 22:59:13 -07:00
poornas	f01d53b20f	cache: do not evict entry on ErrClosedPipe (#8432 ) Fixes: #8431. If client prematurely closes the read end of the pipe, cache entry should not be evicted.	2019-10-22 15:04:25 -07:00
Harshavardhana	d48fd6fde9	Remove unusued params and functions (#8399 )	2019-10-15 18:35:41 -07:00
Harshavardhana	290ad0996f	Move etcd, logger, crypto into their own packages (#8366 ) - Deprecates _MINIO_PROFILER, `mc admin profile` does the job - Move ENVs to common location in cmd/config/	2019-10-08 11:17:56 +05:30
Harshavardhana	3b8adf7528	Move storageclass config handling into cmd/config/storageclass (#8360 ) Continuation of the changes done in PR #8351 to refactor, add tests and move global handling into a more idiomatic style for Go as packages.	2019-10-07 11:20:24 +05:30
Harshavardhana	589e32a4ed	Refactor config and split them in packages (#8351 ) This change is related to larger config migration PR change, this is a first stage change to move our configs to `cmd/config/` - divided into its subsystems	2019-10-04 23:05:33 +05:30
Harshavardhana	ff5bf51952	admin/heal: Fix deep healing to heal objects under more conditions (#8321 ) - Heal if the part.1 is truncated from its original size - Heal if the part.1 fails while being verified in between - Heal if the part.1 fails while being at a certain offset Other cleanups include make sure to flush the HTTP responses properly from storage-rest-server, avoid using 'defer' to improve call latency. 'defer' incurs latency avoid them in our hot-paths such as storage-rest handlers. Fixes #8319	2019-10-02 01:42:15 +05:30

1 2

57 Commits