minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	124544d834	add pre-conditions support for PUT calls during replication (#15674 ) PUT shall only proceed if pre-conditions are met, the new code uses - x-minio-source-mtime - x-minio-source-etag to verify if the object indeed needs to be replicated or not, allowing us to avoid StatObject() call.	2022-09-14 18:44:04 -07:00
Harshavardhana	228c6686f8	allow non-standards fallback for all http.TimeFormats (#15662 ) fixes #15645	2022-09-07 07:24:54 -07:00
Klaus Post	8e4a45ec41	fix: encrypt checksums in metadata (#15620 )	2022-08-31 08:13:23 -07:00
Klaus Post	a9f1ad7924	Add extended checksum support (#15433 )	2022-08-29 16:57:16 -07:00
Harshavardhana	433b6fa8fe	upgrade golang-lint to the latest (#15600 )	2022-08-26 12:52:29 -07:00
Harshavardhana	5cffd3780a	fix: multiple fixes in prefix exclude implementation (#14877 ) - do not need to restrict prefix exclusions that do not have `/` as suffix, relax this requirement as spark may have staging folders with other autogenerated characters , so we are better off doing full prefix March and skip. - multiple delete objects was incorrectly creating a null delete marker on a versioned bucket instead of creating a proper versioned delete marker. - do not suspend paths on the excluded prefixes during delete operations to avoid creating `null` delete markers, honor suspension of versioning only at bucket level for delete markers.	2022-05-07 22:06:44 -07:00
Krishnan Parthasarathi	ad8e611098	feat: implement prefix-level versioning exclusion (#14828 ) Spark/Hadoop workloads which use Hadoop MR Committer v1/v2 algorithm upload objects to a temporary prefix in a bucket. These objects are 'renamed' to a different prefix on Job commit. Object storage admins are forced to configure separate ILM policies to expire these objects and their versions to reclaim space. Our solution: This can be avoided by simply marking objects under these prefixes to be excluded from versioning, as shown below. Consequently, these objects are excluded from replication, and don't require ILM policies to prune unnecessary versions. - MinIO Extension to Bucket Version Configuration ```xml <VersioningConfiguration xmlns="http://s3.amazonaws.com/doc/2006-03-01/"> <Status>Enabled</Status> <ExcludeFolders>true</ExcludeFolders> <ExcludedPrefixes> <Prefix>app1-jobs//_temporary/</Prefix> </ExcludedPrefixes> <ExcludedPrefixes> <Prefix>app2-jobs//__magic/</Prefix> </ExcludedPrefixes> <!-- .. up to 10 prefixes in all --> </VersioningConfiguration> ``` Note: `ExcludeFolders` excludes all folders in a bucket from versioning. This is required to prevent the parent folders from accumulating delete markers, especially those which are shared across spark workloads spanning projects/teams. - To enable version exclusion on a list of prefixes ``` mc version enable --excluded-prefixes "app1-jobs//_temporary/,app2-jobs//_magic," --exclude-prefix-marker myminio/test ```	2022-05-06 19:05:28 -07:00
Harshavardhana	f527c708f2	run gofumpt cleanup across code-base (#14015 )	2022-01-02 09:15:06 -08:00
Krishnan Parthasarathi	3da9ee15d3	Add MaxNoncurrentVersions to NoncurrentExpiration action (#13580 ) This unit allows users to limit the maximum number of noncurrent versions of an object. To enable this rule you need the following ilm.json ``` cat >> ilm.json <<EOF { "Rules": [ { "ID": "test-max-noncurrent", "Status": "Enabled", "Filter": { "Prefix": "user-uploads/" }, "NoncurrentVersionExpiration": { "MaxNoncurrentVersions": 5 } } ] } EOF mc ilm import myminio/mybucket < ilm.json ```	2021-11-19 17:54:10 -08:00
Krishnan Parthasarathi	d0963974a5	pkg/lifecycle: Add SetPredictionHeaders method (#12755 ) This method is used to add expected expiration and transition time for an object in GET/HEAD Object response headers. Also fixed bugs in lifecycle.PredictTransitionTime and getLifecycleTransitionTier in handling current and non-current versions.	2021-07-20 17:36:55 -07:00
Harshavardhana	cdeccb5510	feat: Deprecate embedded browser and import console (#12460 ) This feature also changes the default port where the browser is running, now the port has moved to 9001 and it can be configured with ``` --console-address ":9001" ```	2021-06-17 20:27:04 -07:00
Harshavardhana	1f262daf6f	rename all remaining packages to internal/ (#12418 ) This is to ensure that there are no projects that try to import `minio/minio/pkg` into their own repo. Any such common packages should go to `https://github.com/minio/pkg`	2021-06-01 14:59:40 -07:00
Harshavardhana	f7a87b30bf	Revert "deprecate embedded browser (#12163 )" This reverts commit `736d8cbac4`. Bring contrib files for older contributions	2021-04-30 08:50:39 -07:00
Harshavardhana	736d8cbac4	deprecate embedded browser (#12163 ) https://github.com/minio/console takes over the functionality for the future object browser development Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-27 10:52:12 -07:00
Harshavardhana	069432566f	update license change for MinIO Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Harshavardhana	7875d472bc	avoid notification for non-existent delete objects (#11514 ) Skip notifications on objects that might have had an error during deletion, this also avoids unnecessary replication attempt on such objects. Refactor some places to make sure that we have notified the client before we - notify - schedule for replication - lifecycle etc.	2021-02-10 22:00:42 -08:00
Harshavardhana	a6c146bd00	validate storage class across pools when setting config (#11320 ) ``` mc admin config set alias/ storage_class standard=EC:3 ``` should only succeed if parity ratio is valid for all server pools, if not we should fail proactively. This PR also needs to bring other changes now that we need to cater for variadic drive counts per pool. Bonus fixes also various bugs reproduced with - GetObjectWithPartNumber() - CopyObjectPartWithOffsets() - CopyObjectWithMetadata() - PutObjectPart,PutObject with truncated streams	2021-01-22 12:09:24 -08:00
Anis Elleuch	284a2b9021	ilm: Send delete marker creation event when appropriate (#10696 ) Before this commit, the crawler ILM will always send object delete event notification though this is wrong.	2020-10-16 21:22:12 -07:00
Harshavardhana	736e58dd68	fix: handle concurrent lockers with multiple optimizations (#10640 ) - select lockers which are non-local and online to have affinity towards remote servers for lock contention - optimize lock retry interval to avoid sending too many messages during lock contention, reduces average CPU usage as well - if bucket is not set, when deleteObject fails make sure setPutObjHeaders() honors lifecycle only if bucket name is set. - fix top locks to list out always the oldest lockers always, avoid getting bogged down into map's unordered nature.	2020-10-08 12:32:32 -07:00
Harshavardhana	8e7c00f3d4	add missing request-id from DeleteObject events (#10623 ) fixes #10621	2020-10-02 13:36:13 -07:00
Harshavardhana	0104af6bcc	delayed locks until we have started reading the body (#10474 ) This is to ensure that Go contexts work properly, after some interesting experiments I found that Go net/http doesn't cancel the context when Body is non-zero and hasn't been read till EOF. The following gist explains this, this can lead to pile up of go-routines on the server which will never be canceled and will die at a really later point in time, which can simply overwhelm the server. https://gist.github.com/harshavardhana/c51dcfd055780eaeb71db54f9c589150 To avoid this refactor the locking such that we take locks after we have started reading from the body and only take locks when needed. Also, remove contextReader as it's not useful, doesn't work as expected context is not canceled until the body reaches EOF so there is no point in wrapping it with context and putting a `select {` on it which can unnecessarily increase the CPU overhead. We will still use the context to cancel the lockers etc. Additional simplification in the locker code to avoid timers as re-using them is a complicated ordeal avoid them in the hot path, since locking is very common this may avoid lots of allocations.	2020-09-14 15:57:13 -07:00
Harshavardhana	14b1c9f8e4	fix: return Range errors after If-Matches (#10045 ) closes #7292	2020-07-17 13:01:22 -07:00
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	2020-06-12 20:04:01 -07:00
kannappanr	2c372a9894	Send Partscount only when partnumber is specified (#9793 ) Fixes #9789	2020-06-10 09:22:15 -07:00
Anis Elleuch	790323ac37	lifecycle: Fix object expiration date (#9791 ) re-use PredictExpiryTime() in ComputeAction()	2020-06-09 09:40:53 -07:00
Anis Elleuch	cdf4815a6b	Add x-amz-expiration header in some S3 responses (#9667 ) x-amz-expiration is described in the S3 specification as a header which indicates if the object in question will expire any time in the future.	2020-05-21 14:12:52 -07:00
Harshavardhana	7290d23b26	Apply partNumber checks only on multipart objects (#9528 )	2020-05-06 16:58:09 -07:00
Harshavardhana	282c9f790a	fix: validate partNumber in queryParam as part of preConditions (#9386 )	2020-04-20 22:01:59 -07:00
Harshavardhana	c43f745449	Ensure that we use constants everywhere (#7845 ) This allows for canonicalization of the strings throughout our code and provides a common space for all these constants to reside. This list is rather non-exhaustive but captures all the headers used in AWS S3 API operations	2019-07-02 22:34:32 -07:00
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	2019-04-09 11:39:42 -07:00
Harshavardhana	a2e344bf30	Preserve ETag case for S3 compatibility (#7498 ) Most hadoop distributions hortonworks, cloudera all depend on aws-sdk-java 1.7.x to 1.10.x - the releases which have bugs related case sensitive check for ETag header. Go changes the case of the headers set to be canonical but only preserves them when set through a direct map. This fixes most compatibility issues we have had in the past supporting older hadoop distributions.	2019-04-08 16:54:46 -07:00
poornas	023866642c	canonicalize ETag correctly (#7442 ) Fixes #7441 Trim extra quotes prefixing/suffixing ETag in CompleteMultipartUpload request.	2019-04-01 12:19:52 -07:00
Anis Elleuch	8689ec258b	Don't decrypt ETag in validation when source is SSEC multipart (#7423 ) Copying an encrypted SSEC object when this latter is uploaded using multipart mechanism was failing because ETag in case of encrypted multipart upload is not encrypted. This PR fixes the behavior.	2019-03-25 12:17:31 -07:00
Harshavardhana	e0a87e96de	Populate host value from GetSourceIP directly (#7417 )	2019-03-25 11:45:42 -07:00
Anis Elleuch	b05825ffe8	s3: Fix precondition failed in CopyObjectPart when src is encrypted (#7276 ) CopyObject precondition checks into GetObjectReader in order to perform SSE-C pre-condition checks using the last 32 bytes of encrypted ETag rather than the decrypted ETag This also necessitates moving precondition checks for gateways to gateway layer rather than object handler check	2019-03-06 12:38:41 -08:00
Harshavardhana	a51781e5cf	Use context to fill in more details about error XML (#7232 )	2019-02-13 16:07:21 -08:00
Harshavardhana	df35d7db9d	Introduce staticcheck for stricter builds (#7035 )	2019-02-13 18:29:36 +05:30
Harshavardhana	fef5416b3c	Support unknown gateway errors and convert at handler layer (#7219 ) Different gateway implementations due to different backend API errors, might return different unsupported errors at our handler layer. Current code posed a problem for us because this information was lost and we would convert it to InternalError in this situation all S3 clients end up retrying the request. To avoid this unexpected situation implement a way to support this cleanly such that the underlying information is not lost which is returned by gateway.	2019-02-12 14:55:52 +05:30
Harshavardhana	dba61867e8	Redirect browser requests returning AccessDenied (#6848 ) Anonymous requests from S3 resources returning AccessDenied should be auto redirected to browser for login.	2018-11-26 12:15:12 -08:00
Harshavardhana	6c85706c24	Use GetSourceIP for source ip as request params (#6109 ) Fixes #6108	2018-07-02 14:40:18 -07:00
poornas	a3e806ed61	Add disk based edge caching support. (#5182 ) This PR adds disk based edge caching support for minio server. Cache settings can be configured in config.json to take list of disk drives, cache expiry in days and file patterns to exclude from cache or via environment variables MINIO_CACHE_DRIVES, MINIO_CACHE_EXCLUDE and MINIO_CACHE_EXPIRY Design assumes that Atime support is enabled and the list of cache drives is fixed. - Objects are cached on both GET and PUT/POST operations. - Expiry is used as hint to evict older entries from cache, or if 80% of cache capacity is filled. - When object storage backend is down, GET, LIST and HEAD operations fetch object seamlessly from cache. Current Limitations - Bucket policies are not cached, so anonymous operations are not supported in offline mode. - Objects are distributed using deterministic hashing among list of cache drives specified.If one or more drives go offline, or cache drive configuration is altered - performance could degrade to linear lookup. Fixes #4026	2018-03-28 14:14:06 -07:00
Krishna Srinivas	e452377b24	Add context to the object-interface methods. Make necessary changes to xl fs azure sia	2018-03-15 16:28:25 -07:00
Bala FA	0e4431725c	make notification as separate package (#5294 ) * Remove old notification files * Add net package * Add event package * Modify minio to take new notification system	2018-03-15 13:03:41 -07:00
poornas	0bb6247056	Move nslocking from s3 layer to object layer (#5382 ) Fixes #5350	2018-01-13 10:04:52 +05:30
Frank Wessels	61e0b1454a	Add support for timeouts for locks (#4377 )	2017-08-31 14:43:59 -07:00
Harshavardhana	155a90403a	fs/erasure: Rename meta 'md5Sum' as 'etag'. (#4319 ) This PR also does backend format change to 1.0.1 from 1.0.0. Backward compatible changes are still kept to read the 'md5Sum' key. But all new objects will be stored with the same details under 'etag'. Fixes #4312	2017-05-14 12:05:51 -07:00
Harshavardhana	57c5c75611	web: Simplify and converge common functions in web/obj API. (#4179 ) RemoveObject() in webAPI currently re-implements some part of the code to remove objects combine them for simplicity and code convergence.	2017-04-26 23:27:48 -07:00
Anis Elleuch	e4bd882f11	handlers: Ignore malformatted datetime type header (#4097 ) Ignore headers, such as If-Modified-Since, If-Unmodified-Since, etc.. when they are received with a format other than HTTP date.	2017-04-12 12:34:57 -07:00
Harshavardhana	77a192a7b5	Implement CopyObjectPart API (#3663 ) This API is implemented to allow copying data from an existing source object to an ongoing multipart operation http://docs.aws.amazon.com/AmazonS3/latest/API/mpUploadUploadPartCopy.html Fixes #3662	2017-01-31 09:38:34 -08:00
Harshavardhana	926c75d0b5	api: Set appropriate content-type for success/error responses. (#3537 ) Golang HTTP client automatically detects content-type but for S3 clients this content-type might be incorrect or might misbehave. For example: ``` Content-Type: text/xml; charset=utf-8 ``` Should be ``` Content-Type: application/xml ``` Allow this to be set properly.	2017-01-06 00:37:00 -08:00

1 2

52 Commits