minio

mirror of https://github.com/minio/minio.git synced 2024-12-29 00:23:21 -05:00

Author	SHA1	Message	Date
Klaus Post	4ac9ed4248	CopyObject: Do not remove crypto info when compressed (#11702 ) Removing crypto info makes it impossible to copy encrypted+compressed objects. Disable destination compression when encrypted.	2021-03-08 12:57:54 -08:00
Andreas Auernhammer	f14cc6c943	etag: add FromContentMD5 to parse content-md5 as ETag (#11688 ) This commit adds the `FromContentMD5` function to parse a client-provided content-md5 as ETag. Further, it also adds multipart ETag computation for future needs.	2021-03-03 12:58:28 -08:00
Harshavardhana	37960cbc2f	fix: avoid writing more content on network with O_DIRECT reads (#11659 ) There was an io.LimitReader was missing for the 'length' parameter for ranged requests, that would cause client to get truncated responses and errors. fixes #11651	2021-02-28 15:33:03 -08:00
Andreas Auernhammer	d4b822d697	pkg/etag: add new package for S3 ETag handling (#11577 ) This commit adds a new package `etag` for dealing with S3 ETags. Even though ETag is often viewed as MD5 checksum of an object, handling S3 ETags correctly is a surprisingly complex task. While it is true that the ETag corresponds to the MD5 for the most basic S3 API operations, there are many exceptions in case of multipart uploads or encryption. In worse, some S3 clients expect very specific behavior when it comes to ETags. For example, some clients expect that the ETag is a double-quoted string and fail otherwise. Non-AWS compliant ETag handling has been a source of many bugs in the past. Therefore, this commit adds a dedicated `etag` package that provides functionality for parsing, generating and converting S3 ETags. Further, this commit removes the ETag computation from the `hash` package. Instead, the `hash` package (i.e. `hash.Reader`) should focus only on computing and verifying the content-sha256. One core feature of this commit is to provide a mechanism to communicate a computed ETag from a low-level `io.Reader` to a high-level `io.Reader`. This problem occurs when an S3 server receives a request and has to compute the ETag of the content. However, the server may also wrap the initial body with several other `io.Reader`, e.g. when encrypting or compressing the content: ``` reader := Encrypt(Compress(ETag(content))) ``` In such a case, the ETag should be accessible by the high-level `io.Reader`. The `etag` provides a mechanism to wrap `io.Reader` implementations such that the `ETag` can be accessed by a type-check. This technique is applied to the PUT, COPY and Upload handlers.	2021-02-23 12:31:53 -08:00
mailsmail	173284903b	fix incorrect http range in SelectObjectContentHandler (#11585 )	2021-02-19 17:55:28 -08:00
Harshavardhana	7875d472bc	avoid notification for non-existent delete objects (#11514 ) Skip notifications on objects that might have had an error during deletion, this also avoids unnecessary replication attempt on such objects. Refactor some places to make sure that we have notified the client before we - notify - schedule for replication - lifecycle etc.	2021-02-10 22:00:42 -08:00
Poorna Krishnamoorthy	e6b4ea7618	More fixes for delete marker replication (#11504 ) continuation of PR#11491 for multiple server pools and bi-directional replication. Moving proxying for GET/HEAD to handler level rather than server pool layer as this was also causing incorrect proxying of HEAD. Also fixing metadata update on CopyObject - minio-go was not passing source version ID in X-Amz-Copy-Source header	2021-02-10 17:25:04 -08:00
Krishnan Parthasarathi	b87fae0049	Simplify PutObjReader for plain-text reader usage (#11470 ) This change moves away from a unified constructor for plaintext and encrypted usage. NewPutObjReader is simplified for the plain-text reader use. For encrypted reader use, WithEncryption should be called on an initialized PutObjReader. Plaintext: func NewPutObjReader(rawReader hash.Reader) PutObjReader The hash.Reader is used to provide payload size and md5sum to the downstream consumers. This is different from the previous version in that there is no need to pass nil values for unused parameters. Encrypted: func WithEncryption(encReader hash.Reader, key crypto.ObjectKey) (*PutObjReader, error) This method sets up encrypted reader along with the key to seal the md5sum produced by the plain-text reader (already setup when NewPutObjReader was called). Usage: ``` pReader := NewPutObjReader(rawReader) // ... other object handler code goes here // Prepare the encrypted hashed reader pReader, err = pReader.WithEncryption(encReader, objEncKey) ```	2021-02-10 08:52:50 -08:00
Ritesh H Shukla	3d74efa6b1	fux: copy object for encrypted objects (#11490 )	2021-02-08 19:58:17 -08:00
Anis Elleuch	275f7a63e8	lc: Apply DeleteAction correctly to objects (#11471 ) When lifecycle decides to Delete an object and not a version in a versioned bucket, the code should create a delete marker and not removing the scanned version. This commit fixes the issue.	2021-02-06 16:10:33 -08:00
Harshavardhana	f108873c48	fix: replication metadata comparsion and other fixes (#11410 ) - using miniogo.ObjectInfo.UserMetadata is not correct - using UserTags from Map->String() can change order - ContentType comparison needs to be removed. - Compare both lowercase and uppercase key names. - do not silently error out constructing PutObjectOptions if tag parsing fails - avoid notification for empty object info, failed operations should rely on valid objInfo for notification in all situations - optimize copyObject implementation, also introduce a new replication event - clone ObjectInfo() before scheduling for replication - add additional headers for comparison - remove strings.EqualFold comparison avoid unexpected bugs - fix pool based proxying with multiple pools - compare only specific metadata Co-authored-by: Poorna Krishnamoorthy <poornas@users.noreply.github.com>	2021-02-03 20:41:33 -08:00
Andreas Auernhammer	871b450dbd	crypto: add support for decrypting SSE-KMS metadata (#11415 ) This commit refactors the SSE implementation and add S3-compatible SSE-KMS context handling. SSE-KMS differs from SSE-S3 in two main aspects: 1. The client can request a particular key and specify a KMS context as part of the request. 2. The ETag of an SSE-KMS encrypted object is not the MD5 sum of the object content. This commit only focuses on the 1st aspect. A client can send an optional SSE context when using SSE-KMS. This context is remembered by the S3 server such that the client does not have to specify the context again (during multipart PUT / GET / HEAD ...). The crypto. context also includes the bucket/object name to prevent renaming objects at the backend. Now, AWS S3 behaves as following: - If the user does not provide a SSE-KMS context it does not store one - resp. does not include the SSE-KMS context header in the response (e.g. HEAD). - If the user specifies a SSE-KMS context without the bucket/object name then AWS stores the exact context the client provided but adds the bucket/object name internally. The response contains the KMS context without the bucket/object name. - If the user specifies a SSE-KMS context with the bucket/object name then AWS again stores the exact context provided by the client. The response contains the KMS context with the bucket/object name. This commit implements this behavior w.r.t. SSE-KMS. However, as of now, no such object can be created since the server rejects SSE-KMS encryption requests. This commit is one stepping stone for SSE-KMS support. Co-authored-by: Harshavardhana <harsha@minio.io>	2021-02-03 15:19:08 -08:00
Anis Elleuch	e96fdcd5ec	tagging: Add event notif for PUT object tagging (#11366 ) An optimization to avoid double calling for during PutObject tagging	2021-02-01 13:52:51 -08:00
Anis Elleuch	65aa2bc614	ilm: Remove object in HEAD/GET if having an applicable ILM rule (#11296 ) Remove an object on the fly if there is a lifecycle rule with delete expiry action for the corresponding object.	2021-02-01 09:52:11 -08:00
Anis Elleuch	00cff1aac5	audit: per object send pool number, set number and servers per operation (#11233 )	2021-01-26 13:21:51 -08:00
Harshavardhana	a6c146bd00	validate storage class across pools when setting config (#11320 ) ``` mc admin config set alias/ storage_class standard=EC:3 ``` should only succeed if parity ratio is valid for all server pools, if not we should fail proactively. This PR also needs to bring other changes now that we need to cater for variadic drive counts per pool. Bonus fixes also various bugs reproduced with - GetObjectWithPartNumber() - CopyObjectPartWithOffsets() - CopyObjectWithMetadata() - PutObjectPart,PutObject with truncated streams	2021-01-22 12:09:24 -08:00
Klaus Post	19fb1086b2	select: Fix leak on compressed files (#11302 ) Properly close gzip reader when done reading fixes #11300	2021-01-19 17:51:46 -08:00
Poorna Krishnamoorthy	7090bcc8e0	fix: doc links and delete replication permissions enforcement (#11285 )	2021-01-15 15:22:55 -08:00
Poorna Krishnamoorthy	7824e19d20	Allow synchronous replication if enabled. (#11165 ) Synchronous replication can be enabled by setting the --sync flag while adding a remote replication target. This PR also adds proxying on GET/HEAD to another node in a active-active replication setup in the event of a 404 on the current node.	2021-01-11 22:36:51 -08:00
Klaus Post	eb9172eecb	Allow Compression + encryption (#11103 )	2021-01-05 20:08:35 -08:00
Harshavardhana	c4b1d394d6	erasure: avoid io.Copy in hotpaths to reduce allocation (#11213 )	2021-01-03 16:27:34 -08:00
Andreas Auernhammer	8cdf2106b0	refactor cmd/crypto code for SSE handling and parsing (#11045 ) This commit refactors the code in `cmd/crypto` and separates SSE-S3, SSE-C and SSE-KMS. This commit should not cause any behavior change except for: - `IsRequested(http.Header)` which now returns the requested type {SSE-C, SSE-S3, SSE-KMS} and does not consider SSE-C copy headers. However, SSE-C copy headers alone are anyway not valid.	2020-12-22 09:19:32 -08:00
Harshavardhana	5c451d1690	update x/net/http2 to address few bugs (#11144 ) additionally also configure http2 healthcheck values to quickly detect unstable connections and let them timeout. also use single transport for proxying requests	2020-12-21 21:42:38 -08:00
Harshavardhana	bdd094bc39	fix: avoid sending errors on missing objects on locked buckets (#10994 ) make sure multi-object delete returned errors that are AWS S3 compatible	2020-11-28 21:15:45 -08:00
Poorna Krishnamoorthy	2ff655a745	Refactor replication, ILM handling in DELETE API (#10945 )	2020-11-25 11:24:50 -08:00
Poorna Krishnamoorthy	39f3d5493b	Show Delete replication status header (#10946 ) X-Minio-Replication-Delete-Status header shows the status of the replication of a permanent delete of a version. All GETs are disallowed and return 405 on this object version. In the case of replicating delete markers. X-Minio-Replication-DeleteMarker-Status shows the status of replication, and would similarly return 405. Additionally, this PR adds reporting of delete marker event completion and updates documentation	2020-11-21 23:48:50 -08:00
Poorna Krishnamoorthy	251c1ef6da	Add support for replication of object tags, retention metadata (#10880 )	2020-11-19 18:56:09 -08:00
Poorna Krishnamoorthy	f60b6eb82e	fix validation for deletemarker replication on object locked bucket (#10892 )	2020-11-19 18:47:19 -08:00
Poorna Krishnamoorthy	1ebf6f146a	Add support for ILM transition (#10565 ) This PR adds transition support for ILM to transition data to another MinIO target represented by a storage class ARN. Subsequent GET or HEAD for that object will be streamed from the transition tier. If PostRestoreObject API is invoked, the transitioned object can be restored for duration specified to the source cluster.	2020-11-19 18:47:17 -08:00
Harshavardhana	9a34fd5c4a	Revert "Revert "Add delete marker replication support (#10396 )"" This reverts commit `267d7bf0a9`.	2020-11-19 18:43:58 -08:00
Harshavardhana	267d7bf0a9	Revert "Add delete marker replication support (#10396 )" This reverts commit `50c10a5087`. PR is moved to origin/dev branch	2020-11-12 11:43:14 -08:00
Poorna Krishnamoorthy	50c10a5087	Add delete marker replication support (#10396 ) Delete marker replication is implemented for V2 configuration specified in AWS spec (though AWS allows it only in the V1 configuration). This PR also brings in a MinIO only extension of replicating permanent deletes, i.e. deletes specifying version id are replicated to target cluster.	2020-11-10 15:24:14 -08:00
Bill Thorp	4a1efabda4	Context based AccessKey passing (#10615 ) A new field called AccessKey is added to the ReqInfo struct and populated. Because ReqInfo is added to the context, this allows the AccessKey to be accessed from 3rd-party code, such as a custom ObjectLayer. Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Kaloyan Raev <kaloyan@storj.io>	2020-11-04 09:13:34 -08:00
Harshavardhana	8e7c00f3d4	add missing request-id from DeleteObject events (#10623 ) fixes #10621	2020-10-02 13:36:13 -07:00
Anis Elleuch	71403be912	fix: consider partNumber in GET/HEAD requests (#10618 )	2020-10-01 15:41:12 -07:00
Anis Elleuch	9603489dd3	federation: Honor range with UploadObjectPart to a different cluster (#10570 ) Use gr & length instead of srcInfo.Reader & srcInfo.Size because they don't honor range header	2020-09-25 12:06:42 -07:00
Harshavardhana	d616d8a857	serialize replication and feed it through task model (#10500 ) this allows for eventually controlling the concurrency of replication and overally control of throughput	2020-09-16 16:04:55 -07:00
Ritesh H Shukla	5c47ce456e	Run replication in the background (#10491 )	2020-09-15 18:44:58 -07:00
Harshavardhana	80fab03b63	fix: S3 gateway doesn't support full passthrough for encryption (#10484 ) The entire encryption layer is dependent on the fact that KMS should be configured for S3 encryption to work properly and we only support passing the headers as is to the backend for encryption only if KMS is configured. Make sure that this predictability is maintained, currently the code was allowing encryption to go through and fail at later to indicate that KMS was not configured. We should simply reply "NotImplemented" if KMS is not configured, this allows clients to simply proceed with their tests.	2020-09-15 13:57:15 -07:00
Klaus Post	34859c6d4b	Preallocate (safe) slices when we know the size (#10459 )	2020-09-14 20:44:18 -07:00
Harshavardhana	0104af6bcc	delayed locks until we have started reading the body (#10474 ) This is to ensure that Go contexts work properly, after some interesting experiments I found that Go net/http doesn't cancel the context when Body is non-zero and hasn't been read till EOF. The following gist explains this, this can lead to pile up of go-routines on the server which will never be canceled and will die at a really later point in time, which can simply overwhelm the server. https://gist.github.com/harshavardhana/c51dcfd055780eaeb71db54f9c589150 To avoid this refactor the locking such that we take locks after we have started reading from the body and only take locks when needed. Also, remove contextReader as it's not useful, doesn't work as expected context is not canceled until the body reaches EOF so there is no point in wrapping it with context and putting a `select {` on it which can unnecessarily increase the CPU overhead. We will still use the context to cancel the lockers etc. Additional simplification in the locker code to avoid timers as re-using them is a complicated ordeal avoid them in the hot path, since locking is very common this may avoid lots of allocations.	2020-09-14 15:57:13 -07:00
Harshavardhana	f355374962	add support for configurable remote transport deadline (#10447 ) configurable remote transport timeouts for some special cases where this value needs to be bumped to a higher value when transferring large data between federated instances.	2020-09-11 23:03:08 -07:00
Harshavardhana	eb2934f0c1	simplify webhook DNS further generalize for gateway (#10448 ) continuation of the changes from `eaaf05a7cc` this further simplifies, enables this for gateway deployments as well	2020-09-10 14:19:32 -07:00
Harshavardhana	4a2928eb49	generate missing object delete bucket notifications (#10449 ) fixes #10381	2020-09-09 18:23:08 -07:00
Nitish Tiwari	eaaf05a7cc	Add Kubernetes operator webook server as DNS target (#10404 ) This PR adds a DNS target that ensures to update an entry into Kubernetes operator when a bucket is created or deleted. See minio/operator#264 for details. Co-authored-by: Harshavardhana <harsha@minio.io>	2020-09-09 12:20:49 -07:00
Harshavardhana	2acb530ccd	update rulesguard with new rules (#10392 ) Co-authored-by: Nitish Tiwari <nitish@minio.io> Co-authored-by: Praveen raj Mani <praveen@minio.io>	2020-09-01 16:58:13 -07:00
kannappanr	d15a5ad4cc	S3 Gateway: Check for encryption headers properly (#10309 )	2020-08-22 11:41:49 -07:00
Harshavardhana	e7ba78beee	use GlobalContext instead of context.Background when possible (#10254 )	2020-08-13 09:16:01 -07:00
poornas	79e21601b0	fix: web handlers to enforce replication (#10249 ) This PR also preserves source ETag for replication	2020-08-12 17:32:24 -07:00
poornas	a8dd7b3eda	Refactor replication target management. (#10154 ) Generalize replication target management so that remote targets for a bucket can be managed with ARNs. `mc admin bucket remote` command will be used to manage targets.	2020-07-30 19:55:22 -07:00
Harshavardhana	25a55bae6f	fix: avoid buffering of server sent events by proxies (#10164 )	2020-07-30 19:45:12 -07:00
Harshavardhana	57ff9abca2	Apply quota usage cache invalidation per second (#10127 ) Allow faster lookups for quota check enforcement	2020-07-24 12:24:21 -07:00
poornas	c43da3005a	Add support for server side bucket replication (#9882 )	2020-07-21 17:49:56 -07:00
Harshavardhana	d53e560ce0	fix: copyObject key rotation issue (#10085 ) - copyObject in-place decryption failed due to incorrect verification of headers - do not decode ETag when object is encrypted with SSE-C, so that pre-conditions don't fail prematurely.	2020-07-18 17:36:32 -07:00
Harshavardhana	3fe27c8411	fix: In federated setup dial all hosts to figure out online host (#10074 ) In federated NAS gateway setups, multiple hosts in srvRecords was picked at random which could mean that if one of the host was down the request can indeed fail and if client retries it would succeed. Instead allow server to figure out the current online host quickly such that we can exclude the host which is down. At the max the attempt to look for a downed node is to 300 millisecond, if the node is taking longer to respond than this value we simply ignore and move to the node, total attempts are equal to number of srvRecords if no server is online we simply fallback to last dialed host.	2020-07-17 14:25:47 -07:00
Harshavardhana	14b1c9f8e4	fix: return Range errors after If-Matches (#10045 ) closes #7292	2020-07-17 13:01:22 -07:00
Harshavardhana	4bfc50411c	fix: return versionId in tagging APIs (#10068 )	2020-07-16 22:38:58 -07:00
Anis Elleuch	778e9c864f	Move dependency from minio-go v6 to v7 (#10042 )	2020-07-14 09:38:05 -07:00
Harshavardhana	2743d4ca87	fix: Add support for preserving mtime for replication (#9995 ) This PR is needed for bucket replication support	2020-07-08 17:36:56 -07:00
Harshavardhana	6136a963c8	fix: bump the response header timeout for forwarder as well (#9994 ) continuation of #9986, add more place where the lower timeout comes into effect.	2020-07-08 10:55:24 -07:00
Klaus Post	aa4d1021eb	Remove timeout from putobject and listobjects (#9986 ) Use a separate client for these calls that can take a long time. Add request context to these so they are canceled when the client disconnects instead except for ListObject which doesn't have any equivalent.	2020-07-07 12:19:57 -07:00
Harshavardhana	cdb0e6ffed	support proper values for listMultipartUploads/listParts (#9970 ) object KMS is configured with auto-encryption, there were issues when using docker registry - this has been left unnoticed for a while. This PR fixes an issue with compatibility. Additionally also fix the continuation-token implementation infinite loop issue which was missed as part of #9939 Also fix the heal token to be generated as a client facing value instead of what is remembered by the server, this allows for the server to be stateless regarding the token's behavior.	2020-07-03 19:27:13 -07:00
Harshavardhana	810a4f0723	fix: return proper errors Get/HeadObject for deleteMarkers (#9957 )	2020-07-02 16:17:27 -07:00
kannappanr	5089a7167d	Handle empty retention in get/put object retention (#9948 ) Fixes #9943	2020-06-30 16:44:24 -07:00
Harshavardhana	67062840c1	fix: perform CopyObject under more conditions (#9879 ) - x-amz-storage-class specified CopyObject should proceed regardless, its not a precondition - sourceVersionID is specified CopyObject should proceed regardless, its not a precondition	2020-06-19 13:53:45 -07:00
Harshavardhana	b912c8f035	fix: generate new version when replacing metadata in CopyObject (#9871 )	2020-06-19 08:44:51 -07:00
Harshavardhana	e79874f58e	[feat] Preserve version supplied by client (#9854 ) Just like GET/DELETE APIs it is possible to preserve client supplied versionId's, of course the versionIds have to be uuid, if an existing versionId is found it is overwritten if no object locking policies are found. - PUT /bucketname/objectname?versionId=<id> - POST /bucketname/objectname?uploads=&versionId=<id> - PUT /bucketname/objectname?verisonId=<id> (with x-amz-copy-source)	2020-06-17 11:13:41 -07:00
Harshavardhana	087aaaf894	fix: save deleteMarker properly, precision upto UnixNano() (#9843 )	2020-06-16 07:54:27 -07:00
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	2020-06-12 20:04:01 -07:00
poornas	d26b24f670	avoid storing X-Amz-Tagging-Directive in metadata (#9800 )	2020-06-10 14:29:24 -07:00
kannappanr	2c372a9894	Send Partscount only when partnumber is specified (#9793 ) Fixes #9789	2020-06-10 09:22:15 -07:00
poornas	3d3b75fb8d	Avoid overwriting object tags when changing lock (#9794 )	2020-06-10 08:16:30 -07:00
Harshavardhana	41688a936b	fix: CopyObject behavior on expanded zones (#9729 ) CopyObject was not correctly figuring out the correct destination object location and would end up creating duplicate objects on two different zones, reproduced by doing encryption based key rotation.	2020-05-28 14:36:38 -07:00
Harshavardhana	b330c2c57e	Introduce simpler GetMultipartInfo call for performance (#9722 ) Advantages avoids 100's of stats which are needed for each upload operation in FS/NAS gateway mode when uploading a large multipart object, dramatically increases performance for multipart uploads by avoiding recursive calls. For other gateway's simplifies the approach since azure, gcs, hdfs gateway's don't capture any specific metadata during upload which needs handler validation for encryption/compression. Erasure coding was already optimized, additionally just avoids small allocations of large data structure. Fixes #7206	2020-05-28 12:36:20 -07:00
P R	9d39fb3604	add copyobject tagging replace directive for gateway (#9711 )	2020-05-26 17:32:53 -07:00
Harshavardhana	7ea026ff1d	fix: reply back user-metadata in lower case form (#9697 ) some clients such as veeam expect the x-amz-meta to be sent in lower cased form, while this does indeed defeats the HTTP protocol contract it is harder to change these applications, while these applications get fixed appropriately in future. x-amz-meta is usually sent in lowercased form by AWS S3 and some applications like veeam incorrectly end up relying on the case sensitivity of the HTTP headers. Bonus fixes - Fix the iso8601 time format to keep it same as AWS S3 response - Increase maxObjectList to 50,000 and use maxDeleteList as 10,000 whenever multi-object deletes are needed.	2020-05-25 16:51:32 -07:00
Harshavardhana	0c71ce3398	fix size accounting for encrypted/compressed objects (#9690 ) size calculation in crawler was using the real size of the object instead of its actual size i.e either a decrypted or uncompressed size. this is needed to make sure all other accounting such as bucket quota and mcs UI to display the correct values.	2020-05-24 11:19:17 -07:00
P R	3f6d624c7b	add gateway object tagging support (#9124 )	2020-05-23 11:09:35 -07:00
Anis Elleuch	cdf4815a6b	Add x-amz-expiration header in some S3 responses (#9667 ) x-amz-expiration is described in the S3 specification as a header which indicates if the object in question will expire any time in the future.	2020-05-21 14:12:52 -07:00
Harshavardhana	bd032d13ff	migrate all bucket metadata into a single file (#9586 ) this is a major overhaul by migrating off all bucket metadata related configs into a single object '.metadata.bin' this allows us for faster bootups across 1000's of buckets and as well as keeps the code simple enough for future work and additions. Additionally also fixes #9396, #9394	2020-05-19 13:53:54 -07:00
Harshavardhana	d31eaddba3	fix: avoid double body reads in SelectObject call (#9638 ) Bonus fix handle encryption headers in response properly for both notification and response to the client.	2020-05-19 02:01:08 -07:00
Harshavardhana	1bc32215b9	enable full linter across the codebase (#9620 ) enable linter using golangci-lint across codebase to run a bunch of linters together, we shall enable new linters as we fix more things the codebase. This PR fixes the first stage of this cleanup.	2020-05-18 09:59:45 -07:00
poornas	011a2c0b78	Add docs for bucket quota feature (#9503 ) This PR also adds a check to not enforce bucket quota for server-side metadata copy of an object onto itself.	2020-05-16 19:27:33 -07:00
Harshavardhana	d348ec0f6c	avoid double listObjectParts calls improves performance (#9606 ) this PR is to avoid double calls across multiple calls in APIs - CopyObjectPart - PutObjectPart	2020-05-15 08:06:45 -07:00
Harshavardhana	a1de9cec58	cleanup object-lock/bucket tagging for gateways (#9548 ) This PR is to ensure that we call the relevant object layer APIs for necessary S3 API level functionalities allowing gateway implementations to return proper errors as NotImplemented{} This allows for all our tests in mint to behave appropriately and can be handled appropriately as well.	2020-05-08 13:44:44 -07:00
Bala FA	3773874cd3	add bucket tagging support (#9389 ) This patch also simplifies object tagging support	2020-05-05 14:18:13 -07:00
Harshavardhana	7b58dcb28c	fix: return context error from context reader (#9507 )	2020-05-04 14:33:49 -07:00
poornas	9a547dcbfb	Add API's for managing bucket quota (#9379 ) This PR allows setting a "hard" or "fifo" quota restriction at the bucket level. Buckets that have reached the FIFO quota configured, will automatically be cleaned up in FIFO manner until bucket usage drops to configured quota. If a bucket is configured with a "hard" quota ceiling, all further writes are disallowed.	2020-04-30 15:55:54 -07:00
P R	5dd9cf4398	fix: CopyObject with REPLACE directive deletes existing tags (#9478 ) Fixes #9477	2020-04-29 10:26:37 +05:30
Harshavardhana	60d415bb8a	deprecate/remove global WORM mode (#9436 ) global WORM mode is a complex piece for which the time has passed, with the advent of S3 compatible object locking and retention implementation global WORM is sort of deprecated, this has been mentioned in our documentation for some time, now the time has come for this to go.	2020-04-24 16:37:05 -07:00
BigUstad	45e22cf8aa	fix: selectObject to return error when object does not exist (#9423 )	2020-04-24 13:51:48 -07:00
Harshavardhana	282c9f790a	fix: validate partNumber in queryParam as part of preConditions (#9386 )	2020-04-20 22:01:59 -07:00
Klaus Post	c4464e36c8	fix: limit HTTP transport tuables to affordable values (#9383 ) Close connections pro-actively in transient calls	2020-04-17 11:20:56 -07:00
Harshavardhana	8bae956df6	allow copyObject to rotate storageClass of objects (#9362 ) Added additional mint tests as well to verify, this functionality. Fixes #9357	2020-04-16 17:42:44 -07:00
kannappanr	1fa65c7f2f	fix: object lock behavior when default lock config is enabled (#9305 )	2020-04-13 14:03:23 -07:00
Harshavardhana	29e0727b58	fix: regression in CopyObject not preserving ETag in --compat (#9322 ) issue found after `git bisect` to commit `db41953618`	2020-04-11 20:20:30 -07:00
Andreas Auernhammer	db41953618	avoid unnecessary KMS requests during single-part PUT (#9220 ) This commit fixes a performance issue caused by too many calls to the external KMS - i.e. for single-part PUT requests. In general, the issue is caused by a sub-optimal code structure. In particular, when the server encrypts an object it requests a new data encryption key from the KMS. With this key it does some key derivation and encrypts the object content and ETag. However, to behave S3-compatible the MinIO server has to return the plaintext ETag to the client in case SSE-S3. Therefore, the server code used to decrypt the (previously encrypted) ETag again by requesting the data encryption key (KMS decrypt API) from the KMS. This leads to 2 KMS API calls (1 generate key and 1 decrypt key) per PUT operation - while only one KMS call is necessary. This commit fixes this by fetching a data key only once from the KMS and keeping the derived object encryption key around (for the lifetime of the request). This leads to a significant performance improvement w.r.t. to PUT workloads: ``` Operation: PUT Operations: 161 -> 239 Duration: 28s -> 29s * Average: +47.56% (+25.8 MiB/s) throughput, +47.56% (+2.6) obj/s * Fastest: +55.49% (+34.5 MiB/s) throughput, +55.49% (+3.5) obj/s * 50% Median: +58.24% (+32.8 MiB/s) throughput, +58.24% (+3.3) obj/s * Slowest: +1.83% (+0.6 MiB/s) throughput, +1.83% (+0.1) obj/s ```	2020-04-09 17:01:45 -07:00
Harshavardhana	43a3778b45	fix: support object-remaining-retention-days policy condition (#9259 ) This PR also tries to simplify the approach taken in object-locking implementation by preferential treatment given towards full validation. This in-turn has fixed couple of bugs related to how policy should have been honored when ByPassGovernance is provided. Simplifies code a bit, but also duplicates code intentionally for clarity due to complex nature of object locking implementation.	2020-04-06 13:44:16 -07:00
Harshavardhana	3d3beb6a9d	Add response header timeouts (#9170 ) - Add conservative timeouts upto 3 minutes for internode communication - Add aggressive timeouts of 30 seconds for gateway communication Fixes #9105 Fixes #8732 Fixes #8881 Fixes #8376 Fixes #9028	2020-03-21 22:10:13 -07:00
Klaus Post	8d98662633	re-implement data usage crawler to be more efficient (#9075 ) Implementation overview: https://gist.github.com/klauspost/1801c858d5e0df391114436fdad6987b	2020-03-18 16:19:29 -07:00
kannappanr	8b880a246a	fix: deleteObjectTagging should 204 on success (#9150 )	2020-03-16 23:21:24 -07:00
poornas	9fc7537f2a	Enforce md5sum checks for object retention APIs (#9030 ) this PR enforces md5sum verification for following API's to be compatible with AWS S3 spec - PutObjectRetention - PutObjectLegalHold Co-authored-by: Harshavardhana <harsha@minio.io>	2020-03-04 07:04:12 -08:00
Harshavardhana	23a8411732	Add a generic Walk()'er to list a bucket, optinally prefix (#9026 ) This generic Walk() is used by likes of Lifecyle, or KMS to rotate keys or any other functionality which relies on this functionality.	2020-02-25 21:22:28 +05:30
Harshavardhana	51a9d1bdb7	Avoid unnecessary allocations for XML parsing (#9017 )	2020-02-23 09:06:46 +05:30
poornas	02a59a04d1	Fix error messages returned by (Put)GetObjectLegalHold (#9013 ) fiixing some minor discrepancies between aws s3 responses vs minio server	2020-02-19 08:15:48 +05:30
Harshavardhana	712e82344c	acl: Support PUT calls with success for 'private' ACL's (#9000 ) Add dummy calls which respond success when ACL's are set to be private and fails, if user tries to change them from their default 'private' Some applications such as nuxeo may have an unnecessary requirement for this operation, we support this anyways such that don't have to fully implement the functionality just that we can respond with success for default ACLs	2020-02-16 11:37:52 +05:30
poornas	716a52f261	Fix hang in cache copyobject call (#8993 ) Avoid GetObjectNInfo call from cache in CopyObjectHandler - in the case of server side copy with metadata replacement, the reader returned from cache is never consumed, but the net effect of GetObjectNInfo from cache layer, is cache holding a write lock to fill the cache. Subsequent stat operation on cache in CopyObject is not able to acquire a read lock, thus causing the hang. Fixes #8991	2020-02-13 15:32:26 -08:00
Harshavardhana	c56c2f5fd3	fix routing issue for esoteric characters in gorilla/mux (#8967 ) First step is to ensure that Path component is not decoded by gorilla/mux to avoid routing issues while handling certain characters while uploading through PutObject() Delay the decoding and use PathUnescape() to escape the `object` path component. Thanks to @buengese and @ncw for neat test cases for us to test with. Fixes #8950 Fixes #8647	2020-02-12 09:08:02 +05:30
poornas	9b4d46a6ed	evict cached entry for server side copy (#8947 ) Fixes #8942	2020-02-07 14:36:46 -08:00
Nitish Tiwari	e5951e30d0	Add support for Object Tagging in LifeCycle configuration (#8880 ) Fixes #8870 Co-Authored-By: Krishnan Parthasarathi <krisis@users.noreply.github.com>	2020-02-06 13:20:10 +05:30
Krishnan Parthasarathi	026265f8f7	Add support for bucket encryption feature (#8890 ) - pkg/bucket/encryption provides support for handling bucket encryption configuration - changes under cmd/ provide support for AES256 algorithm only Co-Authored-By: Poorna <poornas@users.noreply.github.com> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-05 15:12:34 +05:30
Harshavardhana	0cbebf0f57	Rename pkg/{tagging,lifecycle} to pkg/bucket sub-directory (#8892 ) Rename to allow for more such features to come in a more proper hierarchical manner.	2020-01-27 14:12:34 -08:00
Harshavardhana	f14f60a487	fix: Avoid double usage calculation on every restart (#8856 ) On every restart of the server, usage was being calculated which is not useful instead wait for sufficient time to start the crawling routine. This PR also avoids lots of double allocations through strings, optimizes usage of string builders and also avoids crawling through symbolic links. Fixes #8844	2020-01-21 14:07:49 -08:00
Nitish Tiwari	61c17c8933	Add ObjectTagging Support (#8754 ) This PR adds support for AWS S3 ObjectTagging API as explained here https://docs.aws.amazon.com/AmazonS3/latest/dev/object-tagging.html	2020-01-20 08:45:59 -08:00
poornas	60e60f68dd	Add support for object locking with legal hold. (#8634 )	2020-01-16 15:41:56 -08:00
poornas	30922148fb	Fix bug preventing overwrite of object if (#8796 ) object lock config is enabled for a bucket. Creating a bucket with object lock configuration enabled does not automatically cause WORM protection to be applied. PUT operation needs to specifically request object locking or bucket has to have default retention settings configured. Fixes regression introduced in #8657	2020-01-13 17:29:31 -08:00
Harshavardhana	669c9da85d	Disable federated buckets when etcd is namespaced (#8709 ) This is to ensure that when we have multiple tenants deployed all sharing the same etcd for global bucket should avoid listing each others buckets, this leads to information leak which should be avoided unless etcd is not namespaced for IAM assets in which case it can be assumed that its a federated setup. Federated setup and namespaced IAM assets on etcd is not supported since namespacing is only useful when you wish to separate the tenants as isolated instances of MinIO. This PR allows a new type of behavior, primarily driven by the usecase of m3(mkube) multi-tenant deployments with global bucket support.	2019-12-29 08:56:45 -08:00
Klaus Post	3211cb5df6	Add encryption buffer (#8626 ) Quite hard to measure difference: ``` λ warp cmp put-before.csv.zst put-after2.csv.zst Operation: PUT Operations: 340 -> 353 * Average: +4.11% (+22.7 MB/s) throughput, +4.11% (+0.2) obj/s * 50% Median: +1.58% (+7.3 MB/s) throughput, +1.58% (+0.1) obj/s ``` Difference is likely bigger on Intel platforms due to higher syscall costs.	2019-12-12 10:01:15 -08:00
Nitish Tiwari	3df7285c3c	Add Support for Cache and S3 related metrics in Prometheus endpoint (#8591 ) This PR adds support below metrics - Cache Hit Count - Cache Miss Count - Data served from Cache (in Bytes) - Bytes received from AWS S3 - Bytes sent to AWS S3 - Number of requests sent to AWS S3 Fixes #8549	2019-12-05 23:16:06 -08:00
Harshavardhana	e542084c37	Add etcd path prefix for all IAM assets (#8569 ) Currently, we use the top-level prefix "config/" for all our IAM assets, instead of to provide tenant-level separation bring 'path_prefix' to namespace the access properly. Fixes #8567	2019-11-25 16:33:34 -08:00
poornas	f931fc7bfb	Fix retention enforcement in Compliance mode (#8556 ) In compliance mode, the retention date can be extended with governance bypass permissions	2019-11-25 10:58:39 -08:00
poornas	ca96560d56	Add object retention at the per object (#8528 ) level - this PR builds on #8120 which added PutBucketObjectLockConfiguration and GetBucketObjectLockConfiguration APIS This PR implements PutObjectRetention, GetObjectRetention API and enhances PUT and GET API operations to display governance metadata if permissions allow.	2019-11-20 13:18:09 -08:00
poornas	13e2b97ad9	Fix regression in caching on single PUT (#8526 ) Regression caused by #8120	2019-11-15 15:46:27 +05:30
Bala FA	fb48ca5020	Add Get/Put Bucket Lock Configuration API support (#8120 ) This feature implements [PUT Bucket object lock configuration][1] and [GET Bucket object lock configuration][2]. After object lock configuration is set, existing and new objects are set to WORM for specified duration. Currently Governance mode works exactly like Compliance mode. Fixes #8101 [1] https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketPUTObjectLockConfiguration.html [2] https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketGETObjectLockConfiguration.html	2019-11-12 14:50:18 -08:00
Harshavardhana	ee4a6a823d	Migrate config to KV data format (#8392 ) - adding oauth support to MinIO browser (#8400) by @kanagaraj - supports multi-line get/set/del for all config fields - add support for comments, allow toggle - add extensive validation of config before saving - support MinIO browser to support proper claims, using STS tokens - env support for all config parameters, legacy envs are also supported with all documentation now pointing to latest ENVs - preserve accessKey/secretKey from FS mode setups - add history support implements three APIs - ClearHistory - RestoreHistory - ListHistory - add help command support for each config parameters - all the bug fixes after migration to KV, and other bug fixes encountered during testing.	2019-10-22 22:59:13 -07:00
poornas	1b74ce3924	Ensure actual object size is sent in notification (#8418 ) Fixes: #8407	2019-10-20 23:48:19 -07:00
Harshavardhana	5afb1b6747	Add support for {jwt:sub} substitutions for policies (#8393 ) Fixes #8345	2019-10-16 08:59:59 -07:00
Harshavardhana	d48fd6fde9	Remove unusued params and functions (#8399 )	2019-10-15 18:35:41 -07:00
poornas	d7060c4c32	Allow logging targets to be configured to receive `minio` (#8347 ) specific errors, `application` errors or `all` by default. console logging on server by default lists all logs - enhance admin console API to accept `type` as query parameter to subscribe to application/minio logs.	2019-10-11 18:50:54 -07:00
Harshavardhana	3b8adf7528	Move storageclass config handling into cmd/config/storageclass (#8360 ) Continuation of the changes done in PR #8351 to refactor, add tests and move global handling into a more idiomatic style for Go as packages.	2019-10-07 11:20:24 +05:30
Klaus Post	ff726969aa	Switch to Snappy -> S2 compression (#8189 )	2019-09-25 23:08:24 -07:00
Andreas Auernhammer	cb7d23cb17	remove SSE-S3 key rotation in CopyObject (#8278 ) This commit removes the SSE-S3 key rotation functionality from CopyObject since there will be a dedicated Admin-API for this purpose. Also update the security documentation to link to mc and the admin documentation.	2019-09-24 02:05:04 +05:30
Andreas Auernhammer	2b51fe9f26	make SSE request header check comprehensive (#8276 ) This commit refactors the SSE header check by moving it into the `crypto` package, adds a unit test for it and makes the check comprehensive.	2019-09-21 03:26:12 +05:30
poornas	29f64355ce	Allow caching on single PutObject (#8100 )	2019-09-05 19:50:16 +05:30
Harshavardhana	f13f421e84	Allow CopyObject in pathStyle across federated instances (#8064 ) Fixes #7976	2019-08-21 22:02:39 -10:00
Harshavardhana	069badc7e9	Allow CopyObjectPart to work in federated setups (#8066 ) Fixes #8065	2019-08-20 07:19:22 -10:00
poornas	3385bf3da8	Rewrite cache implementation to cache only on GET (#7694 ) Fixes #7458 Fixes #7573 Fixes #7938 Fixes #6934 Fixes #6265 Fixes #6630 This will allow the cache to consistently work for server and gateways. Range GET requests will be cached in the background after the request is served from the backend. - All cached content is automatically bitrot protected. - Avoid ETag verification if a cache-control header is set and the cached content is still valid. - This PR changes the cache backend format, and all existing content will be migrated to the new format. Until the data is migrated completely, all content will be served from the backend.	2019-08-09 17:09:08 -07:00
Harshavardhana	e6d8e272ce	Use const slashSeparator instead of "/" everywhere (#8028 )	2019-08-06 12:08:58 -07:00
Harshavardhana	5c0acbc6fc	Add text/event-stream for long running http connections (#7909 ) When MinIO is behind a proxy, proxies end up killing clients when no data is seen on the connection, adding the right content-type ensures that proxies do not come in the way.	2019-07-11 13:19:25 -07:00
Harshavardhana	c43f745449	Ensure that we use constants everywhere (#7845 ) This allows for canonicalization of the strings throughout our code and provides a common space for all these constants to reside. This list is rather non-exhaustive but captures all the headers used in AWS S3 API operations	2019-07-02 22:34:32 -07:00
kannappanr	70b350c383	Remove DeploymentID from response headers (#7815 ) Response headers need not contain deployment ID.	2019-07-01 12:22:01 -07:00
Krishna Srinivas	338e9a9be9	Put object client disconnect (#7824 ) Fail putObject and postpolicy in case client prematurely disconnects Use request's context to cancel lock requests on client disconnects	2019-06-28 22:09:17 -07:00
Andreas Auernhammer	98d3913a1e	enable SSE-KMS pass-through on S3 gateway (#7788 ) This commit relaxes the restriction that the MinIO gateway does not accept SSE-KMS headers. Now, the S3 gateway allows SSE-KMS headers for PUT and MULTIPART PUT requests and forwards them to the S3 gateway backend (AWS). This is considered SSE pass-through mode. Fixes #7753	2019-06-19 17:37:08 -07:00
Harshavardhana	2c0b3cadfc	Update go mod with sem versions of our libraries (#7687 )	2019-05-29 16:35:12 -07:00
Dee Koder	e252114f06	Revert "cache: Rewrite to cache only on download (#7575 )" (#7684 ) This reverts commit `a13b58f630`.	2019-05-22 14:54:15 -07:00
poornas	a13b58f630	cache: Rewrite to cache only on download (#7575 ) This will allow cache to consistently work for server and gateways. Range GET requests will be cached in the background after the request is served from the backend. Fixes: #7458, #7573, #6265, #6630	2019-05-22 08:30:27 +05:30
Harshavardhana	72929ec05b	Turn off md5sum optionally if content-md5 is not set (#7609 ) This PR also brings --compat option to run MinIO in strict S3 compatibility mode, MinIO by default will now try to run high performance mode.	2019-05-08 18:35:40 -07:00
kannappanr	4b858b562a	Compression: Handle auto encryption when size is unknown (#7600 ) When size is unknown and auto encryption is enabled, and compression is set to true, putobject API is failing. Moving adding the SSE-S3 header as part of the request to before checking if compression can be done, otherwise the size is set to -1 and that seems to cause problems.	2019-05-02 08:28:18 -07:00
poornas	a74cb93666	Worm: Permit key-rotation of S3 encrypted objects (#7429 ) Fixes : #7399	2019-04-10 11:31:50 -07:00
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	2019-04-09 11:39:42 -07:00
Harshavardhana	a2e344bf30	Preserve ETag case for S3 compatibility (#7498 ) Most hadoop distributions hortonworks, cloudera all depend on aws-sdk-java 1.7.x to 1.10.x - the releases which have bugs related case sensitive check for ETag header. Go changes the case of the headers set to be canonical but only preserves them when set through a direct map. This fixes most compatibility issues we have had in the past supporting older hadoop distributions.	2019-04-08 16:54:46 -07:00
Harshavardhana	e0a87e96de	Populate host value from GetSourceIP directly (#7417 )	2019-03-25 11:45:42 -07:00
Anis Elleuch	b05825ffe8	s3: Fix precondition failed in CopyObjectPart when src is encrypted (#7276 ) CopyObject precondition checks into GetObjectReader in order to perform SSE-C pre-condition checks using the last 32 bytes of encrypted ETag rather than the decrypted ETag This also necessitates moving precondition checks for gateways to gateway layer rather than object handler check	2019-03-06 12:38:41 -08:00
Kale Blankenship	ef132c5714	Replace snappy.Writer/io.Pipe with snappyCompressReader. (#7316 ) Prevents deferred close functions from being called while still attempting to copy reader to snappyWriter. Reduces code duplication when compressing objects.	2019-03-05 08:35:37 -08:00
Harshavardhana	c3ca954684	Implement AssumeRole API for Minio users (#7267 ) For actual API reference read here https://docs.aws.amazon.com/STS/latest/APIReference/API_AssumeRole.html Documentation is added and updated as well at docs/sts/assume-role.md Fixes #6381	2019-02-27 17:46:55 -08:00
Anis Elleuch	5efbe8a1b3	s3: Add support of encodingType parameter (#7265 ) This commit honors encoding-type parameter in object listing, parts listing and multipart uploads listing.	2019-02-24 11:44:24 +05:30
Harshavardhana	7923b83953	Support multiple-domains in MINIO_DOMAIN (#7274 ) Fixes #7173	2019-02-23 08:48:01 +05:30
Harshavardhana	bedcb7442a	Write xml.Header first instead of spaces to handle XML parsers (#7253 ) Clients like AWS SDK Java and AWS cli XML parsers are unable to handle on `\r\n` characters to avoid these errors send XML header first and write white space characters instead. Also handle cases to avoid double WriteHeader calls	2019-02-21 11:50:15 +05:30
poornas	755e675d5c	Fix: send decrypted size to notification event (#7248 )	2019-02-19 14:14:26 +05:30
Harshavardhana	a51781e5cf	Use context to fill in more details about error XML (#7232 )	2019-02-13 16:07:21 -08:00
Harshavardhana	df35d7db9d	Introduce staticcheck for stricter builds (#7035 )	2019-02-13 18:29:36 +05:30
Harshavardhana	4ba77a916d	Select should return early errors as XML (#7230 ) Currently, we were sending errors in Select binary format, which is incompatible with AWS S3 behavior, errors in binary are sent after HTTP status code is already 200 OK - i.e it happens during the evaluation of the record reader.	2019-02-13 13:18:11 +05:30
Harshavardhana	fef5416b3c	Support unknown gateway errors and convert at handler layer (#7219 ) Different gateway implementations due to different backend API errors, might return different unsupported errors at our handler layer. Current code posed a problem for us because this information was lost and we would convert it to InternalError in this situation all S3 clients end up retrying the request. To avoid this unexpected situation implement a way to support this cleanly such that the underlying information is not lost which is returned by gateway.	2019-02-12 14:55:52 +05:30
poornas	40b8d11209	Move metadata into ObjectOptions for NewMultipart and PutObject (#7060 )	2019-02-09 11:01:06 +05:30
Harshavardhana	85e939636f	Fix JSON parser handling for certain objects (#7162 ) This PR also adds some comments and simplifies the code. Primary handling is done to ensure that we make sure to honor cached buffer. Added unit tests as well Fixes #7141	2019-02-07 08:04:42 +05:30
Krishna Srinivas	3dfbe0f68c	Send white spaces to client till completeMultipart() process completes (#7198 )	2019-02-05 20:58:09 -08:00
kannappanr	9a65f6dc97	Remove duplicate code in object-handlers.go (#7176 ) removed duplicate code in CompleteMultipartUploadHandler and CopyObjectPartHandler.	2019-02-05 13:36:38 -08:00
Anis Elleuch	36dae04671	CopyObjectPart: remove duplicated etag decryption (#7174 )	2019-01-30 19:33:31 -08:00
Aditya Manthramurthy	2786055df4	Add new SQL parser to support S3 Select syntax (#7102 ) - New parser written from scratch, allows easier and complete parsing of the full S3 Select SQL syntax. Parser definition is directly provided by the AST defined for the SQL grammar. - Bring support to parse and interpret SQL involving JSON path expressions; evaluation of JSON path expressions will be subsequently added. - Bring automatic type inference and conversion for untyped values (e.g. CSV data).	2019-01-28 17:59:48 -08:00
Harshavardhana	5353edcc38	Support policy variable replacement (#7085 ) This PR supports iam and bucket policies to have policy variable replacements in resource and condition key values. For example - ${aws:username} - ${aws:userid}	2019-01-21 10:27:14 +05:30
Harshavardhana	6dd13e68c2	Support V2 signatures when autoencryption is enabled (#7084 ) When auto-encryption is turned on, we pro-actively add SSEHeader for all PUT, POST operations. This is unusual for V2 signature calculation because V2 signature doesn't have a pre-defined set of signed headers in the request like V4 signature. According to V2 we should canonicalize all incoming supported HTTP headers. Make sure to validate signatures before we mutate http headers	2019-01-16 12:12:06 -08:00
Bala FA	e23a42305c	Rebase minio/parquet-go and fix null handling. (#7067 )	2019-01-16 21:52:04 +05:30
Nick Craig-Wood	9c26fe47b0	Fix server side copy of files with `?` in - fixes #7058 (#7059 ) Before this change the CopyObjectHandler and the CopyObjectPartHandler both looked for a `versionId` parameter on the `X-Amz-Copy-Source` URL for the version of the object to be copied on the URL unescaped version of the header. This meant that files that had question marks in were truncated after the question mark so that files with `?` in their names could not be server side copied. After this change the URL unescaping is done during the parsing of the `versionId` parameter which fixes the problem. This change also introduces the same logic for the `X-Amz-Copy-Source-Version-Id` header field which was previously ignored, namely returning an error if it is present and not `null` since minio does not currently support versions. S3 Docs: - https://docs.aws.amazon.com/AmazonS3/latest/API/RESTObjectCOPY.html - https://docs.aws.amazon.com/AmazonS3/latest/API/mpUploadUploadPartCopy.html	2019-01-10 13:10:10 -08:00
poornas	ed1275a063	Fix copy from encrypted multipart to single encrypted part (#7056 ) When source is encrypted multipart object and the parts are not evenly divisible by DARE package block size, target encrypted size will not necessarily be the same as encrypted source object.	2019-01-09 15:17:21 -08:00
Bala FA	b0deea27df	Refactor s3select to support parquet. (#7023 ) Also handle pretty formatted JSON documents.	2019-01-08 16:53:04 -08:00
poornas	5a80cbec2a	Add double encryption at S3 gateway. (#6423 ) This PR adds pass-through, single encryption at gateway and double encryption support (gateway encryption with pass through of SSE headers to backend). If KMS is set up (either with Vault as KMS or using MINIO_SSE_MASTER_KEY),gateway will automatically perform single encryption. If MINIO_GATEWAY_SSE is set up in addition to Vault KMS, double encryption is performed.When neither KMS nor MINIO_GATEWAY_SSE is set, do a pass through to backend. When double encryption is specified, MINIO_GATEWAY_SSE can be set to "C" for SSE-C encryption at gateway and backend, "S3" for SSE-S3 encryption at gateway/backend or both to support more than one option. Fixes #6323, #6696	2019-01-05 14:16:42 -08:00
Harshavardhana	d2f8f8c7ee	Fix ETag handling with auto-encryption with CopyObject conditions (#7000 ) minio-java tests were failing under multiple places when auto encryption was turned on, handle all the cases properly This PR fixes - CopyObject should decrypt ETag before it does if-match - CopyObject should not try to preserve metadata of source when rotating keys, unless explicitly asked by the user. - We should not try to decrypt Compressed object etag, the potential case was if user sets encryption headers along with compression enabled.	2018-12-19 14:12:53 -08:00
Harshavardhana	d1e41695fe	Add support for federation on browser (#6891 )	2018-12-19 18:43:47 +05:30
poornas	7c9f934875	Disallow SSE requests when object layer has encryption disabled (#6981 )	2018-12-14 21:39:59 -08:00
Andreas Auernhammer	d264d2c899	add auto-encryption feature (#6523 ) This commit adds an auto-encryption feature which allows the Minio operator to ensure that uploaded objects are always encrypted. This change adds the `autoEncryption` configuration option as part of the KMS conifguration and the ENV. variable `MINIO_SSE_AUTO_ENCRYPTION:{on,off}`. It also updates the KMS documentation according to the changes. Fixes #6502	2018-12-14 13:35:48 -08:00
Harshavardhana	52b159b1db	Allow versionId to be null for Delete,CopyObjectPart (#6972 )	2018-12-14 11:34:37 +05:30
Harshavardhana	6f7c99a333	Allow versionId to be null for Copy,Get,Head API calls (#6942 ) Fixes #6935	2018-12-12 11:43:44 -08:00
Praveen raj Mani	9af7d627ac	Preserve the compression headers while copying (#6952 ) Fixes #6951	2018-12-11 12:05:41 -08:00
Harshavardhana	4c7c571875	Support JSON to CSV and CSV to JSON output format conversion (#6910 ) This PR implements one of the pending items in issue #6286 in S3 API a user can request CSV output for a JSON document and a JSON output for a CSV document. This PR refactors the code a little bit to bring this feature.	2018-12-07 14:55:32 -08:00
Harshavardhana	bef7c01c58	Choose right users in federation mode for CopyObject (#6895 )	2018-11-29 17:35:11 -08:00
Harshavardhana	83fe70f710	Fix CopyObject regression calculating md5sum (#6868 ) CopyObject() failed to calculate proper md5sum when without encryption headers. This is a regression fix perhaps introduced in commit `5f6d717b7a` Fixes https://github.com/minio/minio-go/issues/1044	2018-11-27 13:23:32 -08:00
Harshavardhana	dba61867e8	Redirect browser requests returning AccessDenied (#6848 ) Anonymous requests from S3 resources returning AccessDenied should be auto redirected to browser for login.	2018-11-26 12:15:12 -08:00
Harshavardhana	9e3fce441e	Audit log claims from token (#6847 )	2018-11-22 09:33:24 +05:30
Harshavardhana	bfb505aa8e	Refactor logging in more Go idiomatic style (#6816 ) This refactor brings a change which allows targets to be added in a cleaner way and also audit is now moved out. This PR also simplifies logger dependency for auditing	2018-11-19 14:47:03 -08:00
poornas	5f6d717b7a	Fix: Preserve MD5Sum for SSE encrypted objects (#6680 ) To conform with AWS S3 Spec on ETag for SSE-S3 encrypted objects, encrypt client sent MD5Sum and store it on backend as ETag.Extend this behavior to SSE-C encrypted objects.	2018-11-14 17:36:41 -08:00
Harshavardhana	7e1661f4fa	Performance improvements to SELECT API on certain query operations (#6752 ) This improves the performance of certain queries dramatically, such as 'count()' etc. Without this PR ``` ~ time mc select --query "select count() from S3Object" myminio/sjm-airlines/star2000.csv.gz 2173762 real 0m42.464s user 0m0.071s sys 0m0.010s ``` With this PR ``` ~ time mc select --query "select count(*) from S3Object" myminio/sjm-airlines/star2000.csv.gz 2173762 real 0m17.603s user 0m0.093s sys 0m0.008s ``` Almost a 250% improvement in performance. This PR avoids a lot of type conversions and instead relies on raw sequences of data and interprets them lazily. ``` benchcmp old new benchmark old ns/op new ns/op delta BenchmarkSQLAggregate_100K-4 551213 259782 -52.87% BenchmarkSQLAggregate_1M-4 6981901985 2432413729 -65.16% BenchmarkSQLAggregate_2M-4 13511978488 4536903552 -66.42% BenchmarkSQLAggregate_10M-4 68427084908 23266283336 -66.00% benchmark old allocs new allocs delta BenchmarkSQLAggregate_100K-4 2366 485 -79.50% BenchmarkSQLAggregate_1M-4 47455492 21462860 -54.77% BenchmarkSQLAggregate_2M-4 95163637 43110771 -54.70% BenchmarkSQLAggregate_10M-4 476959550 216906510 -54.52% benchmark old bytes new bytes delta BenchmarkSQLAggregate_100K-4 1233079 1086024 -11.93% BenchmarkSQLAggregate_1M-4 2607984120 557038536 -78.64% BenchmarkSQLAggregate_2M-4 5254103616 1128149168 -78.53% BenchmarkSQLAggregate_10M-4 26443524872 5722715992 -78.36% ```	2018-11-14 15:55:10 -08:00
Harshavardhana	a55a298e00	Make sure to log unhandled errors always (#6784 ) In many situations, while testing we encounter ErrInternalError, to reduce logging we have removed logging from quite a few places which is acceptable but when ErrInternalError occurs we should have a facility to log the corresponding error, this helps to debug Minio server.	2018-11-12 11:07:43 -08:00
kannappanr	df2d75a2a3	Cleanup unnecessary logs (#6788 )	2018-11-09 14:03:37 -08:00
Andreas Auernhammer	d07fb41fe8	add key-rotation for SSE-S3 objects (#6755 ) This commit adds key-rotation for SSE-S3 objects. To execute a key-rotation a SSE-S3 client must - specify the `X-Amz-Server-Side-Encryption: AES256` header for the destination - The source == destination for the COPY operation. Fixes #6754	2018-11-05 10:26:10 -08:00
Harshavardhana	bef0318c36	Support audit logs with additional fields (#6738 ) This PR adds support - Request query params - Request headers - Response headers AuditLogEntry is exported and versioned as well starting with this PR.	2018-11-02 18:40:08 -07:00
kannappanr	9ed7fb4916	Do not call multiple response.WriteHeader calls (#6733 ) Execute method in s3Select package makes a response.WriteHeader call. Not calling it again in SelectObjectContentHandler function in case of error in s3Select.Execute call.	2018-10-31 14:09:26 -07:00
Harshavardhana	f162d7bd97	Performance improvements by re-using record buffer (#6622 ) Avoid unnecessary pointer reference allocations when not needed, for example - SelectFuncs{} - Row{}	2018-10-31 08:48:01 +05:30
Harshavardhana	36990aeafd	Avoid double bucket validation in DeleteObjectHandler (#6720 ) On a heavily loaded server, getBucketInfo() becomes slow, one can easily observe deleting an object causes many additional network calls. This PR is to let the underlying call return the actual error and write it back to the client.	2018-10-30 16:07:57 -07:00
poornas	1c911c5f40	Fix: Validate copy-part encryption header and metadata (#6725 ) Otherwise CopyObjectPart would continue to upload part with incorrect encryption option and fail when upload is finalized	2018-10-29 06:40:34 -07:00
Anis Elleuch	88c3dd49c6	copy: Ensure that the user has GET access to the src object (#6715 )	2018-10-26 16:12:44 -07:00
kannappanr	6869f6d9dd	Remove unwanted logs (#6708 )	2018-10-26 14:41:25 -07:00
Harshavardhana	555d54371c	Fix CopyObjectPart broken source encryption support (#6699 ) Current master didn't support CopyObjectPart when source was encrypted, this PR fixes this by allowing range CopySource decryption at different sequence numbers. Fixes #6698	2018-10-25 08:50:06 -07:00
Praveen raj Mani	ecb042aa1c	Copy and CopyPart changes for compression (#6669 ) This PR fixes - The target object should be compressed even if the source object is not compressed. - The actual size for an encrypted object should be the `decryptedSize`	2018-10-23 11:46:20 -07:00
Andreas Auernhammer	586466584f	fix wrong actual part size assignment in CopyObjectPart (#6652 ) This commit fixes a wrong assignment to `actualPartSize`. The `actualPartSize` for an encrypted src object is not `srcInfo.Size` because that's the encrypted object size which is larger than the actual object size. So the actual part size for an encrypted object is the decrypted size of `srcInfo.Size`.	2018-10-22 14:23:23 -07:00
Ashish Kumar Sinha	c0b4bf0a3e	SQL select query for CSV/JSON (#6648 ) select * , select column names have been implemented for CSV. select * is implemented for JSON.	2018-10-22 12:12:22 -07:00
Andreas Auernhammer	8a6c3aa3cd	crypto: add RemoveInternalEntries function (#6616 ) This commit adds a function for removing crypto-specific internal entries from the object metadata. See #6604	2018-10-19 10:50:52 -07:00
Harshavardhana	62b560510b	Fix SSE-C source decryption handling (#6671 ) Without this fix we have room for two different type of errors. - Source is encrypted and we didn't provide any source encryption keys This results in Incomplete body error to be returned back to the client since source is encrypted and we gave the reader as is to the object layer which was of a decrypted value leading to "IncompleteBody" - Source is not encrypted and we provided source encryption keys. This results in a corrupted object on the destination which is considered encrypted but cannot be read by the server and returns the following error. ``` <Error><Code>XMinioObjectTampered</Code><Message>The requested object was modified and may be compromised</Message><Resource>/id-platform-gamma/ </Resource><RequestId>155EDC3E86BFD4DA</RequestId><HostId>3L137</HostId> </Error> ```	2018-10-19 10:41:13 -07:00
poornas	7e0f1eb8b5	Fix: verify client sent md5sum in encrypted PutObjectPart request (#6668 ) This PR also removes check for SSE-S3 headers as this is not required by S3 specification.	2018-10-18 16:05:05 -07:00
Pontus Leitzler	b43e8337b1	Add error handling in api-resource.go (#6651 )	2018-10-18 07:31:46 -07:00
Praveen raj Mani	cef044178c	Treat columns with spaces inbetween [s3Select] (#6597 ) replace the double/single quotes with backticks for the xwb1989/sqlparser to recognise such queries. Fixes #6589	2018-10-17 11:01:26 -07:00
kannappanr	c7f180ffa9	Add code to translate errInvalidEncryptionParameters to APIErrcode (#6625 ) Fixes #6623	2018-10-16 12:27:34 -07:00
kannappanr	b8bd8d6a03	Validate user provided SSE-C key on Head Object API (#6600 ) Fixes #6598	2018-10-16 12:24:27 -07:00
Anis Elleuch	5b3090dffc	encryption: Fix copy from encrypted multipart to single part (#6604 ) CopyObject handler forgot to remove multipart encryption flag in metadata when source is an encrypted multipart object and the target is also encrypted but single part object. This PR also simplifies the code to facilitate review.	2018-10-15 11:07:36 -07:00
Harshavardhana	b0c9ae7490	Add audit logging for S3 and Web handlers (#6571 ) This PR brings an additional logger implementation called AuditLog which logs to http targets The intention is to use AuditLog to log all incoming requests, this is used as a mechanism by external log collection entities for processing Minio requests.	2018-10-12 12:25:59 -07:00
poornas	110458cd10	Fix: Disallow requests with SSE-KMS headers (#6587 ) Addresses issue #6582. Minio server currently does not have SSE-KMS support. Reject requests with SSE-KMS headers with NotImplementedErr	2018-10-09 15:04:53 -07:00
Harshavardhana	54ae364def	Introduce STS client grants API and OPA policy integration (#6168 ) This PR introduces two new features - AWS STS compatible STS API named AssumeRoleWithClientGrants ``` POST /?Action=AssumeRoleWithClientGrants&Token=<jwt> ``` This API endpoint returns temporary access credentials, access tokens signature types supported by this API - RSA keys - ECDSA keys Fetches the required public key from the JWKS endpoints, provides them as rsa or ecdsa public keys. - External policy engine support, in this case OPA policy engine - Credentials are stored on disks	2018-10-09 14:00:01 -07:00
Harshavardhana	8c29f69b00	Fix racy error communication inside go-routine (#6539 ) Use CloseWithError to communicate errors in pipe, this PR also fixes potential shadowing of error	2018-09-28 13:14:59 +05:30
Praveen raj Mani	ce9d36d954	Add object compression support (#6292 ) Add support for streaming (golang/LZ77/snappy) compression.	2018-09-28 09:06:17 +05:30
poornas	ed703c065d	Add ObjectOptions to GetObjectNInfo (#6533 )	2018-09-27 15:36:45 +05:30
Anis Elleuch	aa4e2b1542	Use GetObjectNInfo in CopyObject and CopyObjectPart (#6489 )	2018-09-25 12:39:46 -07:00
Harshavardhana	48bfebe442	HEAD on an object should mimic GET without body (#6508 ) Add "Range" header support etc.	2018-09-23 22:54:10 +05:30
Aditya Manthramurthy	584cb61bb8	Switch back to GetObjectInfo for HEAD requests (#6513 )	2018-09-21 13:48:58 -07:00
Aditya Manthramurthy	36e51d0cee	Add GetObjectNInfo to object layer (#6449 ) The new call combines GetObjectInfo and GetObject, and returns an object with a ReadCloser interface. Also adds a number of end-to-end encryption tests at the handler level.	2018-09-20 19:22:09 -07:00
Harshavardhana	b62ed5dc90	select API CSV may not be specified (#6493 ) This should be present until we support JSON	2018-09-20 15:04:26 +05:30
Harshavardhana	a0683d3c1f	Send progress only when requested by client in SelectObject (#6467 )	2018-09-17 11:52:46 +05:30
poornas	14fa0097b0	fix: UploadPart,CopyObjectPart does not need sse-s3 header (#6386 ) S3 API spec for UploadPart requires encryption headers to be specified only for SSE-C	2018-09-13 14:53:03 -07:00
poornas	5c0b98abf0	Add ObjectOptions to ObjectLayer calls (#6382 )	2018-09-10 09:42:43 -07:00
Praveen raj Mani	30d4a2cf53	s3select should honour custom record delimiter (#6419 ) Allow custom delimiters like `\r\n`, `a`, `\r` etc in input csv and replace with `\n`. Fixes #6403	2018-09-10 21:50:28 +05:30
Harshavardhana	4487f70f08	Revert all GetObjectNInfo related PRs (#6398 ) * Revert "Encrypted reader wrapped in NewGetObjectReader should be closed (#6383)" This reverts commit `53a0bbeb5b`. * Revert "Change SelectAPI to use new GetObjectNInfo API (#6373)" This reverts commit `5b05df215a`. * Revert "Implement GetObjectNInfo object layer call (#6290)" This reverts commit `e6d740ce09`.	2018-08-31 13:10:12 -07:00
Harshavardhana	53a0bbeb5b	Encrypted reader wrapped in NewGetObjectReader should be closed (#6383 )	2018-08-29 19:18:00 -07:00
Harshavardhana	5b05df215a	Change SelectAPI to use new GetObjectNInfo API (#6373 ) This PR also removes some double checks	2018-08-28 13:08:30 -07:00
Aditya Manthramurthy	e6d740ce09	Implement GetObjectNInfo object layer call (#6290 ) This combines calling GetObjectInfo and GetObject while returning a io.ReadCloser for the object's body. This allows the two operations to be under a single lock, fixing a race between getting object info and reading the object body.	2018-08-27 15:28:23 +05:30
poornas	d547873b17	webhandler - display encryption errors properly (#6339 ) For encrypted objects, download errors need to be displayed in web response format instead of xml format. Fixes #6327	2018-08-24 07:56:24 -07:00
kannappanr	add57a6938	Add content-length as part of event notification structure (#6341 ) Fixes #6321	2018-08-23 14:40:54 -07:00
Andreas Auernhammer	d531080b7e	add SSE-KMS not-implemented error handling (#6234 ) This commit adds error handling for SSE-KMS requests to HEAD, GET, PUT and COPY operations. The server responds with `not implemented` if a client sends a SSE-KMS request.	2018-08-17 21:07:19 -07:00
Harshavardhana	5a4a57700b	Add select docs and fix return values for Select API (#6300 )	2018-08-17 17:11:39 -07:00
poornas	e71ef905f9	Add support for SSE-S3 server side encryption with vault (#6192 ) Add support for sse-s3 encryption with vault as KMS. Also refactoring code to make use of headers and functions defined in crypto package and clean up duplicated code.	2018-08-17 12:52:14 -07:00
Arjun Mishra	7c14cdb60e	S3 Select API Support for CSV (#6127 ) Add support for trivial where clause cases	2018-08-15 03:30:19 -07:00
Anis Elleuch	5a1ae862a7	Avoid sending an error after 206 HTTP code (#6264 ) When a S3 client sends a GET Object with a range header, 206 http code is returned indicating success, however the call of the object layer's GetObject() inside the handler can return an error and will lead to writing an XML error message, which is obviously wrong since we already sent 206 http code. So in the case, we just stop sending data to the S3 client, this latter can still detect if there is no error when comparing received data with Content-Length header in the Get Object response.	2018-08-08 15:39:47 -07:00
kannappanr	76ddf4d32f	Log x-amz-request-id as log and XML error response (#6173 ) Currently, requestid field in logEntry is not populated, as the requestid field gets set at the very end. It is now set before regular handler functions. This is also useful in setting it as part of the XML error response. Travis build for ppc64le has been quite inconsistent and stays queued for most of the time. Removing this build as part of Travis.yml for the time being.	2018-07-20 18:46:32 -07:00
Nitish Tiwari	2aa18cafc6	Update federation target to etcd/clientv3 (#6119 ) With CoreDNS now supporting etcdv3 as the DNS backend, we can update our federation target to etcdv3. Users will now be able to use etcdv3 server as the federation backbone. Minio will update bucket data to etcdv3 and CoreDNS can pick that data up and serve it as bucket style DNS path.	2018-07-12 14:12:40 -07:00
Andreas Auernhammer	adf7340394	fix size computation for en/decrypted objects (#6147 ) This PR fixes the size calculation for encrypted multipart objects.	2018-07-12 11:23:32 -07:00
Andreas Auernhammer	15771ebe8d	Fix decrypted object size and key derivation in CopyObjectPart (#6141 ) This commit fixes the size calculation for multipart objects. The decrypted size of an encrypted multipart object is the sum of the decrypted part sizes. Also fixes the key derivation in CopyObjectPart. Instead of using the same object-encryption-key for each part now an unique per-part key is derived. Updates #6139	2018-07-12 21:59:56 +05:30
Praveen raj Mani	44865596db	SignatureV4 validation with Metadata in the presignedUrl (#5894 ) The `X-Amz-Meta-`/`X-Minio-Meta-` will now be recognized in query string also. Fixes #5857 #5950	2018-07-10 20:27:10 -07:00
Andreas Auernhammer	b181a693fb	fix object rebinding SSE-C security guarantee violation (#6121 ) This commit fixes a weakness of the key-encryption-key derivation for SSE-C encrypted objects. Before this change the key-encryption-key was not bound to / didn't depend on the object path. This allows an attacker to repalce objects - encrypted with the same client-key - with each other. This change fixes this issue by updating the key-encryption-key derivation to include: - the domain (in this case SSE-C) - a canonical object path representation - the encryption & key derivation algorithm Changing the object path now causes the KDF to derive a different key-encryption-key such that the object-key unsealing fails. Including the domain (SSE-C) and encryption & key derivation algorithm is not directly neccessary for this fix. However, both will be included for the SSE-S3 KDF. So they are included here to avoid updating the KDF again when we add SSE-S3. The leagcy KDF 'DARE-SHA256' is only used for existing objects and never for new objects / key rotation.	2018-07-09 17:18:28 -07:00
Harshavardhana	6c85706c24	Use GetSourceIP for source ip as request params (#6109 ) Fixes #6108	2018-07-02 14:40:18 -07:00
Praveen raj Mani	360f3f9335	Checking the existence of the bucket in DeleteObjectHandler (#6085 ) Fixes #6077	2018-06-30 22:35:43 -07:00
Nitish Tiwari	3dc13323e5	Use random host from among multiple hosts to create requests Also use hosts passed to Minio startup command to populate IP addresses if MINIO_PUBLIC_IPS is not set.	2018-06-08 10:22:01 -07:00
Nitish Tiwari	6ce7265c8c	Add support for CopyObject across regions and multiple Minio IPs This PR adds CopyObject support for objects residing in buckets in different Minio instances (where Minio instances are part of a federated setup). Also, added support for multiple Minio domain IPs. This is required for distributed deployments, where one deployment may have multiple nodes, each with a different public IP.	2018-06-08 10:22:01 -07:00
Krishna Srinivas	d6df9b16ac	Return NoSuchKey for anonReqs with s3:ListBucket policy (#5876 )	2018-05-02 12:13:27 +05:30

... 3 4 5 6 7 ...

539 Commits