minio

Commit Graph

Author	SHA1	Message	Date
ebozduman	a91cfa03e7	extend the HINT on backend ownership and its contents (#9846 )	2020-06-16 15:32:29 -07:00
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	2020-06-12 20:04:01 -07:00
Klaus Post	43d6e3ae06	merge object lifecycle checks into usage crawler (#9579 )	2020-06-12 10:28:21 -07:00
Harshavardhana	febe9cc26a	fix: avoid timer leaks in dsync/lsync (#9781 ) At a customer setup with lots of concurrent calls it can be observed that in newRetryTimer there were lots of tiny alloations which are not relinquished upon retries, in this codepath we were only interested in re-using the timer and use it wisely for each locker. ``` (pprof) top Showing nodes accounting for 8.68TB, 97.02% of 8.95TB total Dropped 1198 nodes (cum <= 0.04TB) Showing top 10 nodes out of 79 flat flat% sum% cum cum% 5.95TB 66.50% 66.50% 5.95TB 66.50% time.NewTimer 1.16TB 13.02% 79.51% 1.16TB 13.02% github.com/ncw/directio.AlignedBlock 0.67TB 7.53% 87.04% 0.70TB 7.78% github.com/minio/minio/cmd.xlObjects.putObject 0.21TB 2.36% 89.40% 0.21TB 2.36% github.com/minio/minio/cmd.(posix).Walk 0.19TB 2.08% 91.49% 0.27TB 2.99% os.statNolog 0.14TB 1.59% 93.08% 0.14TB 1.60% os.(File).readdirnames 0.10TB 1.09% 94.17% 0.11TB 1.25% github.com/minio/minio/cmd.readDirN 0.10TB 1.07% 95.23% 0.10TB 1.07% syscall.ByteSliceFromString 0.09TB 1.03% 96.27% 0.09TB 1.03% strings.(Builder).grow 0.07TB 0.75% 97.02% 0.07TB 0.75% path.(lazybuf).append ```	2020-06-08 11:28:40 -07:00
Harshavardhana	5686a7e273	fix NAS gateway support for policy/notification (#9765 ) Fixes #9764	2020-06-03 13:18:54 -07:00
Harshavardhana	b2db8123ec	Preserve errors returned by diskInfo to detect disk errors (#9727 ) This PR basically reverts #9720 and re-implements it differently	2020-05-28 13:03:04 -07:00
Harshavardhana	6e0575a53d	Revert "Disable crawler in FS/NAS gateway mode (#9695 )" (#9702 ) This reverts commit `eba423bb9d`. Additionally also address the FS crawler to properly calculate the sizes for encrypted/compressed content.	2020-05-25 11:32:53 -07:00
Harshavardhana	eba423bb9d	Disable crawler in FS/NAS gateway mode (#9695 ) No one really uses FS for large scale accounting usage, neither we crawl in NAS gateway mode. It is worthwhile to simply disable this feature as its not useful for anyone. Bonus disable bucket quota ops as well in, FS and gateway mode	2020-05-25 00:17:52 -07:00
P R	3f6d624c7b	add gateway object tagging support (#9124 )	2020-05-23 11:09:35 -07:00
Harshavardhana	6656fa3066	simplify further bucket configuration properly (#9650 ) This PR is a continuation from #9586, now the entire parsing logic is fully merged into bucket metadata sub-system, simplify the quota API further by reducing the remove quota handler implementation.	2020-05-20 10:18:15 -07:00
Harshavardhana	bd032d13ff	migrate all bucket metadata into a single file (#9586 ) this is a major overhaul by migrating off all bucket metadata related configs into a single object '.metadata.bin' this allows us for faster bootups across 1000's of buckets and as well as keeps the code simple enough for future work and additions. Additionally also fixes #9396, #9394	2020-05-19 13:53:54 -07:00
Harshavardhana	b730bd1396	fix: possible race in FS local lockMap (#9598 )	2020-05-14 23:59:07 -07:00
Harshavardhana	a1de9cec58	cleanup object-lock/bucket tagging for gateways (#9548 ) This PR is to ensure that we call the relevant object layer APIs for necessary S3 API level functionalities allowing gateway implementations to return proper errors as NotImplemented{} This allows for all our tests in mint to behave appropriately and can be handled appropriately as well.	2020-05-08 13:44:44 -07:00
Anis Elleuch	6885c72f32	disable check for DirectIO in standalone FS mode (#9558 )	2020-05-08 12:07:51 -07:00
Harshavardhana	2dc46cb153	Report correct error when O_DIRECT is not supported (#9545 ) fixes #9537	2020-05-07 16:12:16 -07:00
Bala FA	3773874cd3	add bucket tagging support (#9389 ) This patch also simplifies object tagging support	2020-05-05 14:18:13 -07:00
Klaus Post	073aac3d92	add data update tracking using bloom filter (#9208 ) By monitoring PUT/DELETE and heal operations it is possible to track changed paths and keep a bloom filter for this data. This can help prioritize paths to scan. The bloom filter can identify paths that have not changed, and the few collisions will only result in a marginal extra workload. This can be implemented on either a bucket+(1 prefix level) with reasonable performance. The bloom filter is set to have a false positive rate at 1% at 1M entries. A bloom table of this size is about ~2500 bytes when serialized. To not force a full scan of all paths that have changed cycle bloom filters would need to be kept, so we guarantee that dirty paths have been scanned within cycle runs. Until cycle bloom filters have been collected all paths are considered dirty.	2020-04-27 10:06:21 -07:00
Harshavardhana	60d415bb8a	deprecate/remove global WORM mode (#9436 ) global WORM mode is a complex piece for which the time has passed, with the advent of S3 compatible object locking and retention implementation global WORM is sort of deprecated, this has been mentioned in our documentation for some time, now the time has come for this to go.	2020-04-24 16:37:05 -07:00
Harshavardhana	282c9f790a	fix: validate partNumber in queryParam as part of preConditions (#9386 )	2020-04-20 22:01:59 -07:00
Harshavardhana	69fb68ef0b	fix simplify code to start using context (#9350 )	2020-04-16 10:56:18 -07:00
Harshavardhana	f44cfb2863	use GlobalContext whenever possible (#9280 ) This change is throughout the codebase to ensure that all codepaths honor GlobalContext	2020-04-09 09:30:02 -07:00
Pontus Leitzler	a973402821	add object api check in fs-v1 before returning ready (#9285 ) fs-v1 in server mode only checks to see if the path exist, so that it returns ready before it is indeed ready. This change adds a check to ensure that the global object api is available too before reporting ready. Fixes #9283	2020-04-08 08:53:20 -07:00
Bala FA	2c3e34f001	add force delete option of non-empty bucket (#9166 ) passing HTTP header `x-minio-force-delete: true` would allow standard S3 API DeleteBucket to delete a non-empty bucket forcefully.	2020-03-27 21:52:59 -07:00
Harshavardhana	b1a2169dcc	fix: data usage crawler env handling, usage-cache.bin location (#9163 ) canonicalize the ENVs such that we can bring these ENVs as part of the config values, as a subsequent change. - fix location of per bucket usage to `.minio.sys/buckets/<bucket_name>/usage-cache.bin` - fix location of the overall usage in `json` at `.minio.sys/buckets/.usage.json` (avoid conflicts with a bucket named `usage.json` ) - fix location of the overall usage in `msgp` at `.minio.sys/buckets/.usage.bin` (avoid conflicts with a bucket named `usage.bin`	2020-03-19 09:47:47 -07:00
Anis Elleuch	db2155551a	heal: Pass scan mode to HealObjects to deep scan full quorum objects (#9159 ) As an optimization of the healing, HealObjects() avoid sending an object to the background healing subsystem when the object is present in all disks. However, HealObjects() should have checked the scan type, if this deep, always pass the object to the healing subsystem.	2020-03-18 17:50:00 -07:00
Klaus Post	8d98662633	re-implement data usage crawler to be more efficient (#9075 ) Implementation overview: https://gist.github.com/klauspost/1801c858d5e0df391114436fdad6987b	2020-03-18 16:19:29 -07:00
Krishna Srinivas	2e9fed1a14	non-empty dirs should not be listed as objects (#9129 )	2020-03-13 17:43:00 -07:00
Harshavardhana	23a8411732	Add a generic Walk()'er to list a bucket, optinally prefix (#9026 ) This generic Walk() is used by likes of Lifecyle, or KMS to rotate keys or any other functionality which relies on this functionality.	2020-02-25 21:22:28 +05:30
Harshavardhana	ab7d3cd508	fix: Speed up multi-object delete by taking bulk locks (#8974 ) Change distributed locking to allow taking bulk locks across objects, reduces usually 1000 calls to 1. Also allows for situations where multiple clients sends delete requests to objects with following names ``` {1,2,3,4,5} ``` ``` {5,4,3,2,1} ``` will block and ensure that we do not fail the request on each other.	2020-02-21 11:29:57 +05:30
Anis Elleuch	d4dcf1d722	metrics: Use StorageInfo() instead to have consistent info (#9006 ) Metrics used to have its own code to calculate offline disks. StorageInfo() was avoided because it is an expensive operation by sending calls to all nodes. To make metrics & server info share the same code, a new argument `local` is added to StorageInfo() so it will only query local disks when needed. Metrics now calls StorageInfo() as server info handler does but with the local flag set to false. Co-authored-by: Praveen raj Mani <praveen@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-20 09:21:33 +05:30
Krishnan Parthasarathi	026265f8f7	Add support for bucket encryption feature (#8890 ) - pkg/bucket/encryption provides support for handling bucket encryption configuration - changes under cmd/ provide support for AES256 algorithm only Co-Authored-By: Poorna <poornas@users.noreply.github.com> Co-authored-by: Harshavardhana <harsha@minio.io>	2020-02-05 15:12:34 +05:30
Harshavardhana	2d295a31de	Avoid select inside a recursive function to avoid CPU spikes (#8923 ) Additionally also allow configurable go-routines	2020-02-03 16:45:59 -08:00
Harshavardhana	f98616dce7	heal: Optimize heal listing by avoiding batches (#8901 ) Also limit the heal per object if there is incoming requests by suspending heal for longer periods of time.	2020-01-29 12:05:44 +05:30
Harshavardhana	0cbebf0f57	Rename pkg/{tagging,lifecycle} to pkg/bucket sub-directory (#8892 ) Rename to allow for more such features to come in a more proper hierarchical manner.	2020-01-27 14:12:34 -08:00
Harshavardhana	f14f60a487	fix: Avoid double usage calculation on every restart (#8856 ) On every restart of the server, usage was being calculated which is not useful instead wait for sufficient time to start the crawling routine. This PR also avoids lots of double allocations through strings, optimizes usage of string builders and also avoids crawling through symbolic links. Fixes #8844	2020-01-21 14:07:49 -08:00
Nitish Tiwari	61c17c8933	Add ObjectTagging Support (#8754 ) This PR adds support for AWS S3 ObjectTagging API as explained here https://docs.aws.amazon.com/AmazonS3/latest/dev/object-tagging.html	2020-01-20 08:45:59 -08:00
poornas	30922148fb	Fix bug preventing overwrite of object if (#8796 ) object lock config is enabled for a bucket. Creating a bucket with object lock configuration enabled does not automatically cause WORM protection to be applied. PUT operation needs to specifically request object locking or bucket has to have default retention settings configured. Fixes regression introduced in #8657	2020-01-13 17:29:31 -08:00
Praveen raj Mani	5d09233115	Fix Readiness check (#8681 ) - Remove goroutine-check in Readiness check - Bring in quorum check for readiness Fixes #8385 Co-authored-by: Harshavardhana <harsha@minio.io>	2019-12-28 22:24:43 +05:30
Anis Elleuch	c31e67dcce	Better error when the server is unable to write in the backend (#8697 )	2019-12-25 22:05:54 -08:00
Harshavardhana	5f2318567e	Allow metadata updates on meta bucket even in WORM mode (#8657 ) This ensures that we can update the - .minio.sys is updated for accounting/data usage purposes - .minio.sys is updated to indicate if backend is encrypted or not.	2019-12-17 10:13:12 -08:00
Anis Elleuch	555969ee42	Add data usage collect with its new admin API (#8553 ) Admin data usage info API returns the following (Only FS & XL, for now) - Number of buckets - Number of objects - The total size of objects - Objects histogram - Bucket sizes	2019-12-12 06:02:37 -08:00
Nitish Tiwari	3df7285c3c	Add Support for Cache and S3 related metrics in Prometheus endpoint (#8591 ) This PR adds support below metrics - Cache Hit Count - Cache Miss Count - Data served from Cache (in Bytes) - Bytes received from AWS S3 - Bytes sent to AWS S3 - Number of requests sent to AWS S3 Fixes #8549	2019-12-05 23:16:06 -08:00
Harshavardhana	2ab8d5e47f	Enable build verification with race (#8583 )	2019-12-02 15:54:26 -08:00
poornas	ca96560d56	Add object retention at the per object (#8528 ) level - this PR builds on #8120 which added PutBucketObjectLockConfiguration and GetBucketObjectLockConfiguration APIS This PR implements PutObjectRetention, GetObjectRetention API and enhances PUT and GET API operations to display governance metadata if permissions allow.	2019-11-20 13:18:09 -08:00
Harshavardhana	e9b2bf00ad	Support MinIO to be deployed on more than 32 nodes (#8492 ) This PR implements locking from a global entity into a more localized set level entity, allowing for locks to be held only on the resources which are writing to a collection of disks rather than a global level. In this process this PR also removes the top-level limit of 32 nodes to an unlimited number of nodes. This is a precursor change before bring in bucket expansion.	2019-11-13 12:17:45 -08:00
Bala FA	fb48ca5020	Add Get/Put Bucket Lock Configuration API support (#8120 ) This feature implements [PUT Bucket object lock configuration][1] and [GET Bucket object lock configuration][2]. After object lock configuration is set, existing and new objects are set to WORM for specified duration. Currently Governance mode works exactly like Compliance mode. Fixes #8101 [1] https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketPUTObjectLockConfiguration.html [2] https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketGETObjectLockConfiguration.html	2019-11-12 14:50:18 -08:00
Praveen raj Mani	fa325665b1	Do not append the endpoint for fs/xl disks in StorageInfo (#8472 )	2019-10-31 09:13:54 -07:00
Praveen raj Mani	8836d57e3c	The prometheus metrics refractoring (#8003 ) The measures are consolidated to the following metrics - `disk_storage_used` : Disk space used by the disk. - `disk_storage_available`: Available disk space left on the disk. - `disk_storage_total`: Total disk space on the disk. - `disks_offline`: Total number of offline disks in current MinIO instance. - `disks_total`: Total number of disks in current MinIO instance. - `s3_requests_total`: Total number of s3 requests in current MinIO instance. - `s3_errors_total`: Total number of errors in s3 requests in current MinIO instance. - `s3_requests_current`: Total number of active s3 requests in current MinIO instance. - `internode_rx_bytes_total`: Total number of internode bytes received by current MinIO server instance. - `internode_tx_bytes_total`: Total number of bytes sent to the other nodes by current MinIO server instance. - `s3_rx_bytes_total`: Total number of s3 bytes received by current MinIO server instance. - `s3_tx_bytes_total`: Total number of s3 bytes sent by current MinIO server instance. - `minio_version_info`: Current MinIO version with commit-id. - `s3_ttfb_seconds_bucket`: Histogram that holds the latency information of the requests. And this PR also modifies the current StorageInfo queries - Decouples StorageInfo from ServerInfo . - StorageInfo is enhanced to give endpoint information. NOTE: ADMIN API VERSION IS BUMPED UP IN THIS PR Fixes #7873	2019-10-22 21:01:14 -07:00
poornas	d7060c4c32	Allow logging targets to be configured to receive `minio` (#8347 ) specific errors, `application` errors or `all` by default. console logging on server by default lists all logs - enhance admin console API to accept `type` as query parameter to subscribe to application/minio logs.	2019-10-11 18:50:54 -07:00
Harshavardhana	589e32a4ed	Refactor config and split them in packages (#8351 ) This change is related to larger config migration PR change, this is a first stage change to move our configs to `cmd/config/` - divided into its subsystems	2019-10-04 23:05:33 +05:30
Harshavardhana	b52a3e523c	Avoid using fastjson parser pool, move back to jsoniter (#8190 ) It looks like from implementation point of view fastjson parser pool doesn't behave the same way as expected when dealing many `xl.json` from multiple disks. The fastjson parser pool usage ends up returning incorrect xl.json entries for checksums, with references pointing to older entries. This led to the subtle bug where checksum info is duplicated from a previous xl.json read of a different file from different disk.	2019-09-06 04:21:27 +05:30
Harshavardhana	9ca7470ccc	Avoid using jsoniter, move to fastjson (#8063 ) This is to avoid using unsafe.Pointer type code dependency for MinIO, this causes crashes on ARM64 platforms Refer #8005 collection of runtime crashes due to unsafe.Pointer usage incorrectly. We have seen issues like this before when using jsoniter library in the past. This PR hopes to fix this using fastjson	2019-08-19 08:35:52 -10:00
Harshavardhana	e6d8e272ce	Use const slashSeparator instead of "/" everywhere (#8028 )	2019-08-06 12:08:58 -07:00
Harshavardhana	ac82798d0a	Remove uneeded calls on FS (#7967 )	2019-07-24 15:59:13 +05:30
Krishnan Parthasarathi	559a59220e	Add initial support for bucket lifecycle (#7563 ) This PR is based off @sinhaashish's PR for object lifecycle management, which includes support only for, - Expiration of object - Filter using object prefix (_not_ object tags) N B the code for actual expiration of objects will be included in a subsequent PR.	2019-07-19 21:20:33 +01:00
Krishna Srinivas	338e9a9be9	Put object client disconnect (#7824 ) Fail putObject and postpolicy in case client prematurely disconnects Use request's context to cancel lock requests on client disconnects	2019-06-28 22:09:17 -07:00
Harshavardhana	38224a4c1a	Ignore errors reading fs.json (#7777 )	2019-06-12 16:42:03 -07:00
Anis Elleuch	7abadfccc2	Add self-healing feature (#7604 ) - Background Heal routine receives heal requests from a channel, either to heal format, buckets or objects - Daily sweeper lists all objects in all buckets, these objects don't necessarly have read quorum so they can be removed if these objects are unhealable - Heal daily ops receives objects from the daily sweeper and send them to the heal routine.	2019-06-08 22:14:07 -07:00
Harshavardhana	2c0b3cadfc	Update go mod with sem versions of our libraries (#7687 )	2019-05-29 16:35:12 -07:00
Anis Elleuch	9c90a28546	Implement bulk delete (#7607 ) Bulk delete at storage level in Multiple Delete Objects API In order to accelerate bulk delete in Multiple Delete objects API, a new bulk delete is introduced in storage layer, which will accept a list of objects to delete rather than only one. Consequently, a new API is also need to be added to Object API.	2019-05-13 12:25:49 -07:00
Praveen raj Mani	d9a7f80f68	Remove duplicate checkPutObjectArgs in PutObject and (#7396 ) Fixes #7384	2019-05-13 10:12:06 -07:00
Harshavardhana	64998fc4ab	Remove delayIsLeaf requirement simplify ListObjects further (#7593 )	2019-05-02 10:36:57 +05:30
Harshavardhana	f767a2538a	Optimize listing with leaf check offloaded to posix (#7541 ) Other listing optimizations include - remove double sorting while filtering object entries - improve error message when upload-id is not in quorum - use jsoniter for full unmarshal json, instead of gjson - remove unused code	2019-04-23 14:54:28 -07:00
Harshavardhana	620e462413	Implement S3-HDFS gateway (#7440 ) - [x] Support bucket and regular object operations - [x] Supports Select API on HDFS - [x] Implement multipart API support - [x] Completion of ListObjects support	2019-04-17 09:52:08 -07:00
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	2019-04-09 11:39:42 -07:00
Harshavardhana	0188009c7e	Expose total and available disk space (#7453 )	2019-04-05 09:51:50 +05:30
Harshavardhana	c184038b6a	Add proper custom errors object creations (#7387 ) In scenario 1 ``` - bucket/object-prefix - bucket/object-prefix/object ``` Server responds with `XMinioParentIsObject` In scenario 2 ``` - bucket/object-prefix/object - bucket/object-prefix ``` Server responds with `XMinioObjectExistsAsDirectory` Fixes #6566	2019-03-20 13:06:53 -07:00
Anis Elleuch	facbd653ba	Add normal/deep type of heal scanning (#7251 ) Healing scan used to read all objects parts to check for bitrot checksum. This commit will add a quicker way of healing scan by only checking if parts are actually present in disks or not.	2019-03-14 13:08:51 -07:00
Harshavardhana	7079abc931	Implement HealObjects API to simplify healing (#7351 )	2019-03-13 17:35:09 -07:00
Anis Elleuch	b05825ffe8	s3: Fix precondition failed in CopyObjectPart when src is encrypted (#7276 ) CopyObject precondition checks into GetObjectReader in order to perform SSE-C pre-condition checks using the last 32 bytes of encrypted ETag rather than the decrypted ETag This also necessitates moving precondition checks for gateways to gateway layer rather than object handler check	2019-03-06 12:38:41 -08:00
kannappanr	c57159a0fe	fs mode: List already existing buckets with capital letters (#7244 ) if a bucket with `Captialized letters` is created, `InvalidBucketName` error will be returned. In the case of pre-existing buckets, it will be listed. Fixes #6938	2019-03-05 10:42:32 -08:00
poornas	8022a6efd9	Return ETag for 0-byte object prefixes (#7291 ) Fixes: #7290	2019-02-26 15:09:14 -08:00
Harshavardhana	df35d7db9d	Introduce staticcheck for stricter builds (#7035 )	2019-02-13 18:29:36 +05:30
Harshavardhana	082f777281	Revamp bucket metadata healing (#7208 ) Bucket metadata healing in the current code was executed multiple times each time for a given set. Bucket metadata just like objects are hashed in accordance with its name on any given set, to allow hashing to play a role we should let the top level code decide where to navigate. Current code also had 3 bucket metadata files hardcoded, whereas we should make it generic by listing and navigating the .minio.sys to heal such objects. We also had another bug where due to isObjectDangling changes without pre-existing bucket metadata files, we were erroneously reporting it as grey/corrupted objects. This PR fixes all of the above items.	2019-02-11 09:23:13 +05:30
poornas	40b8d11209	Move metadata into ObjectOptions for NewMultipart and PutObject (#7060 )	2019-02-09 11:01:06 +05:30
Harshavardhana	30135eed86	Redo how to handle stale dangling files (#7171 ) foo.CORRUPTED should never be created because when multiple sets are involved we would hash the file to wrong a location, this PR removes the code. But allows DeleteBucket() to work properly to delete dangling buckets/objects. Also adds another option to Healing where a user needs to specify `--remove` such that all dangling objects will be deleted with user confirmation.	2019-02-05 17:58:48 -08:00
poornas	5a80cbec2a	Add double encryption at S3 gateway. (#6423 ) This PR adds pass-through, single encryption at gateway and double encryption support (gateway encryption with pass through of SSE headers to backend). If KMS is set up (either with Vault as KMS or using MINIO_SSE_MASTER_KEY),gateway will automatically perform single encryption. If MINIO_GATEWAY_SSE is set up in addition to Vault KMS, double encryption is performed.When neither KMS nor MINIO_GATEWAY_SSE is set, do a pass through to backend. When double encryption is specified, MINIO_GATEWAY_SSE can be set to "C" for SSE-C encryption at gateway and backend, "S3" for SSE-S3 encryption at gateway/backend or both to support more than one option. Fixes #6323, #6696	2019-01-05 14:16:42 -08:00
Anis Elleuch	632022971b	s3: Don't set NextMarker when listing is not truncated (#7012 ) Setting NextMarker when IsTruncated is not set seems to be confusing AWS C++ SDK, this commit will avoid setting any string in NextMarker.	2018-12-20 13:30:25 -08:00
poornas	f6980c4630	fix ConfigSys and NotificationSys initialization for NAS (#6920 )	2018-12-05 14:03:42 -08:00
Nitish Tiwari	2a810c7da2	Improve du thread performance (#6849 )	2018-11-26 10:35:14 +05:30
poornas	5f6d717b7a	Fix: Preserve MD5Sum for SSE encrypted objects (#6680 ) To conform with AWS S3 Spec on ETag for SSE-S3 encrypted objects, encrypt client sent MD5Sum and store it on backend as ETag.Extend this behavior to SSE-C encrypted objects.	2018-11-14 17:36:41 -08:00
kannappanr	c872c1f1dc	Return default ETag if fs.json is empty (#6787 )	2018-11-09 10:34:59 -08:00
Harshavardhana	54ae364def	Introduce STS client grants API and OPA policy integration (#6168 ) This PR introduces two new features - AWS STS compatible STS API named AssumeRoleWithClientGrants ``` POST /?Action=AssumeRoleWithClientGrants&Token=<jwt> ``` This API endpoint returns temporary access credentials, access tokens signature types supported by this API - RSA keys - ECDSA keys Fetches the required public key from the JWKS endpoints, provides them as rsa or ecdsa public keys. - External policy engine support, in this case OPA policy engine - Credentials are stored on disks	2018-10-09 14:00:01 -07:00
Praveen raj Mani	c7722fbb1b	Simplify pkg `mimedb` (#6549 ) Content-Type resolution can now use a function `TypeByExtension(extension)` to resolve to the respective content-type.	2018-10-02 11:48:17 +05:30
Praveen raj Mani	ce9d36d954	Add object compression support (#6292 ) Add support for streaming (golang/LZ77/snappy) compression.	2018-09-28 09:06:17 +05:30
poornas	ed703c065d	Add ObjectOptions to GetObjectNInfo (#6533 )	2018-09-27 15:36:45 +05:30
Anis Elleuch	aa4e2b1542	Use GetObjectNInfo in CopyObject and CopyObjectPart (#6489 )	2018-09-25 12:39:46 -07:00
Aditya Manthramurthy	36e51d0cee	Add GetObjectNInfo to object layer (#6449 ) The new call combines GetObjectInfo and GetObject, and returns an object with a ReadCloser interface. Also adds a number of end-to-end encryption tests at the handler level.	2018-09-20 19:22:09 -07:00
poornas	5c0b98abf0	Add ObjectOptions to ObjectLayer calls (#6382 )	2018-09-10 09:42:43 -07:00
Harshavardhana	4487f70f08	Revert all GetObjectNInfo related PRs (#6398 ) * Revert "Encrypted reader wrapped in NewGetObjectReader should be closed (#6383)" This reverts commit `53a0bbeb5b`. * Revert "Change SelectAPI to use new GetObjectNInfo API (#6373)" This reverts commit `5b05df215a`. * Revert "Implement GetObjectNInfo object layer call (#6290)" This reverts commit `e6d740ce09`.	2018-08-31 13:10:12 -07:00
Aditya Manthramurthy	e6d740ce09	Implement GetObjectNInfo object layer call (#6290 ) This combines calling GetObjectInfo and GetObject while returning a io.ReadCloser for the object's body. This allows the two operations to be under a single lock, fixing a race between getting object info and reading the object body.	2018-08-27 15:28:23 +05:30
Krishna Srinivas	52f6d5aafc	Rename of structs and methods (#6230 ) Rename of ErasureStorage to Erasure (and rename of related variables and methods)	2018-08-23 23:35:37 -07:00
Harshavardhana	1ffa6adcd4	Ignore io.EOF returned by ReadFrom for zero byte `fs.json` (#6346 ) Fixes #6256	2018-08-24 11:34:21 +05:30
Harshavardhana	556a51120c	Deprecate ListLocks and ClearLocks (#6233 ) No locks are ever left in memory, we also have a periodic interval of clearing stale locks anyways. The lock instrumentation was not complete and was seldom used. Deprecate this for now and bring it back later if it is really needed. This also in-turn seems to improve performance slightly.	2018-08-02 23:09:42 +05:30
Harshavardhana	ad86454580	Make sure to handle FaultyDisks in listing ops (#6204 ) Continuing from PR `157ed65c35` Our posix.go implementation did not handle I/O errors properly on the disks, this led to situations where top-level callers such as ListObjects might return early without even verifying all the available disks. This commit tries to address this in Kubernetes, drbd/nbd based persistent volumes which can disconnect under load and result in the situations with disks return I/O errors. This commit also simplifies listing operation, listing never returns any error. We can avoid this since we pretty much ignore most of the errors anyways. When objects are accessed directly we return proper errors.	2018-07-27 15:32:19 -07:00
Anis Elleuch	be1700f595	Avoid startup abort when a notify target is down (#6126 ) Minio server was preventing itself to start when any notification target is down and not running. The PR changes the behavior by avoiding startup abort in that case, so the user will still be able to access Minio server using mc admin commands after a restart or set config commands.	2018-07-10 07:20:31 +05:30
wd256	25f9b0bc3b	Handle ListObjectsV2 start-after parameter in ObjectLayer (#6078 )	2018-07-01 09:52:45 +05:30
Harshavardhana	e5e522fc61	docs: fix all Chinese doc links for the new docs site (#6097 ) Additionally fix typos, default to US locale words	2018-06-28 16:02:02 -07:00
Harshavardhana	de251483d1	Avoid ticker timer to simplify disk usage (#6101 ) This PR simplifies the code to avoid tracking any running usage events. This PR also brings in an upper threshold of upto 1 minute suspend the usage function after which the usage would proceed without waiting any longer.	2018-06-28 15:05:45 -07:00
Praveen raj Mani	ea76e72054	Incorrect error message for insufficient volume fix (#6099 ) Reply back with appropriate error message when the server is spawn with volume of insufficient size (< 1GiB). Fixes #5993.	2018-06-28 12:01:05 -07:00
Harshavardhana	25de775560	disable disk-usage when export is root mount path (#6091 ) disk usage crawling is not needed when a tenant is not sharing the same disk for multiple other tenants. This PR adds an optimization when we see a setup uses entire disk, we simply rely on statvfs() to give us total usage. This PR also additionally adds low priority scheduling for usage check routine, such that other go-routines blocked will be automatically unblocked and prioritized before usage.	2018-06-27 18:59:38 -07:00
Harshavardhana	abf209b1dd	load bucket policies using object layer API (#6084 ) This PR fixes an issue during gateway mode where underlying policies were not translated into meaningful policies.	2018-06-27 12:29:48 +05:30
Nitish Tiwari	ad79c626c6	Throw 404 for head requests for prefixes without trailing "/" (#5966 ) Minio server returns 403 (access denied) for head requests to prefixes without trailing "/", this is different from S3 behaviour. S3 returns 404 in such cases. Fixes #6080	2018-06-26 06:54:00 +05:30
Ashish Kumar Sinha	0bbdd02a57	Updating disk storage for FS/Erasure mode (#6081 ) Updating the disk storage stats for FS/Erasure coded backend	2018-06-25 10:46:48 -07:00
Harshavardhana	6fb0604502	Allow usage check to be configurable (#6006 )	2018-06-04 18:35:41 -07:00
Harshavardhana	000e360196	Deprecate showing drive capacity and total free (#5976 ) This addresses a situation that we shouldn't be displaying Total/Free anymore, instead we should simply show the total usage.	2018-05-23 17:30:25 -07:00
Harshavardhana	e6ec645035	Implement support for calculating disk usage per tenant (#5969 ) Fixes #5961	2018-05-23 15:41:29 +05:30
Bala FA	4eb788df79	rename checkPathValid() to getValidPath() (#5949 )	2018-05-17 07:27:07 -07:00
Krishna Srinivas	bb34bd91f1	Fix unnecessary log messages to avoid flooding the logs (#5900 )	2018-05-09 01:38:27 -07:00
Anis Elleuch	6d5f2a4391	Better support of empty directories (#5890 ) Better support of HEAD and listing of zero sized objects with trailing slash (a.k.a empty directory). For that, isLeafDir function is added to indicate if the specified object is an empty directory or not. Each backend (xl, fs) has the responsibility to store that information. Currently, in both of XL & FS, an empty directory is represented by an empty directory in the backend. isLeafDir() checks if the given path is an empty directory or not, since dir listing is costly if the latter contains too many objects, readDirN() is added in this PR to list only N number of entries. In isLeadDir(), we will only list one entry to check if a directory is empty or not.	2018-05-09 01:38:21 -07:00
Anis Elleuch	32700fca52	Enhance fatal errors printing of common issues seen by users (#5878 )	2018-05-08 19:04:36 -07:00
Harshavardhana	c98d8cb1c7	fs: fix a regression allow reading of existing files (#5889 )	2018-05-07 17:00:44 -07:00
kannappanr	fe126de98b	Regenerate fs.json if it is corrupted in FS mode (#5778 ) Also return a default e-tag for pre-existing objects. Fixes #5712	2018-04-24 17:36:43 -07:00
Bala FA	0d52126023	Enhance policy handling to support SSE and WORM (#5790 ) - remove old bucket policy handling - add new policy handling - add new policy handling unit tests This patch brings support to bucket policy to have more control not limiting to anonymous. Bucket owner controls to allow/deny any rest API. For example server side encryption can be controlled by allowing PUT/GET objects with encryptions including bucket owner.	2018-04-24 15:53:30 -07:00
Harshavardhana	ccdb7bc286	Fix s3 compatibility fixes for getBucketLocation,headBucket,deleteBucket (#5842 ) - getBucketLocation - headBucket - deleteBucket Should return 404 or NoSuchBucket even for invalid bucket names, invalid bucket names are only validated during MakeBucket operation	2018-04-24 08:57:33 +05:30
kannappanr	cef992a395	Remove error package and cause functions (#5784 )	2018-04-10 09:36:37 -07:00
Harshavardhana	217fb470a7	Add a check to check if disk is writable (#5662 ) This check is a pre-emptive check to return error early before we attempt to use the disk for any other operations later. refer #5645	2018-04-10 09:26:09 +05:30
Harshavardhana	1d31ad499f	Make sure to re-load reference format after HealFormat (#5772 ) This PR introduces ReloadFormat API call at objectlayer to facilitate this. Previously we repurposed HealFormat but we never ended up updating our reference format on peers. Fixes #5700	2018-04-09 22:55:41 +05:30
kannappanr	f8a3fd0c2a	Create logger package and rename errorIf to LogIf (#5678 ) Removing message from error logging Replace errors.Trace with LogIf	2018-04-05 15:04:40 -07:00
Krishna Srinivas	804a4f9c15	Fix backend format for disk-cache - not to use FS format.json (#5732 )	2018-03-29 14:38:26 -07:00
poornas	af024a9c69	Remove deadcode related to multipart cleanup for fs (#5716 ) The cleanup code is no longer needed as we moved to lockfree multipart backend for fs	2018-03-29 08:26:52 +05:30
poornas	a3e806ed61	Add disk based edge caching support. (#5182 ) This PR adds disk based edge caching support for minio server. Cache settings can be configured in config.json to take list of disk drives, cache expiry in days and file patterns to exclude from cache or via environment variables MINIO_CACHE_DRIVES, MINIO_CACHE_EXCLUDE and MINIO_CACHE_EXPIRY Design assumes that Atime support is enabled and the list of cache drives is fixed. - Objects are cached on both GET and PUT/POST operations. - Expiry is used as hint to evict older entries from cache, or if 80% of cache capacity is filled. - When object storage backend is down, GET, LIST and HEAD operations fetch object seamlessly from cache. Current Limitations - Bucket policies are not cached, so anonymous operations are not supported in offline mode. - Objects are distributed using deterministic hashing among list of cache drives specified.If one or more drives go offline, or cache drive configuration is altered - performance could degrade to linear lookup. Fixes #4026	2018-03-28 14:14:06 -07:00
poornas	76d1e8bbcd	change fs.json format to include checksum fields (#5685 )	2018-03-27 17:23:10 -07:00
Bala FA	3ebe61abdf	Quick support to server level WORM (#5602 ) This is a trival fix to support server level WORM. The feature comes with an environment variable `MINIO_WORM`. Usage: ``` $ export MINIO_WORM=on $ minio server endpoint ```	2018-03-27 16:44:45 -07:00
Krishna Srinivas	e452377b24	Add context to the object-interface methods. Make necessary changes to xl fs azure sia	2018-03-15 16:28:25 -07:00
Krishna Srinivas	9083bc152e	Flat multipart backend implementation for Erasure backend (#5447 )	2018-03-15 13:55:23 -07:00
Bala FA	0e4431725c	make notification as separate package (#5294 ) * Remove old notification files * Add net package * Add event package * Modify minio to take new notification system	2018-03-15 13:03:41 -07:00
Harshavardhana	29ef7d29e4	Fix deadlock in in-place CopyObject decryption/encryption (#5637 ) In-place decryption/encryption already holds write locks on them, attempting to acquire a read lock would fail.	2018-03-12 13:52:38 -07:00
Harshavardhana	7aaf01eb74	Save ETag when updating metadata (#5626 ) Fixes #5622	2018-03-09 10:50:39 -08:00
Harshavardhana	52eea7b9c1	Support SSE-C multipart source objects in CopyObject (#5603 ) Current code didn't implement the logic to support decrypting encrypted multiple parts, this PR fixes by supporting copying encrypted multipart objects.	2018-03-02 17:24:02 -08:00
Anis Elleuch	120b061966	Add multipart support in SSE-C encryption (#5576 ) ) Add Put/Get support of multipart in encryption ) Add GET Range support for encryption ) Add CopyPart encrypted support ) Support decrypting of large single PUT object	2018-03-01 11:37:57 -08:00
Harshavardhana	7cc678c653	Support encryption for CopyObject, GET-Range requests (#5544 ) - Implement CopyObject encryption support - Handle Range GETs for encrypted objects Fixes #5193	2018-02-23 15:07:21 -08:00
Harshavardhana	0ea54c9858	Change CopyObject{Part} to single srcInfo argument (#5553 ) Refactor such that metadata and etag are combined to a single argument `srcInfo`. This is a precursor change for #5544 making it easier for us to provide encryption/decryption functions.	2018-02-21 14:18:47 +05:30
poornas	25107c2e11	Add NAS gateway support (#5516 )	2018-02-20 12:21:12 -08:00
Harshavardhana	fb96779a8a	Add large bucket support for erasure coded backend (#5160 ) This PR implements an object layer which combines input erasure sets of XL layers into a unified namespace. This object layer extends the existing erasure coded implementation, it is assumed in this design that providing > 16 disks is a static configuration as well i.e if you started the setup with 32 disks with 4 sets 8 disks per pack then you would need to provide 4 sets always. Some design details and restrictions: - Objects are distributed using consistent ordering to a unique erasure coded layer. - Each pack has its own dsync so locks are synchronized properly at pack (erasure layer). - Each pack still has a maximum of 16 disks requirement, you can start with multiple such sets statically. - Static sets set of disks and cannot be changed, there is no elastic expansion allowed. - Static sets set of disks and cannot be changed, there is no elastic removal allowed. - ListObjects() across sets can be noticeably slower since List happens on all servers, and is merged at this sets layer. Fixes #5465 Fixes #5464 Fixes #5461 Fixes #5460 Fixes #5459 Fixes #5458 Fixes #5460 Fixes #5488 Fixes #5489 Fixes #5497 Fixes #5496	2018-02-15 17:45:57 -08:00
Harshavardhana	91101b11bb	Converge repeated code to common deleteBucketMetadata() (#5508 )	2018-02-12 18:34:30 -08:00
poornas	4f73fd9487	Unify gateway and object layer. (#5487 ) * Unify gateway and object layer. Bring bucket policies into object layer.	2018-02-09 15:19:30 -08:00
Harshavardhana	033cfb5cef	Remove stale code from minio server (#5479 )	2018-01-31 18:28:28 -08:00
Krishna Srinivas	3b2486ebaf	Lock free multipart backend implementation for FS (#5401 )	2018-01-31 13:17:24 -08:00
Harshavardhana	3ea28e9771	Support creating directories on erasure coded backend (#5443 ) This PR continues from #5049 where we started supporting directories for erasure coded backend	2018-01-30 08:13:13 +05:30
Aditya Manthramurthy	a337ea4d11	Move admin APIs to new path and add redesigned heal APIs (#5351 ) - Changes related to moving admin APIs - admin APIs now have an endpoint under /minio/admin - admin APIs are now versioned - a new API to server the version is added at "GET /minio/admin/version" and all API operations have the path prefix /minio/admin/v1/<operation> - new service stop API added - credentials change API is moved to /minio/admin/v1/config/credential - credentials change API and configuration get/set API now require TLS so that credentials are protected - all API requests now receive JSON - heal APIs are disabled as they will be changed substantially - Heal API changes Heal API is now provided at a single endpoint with the ability for a client to start a heal sequence on all the data in the server, a single bucket, or under a prefix within a bucket. When a heal sequence is started, the server returns a unique token that needs to be used for subsequent 'status' requests to fetch heal results. On each status request from the client, the server returns heal result records that it has accumulated since the previous status request. The server accumulates upto 1000 records and pauses healing further objects until the client requests for status. If the client does not request any further records for a long time, the server aborts the heal sequence automatically. A heal result record is returned for each entity healed on the server, such as system metadata, object metadata, buckets and objects, and has information about the before and after states on each disk. A client may request to force restart a heal sequence - this causes the running heal sequence to be aborted at the next safe spot and starts a new heal sequence.	2018-01-22 14:54:55 -08:00
Aditya Manthramurthy	aa7e5c71e9	Remove upload healing related dead code (#5404 )	2018-01-15 18:20:39 -08:00
Harshavardhana	12f67d47f1	Fix a possible race during PutObject() (#5376 ) Under any concurrent removeObjects in progress might have removed the parents of the same prefix for which there is an ongoing putObject request. An inconsistent situation may arise as explained below even under sufficient locking. PutObject is almost successful at the last stage when a temporary file is renamed to its actual namespace at `a/b/c/object1`. Concurrently a RemoveObject is also in progress at the same prefix for an `a/b/c/object2`. To create the object1 at location `a/b/c` PutObject has to create all the parents recursively. ``` a/b/c - os.MkdirAll loops through has now created 'a/' and 'b/' about to create 'c/' a/b/c/object2 - at this point 'c/' and 'object2' are deleted about to delete b/ ``` Now for os.MkdirAll loop the expected situation is that top level parent 'a/b/' exists which it created , such that it can create 'c/' - since removeObject and putObject do not compete for lock due to holding locks at different resources. removeObject proceeds to delete parent 'b/' since 'c/' is not yet present, once deleted 'os.MkdirAll' would receive an error as syscall.ENOENT which would fail the putObject request. This PR tries to address this issue by implementing a safer/guarded approach where we would retry an operation such as `os.MkdirAll` and `os.Rename` if both operations observe syscall.ENOENT. Fixes #5254	2018-01-13 22:43:02 +05:30
poornas	0bb6247056	Move nslocking from s3 layer to object layer (#5382 ) Fixes #5350	2018-01-13 10:04:52 +05:30
kannappanr	20584dc08f	Remove unnecessary errors printed on the console (#5386 ) Some of the errors printed on server console can be removed as those error message is unnecessary. Fixes #5385	2018-01-11 11:42:05 -08:00
Harshavardhana	490c30f853	erasure: Support cleaning up of stale multipart objects (#5250 ) Just like our single directory/disk setup, this PR brings the functionality to cleanup stale multipart objects older > 2 weeks.	2017-11-30 18:11:42 -08:00
Harshavardhana	8efa82126b	Convert errors tracer into a separate package (#5221 )	2017-11-25 11:58:29 -08:00
Nitish Tiwari	f7b6f7b22f	Update getObjectInfo to stat for objects with trailing / (#5179 ) Apache Spark sends getObject requests with trailing "/". This PR updates the getObjectInfo to stat for files even if they are sent with trailing "/". Fixes #2965	2017-11-16 16:00:27 -08:00
Harshavardhana	5eb210dd2e	Set etag properly to calculated value if available (#5106 ) Fixes #5100	2017-10-24 12:25:42 -07:00
Harshavardhana	1d8a8c63db	Simplify data verification with HashReader. (#5071 ) Verify() was being called by caller after the data has been successfully read after io.EOF. This disconnection opens a race under concurrent access to such an object. Verification is not necessary outside of Read() call, we can simply just do checksum verification right inside Read() call at io.EOF. This approach simplifies the usage.	2017-10-22 11:00:34 +05:30

1 2 3 4 5 ...

335 Commits