minio

Commit Graph

Author	SHA1	Message	Date
Aditya Manthramurthy	aa7e5c71e9	Remove upload healing related dead code (#5404 )	2018-01-15 18:20:39 -08:00
Harshavardhana	12f67d47f1	Fix a possible race during PutObject() (#5376 ) Under any concurrent removeObjects in progress might have removed the parents of the same prefix for which there is an ongoing putObject request. An inconsistent situation may arise as explained below even under sufficient locking. PutObject is almost successful at the last stage when a temporary file is renamed to its actual namespace at `a/b/c/object1`. Concurrently a RemoveObject is also in progress at the same prefix for an `a/b/c/object2`. To create the object1 at location `a/b/c` PutObject has to create all the parents recursively. ``` a/b/c - os.MkdirAll loops through has now created 'a/' and 'b/' about to create 'c/' a/b/c/object2 - at this point 'c/' and 'object2' are deleted about to delete b/ ``` Now for os.MkdirAll loop the expected situation is that top level parent 'a/b/' exists which it created , such that it can create 'c/' - since removeObject and putObject do not compete for lock due to holding locks at different resources. removeObject proceeds to delete parent 'b/' since 'c/' is not yet present, once deleted 'os.MkdirAll' would receive an error as syscall.ENOENT which would fail the putObject request. This PR tries to address this issue by implementing a safer/guarded approach where we would retry an operation such as `os.MkdirAll` and `os.Rename` if both operations observe syscall.ENOENT. Fixes #5254	2018-01-13 22:43:02 +05:30
poornas	0bb6247056	Move nslocking from s3 layer to object layer (#5382 ) Fixes #5350	2018-01-13 10:04:52 +05:30
kannappanr	20584dc08f	Remove unnecessary errors printed on the console (#5386 ) Some of the errors printed on server console can be removed as those error message is unnecessary. Fixes #5385	2018-01-11 11:42:05 -08:00
Harshavardhana	490c30f853	erasure: Support cleaning up of stale multipart objects (#5250 ) Just like our single directory/disk setup, this PR brings the functionality to cleanup stale multipart objects older > 2 weeks.	2017-11-30 18:11:42 -08:00
Harshavardhana	8efa82126b	Convert errors tracer into a separate package (#5221 )	2017-11-25 11:58:29 -08:00
Nitish Tiwari	f7b6f7b22f	Update getObjectInfo to stat for objects with trailing / (#5179 ) Apache Spark sends getObject requests with trailing "/". This PR updates the getObjectInfo to stat for files even if they are sent with trailing "/". Fixes #2965	2017-11-16 16:00:27 -08:00
Harshavardhana	5eb210dd2e	Set etag properly to calculated value if available (#5106 ) Fixes #5100	2017-10-24 12:25:42 -07:00
Harshavardhana	1d8a8c63db	Simplify data verification with HashReader. (#5071 ) Verify() was being called by caller after the data has been successfully read after io.EOF. This disconnection opens a race under concurrent access to such an object. Verification is not necessary outside of Read() call, we can simply just do checksum verification right inside Read() call at io.EOF. This approach simplifies the usage.	2017-10-22 11:00:34 +05:30
Harshavardhana	b2cbade477	Support creating empty directories. (#5049 ) Every so often we get requirements for creating directories/prefixes and we end up rejecting such requirements. This PR implements this and allows empty directories without any new file addition to backend. Existing lower APIs themselves are leveraged to provide this behavior. Only FS backend supports this for the time being as desired.	2017-10-16 17:20:54 -07:00
Harshavardhana	3d0dced23c	Remove go1.9 specific code for windows (#5033 ) Following fix https://go-review.googlesource.com/#/c/41834/ has been merged upstream and released with go1.9.	2017-10-13 15:31:15 +05:30
Harshavardhana	0b546ddfd4	Return errors in PutObject()/PutObjectPart() if input size is -1. (#5015 ) Amazon S3 API expects all incoming stream has a content-length set it was superflous for us to support object layer which supports unknown sized stream as well, this PR removes such requirements and explicitly error out if input stream is less than zero.	2017-10-06 09:38:01 -07:00
Harshavardhana	d3eb5815d9	Avoid DDOS in PutObject() when objectName is '/' and size '0' (#4962 ) It can happen that an incoming PutObject() request might have inputs of following form eg:- - bucketName is 'testbucket' - objectName is '/' bucketName exists and was previously created but there are no other objects in this bucket. In a situation like this parentDirIsObject() goes into an infinite loop. Verifying that if '/' is an object fails on both backends but the resulting `path.Dir('/')` returns `'/'` this causes the closure to loop onto itself. Fixes #4940	2017-09-25 14:47:58 -07:00
Andreas Auernhammer	79ba4d3f33	refactor ObjectLayer PutObject and PutObjectPart (#4925 ) This change refactor the ObjectLayer PutObject and PutObjectPart functions. Instead of passing an io.Reader and a size to PUT operations ObejectLayer expects an HashReader. A HashReader verifies the MD5 sum (and SHA256 sum if required) of the object. This change updates all all PutObject(Part) calls and removes unnecessary code in all ObjectLayer implementations. Fixes #4923	2017-09-19 12:40:27 -07:00
Frank Wessels	61e0b1454a	Add support for timeouts for locks (#4377 )	2017-08-31 14:43:59 -07:00
Harshavardhana	1bb9d49eaa	fs: ListObjects() was reading ETag at wrong offsets (#4846 ) Current code was just using io.ReadAll() on an fd() which might have moved underneath due to a concurrent read operation. Subsequent read will result in EOF We should always seek back and read again. pread() is allowed on all platforms use io.SectionReader to read from the beginning of the file. Fixes #4842	2017-08-23 17:59:14 -07:00
Harshavardhana	d864e00e24	posix: Deprecate custom removeAll/mkdirAll implementations. (#4808 ) Since go1.8 os.RemoveAll and os.MkdirAll both support long path names i.e UNC path on windows. The code we are carrying was directly borrowed from `pkg/os` package and doesn't need to be in our repo anymore. As a side affect this also addresses our codecoverage issue. Refer #4658	2017-08-12 19:25:43 -07:00
Harshavardhana	3544e5ad01	fs: Fix Shutdown() behavior and handle tests properly. (#4796 ) Fixes #4795	2017-08-10 14:11:57 -07:00
Harshavardhana	e7cdd8f02c	fs: Avoid non-idempotent code flow in ListBuckets() (#4798 ) Under the call flow ``` Readdir + \| \| \| path-entry \| \| v StatDir ``` Existing code was written in a manner where say a bucket/top-level directory was indeed deleted between Readdir() and before StatDir() we would ignore certain errors. This is not a plausible situation and might not happen in almost all practical cases. We do not have to look for or interpret these errors returned by StatDir() instead we can just collect the successful values and return back to the client. We do not need to pre-maturely decide on bucket access we just let filesystem decide subsequently for real I/O operations. Refer #4658	2017-08-10 13:36:11 -07:00
Krishnan Parthasarathi	75c43bfb6c	ListMultipartUploads, ListObjectParts return empty response (#4694 ) Also, periodically removes incomplete multipart uploads older than 2 weeks.	2017-08-04 10:45:57 -07:00
Harshavardhana	cc8a8cb877	posix: Check for min disk space and inodes (#4618 ) This is needed such that we don't start or allow writing to a posix disk which doesn't have minimum total disk space available. One part fix for #4617	2017-07-10 18:14:48 -07:00
Frank Wessels	46897b1100	Name return values to prevent the need (and unnecessary code bloat) (#4576 ) This is done to explicitly instantiate objects for every return statement.	2017-06-21 19:53:09 -07:00
Harshavardhana	353f2d3a6e	fs: Hold `format.json` readLock ref to avoid GC. (#4532 ) Looks like if we follow pattern such as ``` _ = rlk ``` Go can potentially kick in GC and close the fd when the reference is lost, only speculation is that the cause here is `SetFinalizer` which is set on `os.close()` internally in `os` stdlib. This is unexpected and unsual endeavour for Go, but we have to make sure the reference is never lost and always dies with the server. Fixes #4530	2017-06-13 08:29:07 -07:00
Harshavardhana	075b8903d7	fs: Add safe locking semantics for `format.json` (#4523 ) This patch also reverts previous changes which were merged for migration to the newer disk format. We will be bringing these changes in subsequent releases. But we wish to add protection in this release such that future release migrations are protected. Revert "fs: Migration should handle bucketConfigs as regular objects. (#4482)" This reverts commit `976870a391`. Revert "fs: Migrate object metadata to objects directory. (#4195)" This reverts commit `76f4f20609`.	2017-06-12 17:40:28 -07:00
Harshavardhana	976870a391	fs: Migration should handle bucketConfigs as regular objects. (#4482 ) Current code failed to anticipate the existence of files which could have been created to corrupt the namespace such as `policy.json` file created at the bucket top level. In the current release creating such as file conflicts with the namespace for future bucket policy operations. We implemented migration of backend format to avoid situations such as these. This PR handles this situation, makes sure that the erroneous files should have been moved properly. Fixes #4478	2017-06-06 12:15:35 -07:00
poornas	18c4e5d357	Enable browser support for gateway (#4425 )	2017-06-01 09:43:20 -07:00
Harshavardhana	072fcf3ba6	fs: Make sure to validate bucket first in PutObject() (#4427 ) Currently even when bucket doesn't exist we wrongly return success, when an object is a directory prefix with '/' as suffix and is of size 0. This PR fixes this behavior.	2017-05-25 09:22:43 -07:00
Harshavardhana	155a90403a	fs/erasure: Rename meta 'md5Sum' as 'etag'. (#4319 ) This PR also does backend format change to 1.0.1 from 1.0.0. Backward compatible changes are still kept to read the 'md5Sum' key. But all new objects will be stored with the same details under 'etag'. Fixes #4312	2017-05-14 12:05:51 -07:00
Harshavardhana	fa3f6d75b6	fs: Verify if parent is an object before i/o. (#4304 ) PutObject() needs to verify and fail. Fixes #4301	2017-05-09 17:46:46 -07:00
Harshavardhana	298b470f69	fs/erasure: Ignore objects with / even for DeleteObject() (#4303 ) Additionally GetObject() also returns errFileNotFound similar to HeadObject(). Fixes #4302	2017-05-09 14:32:24 -07:00
Harshavardhana	76f4f20609	fs: Migrate object metadata to objects directory. (#4195 ) Fixes #3352	2017-05-05 08:49:09 -07:00
Harshavardhana	f0b5c0ec7c	windows: Support all REPARSE_POINT attrib files properly. (#4203 ) This change adopts the upstream fix in this regard at https://go-review.googlesource.com/#/c/41834/ for Minio's purposes. Go's current os.Stat() lacks support for lot of strange windows files such as - share symlinks on SMB2 - symlinks on docker nanoserver - de-duplicated files on NTFS de-duplicated volume. This PR attempts to incorporate the change mentioned here https://blogs.msdn.microsoft.com/oldnewthing/20100212-00/?p=14963/ The article suggests to use Windows I/O manager to dereference the symbolic link. Fixes #4122	2017-05-02 02:35:27 -07:00
Anis Elleuch	14f0047295	fs: Remove fs meta lock when PutObject() fails (#4114 ) Removing the fs meta lock file when PutObject() encounters any error during its execution, such as upload getting permatuerly cancelled by the client.	2017-04-14 12:06:24 -07:00
Harshavardhana	4747adfcb4	fs: Enable returning ETag along with ListObjects() (#4042 ) This is to comply with S3 behavior, we previously removed reading `fs.json` for optimization reasons but we have a reason to believe that providing ETag and using gjson provides needed benefit of not having to deal with unmarshalling overhead of golang stdlib. Fixes #4028	2017-04-04 09:14:03 -07:00
Krishnan Parthasarathi	2bd694dbc8	Add disksUnavailable healStatus const (#3990 ) `disksUnavailable` healStatus constant indicates that a given object needs healing but one or more of disks requiring heal are offline. This can be used by admin heal API consumers to distinguish between a successful heal and a no-op since the outdated disks were offline.	2017-03-31 17:55:15 -07:00
Krishnan Parthasarathi	051f9bb5c6	Implement list uploads heal admin API (#3885 )	2017-03-16 00:15:06 -07:00
Anis Elleuch	a5e60706a2	xl,fs: Return 404 if object ends with a separator (#3897 ) HEAD Object for FS and XL was returning invalid object name when an object name has a trailing slash separator, this PR changes the behavior and will always return 404 object not found, this guarantees a better compatibility with S3 spec.	2017-03-13 22:20:46 -07:00
Harshavardhana	47ac410ab0	Code cleanup - simplify server side code. (#3870 ) Fix all the issues reported by `gosimple` tool.	2017-03-08 10:00:47 -08:00
Anis Elleuch	79e0b9e69a	Relax minio server start when disk threshold is reached and adds space check in FS (#3865 ) * fs: Rename tempObjPath variable in fsCreateFile() * fs/posix: Factor checkDiskFree() function * fs: Add disk free check in fsCreateFile() * posix: Move free disk check to createFile() * xl: Relax free disk check in POSIX initialization * fs: checkDiskFree checks for space to store data	2017-03-07 12:25:40 -08:00
Harshavardhana	bcc5b6e1ef	xl: Rename getOrderedDisks as shuffleDisks appropriately. (#3796 ) This PR is for readability cleanup - getOrderedDisks as shuffleDisks - getOrderedPartsMetadata as shufflePartsMetadata Distribution is now a second argument instead being the primary input argument for brevity. Also change the usage of type casted int64(0), instead rely on direct type reference as `var variable int64` everywhere.	2017-02-24 09:20:40 -08:00
Harshavardhana	7ea1de8245	copyObject: Be case sensitive for windows only server. (#3766 ) For case sensitive platforms we should honor case. Fixes #3765 ``` 1) python s3cmd -c s3cfg_localminio put logo.png s3://testbucket/xyz/etc2/logo.PNG 2) python s3cmd -c s3cfg_localminio ls s3://testbucket/xyz/etc2/ 2017-02-18 10:58 22059 s3://testbucket/xyz/etc2/logo.PNG 3) python s3cmd -c s3cfg_localminio cp s3://testbucket/xyz/etc2/logo.PNG s3://testbucket/xyz/etc2/logo.png remote copy: 's3://testbucket/xyz/etc2/logo.PNG' -> 's3://testbucket/xyz/etc2/logo.png' 4) python s3cmd -c s3cfg_localminio ls s3://testbucket/xyz/etc2/ 2017-02-18 10:58 22059 s3://testbucket/xyz/etc2/logo.PNG 2017-02-18 11:10 22059 s3://testbucket/xyz/etc2/logo.png ```	2017-02-18 13:41:59 -08:00
Harshavardhana	50b4e54a75	fs: Do not return reservedBucket names in ListBuckets() (#3754 ) Make sure to skip reserved bucket names in `ListBuckets()` current code didn't skip this properly and also generalize this behavior for both XL and FS.	2017-02-16 14:52:14 -08:00
Krishna Srinivas	152cdf1c05	fs: Move traceError() to lower functions where possible. (#3633 )	2017-01-26 15:40:10 -08:00
Krishna Srinivas	17dd1c19df	cleanup: refactor common code between FS and XL listDirFactory. (#3639 )	2017-01-26 15:39:22 -08:00
Harshavardhana	dafdc74605	fs: if `fs.json` is empty ignore it while reading metadata. (#3634 ) This is needed so that we don't send wrong errors on previously failed PutObject() which would have left a stale `fs.json` entry.	2017-01-26 10:19:07 -08:00
Krishna Srinivas	82373e3d50	fs: cleanup - do not cache size of metafiles (#3630 ) * Remove Size() method and size field from lock.LockedFile * WriteTo method of fsMeta and uploadsV1 now takes concrete type *lock.LockedFile	2017-01-25 12:29:06 -08:00
Harshavardhana	51fa4f7fe3	Make PutObject a nop for an object which ends with "/" and size is '0' (#3603 ) This helps majority of S3 compatible applications while not returning an error upon directory create request. Fixes #2965	2017-01-20 16:33:01 -08:00
Andrei Kopats	c3f7d1026f	fs: start even if there are not enough free space (#3606 )	2017-01-20 09:30:20 -08:00
Jeffery Utter	9e1f1b50e0	Don't Check Available Inodes on NFS (#3598 ) In some cases (such as with VirutualBox, this value gets hardcoded to 1000, which is less than the required minimum of 10000. Fixes #3592	2017-01-19 10:39:44 -08:00
Anis Elleuch	0715032598	heal: Add ListBucketsHeal object API (#3563 ) ListBucketsHeal will list which buckets that need to be healed: * ListBucketsHeal() (buckets []BucketInfo, err error)	2017-01-19 09:34:18 -08:00

1 2

94 Commits