minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	b9b353db4b	Add env to support synchronous ops for all calls (#6877 )	2018-12-11 16:22:56 -08:00
Nitish Tiwari	2a810c7da2	Improve du thread performance (#6849 )	2018-11-26 10:35:14 +05:30
Harshavardhana	f1f23f6f11	Add sync mode for 'xl.json' (#6798 ) xl.json is the source of truth for all erasure coded objects, without which we won't be able to read the objects properly. This PR enables sync mode for writing `xl.json` such all writes go hit the disk and are persistent under situations such as abrupt power failures on servers running Minio.	2018-11-14 19:48:35 +05:30
Praveen raj Mani	ce9d36d954	Add object compression support (#6292 ) Add support for streaming (golang/LZ77/snappy) compression.	2018-09-28 09:06:17 +05:30
Anis Elleuch	7571582000	Print storage errors during distributed initialization (#6441 ) This commit will print connection failures to other disks in other nodes after 5 retries. It is useful for users to understand why the distribued cluster fails to boot up.	2018-09-10 16:21:59 -07:00
Krishna Srinivas	ce02ab613d	Simplify erasure code by separating bitrot from erasure code (#5959 )	2018-08-06 15:14:08 -07:00
Oleg Kovalov	37de2dbd3b	simplifying if-else chains to switches (#6208 )	2018-08-06 10:26:40 -07:00
Harshavardhana	ad86454580	Make sure to handle FaultyDisks in listing ops (#6204 ) Continuing from PR `157ed65c35` Our posix.go implementation did not handle I/O errors properly on the disks, this led to situations where top-level callers such as ListObjects might return early without even verifying all the available disks. This commit tries to address this in Kubernetes, drbd/nbd based persistent volumes which can disconnect under load and result in the situations with disks return I/O errors. This commit also simplifies listing operation, listing never returns any error. We can avoid this since we pretty much ignore most of the errors anyways. When objects are accessed directly we return proper errors.	2018-07-27 15:32:19 -07:00
Krishna Srinivas	40ed0d1f5d	Support 1GB disk size (#6137 ) Pivotal CF by default has 1GB disk option which causes minio to not start	2018-07-09 18:23:49 -07:00
Harshavardhana	de251483d1	Avoid ticker timer to simplify disk usage (#6101 ) This PR simplifies the code to avoid tracking any running usage events. This PR also brings in an upper threshold of upto 1 minute suspend the usage function after which the usage would proceed without waiting any longer.	2018-06-28 15:05:45 -07:00
Praveen raj Mani	ea76e72054	Incorrect error message for insufficient volume fix (#6099 ) Reply back with appropriate error message when the server is spawn with volume of insufficient size (< 1GiB). Fixes #5993.	2018-06-28 12:01:05 -07:00
Harshavardhana	25de775560	disable disk-usage when export is root mount path (#6091 ) disk usage crawling is not needed when a tenant is not sharing the same disk for multiple other tenants. This PR adds an optimization when we see a setup uses entire disk, we simply rely on statvfs() to give us total usage. This PR also additionally adds low priority scheduling for usage check routine, such that other go-routines blocked will be automatically unblocked and prioritized before usage.	2018-06-27 18:59:38 -07:00
Ashish Kumar Sinha	0bbdd02a57	Updating disk storage for FS/Erasure mode (#6081 ) Updating the disk storage stats for FS/Erasure coded backend	2018-06-25 10:46:48 -07:00
Harshavardhana	cb9ee1584a	Fix TestHealStartNStatusHandler sporadic failure (#6015 ) Fixes #5818	2018-06-12 16:36:31 -07:00
Bala FA	6a8bfcef1c	remove separate file for posix utils. (#5948 )	2018-06-07 12:31:40 +05:30
Bala FA	6a53dd1701	Implement HTTP POST based RPC (#5840 ) Added support for new RPC support using HTTP POST. RPC's arguments and reply are Gob encoded and sent as HTTP request/response body. This patch also removes Go RPC based implementation.	2018-06-06 14:21:56 +05:30
Harshavardhana	6fb0604502	Allow usage check to be configurable (#6006 )	2018-06-04 18:35:41 -07:00
Harshavardhana	000e360196	Deprecate showing drive capacity and total free (#5976 ) This addresses a situation that we shouldn't be displaying Total/Free anymore, instead we should simply show the total usage.	2018-05-23 17:30:25 -07:00
Harshavardhana	e6ec645035	Implement support for calculating disk usage per tenant (#5969 ) Fixes #5961	2018-05-23 15:41:29 +05:30
Bala FA	4eb788df79	rename checkPathValid() to getValidPath() (#5949 )	2018-05-17 07:27:07 -07:00
Anis Elleuch	6d5f2a4391	Better support of empty directories (#5890 ) Better support of HEAD and listing of zero sized objects with trailing slash (a.k.a empty directory). For that, isLeafDir function is added to indicate if the specified object is an empty directory or not. Each backend (xl, fs) has the responsibility to store that information. Currently, in both of XL & FS, an empty directory is represented by an empty directory in the backend. isLeafDir() checks if the given path is an empty directory or not, since dir listing is costly if the latter contains too many objects, readDirN() is added in this PR to list only N number of entries. In isLeadDir(), we will only list one entry to check if a directory is empty or not.	2018-05-09 01:38:21 -07:00
Harshavardhana	ccdb7bc286	Fix s3 compatibility fixes for getBucketLocation,headBucket,deleteBucket (#5842 ) - getBucketLocation - headBucket - deleteBucket Should return 404 or NoSuchBucket even for invalid bucket names, invalid bucket names are only validated during MakeBucket operation	2018-04-24 08:57:33 +05:30
Harshavardhana	4a874dfbc1	Ignore prefix renames when dest directory is not empty (#5798 ) Also make sure to not modify the underlying errors from layers, we should return the error as is and one object layer should translate the errors. Fixes #5797	2018-04-11 17:15:42 -07:00
Harshavardhana	217fb470a7	Add a check to check if disk is writable (#5662 ) This check is a pre-emptive check to return error early before we attempt to use the disk for any other operations later. refer #5645	2018-04-10 09:26:09 +05:30
kannappanr	f8a3fd0c2a	Create logger package and rename errorIf to LogIf (#5678 ) Removing message from error logging Replace errors.Trace with LogIf	2018-04-05 15:04:40 -07:00
Aditya Manthramurthy	ea8973b7d7	Return bit-rot verified data instead of re-reading from disk (#5568 ) - Data from disk was being read after bitrot verification to return data for GetObject. Strictly speaking this does not guarantee bitrot protection, as disks may return bad data even temporarily. - This fix reads data from disk, verifies data for bitrot and then returns data to the client directly.	2018-03-04 14:16:45 -08:00
Anis Elleuch	d2d49f6c6c	xl: Avoid removing directory content in Delete API (#5548 ) Delete & Multi Delete API should not try to remove the directory content. The only permitted case is with zero size object with a trailing slash in its name.	2018-02-20 15:33:26 -08:00
Anis Elleuch	926e480156	posix.RenameFile(): Allow overwriting an empty directory (#5551 ) Overwriting files is allowed, but since the introduction of the object directory, we will aslo need to allow overwriting an empty directory. Putting twice the same object directory won't fail with 403 error anymore.	2018-02-20 12:20:18 -08:00
Harshavardhana	fb96779a8a	Add large bucket support for erasure coded backend (#5160 ) This PR implements an object layer which combines input erasure sets of XL layers into a unified namespace. This object layer extends the existing erasure coded implementation, it is assumed in this design that providing > 16 disks is a static configuration as well i.e if you started the setup with 32 disks with 4 sets 8 disks per pack then you would need to provide 4 sets always. Some design details and restrictions: - Objects are distributed using consistent ordering to a unique erasure coded layer. - Each pack has its own dsync so locks are synchronized properly at pack (erasure layer). - Each pack still has a maximum of 16 disks requirement, you can start with multiple such sets statically. - Static sets set of disks and cannot be changed, there is no elastic expansion allowed. - Static sets set of disks and cannot be changed, there is no elastic removal allowed. - ListObjects() across sets can be noticeably slower since List happens on all servers, and is merged at this sets layer. Fixes #5465 Fixes #5464 Fixes #5461 Fixes #5460 Fixes #5459 Fixes #5458 Fixes #5460 Fixes #5488 Fixes #5489 Fixes #5497 Fixes #5496	2018-02-15 17:45:57 -08:00
Harshavardhana	3ea28e9771	Support creating directories on erasure coded backend (#5443 ) This PR continues from #5049 where we started supporting directories for erasure coded backend	2018-01-30 08:13:13 +05:30
Harshavardhana	12f67d47f1	Fix a possible race during PutObject() (#5376 ) Under any concurrent removeObjects in progress might have removed the parents of the same prefix for which there is an ongoing putObject request. An inconsistent situation may arise as explained below even under sufficient locking. PutObject is almost successful at the last stage when a temporary file is renamed to its actual namespace at `a/b/c/object1`. Concurrently a RemoveObject is also in progress at the same prefix for an `a/b/c/object2`. To create the object1 at location `a/b/c` PutObject has to create all the parents recursively. ``` a/b/c - os.MkdirAll loops through has now created 'a/' and 'b/' about to create 'c/' a/b/c/object2 - at this point 'c/' and 'object2' are deleted about to delete b/ ``` Now for os.MkdirAll loop the expected situation is that top level parent 'a/b/' exists which it created , such that it can create 'c/' - since removeObject and putObject do not compete for lock due to holding locks at different resources. removeObject proceeds to delete parent 'b/' since 'c/' is not yet present, once deleted 'os.MkdirAll' would receive an error as syscall.ENOENT which would fail the putObject request. This PR tries to address this issue by implementing a safer/guarded approach where we would retry an operation such as `os.MkdirAll` and `os.Rename` if both operations observe syscall.ENOENT. Fixes #5254	2018-01-13 22:43:02 +05:30
Harshavardhana	3d0dced23c	Remove go1.9 specific code for windows (#5033 ) Following fix https://go-review.googlesource.com/#/c/41834/ has been merged upstream and released with go1.9.	2017-10-13 15:31:15 +05:30
Andreas Auernhammer	7e6b5bdbb7	remove ReadFileWithVerify from StorageAPI (#4947 ) This change removes the ReadFileWithVerify function from the StorageAPI. The ReadFile was basically a redirection to ReadFileWithVerify. This change removes the redirection and moves the logic of ReadFileWithVerify directly into ReadFile. This removes a lot of unnecessary code in all StorageAPI implementations. Fixes #4946 * review: fix doc and typos	2017-09-25 11:32:56 -07:00
Harshavardhana	879cef37a1	Fail to start server if detected cross-device mounts. (#4807 ) Fixes #4764	2017-08-15 15:10:50 -07:00
Andreas Auernhammer	85fcee1919	erasure: simplify XL backend operations (#4649 ) (#4758 ) This change provides new implementations of the XL backend operations: - create file - read file - heal file Further this change adds table based tests for all three operations. This affects also the bitrot algorithm integration. Algorithms are now integrated in an idiomatic way (like crypto.Hash). Fixes #4696 Fixes #4649 Fixes #4359	2017-08-14 18:08:42 -07:00
Harshavardhana	d864e00e24	posix: Deprecate custom removeAll/mkdirAll implementations. (#4808 ) Since go1.8 os.RemoveAll and os.MkdirAll both support long path names i.e UNC path on windows. The code we are carrying was directly borrowed from `pkg/os` package and doesn't need to be in our repo anymore. As a side affect this also addresses our codecoverage issue. Refer #4658	2017-08-12 19:25:43 -07:00
Brendan Ashworth	aeafe668d8	posix: do not upstream errors in deleteFile (#4771 ) This commit changes posix's deleteFile() to not upstream errors from removing parent directories. This fixes a race condition. The race condition occurs when multiple deleteFile()s are called on the same parent directory, but different child files. Because deleteFile() recursively removes parent directories if they are empty, but deleteFile() errors if the selected deletePath does not exist, there was an opportunity for a race condition. The two processes would remove the child directories successfully, then depend on the parent directory still existing. In some cases this is an invalid assumption, because other processes can remove the parent directory beforehand. This commit changes deleteFile() to not upstream an error if one occurs, because the only required error should be from the immediate deletePath, not from a parent path. In the specific bug report, multiple CompleteMultipartUpload requests would launch multiple deleteFile() requests. Because they chain up on parent directories, ultimately at the end, there would be multiple remove files for the ultimate parent directory, .minio.sys/multipart/{bucket}. Because only one will succeed and one will fail, an error would be upstreamed saying that the file does not exist, and the CompleteMultipartUpload code interpreted this as NoSuchKey, or that the object/part id doesn't exist. This was faulty behavior and is now fixed. The added test fails before this change and passes after this change. Fixes: https://github.com/minio/minio/issues/4727	2017-08-04 16:51:20 -07:00
Brendan Ashworth	28bc5899fd	posix: test isDirEmpty, change error conditional (#4743 ) This commit adds a new test for isDirEmpty (for code coverage) and changes around the error conditional. Previously, there was a `return nil` statement that would only be triggered under a race condition and would trip up our test coverage for no real reason. With this new error conditional, there's no awkward 'else'-esque condition, which means test coverage will not change between runs for no reason in this specific test. It's also a cleaner read.	2017-08-04 10:43:51 -07:00
Nitish Tiwari	fcc61fa46a	Remove minimum inodes reqd check (#4747 )	2017-08-03 20:07:22 -07:00
Brendan Ashworth	bccc386994	fs: drop Stat() call from fsDeleteFile,deleteFile (#4744 ) This commit makes fsDeleteFile() simply call deleteFile() after calling the relevant path length checking functions. This DRYs the code base. This commit removes the Stat() call from deleteFile(). This improves performance and removes any possibility of a race condition. This additionally adds tests and a benchmark for said function. The results aren't very consistent, although I'd expect this commit to make it faster.	2017-08-03 20:04:28 -07:00
Harshavardhana	cc8a8cb877	posix: Check for min disk space and inodes (#4618 ) This is needed such that we don't start or allow writing to a posix disk which doesn't have minimum total disk space available. One part fix for #4617	2017-07-10 18:14:48 -07:00
Harshavardhana	075b8903d7	fs: Add safe locking semantics for `format.json` (#4523 ) This patch also reverts previous changes which were merged for migration to the newer disk format. We will be bringing these changes in subsequent releases. But we wish to add protection in this release such that future release migrations are protected. Revert "fs: Migration should handle bucketConfigs as regular objects. (#4482)" This reverts commit `976870a391`. Revert "fs: Migrate object metadata to objects directory. (#4195)" This reverts commit `76f4f20609`.	2017-06-12 17:40:28 -07:00
Frank Wessels	0f0758aece	Load IO error count for posix atomically (#4448 ) * Load error count atomically in order to check for maximum allowed number of IO errors. * Remove unused (previously atomic) network IO error count	2017-05-31 09:22:53 -07:00
Aditya Manthramurthy	8975da4e84	Add new ReadFileWithVerify storage-layer API (#4349 ) This is an enhancement to the XL/distributed-XL mode. FS mode is unaffected. The ReadFileWithVerify storage-layer call is similar to ReadFile with the additional functionality of performing bit-rot checking. It accepts additional parameters for a hashing algorithm to use and the expected hex-encoded hash string. This patch provides significant performance improvement because: 1. combines the step of reading the file (during erasure-decoding/reconstruction) with bit-rot verification; 2. limits the number of file-reads; and 3. avoids transferring the file over the network for bit-rot verification. ReadFile API is implemented as ReadFileWithVerify with empty hashing arguments. Credits to AB and Harsha for the algorithmic improvement. Fixes #4236.	2017-05-16 14:21:52 -07:00
Harshavardhana	76f4f20609	fs: Migrate object metadata to objects directory. (#4195 ) Fixes #3352	2017-05-05 08:49:09 -07:00
Harshavardhana	f0b5c0ec7c	windows: Support all REPARSE_POINT attrib files properly. (#4203 ) This change adopts the upstream fix in this regard at https://go-review.googlesource.com/#/c/41834/ for Minio's purposes. Go's current os.Stat() lacks support for lot of strange windows files such as - share symlinks on SMB2 - symlinks on docker nanoserver - de-duplicated files on NTFS de-duplicated volume. This PR attempts to incorporate the change mentioned here https://blogs.msdn.microsoft.com/oldnewthing/20100212-00/?p=14963/ The article suggests to use Windows I/O manager to dereference the symbolic link. Fixes #4122	2017-05-02 02:35:27 -07:00
Anis Elleuch	79e0b9e69a	Relax minio server start when disk threshold is reached and adds space check in FS (#3865 ) * fs: Rename tempObjPath variable in fsCreateFile() * fs/posix: Factor checkDiskFree() function * fs: Add disk free check in fsCreateFile() * posix: Move free disk check to createFile() * xl: Relax free disk check in POSIX initialization * fs: checkDiskFree checks for space to store data	2017-03-07 12:25:40 -08:00
Harshavardhana	50b4e54a75	fs: Do not return reservedBucket names in ListBuckets() (#3754 ) Make sure to skip reserved bucket names in `ListBuckets()` current code didn't skip this properly and also generalize this behavior for both XL and FS.	2017-02-16 14:52:14 -08:00
Krishnan Parthasarathi	e5773e11c6	Make minio server compile on OpenBSD, NetBSD, Solaris (#3719 )	2017-02-08 22:27:35 -08:00
Jeffery Utter	9e1f1b50e0	Don't Check Available Inodes on NFS (#3598 ) In some cases (such as with VirutualBox, this value gets hardcoded to 1000, which is less than the required minimum of 10000. Fixes #3592	2017-01-19 10:39:44 -08:00

1 2

73 Commits