minio

mirror of https://github.com/minio/minio.git synced 2024-12-26 07:05:55 -05:00

Author	SHA1	Message	Date
Praveen raj Mani	ce9d36d954	Add object compression support (#6292 ) Add support for streaming (golang/LZ77/snappy) compression.	2018-09-28 09:06:17 +05:30
Anis Elleuch	7571582000	Print storage errors during distributed initialization (#6441 ) This commit will print connection failures to other disks in other nodes after 5 retries. It is useful for users to understand why the distribued cluster fails to boot up.	2018-09-10 16:21:59 -07:00
Krishna Srinivas	ce02ab613d	Simplify erasure code by separating bitrot from erasure code (#5959 )	2018-08-06 15:14:08 -07:00
Oleg Kovalov	37de2dbd3b	simplifying if-else chains to switches (#6208 )	2018-08-06 10:26:40 -07:00
Harshavardhana	ad86454580	Make sure to handle FaultyDisks in listing ops (#6204 ) Continuing from PR `157ed65c35` Our posix.go implementation did not handle I/O errors properly on the disks, this led to situations where top-level callers such as ListObjects might return early without even verifying all the available disks. This commit tries to address this in Kubernetes, drbd/nbd based persistent volumes which can disconnect under load and result in the situations with disks return I/O errors. This commit also simplifies listing operation, listing never returns any error. We can avoid this since we pretty much ignore most of the errors anyways. When objects are accessed directly we return proper errors.	2018-07-27 15:32:19 -07:00
Krishna Srinivas	40ed0d1f5d	Support 1GB disk size (#6137 ) Pivotal CF by default has 1GB disk option which causes minio to not start	2018-07-09 18:23:49 -07:00
Harshavardhana	de251483d1	Avoid ticker timer to simplify disk usage (#6101 ) This PR simplifies the code to avoid tracking any running usage events. This PR also brings in an upper threshold of upto 1 minute suspend the usage function after which the usage would proceed without waiting any longer.	2018-06-28 15:05:45 -07:00
Praveen raj Mani	ea76e72054	Incorrect error message for insufficient volume fix (#6099 ) Reply back with appropriate error message when the server is spawn with volume of insufficient size (< 1GiB). Fixes #5993.	2018-06-28 12:01:05 -07:00
Harshavardhana	25de775560	disable disk-usage when export is root mount path (#6091 ) disk usage crawling is not needed when a tenant is not sharing the same disk for multiple other tenants. This PR adds an optimization when we see a setup uses entire disk, we simply rely on statvfs() to give us total usage. This PR also additionally adds low priority scheduling for usage check routine, such that other go-routines blocked will be automatically unblocked and prioritized before usage.	2018-06-27 18:59:38 -07:00
Ashish Kumar Sinha	0bbdd02a57	Updating disk storage for FS/Erasure mode (#6081 ) Updating the disk storage stats for FS/Erasure coded backend	2018-06-25 10:46:48 -07:00
Harshavardhana	cb9ee1584a	Fix TestHealStartNStatusHandler sporadic failure (#6015 ) Fixes #5818	2018-06-12 16:36:31 -07:00
Bala FA	6a8bfcef1c	remove separate file for posix utils. (#5948 )	2018-06-07 12:31:40 +05:30
Bala FA	6a53dd1701	Implement HTTP POST based RPC (#5840 ) Added support for new RPC support using HTTP POST. RPC's arguments and reply are Gob encoded and sent as HTTP request/response body. This patch also removes Go RPC based implementation.	2018-06-06 14:21:56 +05:30
Harshavardhana	6fb0604502	Allow usage check to be configurable (#6006 )	2018-06-04 18:35:41 -07:00
Harshavardhana	000e360196	Deprecate showing drive capacity and total free (#5976 ) This addresses a situation that we shouldn't be displaying Total/Free anymore, instead we should simply show the total usage.	2018-05-23 17:30:25 -07:00
Harshavardhana	e6ec645035	Implement support for calculating disk usage per tenant (#5969 ) Fixes #5961	2018-05-23 15:41:29 +05:30
Bala FA	4eb788df79	rename checkPathValid() to getValidPath() (#5949 )	2018-05-17 07:27:07 -07:00
Anis Elleuch	6d5f2a4391	Better support of empty directories (#5890 ) Better support of HEAD and listing of zero sized objects with trailing slash (a.k.a empty directory). For that, isLeafDir function is added to indicate if the specified object is an empty directory or not. Each backend (xl, fs) has the responsibility to store that information. Currently, in both of XL & FS, an empty directory is represented by an empty directory in the backend. isLeafDir() checks if the given path is an empty directory or not, since dir listing is costly if the latter contains too many objects, readDirN() is added in this PR to list only N number of entries. In isLeadDir(), we will only list one entry to check if a directory is empty or not.	2018-05-09 01:38:21 -07:00
Harshavardhana	ccdb7bc286	Fix s3 compatibility fixes for getBucketLocation,headBucket,deleteBucket (#5842 ) - getBucketLocation - headBucket - deleteBucket Should return 404 or NoSuchBucket even for invalid bucket names, invalid bucket names are only validated during MakeBucket operation	2018-04-24 08:57:33 +05:30
Harshavardhana	4a874dfbc1	Ignore prefix renames when dest directory is not empty (#5798 ) Also make sure to not modify the underlying errors from layers, we should return the error as is and one object layer should translate the errors. Fixes #5797	2018-04-11 17:15:42 -07:00
Harshavardhana	217fb470a7	Add a check to check if disk is writable (#5662 ) This check is a pre-emptive check to return error early before we attempt to use the disk for any other operations later. refer #5645	2018-04-10 09:26:09 +05:30
kannappanr	f8a3fd0c2a	Create logger package and rename errorIf to LogIf (#5678 ) Removing message from error logging Replace errors.Trace with LogIf	2018-04-05 15:04:40 -07:00
Aditya Manthramurthy	ea8973b7d7	Return bit-rot verified data instead of re-reading from disk (#5568 ) - Data from disk was being read after bitrot verification to return data for GetObject. Strictly speaking this does not guarantee bitrot protection, as disks may return bad data even temporarily. - This fix reads data from disk, verifies data for bitrot and then returns data to the client directly.	2018-03-04 14:16:45 -08:00
Anis Elleuch	d2d49f6c6c	xl: Avoid removing directory content in Delete API (#5548 ) Delete & Multi Delete API should not try to remove the directory content. The only permitted case is with zero size object with a trailing slash in its name.	2018-02-20 15:33:26 -08:00
Anis Elleuch	926e480156	posix.RenameFile(): Allow overwriting an empty directory (#5551 ) Overwriting files is allowed, but since the introduction of the object directory, we will aslo need to allow overwriting an empty directory. Putting twice the same object directory won't fail with 403 error anymore.	2018-02-20 12:20:18 -08:00
Harshavardhana	fb96779a8a	Add large bucket support for erasure coded backend (#5160 ) This PR implements an object layer which combines input erasure sets of XL layers into a unified namespace. This object layer extends the existing erasure coded implementation, it is assumed in this design that providing > 16 disks is a static configuration as well i.e if you started the setup with 32 disks with 4 sets 8 disks per pack then you would need to provide 4 sets always. Some design details and restrictions: - Objects are distributed using consistent ordering to a unique erasure coded layer. - Each pack has its own dsync so locks are synchronized properly at pack (erasure layer). - Each pack still has a maximum of 16 disks requirement, you can start with multiple such sets statically. - Static sets set of disks and cannot be changed, there is no elastic expansion allowed. - Static sets set of disks and cannot be changed, there is no elastic removal allowed. - ListObjects() across sets can be noticeably slower since List happens on all servers, and is merged at this sets layer. Fixes #5465 Fixes #5464 Fixes #5461 Fixes #5460 Fixes #5459 Fixes #5458 Fixes #5460 Fixes #5488 Fixes #5489 Fixes #5497 Fixes #5496	2018-02-15 17:45:57 -08:00
Harshavardhana	3ea28e9771	Support creating directories on erasure coded backend (#5443 ) This PR continues from #5049 where we started supporting directories for erasure coded backend	2018-01-30 08:13:13 +05:30
Harshavardhana	12f67d47f1	Fix a possible race during PutObject() (#5376 ) Under any concurrent removeObjects in progress might have removed the parents of the same prefix for which there is an ongoing putObject request. An inconsistent situation may arise as explained below even under sufficient locking. PutObject is almost successful at the last stage when a temporary file is renamed to its actual namespace at `a/b/c/object1`. Concurrently a RemoveObject is also in progress at the same prefix for an `a/b/c/object2`. To create the object1 at location `a/b/c` PutObject has to create all the parents recursively. ``` a/b/c - os.MkdirAll loops through has now created 'a/' and 'b/' about to create 'c/' a/b/c/object2 - at this point 'c/' and 'object2' are deleted about to delete b/ ``` Now for os.MkdirAll loop the expected situation is that top level parent 'a/b/' exists which it created , such that it can create 'c/' - since removeObject and putObject do not compete for lock due to holding locks at different resources. removeObject proceeds to delete parent 'b/' since 'c/' is not yet present, once deleted 'os.MkdirAll' would receive an error as syscall.ENOENT which would fail the putObject request. This PR tries to address this issue by implementing a safer/guarded approach where we would retry an operation such as `os.MkdirAll` and `os.Rename` if both operations observe syscall.ENOENT. Fixes #5254	2018-01-13 22:43:02 +05:30
Harshavardhana	3d0dced23c	Remove go1.9 specific code for windows (#5033 ) Following fix https://go-review.googlesource.com/#/c/41834/ has been merged upstream and released with go1.9.	2017-10-13 15:31:15 +05:30
Andreas Auernhammer	7e6b5bdbb7	remove ReadFileWithVerify from StorageAPI (#4947 ) This change removes the ReadFileWithVerify function from the StorageAPI. The ReadFile was basically a redirection to ReadFileWithVerify. This change removes the redirection and moves the logic of ReadFileWithVerify directly into ReadFile. This removes a lot of unnecessary code in all StorageAPI implementations. Fixes #4946 * review: fix doc and typos	2017-09-25 11:32:56 -07:00
Harshavardhana	879cef37a1	Fail to start server if detected cross-device mounts. (#4807 ) Fixes #4764	2017-08-15 15:10:50 -07:00
Andreas Auernhammer	85fcee1919	erasure: simplify XL backend operations (#4649 ) (#4758 ) This change provides new implementations of the XL backend operations: - create file - read file - heal file Further this change adds table based tests for all three operations. This affects also the bitrot algorithm integration. Algorithms are now integrated in an idiomatic way (like crypto.Hash). Fixes #4696 Fixes #4649 Fixes #4359	2017-08-14 18:08:42 -07:00
Harshavardhana	d864e00e24	posix: Deprecate custom removeAll/mkdirAll implementations. (#4808 ) Since go1.8 os.RemoveAll and os.MkdirAll both support long path names i.e UNC path on windows. The code we are carrying was directly borrowed from `pkg/os` package and doesn't need to be in our repo anymore. As a side affect this also addresses our codecoverage issue. Refer #4658	2017-08-12 19:25:43 -07:00
Brendan Ashworth	aeafe668d8	posix: do not upstream errors in deleteFile (#4771 ) This commit changes posix's deleteFile() to not upstream errors from removing parent directories. This fixes a race condition. The race condition occurs when multiple deleteFile()s are called on the same parent directory, but different child files. Because deleteFile() recursively removes parent directories if they are empty, but deleteFile() errors if the selected deletePath does not exist, there was an opportunity for a race condition. The two processes would remove the child directories successfully, then depend on the parent directory still existing. In some cases this is an invalid assumption, because other processes can remove the parent directory beforehand. This commit changes deleteFile() to not upstream an error if one occurs, because the only required error should be from the immediate deletePath, not from a parent path. In the specific bug report, multiple CompleteMultipartUpload requests would launch multiple deleteFile() requests. Because they chain up on parent directories, ultimately at the end, there would be multiple remove files for the ultimate parent directory, .minio.sys/multipart/{bucket}. Because only one will succeed and one will fail, an error would be upstreamed saying that the file does not exist, and the CompleteMultipartUpload code interpreted this as NoSuchKey, or that the object/part id doesn't exist. This was faulty behavior and is now fixed. The added test fails before this change and passes after this change. Fixes: https://github.com/minio/minio/issues/4727	2017-08-04 16:51:20 -07:00
Brendan Ashworth	28bc5899fd	posix: test isDirEmpty, change error conditional (#4743 ) This commit adds a new test for isDirEmpty (for code coverage) and changes around the error conditional. Previously, there was a `return nil` statement that would only be triggered under a race condition and would trip up our test coverage for no real reason. With this new error conditional, there's no awkward 'else'-esque condition, which means test coverage will not change between runs for no reason in this specific test. It's also a cleaner read.	2017-08-04 10:43:51 -07:00
Nitish Tiwari	fcc61fa46a	Remove minimum inodes reqd check (#4747 )	2017-08-03 20:07:22 -07:00
Brendan Ashworth	bccc386994	fs: drop Stat() call from fsDeleteFile,deleteFile (#4744 ) This commit makes fsDeleteFile() simply call deleteFile() after calling the relevant path length checking functions. This DRYs the code base. This commit removes the Stat() call from deleteFile(). This improves performance and removes any possibility of a race condition. This additionally adds tests and a benchmark for said function. The results aren't very consistent, although I'd expect this commit to make it faster.	2017-08-03 20:04:28 -07:00
Harshavardhana	cc8a8cb877	posix: Check for min disk space and inodes (#4618 ) This is needed such that we don't start or allow writing to a posix disk which doesn't have minimum total disk space available. One part fix for #4617	2017-07-10 18:14:48 -07:00
Harshavardhana	075b8903d7	fs: Add safe locking semantics for `format.json` (#4523 ) This patch also reverts previous changes which were merged for migration to the newer disk format. We will be bringing these changes in subsequent releases. But we wish to add protection in this release such that future release migrations are protected. Revert "fs: Migration should handle bucketConfigs as regular objects. (#4482)" This reverts commit `976870a391`. Revert "fs: Migrate object metadata to objects directory. (#4195)" This reverts commit `76f4f20609`.	2017-06-12 17:40:28 -07:00
Frank Wessels	0f0758aece	Load IO error count for posix atomically (#4448 ) * Load error count atomically in order to check for maximum allowed number of IO errors. * Remove unused (previously atomic) network IO error count	2017-05-31 09:22:53 -07:00
Aditya Manthramurthy	8975da4e84	Add new ReadFileWithVerify storage-layer API (#4349 ) This is an enhancement to the XL/distributed-XL mode. FS mode is unaffected. The ReadFileWithVerify storage-layer call is similar to ReadFile with the additional functionality of performing bit-rot checking. It accepts additional parameters for a hashing algorithm to use and the expected hex-encoded hash string. This patch provides significant performance improvement because: 1. combines the step of reading the file (during erasure-decoding/reconstruction) with bit-rot verification; 2. limits the number of file-reads; and 3. avoids transferring the file over the network for bit-rot verification. ReadFile API is implemented as ReadFileWithVerify with empty hashing arguments. Credits to AB and Harsha for the algorithmic improvement. Fixes #4236.	2017-05-16 14:21:52 -07:00
Harshavardhana	76f4f20609	fs: Migrate object metadata to objects directory. (#4195 ) Fixes #3352	2017-05-05 08:49:09 -07:00
Harshavardhana	f0b5c0ec7c	windows: Support all REPARSE_POINT attrib files properly. (#4203 ) This change adopts the upstream fix in this regard at https://go-review.googlesource.com/#/c/41834/ for Minio's purposes. Go's current os.Stat() lacks support for lot of strange windows files such as - share symlinks on SMB2 - symlinks on docker nanoserver - de-duplicated files on NTFS de-duplicated volume. This PR attempts to incorporate the change mentioned here https://blogs.msdn.microsoft.com/oldnewthing/20100212-00/?p=14963/ The article suggests to use Windows I/O manager to dereference the symbolic link. Fixes #4122	2017-05-02 02:35:27 -07:00
Anis Elleuch	79e0b9e69a	Relax minio server start when disk threshold is reached and adds space check in FS (#3865 ) * fs: Rename tempObjPath variable in fsCreateFile() * fs/posix: Factor checkDiskFree() function * fs: Add disk free check in fsCreateFile() * posix: Move free disk check to createFile() * xl: Relax free disk check in POSIX initialization * fs: checkDiskFree checks for space to store data	2017-03-07 12:25:40 -08:00
Harshavardhana	50b4e54a75	fs: Do not return reservedBucket names in ListBuckets() (#3754 ) Make sure to skip reserved bucket names in `ListBuckets()` current code didn't skip this properly and also generalize this behavior for both XL and FS.	2017-02-16 14:52:14 -08:00
Krishnan Parthasarathi	e5773e11c6	Make minio server compile on OpenBSD, NetBSD, Solaris (#3719 )	2017-02-08 22:27:35 -08:00
Jeffery Utter	9e1f1b50e0	Don't Check Available Inodes on NFS (#3598 ) In some cases (such as with VirutualBox, this value gets hardcoded to 1000, which is less than the required minimum of 10000. Fixes #3592	2017-01-19 10:39:44 -08:00
Harshavardhana	62f8343879	Add constants for commonly used values. (#3588 ) This is a consolidation effort, avoiding usage of naked strings in codebase. Whenever possible use constants which can be repurposed elsewhere. This also fixes `goconst ./...` reported issues.	2017-01-18 12:24:34 -08:00
Harshavardhana	1c699d8d3f	fs: Re-implement object layer to remember the fd (#3509 ) This patch re-writes FS backend to support shared backend sharing locks for safe concurrent access across multiple servers.	2017-01-16 17:05:00 -08:00
Harshavardhana	2062add05f	fs/posix: On windows use helpers and init format.json properly. (#3434 ) Fixes #3433	2016-12-12 15:43:41 -08:00
Krishnan Parthasarathi	feb6685359	posix: Use preparePath only for paths used with syscall or os functions (#3377 )	2016-11-30 20:56:15 -08:00
Harshavardhana	6efee2072d	objectLayer: Check for `format.json` in a wrapped disk. (#3311 ) This is needed to validate if the `format.json` indeed exists when a fresh node is brought online. This wrapped implementation also connects to the remote node by attempting a re-login. Subsequently after a successful connect `format.json` is validated as well. Fixes #3207	2016-11-23 15:48:10 -08:00
Bala FA	825000bc34	Use humanize constants for KiB, MiB and GiB units. (#3322 )	2016-11-22 18:18:22 -08:00
Anis Elleuch	4098025c11	Remove XL multipart tmp files when the latter is canceled (#3214 ) XL multipart fails to remove tmp files when an error occurs during upload, this case covers the scenario where an upload is canceled manually by the client in the middle of job.	2016-11-21 16:34:57 -08:00
Krishna Srinivas	afa4c7c3ef	fs/multipart: Append multipart parts in a proper Go routine in background. (#3282 )	2016-11-20 23:42:53 -08:00
Harshavardhana	0b9f0d14a1	auth/rpc: Take remote disk offline after maximum allowed attempts. (#3288 ) Disks when are offline for a long period of time, we should ignore the disk after trying Login upto 5 times. This is to reduce the network chattiness, this also reduces the overall time spent on `net.Dial`. Fixes #3286	2016-11-20 16:57:12 -08:00
Harshavardhana	2f7fb78692	rpc: Our rpcClient should make an attempt to reconnect. (#3221 ) rpcClient should attempt a reconnect if the call fails with 'rpc.ErrShutdown' this is needed since at times when the servers are taken down and brought back up. The hijacked connection from net.Dial is usually closed. So upon first attempt rpcClient might falsely indicate that disk to be down, to avoid this state make another dial attempt to really fail. Fixes #3206 Fixes #3205	2016-11-10 07:44:41 -08:00
Harshavardhana	f3c6c55719	posix: Fix windows performance issues. (#3132 ) Do not attempt to fetch volume/drive information for each i/o situation. In our case we do this in all calls `posix.go` this in-turn created a terrible situation for windows. This issue does not affect the i/o path on Unix platforms since statvfs calls are in the range of micro seconds on these platforms. This verification is only needed during startup and we let things fail at a later stage on windows.	2016-10-31 09:34:44 -07:00
Anis Elleuch	a47ce7ab22	Add support of fallocate for FS and XL backends (#3032 )	2016-10-29 12:44:44 -07:00
Harshavardhana	9e2d0ac50b	Move to URL based syntax formatting. (#3092 ) For command line arguments we are currently following - <node-1>:/path ... <node-n>:/path This patch changes this to - http://<node-1>/path ... http://<node-n>/path	2016-10-27 03:30:52 -07:00
Harshavardhana	e9c45102b0	posix: Use sync.Pool buffers to copy in large buffers. (#3106 ) These fixes are borrowed from the fixes required for GlusterFS i/o throughput.	2016-10-26 17:14:05 -07:00
Krishna Srinivas	d3aaf50a40	posix: Split on ":" in path d:\export makes minio use wrong disk. (#3027 ) As the host/path split happens at a higher layer now, split at posix is not needed. fixes part of #2987	2016-10-20 23:39:33 -07:00
Harshavardhana	95567c68bf	posix: Do not print errors in expected errors. (#3012 ) Fixes #3011	2016-10-20 09:26:18 -07:00
Harshavardhana	8d2347bc7b	storage: DeleteFile should return errFileNotFound for ENOENT. (#2978 )	2016-10-17 16:38:46 -07:00
Harshavardhana	6494b77d41	server: Add more elaborate startup messages. (#2731 ) These messages based on our prep stage during XL and prints more informative message regarding drive information. This change also does a much needed refactoring.	2016-10-05 12:48:07 -07:00
Anis Elleuch	9417614a8e	Recalculate free minimum disk space (#2788 ) * Fix calculating free space disk by using blocks available for unprivileged user * Use fixed minimal free disk space instead of percentage	2016-09-27 12:46:38 -07:00
Anis Elleuch	d936ed90ae	Avoid testing on system errors strings in posix (#2583 )	2016-09-13 21:18:30 -07:00
Harshavardhana	339425fd52	server: Fetch StorageInfo() from underlying disks transparently. (#2549 ) Fixes #2511	2016-09-13 21:18:30 -07:00
Mohit Agarwal	418921de89	minor cleanup - Reused contains() from utils.go at a couple of places - Cleanup in return statements and boolean checks	2016-08-24 22:54:34 -07:00
Harshavardhana	bccf549463	server: Move all the top level files into cmd folder. (#2490 ) This change brings a change which was done for the 'mc' package to allow for clean repo and have a cleaner github drop in experience.	2016-08-18 16:23:42 -07:00

1 2 3

120 Commits