minio

Commit Graph

Author	SHA1	Message	Date
Klaus Post	faf013ec84	Improve performance on multiple versions (#13573 ) Existing: ```go type xlMetaV2 struct { Versions []xlMetaV2Version `json:"Versions" msg:"Versions"` } ``` Serialized as regular MessagePack. ```go //msgp:tuple xlMetaV2VersionHeader type xlMetaV2VersionHeader struct { VersionID [16]byte ModTime int64 Type VersionType Flags xlFlags } ``` Serialize as streaming MessagePack, format: ``` int(headerVersion) int(xlmetaVersion) int(nVersions) for each version { binary blob, xlMetaV2VersionHeader, serialized binary blob, xlMetaV2Version, serialized. } ``` xlMetaV2VersionHeader is <= 30 bytes serialized. Deserialized struct can easily be reused and does not contain pointers, so efficient as a slice (single allocation) This allows quickly parsing everything as slices of bytes (no copy). Versions are always saved sorted by modTime, newest first. No more need to sort on load. * Allows checking if a version exists. * Allows reading single version without unmarshal all. * Allows reading latest version of type without unmarshal all. * Allows reading latest version without unmarshal of all. * Allows checking if the latest is deleteMarker by reading first entry. * Allows adding/updating/deleting a version with only header deserialization. * Reduces allocations on conversion to FileInfo(s).	2021-11-18 12:15:22 -08:00
Anis Elleuch	4caed7cc0d	metrics: Add replication latency metrics (#13515 ) Add a new Prometheus metric for bucket replication latency e.g.: minio_bucket_replication_latency_ns{ bucket="testbucket", operation="upload", range="LESS_THAN_1_MiB", server="127.0.0.1:9001", targetArn="arn:minio:replication::45da043c-14f5-4da4-9316-aba5f77bf730:testbucket"} 2.2015663e+07 Co-authored-by: Klaus Post <klauspost@gmail.com>	2021-11-17 12:10:57 -08:00
Harshavardhana	661b263e77	add gocritic/ruleguard checks back again, cleanup code. (#13665 ) - remove some duplicated code - reported a bug, separately fixed in #13664 - using strings.ReplaceAll() when needed - using filepath.ToSlash() use when needed - remove all non-Go style comments from the codebase Co-authored-by: Aditya Manthramurthy <donatello@users.noreply.github.com>	2021-11-16 09:28:29 -08:00
Harshavardhana	4ed0eb7012	remove double reads updating object metadata (#13542 ) Removes RLock/RUnlock for updating metadata, since we already take a write lock to update metadata, this change removes reading of xl.meta as well as an additional lock, the performance gain should increase 3x theoretically for - PutObjectRetention - PutObjectLegalHold This optimization is mainly for Veeam like workloads that require a certain level of iops from these API calls, we were losing iops.	2021-10-30 08:22:04 -07:00
Poorna K	e7f559c582	Fixes to replication metrics (#13493 ) For reporting ReplicaSize and loading initial replication metrics correctly.	2021-10-21 18:52:55 -07:00
Poorna Krishnamoorthy	7f6ed35347	Allow null versions to be replicated (#13310 ) for pre-existing objects present in a bucket prior to enabling existing object replication. Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io>	2021-09-28 10:26:12 -07:00
Poorna Krishnamoorthy	19ecdc75a8	replication: Simplify metrics calculation (#13274 ) Also doing some code cleanup	2021-09-22 10:48:45 -07:00
Poorna Krishnamoorthy	806b10b934	fix: improve error messages returned during replication setup (#13261 )	2021-09-21 13:03:20 -07:00
Poorna Krishnamoorthy	c4373ef290	Add support for multi site replication (#12880 )	2021-09-18 13:31:35 -07:00
Harshavardhana	0892f1e406	fix: multipart replication and encrypted etag for sse-s3 (#13171 ) Replication was not working properly for encrypted objects in single PUT object for preserving etag, We need to make sure to preserve etag such that replication works properly and not gets into infinite loops of copying due to ETag mismatches.	2021-09-08 22:25:23 -07:00
Poorna Krishnamoorthy	9af4e7b1da	Add healthcheck back for replication targets (#13168 ) This will allow objects to relinquish read lock held during replication earlier if the target is known to be down without waiting for connection timeout when replication is attempted.	2021-09-08 15:34:50 -07:00
Poorna Krishnamoorthy	a366143c5b	Remove replication permission check (#13135 ) Fixes #13105	2021-09-02 09:31:13 -07:00
Poorna Krishnamoorthy	6a7e22386e	Use part sizes correctly in multipart replication (#13061 ) fixes #13057	2021-08-24 14:41:05 -07:00
Poorna Krishnamoorthy	674c6f7a7b	fix: resync of replication of delete markers (#12932 ) Fixes #12919	2021-08-23 14:48:22 -07:00
Klaus Post	63f3e5c3fc	replication: Lock object while replicating (#13014 ) Introduce a replication lock that will ensure that only one replication operation will run for any given object at any time. Fixes #13013	2021-08-23 08:16:18 -07:00
Harshavardhana	3c34e18a4e	allow multipart uploads for single part multipart (#12821 ) its possible that some multipart uploads would have uploaded only single parts so relying on `len(o.Parts)` alone is not sufficient, we need to look for ETag pattern to be absolutely sure.	2021-07-28 22:11:55 -07:00
Poorna Krishnamoorthy	b6cd54779c	Increase context timeout for bandwidth throttled reader (#12820 ) increase default timeout up to one hour for toy setups. fixes #12812	2021-07-28 15:20:01 -07:00
Anis Elleuch	b8f95fb3d4	fix: Use correct replication status in replication healing (#12711 ) In case of replication healing, we always store completed status in the object metadata, which is wrong because replication could fail in the further retries.	2021-07-14 09:58:46 -07:00
Harshavardhana	4f6c74a257	simplify audit logging for replication and ILM (#12610 ) auditLog should be attempted right before the return of the function and not multiple times per function, this ensures that we only trigger it once per function call.	2021-07-01 14:02:44 -07:00
Poorna Krishnamoorthy	a3f0288262	Use multipart call for replication (#12535 ) if object was uploaded with multipart. This is to ensure that GetObject calls with partNumber in URI request parameters have same behavior on source and replication target.	2021-06-30 07:44:24 -07:00
Poorna Krishnamoorthy	a69c2a2fb3	Change replication to use read lock instead of writelock (#12581 ) Fixes #12573 This PR also adding audit logging for replication activity	2021-06-28 23:58:08 -07:00
Poorna Krishnamoorthy	d00783c923	Use rate.Limiter for bandwidth monitoring (#12506 ) Bonus: fixes a hang when bandwidth caps are enabled for synchronous replication	2021-06-24 18:29:30 -07:00
Harshavardhana	41caf89cf4	fix: apply pre-conditions first on object metadata (#12545 ) This change in error flow complies with AWS S3 behavior for applications depending on specific error conditions. fixes #12543	2021-06-24 09:44:00 -07:00
Harshavardhana	cdeccb5510	feat: Deprecate embedded browser and import console (#12460 ) This feature also changes the default port where the browser is running, now the port has moved to 9001 and it can be configured with ``` --console-address ":9001" ```	2021-06-17 20:27:04 -07:00
Poorna Krishnamoorthy	dbea8d2ee0	Add support for existing object replication. (#12109 ) Also adding an API to allow resyncing replication when existing object replication is enabled and the remote target is entirely lost. With the `mc replicate reset` command, the objects that are eligible for replication as per the replication config will be resynced to target if existing object replication is enabled on the rule.	2021-06-01 19:59:11 -07:00
Harshavardhana	1f262daf6f	rename all remaining packages to internal/ (#12418 ) This is to ensure that there are no projects that try to import `minio/minio/pkg` into their own repo. Any such common packages should go to `https://github.com/minio/pkg`	2021-06-01 14:59:40 -07:00
Harshavardhana	fdc2020b10	move to iam, bucket policy from minio/pkg (#12400 )	2021-05-29 21:16:42 -07:00
Poorna Krishnamoorthy	547bb7d0a1	replication: Init worker kill channel correctly (#12379 ) Signed-off-by: Poorna Krishnamoorthy <poorna@minio.io>	2021-05-28 13:28:37 -07:00
Poorna Krishnamoorthy	951acf561c	Add support for syncing replica modifications (#11104 ) when bidirectional replication is set up. If ReplicaModifications is enabled in the replication configuration, sync metadata updates to source if replication rules are met. By default, if this configuration is unset, MinIO automatically sync's metadata updates on replica back to the source.	2021-05-13 19:20:45 -07:00
Harshavardhana	1aa5858543	move madmin to github.com/minio/madmin-go (#12239 )	2021-05-06 08:52:02 -07:00
Harshavardhana	f7a87b30bf	Revert "deprecate embedded browser (#12163 )" This reverts commit `736d8cbac4`. Bring contrib files for older contributions	2021-04-30 08:50:39 -07:00
Harshavardhana	0faa4e6187	fix: make sure failed requests only to failed queue (#12196 ) failed queue should be used for retried requests to avoid cascading the failures into incoming queue, this would allow for a more fair retry for failed replicas. Additionally also avoid taking context in queue task to avoid confusion, simplifies its usage.	2021-04-29 18:20:39 -07:00
Poorna Krishnamoorthy	90112b5644	Update ReplicationStatus if metadata not updated correctly (#12191 ) There can be situations where replication completed but the `X-Amz-Replication-Status` metadata update failed such as when the server returns 503 under high load. This object version will continue to be picked up by the scanner and replicateObject would perform no action since the versions match between source and target. The metadata would never reflect that replication was successful without this fix, leading to repeated re-queuing.	2021-04-29 16:46:26 -07:00
Harshavardhana	c4b21ac7fa	fix: remove healthcheck routine for replication targets (#12192 ) Bonus also fix a racy lookup on arnsMap() without a read lock, hold read locks to avoid such race. moving the healthcheck logic to minio-go	2021-04-29 16:41:28 -07:00
Poorna Krishnamoorthy	632252ff1d	fix: change SetRemoteTarget API to allow editing remote target granularly (#12175 ) Currently, only credentials could be updated with `mc admin bucket remote edit`. Allow updating synchronous replication flag, path, bandwidth and healthcheck duration on buckets, and a flag to disable proxying in active-active replication.	2021-04-28 15:26:20 -07:00
Harshavardhana	736d8cbac4	deprecate embedded browser (#12163 ) https://github.com/minio/console takes over the functionality for the future object browser development Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-27 10:52:12 -07:00
Harshavardhana	82dc6aff1c	add support for configurable replication MRF workers (#12125 ) just like replication workers, allow failed replication workers to be configurable in situations like DR failures etc to catch up on replication sooner when DR is back online. Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 21:58:45 -07:00
Poorna Krishnamoorthy	014e419151	fix: ensure pending replication queued to MRF queue (#12138 ) Signed-off-by: Poorna Krishnamoorthy <poorna@minio.io>	2021-04-23 16:52:57 -07:00
Krishnan Parthasarathi	c829e3a13b	Support for remote tier management (#12090 ) With this change, MinIO's ILM supports transitioning objects to a remote tier. This change includes support for Azure Blob Storage, AWS S3 compatible object storage incl. MinIO and Google Cloud Storage as remote tier storage backends. Some new additions include: - Admin APIs remote tier configuration management - Simple journal to track remote objects to be 'collected' This is used by object API handlers which 'mutate' object versions by overwriting/replacing content (Put/CopyObject) or removing the version itself (e.g DeleteObjectVersion). - Rework of previous ILM transition to fit the new model In the new model, a storage class (a.k.a remote tier) is defined by the 'remote' object storage type (one of s3, azure, GCS), bucket name and a prefix. * Fixed bugs, review comments, and more unit-tests - Leverage inline small object feature - Migrate legacy objects to the latest object format before transitioning - Fix restore to particular version if specified - Extend SharedDataDirCount to handle transitioned and restored objects - Restore-object should accept version-id for version-suspended bucket (#12091) - Check if remote tier creds have sufficient permissions - Bonus minor fixes to existing error messages Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io> Co-authored-by: Krishna Srinivas <krishna@minio.io> Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Harshavardhana	069432566f	update license change for MinIO Signed-off-by: Harshavardhana <harsha@minio.io>	2021-04-23 11:58:53 -07:00
Harshavardhana	2ef824bbb2	collapse two distinct calls into single RenameData() call (#12093 ) This is an optimization by reducing one extra system call, and many network operations. This reduction should increase the performance for small file workloads.	2021-04-20 10:44:39 -07:00
Harshavardhana	0a9d8dfb0b	fix: crash in single drive mode for lifecycle (#12077 ) also make sure to close the channel on the producer side, not in a separate go-routine, this can lead to races between a writer and a closer. fixes #12073	2021-04-16 14:09:25 -07:00
Poorna Krishnamoorthy	d30c5d1cf0	Avoid metadata update for incoming replication failure (#12054 ) This is an optimization to save IOPS. The replication failures will be re-queued once more to re-attempt replication. If it still does not succeed, the replication status is set as `FAILED` and will be caught up on scanner cycle.	2021-04-15 16:32:00 -07:00
Harshavardhana	abb55bd49e	fix: properly close leaking bandwidth monitor channel (#11967 ) This PR fixes - close leaking bandwidth report channel leakage - remove the closer requirement for bandwidth monitor instead if Read() fails remember the error and return error for all subsequent reads. - use locking for usage-cache.bin updates, with inline data we cannot afford to have concurrent writes to usage-cache.bin corrupting xl.meta	2021-04-05 16:07:53 -07:00
Harshavardhana	09ee303244	add cluster support for realtime bucket stats (#11963 ) implementation in #11949 only catered from single node, but we need cluster metrics by capturing from all peers. introduce bucket stats API that will be used for capturing in-line bucket usage as well eventually	2021-04-04 15:34:33 -07:00
Harshavardhana	d46386246f	api: Introduce metadata update APIs to update only metadata (#11962 ) Current implementation heavily relies on readAllFileInfo but with the advent of xl.meta inlined with data, we cannot easily avoid reading data when we are only interested is updating metadata, this leads to invariably write amplification during metadata updates, repeatedly reading data when we are only interested in updating metadata. This PR ensures that we implement a metadata only update API at storage layer, that handles updates to metadata alone for any given version - given the version is valid and present. This helps reduce the chattiness for following calls.. - PutObjectTags - DeleteObjectTags - PutObjectLegalHold - PutObjectRetention - ReplicateObject (updates metadata on replication status)	2021-04-04 13:32:31 -07:00
Poorna Krishnamoorthy	47c09a1e6f	Various improvements in replication (#11949 ) - collect real time replication metrics for prometheus. - add pending_count, failed_count metric for total pending/failed replication operations. - add API to get replication metrics - add MRF worker to handle spill-over replication operations - multiple issues found with replication - fixes an issue when client sends a bucket name with `/` at the end from SetRemoteTarget API call make sure to trim the bucket name to avoid any extra `/`. - hold write locks in GetObjectNInfo during replication to ensure that object version stack is not overwritten while reading the content. - add additional protection during WriteMetadata() to ensure that we always write a valid FileInfo{} and avoid ever writing empty FileInfo{} to the lowest layers. Co-authored-by: Poorna Krishnamoorthy <poorna@minio.io> Co-authored-by: Harshavardhana <harsha@minio.io>	2021-04-03 09:03:42 -07:00
Harshavardhana	8e6e287729	fix: delete/delete marker replication versions consistent (#11932 ) replication didn't work as expected when deletion of delete markers was requested in DeleteMultipleObjects API, this is due to incorrect lookup elements being used to look for delete markers.	2021-03-30 17:15:36 -07:00
Poorna Krishnamoorthy	5e003549cc	Replication: Enforce DeleteMarker disable setting (#11720 ) This PR also enforces DeleteReplication disable setting	2021-03-13 10:28:35 -08:00
Poorna Krishnamoorthy	2f29719e6b	resize replication worker pool dynamically after config update (#11737 )	2021-03-09 02:56:42 -08:00

1 2 3

104 Commits