1
0
mirror of https://github.com/minio/minio.git synced 2025-01-15 08:45:00 -05:00

67 Commits

Author SHA1 Message Date
Poorna
d990661d1f
replication: enforce precondition for multipart () 2024-03-20 18:12:37 -07:00
Harshavardhana
fba883839d
feat: bring new HDD related performance enhancements ()
Optionally allows customers to enable 

- Enable an external cache to catch GET/HEAD responses 
- Enable skipping disks that are slow to respond in GET/HEAD 
  when we have already achieved a quorum
2023-11-22 13:46:17 -08:00
Harshavardhana
74e0c9ab9b
reduce unnecessary logging, simplify certain error handling ()
remove a bunch of unnecessary logs
2023-10-10 00:33:42 -07:00
Harshavardhana
62c9e500de
remove mTime requirement from pre-condition checks ()
given a versionId the mtime is always the same, it
can never be different than its original value.

versionIds also do not conflict, since they are uuid's
and unique practically forever.
2023-08-24 14:33:58 -07:00
Harshavardhana
f32efd5429
more compliance related fixes ()
- lifecycle must return InvalidArgument for rule errors
- do not return `null` versionId in HTTP header
- reject mixed SSE uploads with correct error message
2023-06-13 13:52:33 -07:00
Krishnan Parthasarathi
3e128c116e
Add lifecycle event source to audit log tags () 2023-05-22 15:28:56 -07:00
Krishnan Parthasarathi
0ec722bc54
Add tags to NewerNoncurrentVersions audit event () 2023-05-02 12:56:33 -07:00
Klaus Post
7fad0c8b41
Remove checksums from HTTP range request, add part checksums () 2023-04-28 08:26:32 -07:00
Krishnan Parthasarathi
e7cac8acef
Add tags to auditLogLifecycle () 2023-04-26 17:49:00 -07:00
Poorna
cd6dec49c0
Add trace support for ilm activity () 2023-04-11 19:22:32 -07:00
Harshavardhana
c06e0bfef9
set correct Host: value for replication event notification () 2023-04-06 10:20:53 -07:00
Anis Eleuch
111c7d4026
assumeRole return the correct http code for auth errors () 2023-04-05 22:19:31 -07:00
Shubhendu
5fe1b46bfd
Enabled to send audit log while version deletion ()
Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>
2023-04-03 11:58:04 -07:00
Harshavardhana
b66d7dc708
add missing x-amz-id-2 to event notification date () 2023-02-20 15:41:47 +05:30
Harshavardhana
11c7ecb5cf
support if-match/if-none-match with s3 uploads () 2023-02-06 18:58:29 -08:00
Harshavardhana
124544d834
add pre-conditions support for PUT calls during replication ()
PUT shall only proceed if pre-conditions are met, the new
code uses

- x-minio-source-mtime
- x-minio-source-etag

to verify if the object indeed needs to be replicated
or not, allowing us to avoid StatObject() call.
2022-09-14 18:44:04 -07:00
Harshavardhana
228c6686f8
allow non-standards fallback for all http.TimeFormats ()
fixes 
2022-09-07 07:24:54 -07:00
Klaus Post
8e4a45ec41
fix: encrypt checksums in metadata () 2022-08-31 08:13:23 -07:00
Klaus Post
a9f1ad7924
Add extended checksum support () 2022-08-29 16:57:16 -07:00
Harshavardhana
433b6fa8fe
upgrade golang-lint to the latest () 2022-08-26 12:52:29 -07:00
Harshavardhana
5cffd3780a
fix: multiple fixes in prefix exclude implementation ()
- do not need to restrict prefix exclusions that do not
  have `/` as suffix, relax this requirement as spark may
  have staging folders with other autogenerated characters
  , so we are better off doing full prefix March and skip. 

- multiple delete objects was incorrectly creating a
  null delete marker on a versioned bucket instead of
  creating a proper versioned delete marker.

- do not suspend paths on the excluded prefixes during
  delete operations to avoid creating `null` delete markers,
  honor suspension of versioning only at bucket level for
  delete markers.
2022-05-07 22:06:44 -07:00
Krishnan Parthasarathi
ad8e611098
feat: implement prefix-level versioning exclusion ()
Spark/Hadoop workloads which use Hadoop MR 
Committer v1/v2 algorithm upload objects to a 
temporary prefix in a bucket. These objects are 
'renamed' to a different prefix on Job commit. 
Object storage admins are forced to configure 
separate ILM policies to expire these objects 
and their versions to reclaim space.

Our solution:

This can be avoided by simply marking objects 
under these prefixes to be excluded from versioning, 
as shown below. Consequently, these objects are 
excluded from replication, and don't require ILM 
policies to prune unnecessary versions.

-  MinIO Extension to Bucket Version Configuration
```xml
<VersioningConfiguration xmlns="http://s3.amazonaws.com/doc/2006-03-01/"> 
        <Status>Enabled</Status>
        <ExcludeFolders>true</ExcludeFolders>
        <ExcludedPrefixes>
          <Prefix>app1-jobs/*/_temporary/</Prefix>
        </ExcludedPrefixes>
        <ExcludedPrefixes>
          <Prefix>app2-jobs/*/__magic/</Prefix>
        </ExcludedPrefixes>

        <!-- .. up to 10 prefixes in all -->     
</VersioningConfiguration>
```
Note: `ExcludeFolders` excludes all folders in a bucket 
from versioning. This is required to prevent the parent 
folders from accumulating delete markers, especially
those which are shared across spark workloads 
spanning projects/teams.

- To enable version exclusion on a list of prefixes

```
mc version enable --excluded-prefixes "app1-jobs/*/_temporary/,app2-jobs/*/_magic," --exclude-prefix-marker myminio/test
```
2022-05-06 19:05:28 -07:00
Harshavardhana
f527c708f2
run gofumpt cleanup across code-base () 2022-01-02 09:15:06 -08:00
Krishnan Parthasarathi
3da9ee15d3
Add MaxNoncurrentVersions to NoncurrentExpiration action ()
This unit allows users to limit the maximum number of noncurrent 
versions of an object.

To enable this rule you need the following *ilm.json*
```
cat >> ilm.json <<EOF
{
    "Rules": [
        {
            "ID": "test-max-noncurrent",
            "Status": "Enabled",
            "Filter": {
                "Prefix": "user-uploads/"
            },
            "NoncurrentVersionExpiration": {
                "MaxNoncurrentVersions": 5
            }
        }
    ]
}
EOF
mc ilm import myminio/mybucket < ilm.json
```
2021-11-19 17:54:10 -08:00
Krishnan Parthasarathi
d0963974a5
pkg/lifecycle: Add SetPredictionHeaders method ()
This method is used to add expected expiration and transition time 
for an object in GET/HEAD Object response headers.

Also fixed bugs in lifecycle.PredictTransitionTime and
getLifecycleTransitionTier in handling current and 
non-current versions.
2021-07-20 17:36:55 -07:00
Harshavardhana
cdeccb5510
feat: Deprecate embedded browser and import console ()
This feature also changes the default port where
the browser is running, now the port has moved
to 9001 and it can be configured with

```
--console-address ":9001"
```
2021-06-17 20:27:04 -07:00
Harshavardhana
1f262daf6f
rename all remaining packages to internal/ ()
This is to ensure that there are no projects
that try to import `minio/minio/pkg` into
their own repo. Any such common packages should
go to `https://github.com/minio/pkg`
2021-06-01 14:59:40 -07:00
Harshavardhana
f7a87b30bf Revert "deprecate embedded browser ()"
This reverts commit 736d8cbac483d8bf56c3422ca9a9c4c3e043c6cf.

Bring contrib files for older contributions
2021-04-30 08:50:39 -07:00
Harshavardhana
736d8cbac4
deprecate embedded browser ()
https://github.com/minio/console takes over the functionality for the
future object browser development

Signed-off-by: Harshavardhana <harsha@minio.io>
2021-04-27 10:52:12 -07:00
Harshavardhana
069432566f update license change for MinIO
Signed-off-by: Harshavardhana <harsha@minio.io>
2021-04-23 11:58:53 -07:00
Harshavardhana
7875d472bc
avoid notification for non-existent delete objects ()
Skip notifications on objects that might have had
an error during deletion, this also avoids unnecessary
replication attempt on such objects.

Refactor some places to make sure that we have notified
the client before we

- notify
- schedule for replication
- lifecycle etc.
2021-02-10 22:00:42 -08:00
Harshavardhana
a6c146bd00
validate storage class across pools when setting config ()
```
mc admin config set alias/ storage_class standard=EC:3
```

should only succeed if parity ratio is valid for all
server pools, if not we should fail proactively.

This PR also needs to bring other changes now that
we need to cater for variadic drive counts per pool.

Bonus fixes also various bugs reproduced with

- GetObjectWithPartNumber()
- CopyObjectPartWithOffsets()
- CopyObjectWithMetadata()
- PutObjectPart,PutObject with truncated streams
2021-01-22 12:09:24 -08:00
Anis Elleuch
284a2b9021
ilm: Send delete marker creation event when appropriate ()
Before this commit, the crawler ILM will always send object delete event
notification though this is wrong.
2020-10-16 21:22:12 -07:00
Harshavardhana
736e58dd68
fix: handle concurrent lockers with multiple optimizations ()
- select lockers which are non-local and online to have
  affinity towards remote servers for lock contention

- optimize lock retry interval to avoid sending too many
  messages during lock contention, reduces average CPU
  usage as well

- if bucket is not set, when deleteObject fails make sure
  setPutObjHeaders() honors lifecycle only if bucket name
  is set.

- fix top locks to list out always the oldest lockers always,
  avoid getting bogged down into map's unordered nature.
2020-10-08 12:32:32 -07:00
Harshavardhana
8e7c00f3d4
add missing request-id from DeleteObject events ()
fixes 
2020-10-02 13:36:13 -07:00
Harshavardhana
0104af6bcc
delayed locks until we have started reading the body ()
This is to ensure that Go contexts work properly, after some
interesting experiments I found that Go net/http doesn't
cancel the context when Body is non-zero and hasn't been
read till EOF.

The following gist explains this, this can lead to pile up
of go-routines on the server which will never be canceled
and will die at a really later point in time, which can
simply overwhelm the server.

https://gist.github.com/harshavardhana/c51dcfd055780eaeb71db54f9c589150

To avoid this refactor the locking such that we take locks after we
have started reading from the body and only take locks when needed.

Also, remove contextReader as it's not useful, doesn't work as expected
context is not canceled until the body reaches EOF so there is no point
in wrapping it with context and putting a `select {` on it which
can unnecessarily increase the CPU overhead.

We will still use the context to cancel the lockers etc.
Additional simplification in the locker code to avoid timers
as re-using them is a complicated ordeal avoid them in
the hot path, since locking is very common this may avoid
lots of allocations.
2020-09-14 15:57:13 -07:00
Harshavardhana
14b1c9f8e4
fix: return Range errors after If-Matches ()
closes 
2020-07-17 13:01:22 -07:00
Harshavardhana
4915433bd2
Support bucket versioning ()
- Implement a new xl.json 2.0.0 format to support,
  this moves the entire marshaling logic to POSIX
  layer, top layer always consumes a common FileInfo
  construct which simplifies the metadata reads.
- Implement list object versions
- Migrate to siphash from crchash for new deployments
  for object placements.

Fixes 
2020-06-12 20:04:01 -07:00
kannappanr
2c372a9894
Send Partscount only when partnumber is specified ()
Fixes 
2020-06-10 09:22:15 -07:00
Anis Elleuch
790323ac37
lifecycle: Fix object expiration date ()
re-use PredictExpiryTime() in ComputeAction()
2020-06-09 09:40:53 -07:00
Anis Elleuch
cdf4815a6b
Add x-amz-expiration header in some S3 responses ()
x-amz-expiration is described in the S3 specification as a header which
indicates if the object in question will expire any time in the future.
2020-05-21 14:12:52 -07:00
Harshavardhana
7290d23b26
Apply partNumber checks only on multipart objects () 2020-05-06 16:58:09 -07:00
Harshavardhana
282c9f790a
fix: validate partNumber in queryParam as part of preConditions () 2020-04-20 22:01:59 -07:00
Harshavardhana
c43f745449
Ensure that we use constants everywhere ()
This allows for canonicalization of the strings
throughout our code and provides a common space
for all these constants to reside.

This list is rather non-exhaustive but captures
all the headers used in AWS S3 API operations
2019-07-02 22:34:32 -07:00
kannappanr
5ecac91a55
Replace Minio refs in docs with MinIO and links () 2019-04-09 11:39:42 -07:00
Harshavardhana
a2e344bf30 Preserve ETag case for S3 compatibility ()
Most hadoop distributions hortonworks, cloudera all
depend on aws-sdk-java 1.7.x to 1.10.x - the releases
which have bugs related case sensitive check for
ETag header. Go changes the case of the headers set
to be canonical but only preserves them when set
through a direct map.

This fixes most compatibility issues we have had
in the past supporting older hadoop distributions.
2019-04-08 16:54:46 -07:00
poornas
023866642c canonicalize ETag correctly ()
Fixes  
Trim extra quotes prefixing/suffixing ETag in
CompleteMultipartUpload request.
2019-04-01 12:19:52 -07:00
Anis Elleuch
8689ec258b Don't decrypt ETag in validation when source is SSEC multipart ()
Copying an encrypted SSEC object when this latter is uploaded using
multipart mechanism was failing because ETag in case of encrypted
multipart upload is not encrypted.

This PR fixes the behavior.
2019-03-25 12:17:31 -07:00
Harshavardhana
e0a87e96de
Populate host value from GetSourceIP directly () 2019-03-25 11:45:42 -07:00
Anis Elleuch
b05825ffe8 s3: Fix precondition failed in CopyObjectPart when src is encrypted ()
CopyObject precondition checks into GetObjectReader
in order to perform SSE-C pre-condition checks using the
last 32 bytes of encrypted ETag rather than the decrypted
ETag

This also necessitates moving precondition checks for
gateways to gateway layer rather than object handler check
2019-03-06 12:38:41 -08:00