minio/pkg
Harshavardhana 7e1661f4fa Performance improvements to SELECT API on certain query operations (#6752)
This improves the performance of certain queries dramatically,
such as 'count(*)' etc.

Without this PR
```
~ time mc select --query "select count(*) from S3Object" myminio/sjm-airlines/star2000.csv.gz
2173762

real	0m42.464s
user	0m0.071s
sys	0m0.010s
```

With this PR
```
~ time mc select --query "select count(*) from S3Object" myminio/sjm-airlines/star2000.csv.gz
2173762

real	0m17.603s
user	0m0.093s
sys	0m0.008s
```

Almost a 250% improvement in performance. This PR avoids a lot of type
conversions and instead relies on raw sequences of data and interprets
them lazily.

```
benchcmp old new
benchmark                        old ns/op       new ns/op       delta
BenchmarkSQLAggregate_100K-4     551213          259782          -52.87%
BenchmarkSQLAggregate_1M-4       6981901985      2432413729      -65.16%
BenchmarkSQLAggregate_2M-4       13511978488     4536903552      -66.42%
BenchmarkSQLAggregate_10M-4      68427084908     23266283336     -66.00%

benchmark                        old allocs     new allocs     delta
BenchmarkSQLAggregate_100K-4     2366           485            -79.50%
BenchmarkSQLAggregate_1M-4       47455492       21462860       -54.77%
BenchmarkSQLAggregate_2M-4       95163637       43110771       -54.70%
BenchmarkSQLAggregate_10M-4      476959550      216906510      -54.52%

benchmark                        old bytes       new bytes      delta
BenchmarkSQLAggregate_100K-4     1233079         1086024        -11.93%
BenchmarkSQLAggregate_1M-4       2607984120      557038536      -78.64%
BenchmarkSQLAggregate_2M-4       5254103616      1128149168     -78.53%
BenchmarkSQLAggregate_10M-4      26443524872     5722715992     -78.36%
```
2018-11-14 15:55:10 -08:00
..
auth Check key length before adding a new user. (#6790) 2018-11-09 15:48:24 -08:00
bpool Add large bucket support for erasure coded backend (#5160) 2018-02-15 17:45:57 -08:00
certs Watch for symlinked certs in container envs (#6282) 2018-08-16 18:37:21 -07:00
cgroup sys/stats: return cgroup mem limit, fall back to sysinfo() (#4002) 2017-04-02 10:46:16 -07:00
disk Implement support for calculating disk usage per tenant (#5969) 2018-05-23 15:41:29 +05:30
dns Update federation target to etcd/clientv3 (#6119) 2018-07-12 14:12:40 -07:00
ellipses Add large bucket support for erasure coded backend (#5160) 2018-02-15 17:45:57 -08:00
event Enable default vet flags (#6810) 2018-11-14 10:23:44 -08:00
handlers Avoid logging the body of the http 206 response (#6258) 2018-08-08 12:34:42 -07:00
hash Add object compression support (#6292) 2018-09-28 09:06:17 +05:30
iam Re-populate public key if JWT fails to parse (#6786) 2018-11-08 17:01:20 -08:00
ioutil Add GetObjectNInfo to object layer (#6449) 2018-09-20 19:22:09 -07:00
lock Fix goroutine test fatalf (#6682) 2018-10-23 09:44:20 -07:00
madmin Check key length before adding a new user. (#6790) 2018-11-09 15:48:24 -08:00
mimedb Fix mimedb update files (#6744) 2018-10-31 14:15:27 -07:00
mountinfo disable disk-usage when export is root mount path (#6091) 2018-06-27 18:59:38 -07:00
net Remove panic() and handle it appropriately (#5807) 2018-04-19 17:24:43 -07:00
policy Sid value can be any unicode character support it (#6676) 2018-10-23 16:11:06 +05:30
quick Migrate config.json from config-dir to backend (#6195) 2018-08-15 10:11:47 +05:30
s3select Performance improvements to SELECT API on certain query operations (#6752) 2018-11-14 15:55:10 -08:00
safe Remove go1.9 specific code for windows (#5033) 2017-10-13 15:31:15 +05:30
sync/errgroup Add large bucket support for erasure coded backend (#5160) 2018-02-15 17:45:57 -08:00
sys Push max threads to little less than kernel limit (#5001) 2017-10-03 10:37:45 -07:00
trie trie: new package (#3729) 2017-02-10 11:51:41 -08:00
wildcard pkg/wildcard: Simplify the wildcard logic further. (#2555) 2016-08-27 00:27:17 -07:00
words words: new package Damerau Levenshtein distance function. (#3929) 2017-03-19 14:23:05 -07:00