minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	2760fc86af	Bump default idleConnsPerHost to control conns in time_wait (#10653 ) This PR fixes a hang which occurs quite commonly at higher concurrency by allowing following changes - allowing lower connections in time_wait allows faster socket open's - lower idle connection timeout to ensure that we let kernel reclaim the time_wait connections quickly - increase somaxconn to 4096 instead of 2048 to allow larger tcp syn backlogs. fixes #10413	2020-10-12 14:19:46 -07:00
Harshavardhana	736e58dd68	fix: handle concurrent lockers with multiple optimizations (#10640 ) - select lockers which are non-local and online to have affinity towards remote servers for lock contention - optimize lock retry interval to avoid sending too many messages during lock contention, reduces average CPU usage as well - if bucket is not set, when deleteObject fails make sure setPutObjHeaders() honors lifecycle only if bucket name is set. - fix top locks to list out always the oldest lockers always, avoid getting bogged down into map's unordered nature.	2020-10-08 12:32:32 -07:00
Harshavardhana	1f9abbee4d	make sure to release locks upon timeout (#10596 ) fixes #10418	2020-09-29 15:18:34 -07:00
Harshavardhana	c13afd56e8	Remove MaxConnsPerHost settings to avoid potential hangs (#10438 ) MaxConnsPerHost can potentially hang a call without any way to timeout, we do not need this setting for our proxy and gateway implementations instead IdleConn settings are good enough. Also ensure to use NewRequestWithContext and make sure to take the disks offline only for network errors. Fixes #10304	2020-09-08 14:22:04 -07:00
Harshavardhana	b0e1d4ce78	re-attach offline drive after new drive replacement (#10416 ) inconsistent drive healing when one of the drive is offline while a new drive was replaced, this change is to ensure that we can add the offline drive back into the mix by healing it again.	2020-09-04 17:09:02 -07:00
Harshavardhana	eb19c8af40	Bump response header timeout for proxying list request (#10420 )	2020-09-04 16:07:40 -07:00
Harshavardhana	a359e36e35	tolerate listing with only readQuorum disks (#10357 ) We can reduce this further in the future, but this is a good value to keep around. With the advent of continuous healing, we can be assured that namespace will eventually be consistent so we are okay to avoid the necessity to a list across all drives on all sets. Bonus Pop()'s in parallel seem to have the potential to wait too on large drive setups and cause more slowness instead of gaining any performance remove it for now. Also, implement load balanced reply for local disks, ensuring that local disks have an affinity for - cleanupStaleMultipartUploads()	2020-08-26 19:29:35 -07:00
Harshavardhana	9fd836e51f	add dnsStore interface for upcoming operator webhook (#10077 )	2020-07-20 12:28:48 -07:00
Anis Elleuch	778e9c864f	Move dependency from minio-go v6 to v7 (#10042 )	2020-07-14 09:38:05 -07:00
Anis Elleuch	2be20588bf	Reroute requests based token heal/listing (#9939 ) When manual healing is triggered, one node in a cluster will become the authority to heal. mc regularly sends new requests to fetch the status of the ongoing healing process, but a load balancer could land the healing request to a node that is not doing the healing request. This PR will redirect a request to the node based on the node index found described as part of the client token. A similar technique is also used to proxy ListObjectsV2 requests by encoding this information in continuation-token	2020-07-03 11:53:03 -07:00
Krishna Srinivas	4c266df863	fix: proxy ListObjects request to one of the server based on hash(bucket) (#9881 )	2020-07-02 10:56:22 -07:00
Harshavardhana	c0ac25bfff	fix: readiness needs to be like liveness (#9941 ) Readiness as no reasoning to be cluster scope because that is not how the k8s networking works for pods, all the pods to a deployment are not sharing the network in a singleton. Instead they are run as local scopes to themselves, with readiness failures the pod is potentially taken out of the network to be resolvable - this affects the distributed setup in myriad of different ways. Instead readiness should behave like liveness with local scope alone, and should be a dummy implementation. This PR all the startup times and overal k8s startup time dramatically improves. Added another handler called as `/minio/health/cluster` to understand the cluster scope health.	2020-06-30 11:28:27 -07:00
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	2020-06-12 20:04:01 -07:00
Harshavardhana	f44cfb2863	use GlobalContext whenever possible (#9280 ) This change is throughout the codebase to ensure that all codepaths honor GlobalContext	2020-04-09 09:30:02 -07:00
Harshavardhana	813e0fc1a8	fix: optimize isConnected to avoid url.String() conversions (#9202 ) Stringifying in a loop can tax the system, avoid this and convert the endpoints to strings early on and remember them for the lifetime of the server.	2020-03-24 18:53:24 -07:00
Harshavardhana	6f6a2214fc	Add rate limiter for S3 API layer (#9196 ) - total number of S3 API calls per server - maximum wait duration for any S3 API call This implementation is primarily meant for situations where HDDs are not capable enough to handle the incoming workload and there is no way to throttle the client. This feature allows MinIO server to throttle itself such that we do not overwhelm the HDDs.	2020-03-24 12:43:40 -07:00
Klaus Post	37b32199e3	Validate XL sets on format (#8779 ) When formatting a set validate if a host failure will likely lead to data loss. While we don't know what config will be set in the future evaluate to our best knowledge, assuming default settings.	2020-01-13 13:09:10 -08:00
Harshavardhana	5aa5dcdc6d	lock: improve locker initialization at init (#8776 ) Use reference format to initialize lockers during startup, also handle `nil` for NetLocker in dsync and remove errorLocker implementation Add further tuning parameters such as - DialTimeout is now 15 seconds from 30 seconds - KeepAliveTimeout is not 20 seconds, 5 seconds more than default 15 seconds - ResponseHeaderTimeout to 10 seconds - ExpectContinueTimeout is reduced to 3 seconds - DualStack is enabled by default remove setting it to `true` - Reduce IdleConnTimeout to 30 seconds from 1 minute to avoid idleConn build up Fixes #8773	2020-01-10 02:35:06 -08:00
Harshavardhana	60813bef29	Allow proper setCount SLAs across zones (#8752 ) Fixes scenario where zones are appropriately handled, along with supporting overriding set count. The new fix also ensures that we handle the various setup types properly. Update documentation to properly indicate the behavior. Fixes #8750 Co-authored-by: Nitish Tiwari <nitish@minio.io>	2020-01-07 09:13:44 -08:00
Harshavardhana	54431b3953	Change replica set detection for localhost on single endpoint (#8692 )	2019-12-24 11:31:32 -08:00
Harshavardhana	d140074773	fix: replica set deployment for multi tenants (#8673 ) Changes in IP underneath are dynamic in replica sets with multiple tenants, so deploying in that fashion will not work until we wait for atleast one participatory server to be local. This PR also ensures that multi-tenant zone expansion also works in replica set k8s deployments. Introduces a new ENV `KUBERNETES_REPLICA_SET` check to call appropriate code paths.	2019-12-19 13:45:56 -08:00
Harshavardhana	39face27cf	Simplify k8s replicated set deployment (#8666 ) Continuation from #8629 which basically broke zone deployments on k8s statefulset environment due to incorrect assumptions which made it work on replicated set. Fix this properly such that this container works for both replicated set and stateful set deployment	2019-12-18 17:05:24 -08:00
Harshavardhana	c9c0d5eec2	Allow CNAME records when specified as MINIO_PUBLIC_IPS (#8662 ) This is necessary for `m3` global bucket support	2019-12-18 11:02:45 +05:30
Harshavardhana	3e9ab5f4a9	Fix k8s replica set deployment (#8629 ) In replica sets, hosts resolve to localhost IP automatically until the deployment fully comes up. To avoid this issue we need to wait for such resolution.	2019-12-10 20:28:22 -08:00
Harshavardhana	5d3d57c12a	Start using error wrapping with fmt.Errorf (#8588 ) Use fatih/errwrap to fix all the code to use error wrapping with fmt.Errorf()	2019-12-02 09:28:01 -08:00
Harshavardhana	5d65428b29	Handle localhost distributed setups properly (#8577 ) Fixes an issue reported by @klauspost and @vadmeste This PR also allows users to expand their clusters from single node XL deployment to distributed mode.	2019-11-26 11:42:10 -08:00
Harshavardhana	c3771df641	Add bootstrap REST handler for verifying server config (#8550 )	2019-11-22 12:45:13 -08:00
Harshavardhana	4e9de58675	Avoid pointer based copy, instead use Clone() (#8547 ) This PR adds functional test to test expanded cluster syntax.	2019-11-21 17:54:51 +05:30
Harshavardhana	347b29d059	Implement bucket expansion (#8509 )	2019-11-19 17:42:27 -08:00
Harshavardhana	e9b2bf00ad	Support MinIO to be deployed on more than 32 nodes (#8492 ) This PR implements locking from a global entity into a more localized set level entity, allowing for locks to be held only on the resources which are writing to a collection of disks rather than a global level. In this process this PR also removes the top-level limit of 32 nodes to an unlimited number of nodes. This is a precursor change before bring in bucket expansion.	2019-11-13 12:17:45 -08:00
Harshavardhana	9e7a3e6adc	Extend further validation of config values (#8469 ) - This PR allows config KVS to be validated properly without being affected by ENV overrides, rejects invalid values during set operation - Expands unit tests and refactors the error handling for notification targets, returns error instead of ignoring targets for invalid KVS - Does all the prep-work for implementing safe-mode style operation for MinIO server, introduces a new global variable to toggle safe mode based operations NOTE: this PR itself doesn't provide safe mode operations	2019-10-30 23:39:09 -07:00
poornas	d7060c4c32	Allow logging targets to be configured to receive `minio` (#8347 ) specific errors, `application` errors or `all` by default. console logging on server by default lists all logs - enhance admin console API to accept `type` as query parameter to subscribe to application/minio logs.	2019-10-11 18:50:54 -07:00
Harshavardhana	36e12a6038	Assume local endpoints appropriately in k8s deployments (#8375 ) On Kubernetes/Docker setups DNS resolves inappropriately sometimes where there are situations same endpoints with multiple disks come online indicating either one of them is local and some of them are not local. This situation can never happen and its only a possibility in orchestrated deployments with dynamic DNS. Following code ensures that we treat if one of the endpoint says its local for a given host it is true for all endpoints for the same host. Following code ensures that this assumption is true and it works in all scenarios and it is safe to assume for a given host. This PR also adds validation such that we do not crash the server if there are bugs in the endpoints list in dsync initialization. Thanks to Daniel Valdivia <hola@danielvaldivia.com> for reproducing this, this fix is needed as part of the https://github.com/minio/m3 project.	2019-10-10 10:14:17 +05:30
Harshavardhana	290ad0996f	Move etcd, logger, crypto into their own packages (#8366 ) - Deprecates _MINIO_PROFILER, `mc admin profile` does the job - Move ENVs to common location in cmd/config/	2019-10-08 11:17:56 +05:30
Harshavardhana	589e32a4ed	Refactor config and split them in packages (#8351 ) This change is related to larger config migration PR change, this is a first stage change to move our configs to `cmd/config/` - divided into its subsystems	2019-10-04 23:05:33 +05:30
Harshavardhana	73e4e99942	Hosts should be skipped, when calculating local info (#8191 ) endpoint.IsLocal will not have .Host entries so using them to skip double entries will never work. change the code such that we look for endpoint.Host outside of endpoint.IsLocal logic to skip double hosts appropriately. Move these functions to their appropriate file.	2019-09-12 23:36:12 +05:30
Harshavardhana	e6d8e272ce	Use const slashSeparator instead of "/" everywhere (#8028 )	2019-08-06 12:08:58 -07:00
maihde	5cd9f10a02	Support Federation on a single machine (#8009 ) When checking if federation is necessary, the code compares the SRV record stored in etcd against the list of endpoints that the MinIO server is exposing. If there is an intersection in this list the request is forwarded. The SRV record includes both the host and the port, but the intersection check previously only looked at the IP address. This would prevent federation from working in situations where the endpoint IP is the same for multiple MinIO servers. Some examples of where this can occur are: - running mulitiple copies of MinIO on the same host - using multiple MinIO servers behind a NAT with port-forwarding	2019-08-02 12:40:51 -07:00
poornas	0505ef83b5	Fix host address returned in admin API calls (#7846 )	2019-07-05 20:41:35 -07:00
Harshavardhana	2c0b3cadfc	Update go mod with sem versions of our libraries (#7687 )	2019-05-29 16:35:12 -07:00
Harshavardhana	091b9b661f	Complain if we detect sub-optimal ordering in distributed setup (#7576 ) Fixes #6156	2019-04-29 10:10:50 +05:30
Praveen raj Mani	d96584ef58	Allow server to start if one of local nodes in docker/kubernetes setup is resolved (#7452 ) Allow server to start if one of the local nodes in docker/kubernetes setup is successfully resolved - The rule is that we need atleast one local node to work. We dont need to resolve the rest at that point. - In a non-orchestrational setup, we fail if we do not have atleast one local node up and running. - In an orchestrational setup (docker-swarm and kubernetes), We retry with a sleep of 5 seconds until any one local node shows up. Fixes #6995	2019-04-19 10:26:44 -07:00
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	2019-04-09 11:39:42 -07:00
Harshavardhana	91576d416d	Fix GetLocalPeer usage in perf handlers (#7249 ) GetLocalPeer usage should be fixed and used only once per call for not all local endpoints.	2019-02-20 16:04:55 -08:00
Harshavardhana	df35d7db9d	Introduce staticcheck for stricter builds (#7035 )	2019-02-13 18:29:36 +05:30
Sidhartha Mani	34e7259f95	Add Historic CPU and memory stats (#7136 ) Collect historic cpu and mem stats. Also, use actual values instead of formatted strings while returning to the client. The string formatting prevents values from being processed by the server or by the client without parsing it. This change will allow the values to be processed (eg. compute rolling-average over the lifetime of the minio server) and offloads the formatting to the client.	2019-01-30 12:47:32 +05:30
Sidhartha Mani	f3f47d8cd3	Add ServerCPULoadInfo() and ServerMemUsageInfo() admin API (#7038 )	2019-01-09 19:04:19 -08:00
Nitish Tiwari	fcb56d864c	Add ServerDrivesPerfInfo() admin API (#6969 ) This is part of implementation for mc admin health command. The ServerDrivesPerfInfo() admin API returns read and write speed information for all the drives (local and remote) in a given Minio server deployment. Part of minio/mc#2606	2018-12-31 09:46:44 -08:00
Harshavardhana	bebaff269c	Support IPv6 in minio command line (#6947 ) Fixes #6946	2018-12-14 13:07:46 +05:30
Anis Elleuch	61145361fd	Fallback to non-loopback IF addresses for Domain IPs (#6918 ) When MINIO_PUBLIC_IPS is not specified and no endpoints are passed as arguments, fallback to the address of non loop-back interfaces. This is useful so users can avoid setting MINIO_PUBLIC_IPS in docker or orchestration scripts, ince users naturally setup an internal network that connects all instances.	2018-12-04 17:35:22 -08:00
Harshavardhana	6fe9a613c0	Prioritize HTTP requests over Heal (#6468 ) Additionally also heal 256 objects at any given time in parallel. Fixes #6196 Fixes #6241	2018-09-17 18:28:34 -07:00
Nitish Tiwari	3dc13323e5	Use random host from among multiple hosts to create requests Also use hosts passed to Minio startup command to populate IP addresses if MINIO_PUBLIC_IPS is not set.	2018-06-08 10:22:01 -07:00
Anis Elleuch	32700fca52	Enhance fatal errors printing of common issues seen by users (#5878 )	2018-05-08 19:04:36 -07:00
kannappanr	f8a3fd0c2a	Create logger package and rename errorIf to LogIf (#5678 ) Removing message from error logging Replace errors.Trace with LogIf	2018-04-05 15:04:40 -07:00
Bala FA	0e4431725c	make notification as separate package (#5294 ) * Remove old notification files * Add net package * Add event package * Modify minio to take new notification system	2018-03-15 13:03:41 -07:00
Harshavardhana	fb96779a8a	Add large bucket support for erasure coded backend (#5160 ) This PR implements an object layer which combines input erasure sets of XL layers into a unified namespace. This object layer extends the existing erasure coded implementation, it is assumed in this design that providing > 16 disks is a static configuration as well i.e if you started the setup with 32 disks with 4 sets 8 disks per pack then you would need to provide 4 sets always. Some design details and restrictions: - Objects are distributed using consistent ordering to a unique erasure coded layer. - Each pack has its own dsync so locks are synchronized properly at pack (erasure layer). - Each pack still has a maximum of 16 disks requirement, you can start with multiple such sets statically. - Static sets set of disks and cannot be changed, there is no elastic expansion allowed. - Static sets set of disks and cannot be changed, there is no elastic removal allowed. - ListObjects() across sets can be noticeably slower since List happens on all servers, and is merged at this sets layer. Fixes #5465 Fixes #5464 Fixes #5461 Fixes #5460 Fixes #5459 Fixes #5458 Fixes #5460 Fixes #5488 Fixes #5489 Fixes #5497 Fixes #5496	2018-02-15 17:45:57 -08:00
Harshavardhana	22897de4c7	fail when endpoints point to same path locally (#5523 )	2018-02-15 14:38:17 +05:30
Aditya Manthramurthy	a337ea4d11	Move admin APIs to new path and add redesigned heal APIs (#5351 ) - Changes related to moving admin APIs - admin APIs now have an endpoint under /minio/admin - admin APIs are now versioned - a new API to server the version is added at "GET /minio/admin/version" and all API operations have the path prefix /minio/admin/v1/<operation> - new service stop API added - credentials change API is moved to /minio/admin/v1/config/credential - credentials change API and configuration get/set API now require TLS so that credentials are protected - all API requests now receive JSON - heal APIs are disabled as they will be changed substantially - Heal API changes Heal API is now provided at a single endpoint with the ability for a client to start a heal sequence on all the data in the server, a single bucket, or under a prefix within a bucket. When a heal sequence is started, the server returns a unique token that needs to be used for subsequent 'status' requests to fetch heal results. On each status request from the client, the server returns heal result records that it has accumulated since the previous status request. The server accumulates upto 1000 records and pauses healing further objects until the client requests for status. If the client does not request any further records for a long time, the server aborts the heal sequence automatically. A heal result record is returned for each entity healed on the server, such as system metadata, object metadata, buckets and objects, and has information about the before and after states on each disk. A client may request to force restart a heal sequence - this causes the running heal sequence to be aborted at the next safe spot and starts a new heal sequence.	2018-01-22 14:54:55 -08:00
Harshavardhana	2755a0b763	Check if SSL is configured to validate input arguments (#5252 ) This PR handles following situations - secure endpoints provided, server should fail to start if TLS is not configured - insecure endpoints provided, server starts ignoring if TLS is configured or not. Fixes #5251	2017-12-04 12:17:12 +05:30
Harshavardhana	d10679866c	Fix minio distributed setup to properly work on windows (#5152 ) On windows having a preceding "/" will cause problems, if the command line already has C:/<export-folder/ in it. Final resulting path on windows might become C:/C:/ this will cause problems of starting minio server properly in distributed mode on windows. As a special case make sure to trim off the separator. NOTE: It is also perfectly fine for windows users to have a path without C:/ since at that point we treat it as relative path and obtain the full filesystem path as well. Providing C:/ style is necessary to provide paths other than C:/, such as F:/, D:/ etc. Another additional benefit here is that this style also supports providing UNC paths as well. Fixes #5136	2017-11-12 08:09:53 +05:30
Harshavardhana	db6b6e9518	S3 peers should be initialized properly (#5024 ) Fixes #4991	2017-10-08 20:23:42 -07:00
Harshavardhana	879cef37a1	Fail to start server if detected cross-device mounts. (#4807 ) Fixes #4764	2017-08-15 15:10:50 -07:00
Dee Koder	1978b9d8f9	Prevent minio server starting in standalone erasure mode for wrong inputs. (#4700 ) It is possible at times due to a typo when distributed mode was intended a user might end up starting standalone erasure mode causing confusion. Add code to check this based on some standard heuristic guess work and report an error to the user. Fixes #4686	2017-08-11 11:47:28 -07:00
Frank Wessels	46897b1100	Name return values to prevent the need (and unnecessary code bloat) (#4576 ) This is done to explicitly instantiate objects for every return statement.	2017-06-21 19:53:09 -07:00
Krishna Srinivas	ec2920e981	Allow "minio server ." to start minio in fs mode (#4513 )	2017-06-08 18:58:51 -07:00
Anis Elleuch	542f7ae42c	gateway: Reject endpoint pointing to local gateway (#4310 ) Show an error when the user enters an endpoint url pointing to the gateway server itself.	2017-05-16 21:13:29 -07:00
Frank	0d1e2ab509	Remove hardcoded min and max limit for erasure coding (#4157 )	2017-04-24 10:00:33 -07:00
Bala FA	d103d5fb7c	server: Error out if loopback addr is used for Distributed Erasure (#4105 )	2017-04-12 20:27:24 -07:00
Bala FA	de204a0a52	Add extensive endpoints validation (#4019 )	2017-04-11 15:44:27 -07:00

1 2 3

119 Commits