minio/docs/distributed/README.md
ebozduman fb4186f6b9 Adds missing info to docs for credentials and domain env. vars. (#6447)
* Adds missing information to documentation for credentials and domain environment variables for distributed minio server startup.
2018-09-10 17:14:40 -07:00

7.0 KiB
Raw Blame History

Distributed Minio Quickstart Guide Slack Go Report Card Docker Pulls codecov

Minio in distributed mode lets you pool multiple drives (even on different machines) into a single object storage server. As drives are distributed across several nodes, distributed Minio can withstand multiple node failures and yet ensure full data protection.

Why distributed Minio?

Minio in distributed mode can help you setup a highly-available storage system with a single object storage deployment. With distributed Minio, you can optimally use storage devices, irrespective of their location in a network.

Data protection

Distributed Minio provides protection against multiple node/drive failures and bit rot using erasure code. As the minimum disks required for distributed Minio is 4 (same as minimum disks required for erasure coding), erasure code automatically kicks in as you launch distributed Minio.

High availability

A stand-alone Minio server would go down if the server hosting the disks goes offline. In contrast, a distributed Minio setup with n disks will have your data safe as long as n/2 or more disks are online. You'll need a minimum of (n/2 + 1) Quorum disks to create new objects though.

For example, an 8-node distributed Minio setup, with 1 disk per node would stay put, even if upto 4 nodes are offline. But, you'll need at least 5 nodes online to create new objects.

Limits

As with Minio in stand-alone mode, distributed Minio has a per tenant limit of minimum 2 and maximum 32 servers. There are no limits on number of disks shared across these servers. If you need a multiple tenant setup, you can easily spin multiple Minio instances managed by orchestration tools like Kubernetes.

Note that with distributed Minio you can play around with the number of nodes and drives as long as the limits are adhered to. For example, you can have 2 nodes with 4 drives each, 4 nodes with 4 drives each, 8 nodes with 2 drives each, 32 servers with 24 drives each and so on.

You can also use storage classes to set custom data and parity distribution across total disks.

Consistency Guarantees

Minio follows strict read-after-write consistency model for all i/o operations both in distributed and standalone modes.

Get started

If you're aware of stand-alone Minio set up, the process remains largely the same, as the Minio server automatically switches to stand-alone or distributed mode, depending on the command line parameters.

1. Prerequisites

Install Minio - Minio Quickstart Guide.

2. Run distributed Minio

To start a distributed Minio instance, you just need to pass drive locations as parameters to the minio server command. Then, youll need to run the same command on all the participating nodes.

Note

  • All the nodes running distributed Minio need to have same access key and secret key for the nodes to connect. To achieve this, it is mandatory to export access key and secret key as environment variables, MINIO_ACCESS_KEY and MINIO_SECRET_KEY, on all the nodes before executing Minio server command.
  • All the nodes running distributed Minio need to be on homogenous environments i.e same operating system, same number of disks and same interconnects.
  • MINIO_DOMAIN environment variable should be defined and exported if domain is needed to be set.
  • Minio distributed mode requires fresh directories. If required, the drives can be shared with other applications. You can do this by using a sub-directory exclusive to minio. For example, if you have mounted your volume under /export, pass /export/data as arguments to Minio server.
  • The IP addresses and drive paths below are for demonstration purposes only, you need to replace these with the actual IP addresses and drive paths/folders.
  • Servers running distributed Minio instances should be less than 3 seconds apart. You can use NTP as a best practice to ensure consistent times across servers.
  • Running Distributed Minio on Windows is experimental as of now. Please proceed with caution.

Example 1: Start distributed Minio instance on 8 nodes with 1 disk each mounted at /export1 (pictured below), by running this command on all the 8 nodes: Distributed Minio, 8 nodes with 1 disk each

GNU/Linux and macOS

export MINIO_ACCESS_KEY=<ACCESS_KEY>
export MINIO_SECRET_KEY=<SECRET_KEY>
minio server http://192.168.1.1{1...18}/export1

Windows (experimental)

set MINIO_ACCESS_KEY=<ACCESS_KEY>
set MINIO_SECRET_KEY=<SECRET_KEY>
minio.exe server http://192.168.1.1{1...18}/C:/data

Example 2: Start distributed Minio instance on 4 nodes with 4 disks (pictured below), by running this command on all the 4 nodes: Distributed Minio, 4 nodes with 4 disks each

GNU/Linux and macOS

export MINIO_ACCESS_KEY=<ACCESS_KEY>
export MINIO_SECRET_KEY=<SECRET_KEY>
minio server http://192.168.1.1{1...14}/export{1...4}

Windows (experimental)

set MINIO_ACCESS_KEY=<ACCESS_KEY>
set MINIO_SECRET_KEY=<SECRET_KEY>
minio.exe server http://192.168.1.1{1...4}/C:/data{1...4}

NOTE: {1...n} shown have 3 dots! Using only 2 dots {1..4} will be interpreted by your shell and won't be passed to minio server, affecting the erasure coding order, which may impact performance and high availability. Always use {1...n} (3 dots!) to allow minio server to optimally erasure-code data

3. Test your setup

To test this setup, access the Minio server via browser or mc.

Explore Further