5 Matching Annotations
  1. Jun 2017
    1. Replication is important for two primary reasons:
      • HA with fail-over mechanism.
      • scale out your search volume/throughput as searches can happen based on your replicas in parallel.
    2. Sharding is important for two primary reasons:
      • horizontal scaling of a single index
      • parallelize seek operations on multiple shards when index gets too big
    3. , Elasticsearch provides the ability to subdivide your index into multiple pieces called shards.

      Think of shard as the indexes broken down further to span over multiple nodes in your cluster.

    4. An index can potentially store a large amount of data that can exceed the hardware limits of a single node. For example, a single index of a billion documents taking up 1TB of disk space may not fit on the disk of a single node or may be too slow to serve search requests from a single node alone.

      Indexes may overflow the disk space. Hence you want to get the most out of your instances by indexing the nodes.

    1. The create index API allows to instantiate an index. Elasticsearch provides support for multiple indices, including executing operations across several indices.

      In this you could create different shard size per Index basis on Elastic. Super useful when you have a single cluster but multi-tenant