What are chunks in MongoDB sharding and how does chunk splitting and migration work?

Problem Statement

Explanation

Chunks are logical groupings of documents based on shard key ranges. MongoDB divides the shard key space into chunks, and each chunk contains documents whose shard key values fall within a specific range. For example, one chunk might contain all documents with user IDs from 1 to 1000, while another contains 1001 to 2000. By default, each chunk has a maximum size of 64 MB. When a chunk grows beyond this size due to inserts or updates, MongoDB automatically splits it into two smaller chunks. Chunk splitting is a metadata operation that updates the config servers; no data is moved during a split. The split point is chosen to divide the chunk into roughly equal sizes. After chunks are split, the balancer monitors the distribution of chunks across shards. If one shard has significantly more chunks than another, the balancer migrates chunks from the heavily loaded shard to less loaded shards. Chunk migration involves copying documents from the source shard to the destination shard, then updating metadata on config servers to reflect the new location. During migration, the chunk being migrated remains on the source shard and continues serving queries until migration completes. Once all documents are copied and verified, the metadata is updated atomically. This ensures that chunk migrations are transparent to applications, though they do consume resources like network bandwidth and disk I/O. The balancer can be configured to run only during specific time windows to avoid impacting production workloads. You can also manually split chunks or move them if needed for special circumstances. However, automatic chunk management works well for most deployments. Chunk size affects migration frequency and efficiency. Larger chunks mean fewer but longer migrations. Smaller chunks mean more frequent but faster migrations. The default 64 MB is a good balance for most workloads.

Code Solution

SolutionRead Only

// View chunks for a collection
use config
db.chunks.find({ ns: "mydb.users" }).pretty()

// Example chunk
{
  _id: "mydb.users-userId_1000",
  ns: "mydb.users",
  min: { userId: 1000 },
  max: { userId: 2000 },
  shard: "shard0001"
}

// Chunk lifecycle:
// 1. Chunk grows beyond 64MB
// 2. MongoDB splits chunk at midpoint
//    Chunk A: userId 1000-1500
//    Chunk B: userId 1501-2000
// 3. Balancer detects imbalance
// 4. Balancer migrates Chunk B to another shard

// Manual chunk split (rarely needed)
sh.splitAt("mydb.users", { userId: 5000 })

// Change chunk size
use config
db.settings.updateOne(
  { _id: "chunksize" },
  { $set: { value: 128 } },  // 128MB chunks
  { upsert: true }
)

Practice Sets

This question appears in the following practice sets:

MongoDB Sharding & Scalability

Next Question

Explanation

Code Solution

SolutionRead Only

// View chunks for a collection
use config
db.chunks.find({ ns: "mydb.users" }).pretty()

// Example chunk
{
  _id: "mydb.users-userId_1000",
  ns: "mydb.users",
  min: { userId: 1000 },
  max: { userId: 2000 },
  shard: "shard0001"
}

// Chunk lifecycle:
// 1. Chunk grows beyond 64MB
// 2. MongoDB splits chunk at midpoint
//    Chunk A: userId 1000-1500
//    Chunk B: userId 1501-2000
// 3. Balancer detects imbalance
// 4. Balancer migrates Chunk B to another shard

// Manual chunk split (rarely needed)
sh.splitAt("mydb.users", { userId: 5000 })

// Change chunk size
use config
db.settings.updateOne(
  { _id: "chunksize" },
  { $set: { value: 128 } },  // 128MB chunks
  { upsert: true }
)

What are chunks in MongoDB sharding and how does chunk splitting and migration work?

Problem Statement

Explanation

Code Solution

Practice Sets

Related Questions

What format does MongoDB use to store data internally?

What field serves as the primary key in MongoDB documents?

Which of the following is NOT a valid MongoDB data type?

What is the main advantage of MongoDB's schema-less design?

Which method would you use to insert multiple documents at once in MongoDB?

More from MongoDB/NoSQL

Master Interviews
Anywhere, Anytime

What are chunks in MongoDB sharding and how does chunk splitting and migration work?

Problem Statement

Explanation

Code Solution

Practice Sets

Related Questions

What format does MongoDB use to store data internally?

What field serves as the primary key in MongoDB documents?

Which of the following is NOT a valid MongoDB data type?

What is the main advantage of MongoDB's schema-less design?

Which method would you use to insert multiple documents at once in MongoDB?

More from MongoDB/NoSQL

What are chunks in MongoDB sharding and how does chunk splitting and migration work?

Problem Statement

Explanation

Code Solution

Practice Sets

Related Questions

What format does MongoDB use to store data internally?

What field serves as the primary key in MongoDB documents?

Which of the following is NOT a valid MongoDB data type?

What is the main advantage of MongoDB's schema-less design?

Which method would you use to insert multiple documents at once in MongoDB?

More from MongoDB/NoSQL

Master Interviews Anywhere, Anytime

What are chunks in MongoDB sharding and how does chunk splitting and migration work?

Problem Statement

Explanation

Code Solution

Practice Sets

Related Questions

What format does MongoDB use to store data internally?

What field serves as the primary key in MongoDB documents?

Which of the following is NOT a valid MongoDB data type?

What is the main advantage of MongoDB's schema-less design?

Which method would you use to insert multiple documents at once in MongoDB?

More from MongoDB/NoSQL

Master Interviews
Anywhere, Anytime