AIExplainer
AI Hardware Intermediate

shard

A logical division of the training set or the model.

A logical division of the training set or the model. Typically, some process creates shards by dividing the examples or parameters into (usually) equal-sized chunks. Each shard is then assigned to a different machine. Sharding a model is called model parallelism; sharding data is called data parallelism.

Practitioners refer to shard when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.