What is a broadcasting?
Expanding the shape of an operand in a matrix math operation to dimensions compatible for that operation.
broadcasting explained in plain English
Expanding the shape of an operand in a matrix math operation to dimensions compatible for that operation. For example, linear algebra requires that the two operands in a matrix addition operation must have the same dimensions. Consequently, you can't add a matrix of shape (m, n) to a vector of length n. Broadcasting enables this operation by virtually expanding the vector of length n to a matrix of shape (m, n) by replicating the same values down each column.
Given the following definitions of A and B, linear algebra prohibits A+B because A and B have different dimensions:
However, broadcasting enables the operation A+B by virtually expanding B to:
Example
--- See the following description of broadcasting in NumPy for more details.
People also read
- Backpropagation
The process that tells a neural network which internal settings caused an error and how to adjust them, working backwards through layers.
- feature engineering
A process that involves the following steps: 1.
- Tensor
The primary data structure in TensorFlow programs.
- act
A stage in the agentic loop in which the agent executes the action chosen during the reason stage.
- AUC
A number between 0.
- auxiliary loss
A loss function—used in conjunction with a neural network model's main loss function—that helps accelerate training during the early iterations when weights are randomly initialized.
- bag of words
A representation of the words in a phrase or passage, irrespective of order.
- Bayesian neural network
A probabilistic neural network that accounts for uncertainty in weights and outputs.
- Bayesian optimization
A probabilistic regression model technique for optimizing computationally expensive objective functions by instead optimizing a surrogate that quantifies the uncertainty using a Bayesian learning technique.
- bidirectional language model
A language model that determines the probability that a given token is present at a given location in an excerpt of text based on the preceding and following text.