Deep Learning Intermediate 1 min read

What is an exploding gradient problem?

The tendency for gradients in deep neural networks (especially recurrent neural networks) to become surprisingly steep (high).

exploding gradient problem explained in plain English

The tendency for gradients in deep neural networks (especially recurrent neural networks) to become surprisingly steep (high). Steep gradients often cause very large updates to the weights of each node in a deep neural network. Models suffering from the exploding gradient problem become difficult or impossible to train. Gradient clipping can mitigate this problem. Compare to vanishing gradient problem.

Example

Practitioners refer to exploding gradient problem when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.

exploding gradient problem explained in plain English

Example

People also read