What is a convex function?
A function in which the region above the graph of the function is a convex set.
convex function explained in plain English
A function in which the region above the graph of the function is a convex set. The prototypical convex function is shaped something like the letter U. For example, the following are all convex functions: In contrast, the following function is not convex. Notice how the region above the graph is not a convex set: A strictly convex function has exactly one local minimum point, which is also the global minimum point. The classic U-shaped functions are strictly convex functions. However, some convex functions (for example, straight lines) are not U-shaped.
A lot of the common loss functions, including the following, are convex functions: - L2 loss - Log Loss - L1 regularization - L2 regularization Many variations of gradient descent are guaranteed to find a point close to the minimum of a strictly convex function. Similarly, many variations of stochastic gradient descent have a high probability (though, not a guarantee) of finding a point close to the minimum of a strictly convex function. The sum of two convex functions (for example, L2 loss + L1 regularization) is a convex function. Deep models are never convex functions. Remarkably, algorithms designed for convex optimization tend to find reasonably good solutions on deep networks anyway, even though those solutions are not guaranteed to be a global minimum. --- See Convergence and convex functions in Machine Learning Crash Course for more information.
Example
Practitioners refer to convex function when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.
People also read
- auxiliary loss
A loss function—used in conjunction with a neural network model's main loss function—that helps accelerate training during the early iterations when weights are randomly initialized.
- Backpropagation
The process that tells a neural network which internal settings caused an error and how to adjust them, working backwards through layers.
- Bayesian neural network
A probabilistic neural network that accounts for uncertainty in weights and outputs.
- convolution
In mathematics, casually speaking, a mixture of two functions.
- convolutional filter
One of the two actors in a convolutional operation.
- convolutional layer
A layer of a deep neural network in which a convolutional filter passes along an input matrix.
- convolutional operation
The following two-step mathematical operation: 1.
- embedding layer
A special hidden layer that trains on a high-dimensional categorical feature to gradually learn a lower dimension embedding vector.
- embedding vector
Broadly speaking, an array of floating-point numbers taken from any hidden layer that describe the inputs to that hidden layer.
- full softmax
Synonym for softmax.