What is a feature engineering?
A process that involves the following steps: 1.
feature engineering explained in plain English
A process that involves the following steps: 1. Determining which features might be useful in training a model. 2. Converting raw data from the dataset into efficient versions of those features. For example, you might determine that`temperature` might be a useful feature. Then, you might experiment with bucketing to optimize what the model can learn from different`temperature` ranges. Feature engineering is sometimes called feature extraction or featurization.
In TensorFlow, feature engineering often means converting raw log file entries to tf.Example protocol buffers. See also tf.Transform. --- See Numerical data: How a model ingests data using feature vectors in Machine Learning Crash Course for more information.
Example
Practitioners refer to feature engineering when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.
People also read
- Backpropagation
The process that tells a neural network which internal settings caused an error and how to adjust them, working backwards through layers.
- AUC
A number between 0.
- Bayesian neural network
A probabilistic neural network that accounts for uncertainty in weights and outputs.
- Bayesian optimization
A probabilistic regression model technique for optimizing computationally expensive objective functions by instead optimizing a surrogate that quantifies the uncertainty using a Bayesian learning technique.
- broadcasting
Expanding the shape of an operand in a matrix math operation to dimensions compatible for that operation.
- classification threshold
In a binary classification, a number between 0 and 1 that converts the raw output of a logistic regression model into a prediction of either the positive class or the negative class.
- configuration
The process of assigning the initial property values used to train a model, including: hyperparameters such as: - learning rate - iterations - optimizer - loss function In machine learning projects, c
- confusion matrix
An NxN table that summarizes the number of correct and incorrect predictions that a classification model made.
- cross-entropy
A generalization of Log Loss to multi-class classification problems.
- discriminative model
A model that predicts labels from a set of one or more features.