Hill Climbing

An algorithm for iteratively improving ("walking uphill") an ML model until the model stops improving ("reaches the top of a hill"). The general form of the algorithm is as follows:

Build a starting model.

Create new candidate models by making small adjustments to the way you train or fine-tune. This might entail working with a slightly different training set or different hyperparameters.

Evaluate the new candidate models and take one of the following actions:

If a candidate model outperforms the starting model, then that candidate model becomes the new starting model. In this case, repeat Steps 1, 2, and 3.

If no model outperforms the starting model, then you've reached the top of the hill and should stop iterating.

Note: Think of the top of the hill as a local maximum that isn't necessarily a global maximum. That is, hill climbing can help you find the best model within your current constraints. However, you might be able to build an even better model by starting over with a new approach.

See Deep Learning Tuning Playbook for guidance on hyperparameter tuning. See the Data modules of Machine Learning Crash Course for guidance on feature engineering.

Real-world uses

Created for this library

1.
An operations team uses hill climbing on a simple cost surface to optimize a small set of dispatch parameters when training a full RL policy is not justified.
2.
A research team uses hill climbing as a baseline optimizer to compare against more sophisticated methods on a low-dimensional problem.
3.
A pricing team uses hill climbing to tune a handful of business rules where gradient-based optimization is not applicable.

Back to glossary

An algorithm for iteratively improving ("walking uphill") an ML model until the model stops improving ("reaches the top of a hill"). The general form of the algorithm is as follows:

Build a starting model.

Create new candidate models by making small adjustments to the way you train or fine-tune. This might entail working with a slightly different training set or different hyperparameters.

Evaluate the new candidate models and take one of the following actions:

If a candidate model outperforms the starting model, then that candidate model becomes the new starting model. In this case, repeat Steps 1, 2, and 3.

If no model outperforms the starting model, then you've reached the top of the hill and should stop iterating.

See Deep Learning Tuning Playbook for guidance on hyperparameter tuning. See the Data modules of Machine Learning Crash Course for guidance on feature engineering.

Real-world uses

Created for this library

1.
An operations team uses hill climbing on a simple cost surface to optimize a small set of dispatch parameters when training a full RL policy is not justified.
2.
A research team uses hill climbing as a baseline optimizer to compare against more sophisticated methods on a low-dimensional problem.
3.
A pricing team uses hill climbing to tune a handful of business rules where gradient-based optimization is not applicable.

Back to glossary

Hill Climbing

Real-world uses

Related terms

Loading…

Hill Climbing

Real-world uses

Related terms