Running SGD multiple times and picking the best result: keywords / name for this practice?

Question

When fitting neural networks, I often run stochastic gradient descent multiple times and take the run with the lowest training loss. I'm trying to look up research literature on this practice, but I'm unaware what its called. Any terms, keywords, or references are appreciated.

The closest thing I have found is "Stochastic Gradient Descent with Restarts", but I don't believe its quite the same idea.

Almost the same question was asked last month at stats.stackexchange.com/questions/665187. — whuber
– whuber ♦, Commented Jun 10 at 21:20

Jacob Maibach · Accepted Answer · 2025-09-09 19:49:07Z

2

A common name for running a local optimization algorithm multiple times in order to get closer to a global minimum is "Multi Start Methods". This is studied in the field of Global Optimization. For a quick reference, see the MATLAB documentation on its global optimization toolbox.

answered Sep 9 at 19:49

Jacob Maibach

1471 silver badge8 bronze badges

Add a comment |

Stack Exchange Network

Running SGD multiple times and picking the best result: keywords / name for this practice?

1 Answer 1

Your Answer

Linked

Hot Network Questions

Running SGD multiple times and picking the best result: keywords / name for this practice?

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Linked

Related

Hot Network Questions