SGD

Instructions

Gradient Descent is a method of finding the minimum of a function. This is extremely useful in machine learning and neural networks as we want to find the correct combination of weights to minimise the error at the output of the neural network. We can treat the error function for a neural network with one hundred connections as a function with one hundred parameters: \( f(x_1, x_2,..., x_{100}) \) The goal is to find the combination of \(x_i\) that make the function as close to zero as possible.

Gradient Descent

f(x, y) =

Learning Rate: 0.05

5e-7 1.7

Starting X = 100 Starting Y = 100

Number of Iterations =

Gradient Descent Type:

Run Simulation

Reset

Clip maximum values

Vary Learning Rate

Average Vector:

\[ \left(\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}\right) \]

Best Vector:

\[ \left(\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}\right) \]

Final Vector:

\[ \left(\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}\right) \]

Function
Error Graph