Backpropagation from Scratch

This is a toy MLP with one hidden layer for backprop.

The target vector y is fixed to an arbitrary value of [10,-2] for simplicity. It can be modified to depend on the input vector x for useful predictions.

Files

File	Desc
	Notebook → Download and modify the code! :)

Computation-Graph

with the Sigmoid activation 𝞹(z) = 1/(1+exp(-x)) and Loss = (1/2) * (y - o)²

Weight derivation

W1-Matrix gradient:

W2-Matrix gradient:

Gradient Calculations for Weights in Code

Gradient of Weight Matrix W⁽¹⁾

z = np.dot(W1, x)
a = (-y + out).T
b = np.dot(a, W2)
c = sigmoid(z) * (1 - sigmoid(z))
d = b.T * c
dL_dW1 = d * x.T

Gradient of Weight Matrix W⁽²⁾

h = sigmoid(np.dot(W1, x))
dL_dW2 = np.dot((-y + out), h.T)

Result:

We can see that the backpropagation is working and the correct gradients are calculated. The network is learning and decreasing its loss:

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
assets		assets
Backprop-from-Scratch.ipynb		Backprop-from-Scratch.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Backpropagation from Scratch

This is a toy MLP with one hidden layer for backprop.

Files

Computation-Graph

Weight derivation

W1-Matrix gradient:

W2-Matrix gradient:

Gradient Calculations for Weights in Code

Gradient of Weight Matrix W⁽¹⁾

Gradient of Weight Matrix W⁽²⁾

Result:

About

Releases

Packages

Languages

till2/Backpropagation-from-Scratch

Folders and files

Latest commit

History

Repository files navigation

Backpropagation from Scratch

This is a toy MLP with one hidden layer for backprop.

Files

Computation-Graph

Weight derivation

W1-Matrix gradient:

W2-Matrix gradient:

Gradient Calculations for Weights in Code

Gradient of Weight Matrix W⁽¹⁾

Gradient of Weight Matrix W⁽²⁾

Result:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages