DLVR 1 Deep Neural Networks

Posted Feb 17, 2025

2 min read

对于作业常见错误贴代码截图等

Deep Learning Computer Vision & Robotics Part I. Basic Deep Neural Networks

1980s - Rediscover Back-propagation
1990s - Winter for Neural Networks
- LeNet Handwritten Digits Recognition
2000s - Era of Internet & Gaming
- Massive Data & GPGPUs
2010s - Early Success & Boom of ANN
- AlexNet (CNN) on ImageNet
- LSTM & Transformers (Attention)
2024 - Nobel Prize Winners
- Physics - Artificial Neural Networks
- Chemistry - AlphaFold Protein Design

This example uses the Boston Housing Dataset to predict the median value of owner-occupied homes (per 1000 dollars).

$z_j$: input to node $j$ in layer $l$
$g_j$: activation function for node $j$ in layer $l$ (applied to $z_j$)
$a_j = g_j(z_j)$: the output/activation of node $j$ in layer $l$
$b_j$: bias/offset for unit $j$ in layer $l$
$w_{ij}$: weights connecting node $i$ in layer $(l - 1)$ to node $j$ in layer $l$
$t_k$: target value for node $k$ in the output layer $E = \frac{1}{2} \sum_{k=1}^{K}(a_k - t_k)^2$ Training Dataset

| # ID | X1 | Xn | $t_k$ | |——-|——-|——-|————-| | 00001 | ……| ……| …… | | 00002 | ……| ……| …… | | ……| ……| ……| …… | | 99999 | ……| ……| …… |

$\delta_k = (a_k - t_k)g_k'(z_k) \rightarrow \frac{\partial E}{\partial w_{jk}} = \delta_k a_j$

\[\delta_j = g_j'(z_j) \sum_k \delta_k w_{jk} \rightarrow \frac{\partial E}{\partial w_{ij}} = \delta_j a_i\]

This post is licensed under CC BY 4.0 by the author.