ImageNet Classification with Deep Convolutional Neural Networks

Architecture:

How many layers does AlexNet have in total?

1. 8 layers - 5 convolutional and 3 fully connected

What type of neural network is AlexNet (e.g., feedforward, recurrent, convolutional)?

1. Convolutional

What activation function did AlexNet popularize?

Name one regularization technique used in AlexNet to prevent overfitting.

Intermediate Level:

Explain the significance of AlexNet in the context of the deep learning revolution.

ow did AlexNet utilize GPUs, and why was this important?

Describe the purpose and mechanism of the dropout technique used in AlexNet.

How did AlexNet's use of ReLU activation functions contribute to its success?

Deep convolutional neural networks with ReLUs train several times faster than their equivalents with tanh units

Advanced Level:

Explain the update rule used in AlexNet's training process, including the role of momentum and weight decay.

$i$ is iteration index, $v$ is momentum variable, $ϵ$ is learning rate
$⟨$ $\frac{\partial L}{\partial w} |_{w_{i}}$ $⟩$ $_{D_{i}}$ ^[1] is the average over the ith batch $D_{i}$ of the derivative of the objective with respect to w, evaluated at $w_{i}$ .

How did the splitting of the network across two GPUs affect the architecture and training process?
Discuss the implications of visualizing the learned features of AlexNet's first convolutional layer.
Compare and contrast AlexNet's approach to data augmentation with more modern techniques.