14. ??? ???? ??
??? ??? 1 ?????? N
W
a y
a y
W W
Backpropagation
????
(Activation Function)
f(a)
????, tanh,
Sigmoid, ReLU, ELU ?
Softmax(a)
Drop out
Weight Update
???
(Optimization)
SGD, AdaGrad,
Momentum, Adam ?
??? (Normalization)
???? (Loss Function)
Batch Size
Learning rate
epoch? ?? (Layer size)
?? ?? (Unit size)