youtu.be/aircAruvnKk 잘 만듬 Cost Function Let's first define a few variables that we will need to use: L = total number of layers in the network s_lsl = number of units (not counting bias unit) in layer l K = number of output units/classes Recall that in neural networks, we may have many output nodes. We denote h_\Theta(x)_khΘ(x)k as being a hypothesis that results in the k^{th}kth output. Our ..