After tuning our model, we find that the softmax classifier works well. Specifically, the last layer of our network computes scores for each class, and they are fed into the softmax function. The model achieves 100% accuracy on the training data. However, we observe that the training loss does not reach zero. What can we say about (cross-entropy) loss can never be zero?
#Why can Cross-entropy not be zero?
1 messages · Page 1 of 1 (latest)