I've been reading a book and they mention a use case for multiple outputs in a DL model that I don't really understand how it works.
There are also many use cases in which you may want to have multiple outputs:
...
- Another use case is as a regularization technique (i.e., a training constraint
whose objective is to reduce overfitting and thus improve the model’s ability to
generalize). For example, you may want to add an auxiliary output in a neural
network architecture (see Figure 10-15) to ensure that the underlying part of the network learns something useful on its own, without relying on the rest of the
network.
Would anyone happen to know how the auxiliary output works in regularizing the model and could explain it to me?