alt
Hacker News
whatshisface
•
10/12/2024
•
0 replies
•
view on HN
The network is linear, but the loss is ln(1+exp(x)), a soft ReLU.