logoalt Hacker News

whatshisface10/12/20240 repliesview on HN

The network is linear, but the loss is ln(1+exp(x)), a soft ReLU.