logoalt Hacker News

esafak04/02/20251 replyview on HN

No, they are not. Model outputs can be discretized but the model parameters (excluding hyperparameters) are typically continuous. That's why we can use gradient descent.


Replies

bob102904/02/2025

Where are the model parameters stored and how are they represented?

show 1 reply