Wasn’t it already obvious given the awfully familiar parameter numbers?

alfiedotwtf • today at 4:45 PM • 1 reply • view on HN

Replies

That only tells what base architecture they used, but fine tuning does not increase the number of weights, it just adapts the weights to improve better on a fine tuning dataset- something they claimed they had done

alt Hacker News

Replies