logoalt Hacker News

Which one is more important: more parameters or more computation? (2021)

37 pointsby jxmorris12yesterday at 4:44 PM4 commentsview on HN

Comments

vorticalboxtoday at 5:50 PM

This reminds me of https://dnhkng.github.io/posts/rys/

David looks into the LLM finds the thinking layers and cut duplicates then and put them back to back.

This increases the LLM scores with basically no over head.

Very interesting read.

show 1 reply
kangtoday at 8:53 PM

The answer should be obvious that its both.

Zurada was one of our AI textbook that makes it visual that right from a simple classifier to a large language model, we are mathematically creating a shape(, that the signal interacts with). More parameters would mean shape can be curved in more ways and more data means the curve is getting hi-definition.

They reach something with data, treating neural network as blackbox, which could be derived mathematically using the information we know.

l4tq3today at 7:06 PM

[dead]

34ylshtoday at 7:00 PM

[flagged]