First impression: Third-party benchmarks or gtfo. Personally, I've never heard of either of the...

fwipsy • yesterday at 2:11 PM • 4 replies • view on HN

First impression: Third-party benchmarks or gtfo. Personally, I've never heard of either of these companies before. We're just supposed to take their word that they've matched the best models on the market?

Sakana describes their model as a "Orchestration Model." Does that mean that it's actually a bunch of different models glued together?

Replies

lifeformed • yesterday at 2:52 PM

Is it actually that hard to make good models or is it just about the amount of resources you have to do training? (This is an actual question, I really don't know.) I'm sure it's not trivial but does it really take world class secret knowledge to build off of the known existing techniques? I feel like there's tons of low hanging fruit still to explore, and time and resources are the limiting factor.

➕ show 3 replies

alwa • yesterday at 9:21 PM

My impression is that the answer is yes, that it purports to dispense the glue on-the-fly in some kind of dynamic way rather than being some kind of new model-amalgam.

alt Hacker News

Replies