I really, really dislike when companies use GitHub to promote their product by posting a "research paper" and a code sample.
It's not even an SDK, library, etc., it's just advertising.
I've noticed a number of China-based labs do this; they will often post a really cool demo, some images, and then either an API or just nothing except advertising for their company (e.g. model may not even exist). Often they will also promise in some GitHub issue that they will release the weights, and never do.
I'd love to see some sort of study here, I wonder what % of "omg really cool AI model!!!" hype papers [1] never provide an API, [2] cannot be reproduced at all, and/or [3] promise but never provide weights. If this was any other field, academics would be up in arms about likely fraud, false advertising, etc.
I'm one of the authors of the paper. Thanks for raising a good point. It will be better if we upload the paper to arxiv but it's MLK in the US so submissions will be delayed by a couple of days. And we just can't wait to share some of the knowledge we gained from our experiments. Hope they will be useful for the community. Would much appreciate it if you have an idea about a better site for this. That said our API requests are open and we'll roll out more in the next few days depending on our server resources.
These types of "repositories" should contain some kind of flag/indication that it contains no source code, similar to when a repo is archived
…but they do provide an APi.
HN is really not beating the bikeshedding allegations
That is unfortunate but they do present some theoretical insights about scaling context length and probably a more efficient way to do RL. Even knowledge about it can have an effect on next iterations from other labs.
It's not just Chinese labs that do this, lots of companies upload a README to a GitHub repository then link that repository from the website, I guess so they can have a GitHub icon somewhere on the website?
Submission is basically a form for requesting access to their closed API (which ironically is called "OpenPlatform" for some reason).