logoalt Hacker News

Claude Science

168 pointsby lebovictoday at 5:07 PM68 commentsview on HN

Comments

lebovictoday at 6:16 PM

I built one of the connected tools included in this launch (the Biomni HPC [1]), and I have spent an inordinate amount of my life working on this problem. (I also worked at Anthropic, but not on this product.)

As other comments have pointed out, this is for data science – but it's capable of more than making plots and writing papers [2]. It has integrations with many databases and computational tools, including a researcher's institutional cluster.

That alone is valuable. I founded a startup after struggling with this problem at a bio startup; integrating these tools and databases is hard and time consuming. If the only outcome of this product is that great APIs are built for LLMs, it will be a massive positive impact. Many databases used in computational genomics are still only accessible through FTP!

LLMs are particularly good at navigating these tools and databases. It's often very specialized, but straightforward, work that benefits from in-context skills. Seeing an early glimpse of my former customers – bioinformaticians – using LLMs to solve this problem is what led me to join Anthropic in 2024.

Also, this pattern isn't fundamentally constrained to data science: you can also integrate with a wet lab or a CRO for some kinds of science. This is what I'm spending my time on now.

This type of science doesn't solve everything, but it's useful in some niches. For example, progress on many rare diseases is bottlenecked by researcher attention rather than a fundamental breakthrough.

[1] https://x.com/phylo_bio/article/2029233694775624096

[2] In comparison, OpenAI's science product – Prism – was effectively a LaTeX editor they acquired with Crixet.

show 3 replies
jszymborskitoday at 7:30 PM

Any other researchers paranoid of using LLMs for fear of them using your data and front running your publications/work?

Or incorporating it in training data and then spitting it out to a competing lab?

show 1 reply
minimaxirtoday at 5:30 PM

When I saw "Science" I didn't think they meant Data Science, which is what the UIs full of pandas code and plots imply. Even if the focus is on the sciences, I suspect that's the less valuable part of the announcement particularly with the implication of Jupyter Notebook 2.0.

Image-understanding for data viz is a use case that has been ignored, and modern LLMs are getting better at proper EDA. But, uh, I may need to update my resume.

show 3 replies
immmmmmtoday at 7:29 PM

When I was doing my phd, around 2 decades ago, I was often going to the library’s compactus to fish for a Phys Rev from the 80s. Back then papers were sparse and expensive. But the quality!

The Higgs boson is 3 papers, 6 authors and 6 pages in total!

At the end of my phd, 30++ pages slop papers were the norm.

Nowadays, well..

The paper by Higgs was one page. The guy probably published less than a hundred pages in his career.

One reason that made me abandon a career was the disgust caused by the publishing frienzy.

And now tokens..

show 1 reply
Recursingtoday at 6:11 PM

This seems to have unblocked Claude Desktop for Linux ( https://code.claude.com/docs/en/desktop-linux )

show 1 reply
Sol-today at 5:25 PM

So it's like Claude Cowork for Science, i.e. for less tech-savvy users? I would imagine scientists with some coding background might just prefer to use Claude Code normally and integrate it with their stack of choice, but perhaps the comfort and ease of use of Claude Science still wins out.

qwerty_clickstoday at 7:03 PM

Should be called Claude-bio-big-bucks.

What about earth science, physics, engineering? The connectors and skills are all just biology and pharma. Boo

theplumbertoday at 7:08 PM

They forgot to include an example of prompt error on “cancer” with Fable in that “nice” video.

raphmantoday at 5:25 PM

tl;dr: Use this if you don't like doing science or doing things well. It hallucinates references.

Seems to be based on https://github.com/swaruplab/operon as evidenced by the authorization dialog and https://x.com/testingcatalog/status/2037684573161783373 .

Mostly targeted at life sciences - e.g. integration for FDA, PubMed, genomics databases but no ACM / IEEE as far as I can tell.

Edit: arXiv search seems to be supported - but not Google Scholar etc. So, this tool is of little use for most researchers outside life sciences.

Edit 2: Quick walkthrough: the AppImage starts a browser window with an onboarding wizard and a chat interface. It suggests a few things one might do at the start of a research project - e.g. do a quick literature review. When I chose that option, wrote Python scripts that used MCP calls to do arXiv searches. Stayed seemingly stuck there for a few minutes not returning anything. Then:

> The free-text search returned too much noise

Claude decided to choose a certain paper as a starting point for further research. Shortly afterwards:

> That DOI resolved to the wrong paper. Let me find the correct anchor papers by title/author search directly.

Then it meandered a few more minutes doing research and creating a citation graph (that it did not show to me).

> I have a complete picture. Let me verify the key DOIs resolve and then write the review.

Then:

> The lint flags em-dash overuse. Let me reduce them, then save.

Then: a nice but verbose literature overview of my chosen topic

<blink>BUT it includes at least one hallucinated reference!</blink>

P.S.: What does this mean?

  [reviewer] verifier_mode=default-on downgraded to off: pro subscription tier, autoReviewer withheld (frame=f2a81cb2)
show 2 replies
fastaguy88today at 7:10 PM

Download for mac. Find out I need a different subscription. Cannot quit program (must force quit).

Perhaps I need AI to use it.

stanford_labrattoday at 5:41 PM

impressive to me, but sadly i feel a little misleading since this is only the data-science part of life sciences.

every few weeks though i test claude and chatgpt on their scientific reasoning and it has definitely improved over time. in my experience without specific instruction on what is known/unknown they typically are lagging behind the leading edge of the field (dev bio/pluripotency in my case). probably because scientific research articles are not open-source so they can't crawl them.

claude has definitely outperformed chatgpt in this regard however, it's scientific reasoning is impressive.

JoshGlazebrooktoday at 5:14 PM

The fact that we are coming up on a month of Fable being unavailable with essentially zero actual signal from Anthropic around when it may be back is crazy to me. Yet still we have these random new products coming out?

show 3 replies
khurstoday at 5:27 PM

Big Pharama = Big Budgets.

So targeting them with a tailored product is understandable.

show 1 reply
domrdytoday at 5:56 PM

It has Sonnet 5 as a usable model. Interesting.

show 2 replies
jvanderbottoday at 5:25 PM

Thought I'd give it a whirl - crashed immediately.

I was tickled they had a "Download for linux" button prominently shown, but nothing yet.

nickandbrotoday at 5:20 PM

So I guess they released this instead of Sonnet 5?

trallnagtoday at 6:43 PM

"Pre-configured for your domain [...] cheminformatics" as in something like ChEMBL?

brcmthrowawaytoday at 7:06 PM

DoA

cmiles8today at 5:19 PM

Science isn’t suffering from a lack of papers. It’s suffering from a lack of good papers. Making it easier to just pump out paper-mill publications is about the last thing science needs right now.

show 2 replies
imdsmtoday at 5:46 PM

Weird that it runs as a local webserver rather than as an app

game_the0rytoday at 5:43 PM

Disappointing that science came after cowork. Shows how their priorities are for profitability first and help humanity second.

show 1 reply
tripleeetoday at 5:21 PM

maxed out on coding improvements so now they're trying to expand to other markets

show 1 reply
CamperBob2today at 6:43 PM

Claude: "Not that science"

bozdemirtoday at 5:32 PM

Another overrated packaged workspace to drain more usage... No thank you.

Retr0idtoday at 6:07 PM

> every step from data wrangling to *publication*

Do they have no shame?

Edit: seems like no https://news.ycombinator.com/item?id=48736814

calldacopsidgaftoday at 5:26 PM

this a great application for the sycophantic, non-deterministic lying machine!

show 1 reply
aplthrowaway67today at 5:24 PM

[dead]

bigyabaitoday at 5:14 PM

How about no?

AI brand identity has made the unfortunate pivot to "how much do you trust us" which is going be a real race to the bottom. I don't want LLMs managing nuclear reactors or replacing junior lab technicians. I don't trust any of these LLMs to do the bare minimum, regardless of how good it is for your brand.

It's gross watching these stunts unfold. Next ChatGPT will fly a passenger jet, which Claude will one-up with an agentic surgery, which OpenAI will respond to by putting a humanoid robot on the moon. If this is what 21st century market competition looks like, we are all fucked.

show 1 reply