> Known generative-AI crawlers are disallowed in robots.txt. This is a research catalogue assembl...

lekevicius • yesterday at 10:57 PM • 3 replies • view on HN

> Known generative-AI crawlers are disallowed in robots.txt. This is a research catalogue assembled from primary sources; it is not training data, and a model fine-tuned on these paragraphs would launder out exactly the part — the citations — that gives the prose its value.

This reads like distaste for LLMs - but generally website reads (and is designed as!) very LLMy.

Replies

zetalyrae • yesterday at 11:14 PM

If the About page said who made it, i.e. if someone was putting their reputation on the line, I might be more receptive. But the website has enough LLM design tics to make me suspicious.

It's sad. I come to Hacker News to see cool stuff and when I click on a link and see something obviously put together by an LLM I feel like I've been tricked :(

➕ show 3 replies

egeozcan • today at 12:09 AM

They may have used LLMs to design the site but IMHO the content is fine and well-sourced. Example: https://storiedcolors.com/color/blaze-orange/

Even if LLMs were used to help, someone must have spent a lot of time on making it read well. At least that's how it feels like.

➕ show 1 reply

1f60c • yesterday at 11:55 PM

"One color a day, told as it ought to be told: with its provenance, its chemistry, and the people who paid for it in poison." is so Claude it hurts. :'D

alt Hacker News

Replies