logoalt Hacker News

ameliuslast Thursday at 3:42 PM3 repliesview on HN

A dataset with only data from before 2024 will soon be worth billions.


Replies

blahyawnblahlast Thursday at 3:54 PM

2022. When chatgpt first came out. https://arstechnica.com/ai/2025/06/why-one-man-is-archiving-...

show 1 reply
Workaccount2last Thursday at 10:53 PM

Sythentic data is already being embraced. Turns out you actually can create good training data with these models.