logoalt Hacker News

mNovak12/08/20241 replyview on HN

"The data collection process involved a daily ritual of manually visiting the Tagesschau website to capture links"

I don't know what to say... I'm amazed they kept this up so long, but this really should never have been the game plan.

I also had some data science hobby projects around covid; I got busy, lost interest after 6 months. But the scrapers keep running in the cloud, in case I get motivated again (anyone need structured data on eBay listings for laptops since 2020?), that's the beauty of automation for these sorts of things.


Replies

plaidfuji12/08/2024

Do you just pay the bill for the resources indefinitely?

show 3 replies