logoalt Hacker News

tossandthrowtoday at 1:30 PM4 repliesview on HN

Have a look at this article: https://www.washingtonpost.com/technology/interactive/2023/a...

NY Times is 0.06% of common crawl.

These news media outlets provide a drop in the ocean worth of information. Both qualitatively and quantitatively.

The news / media industry is really just trying to hold on to their lifeboat before inevitably becoming entirely irrelevant.

(I do find this sad, but it is like the reality - I can already now get considerably better journalism using LLMs than actual journalists - both click bait stuff and high quality stuff)


Replies

pimlottctoday at 1:44 PM

That seems like a reductive way to consider it. What percent of music was created by Led Zeppelin? What percent of art was painted by Monet? What percent of films by Alfred Hitchcock? It may be a small percentage objectively but they are hugely influential.

show 1 reply
Gigachadtoday at 3:41 PM

90% of common crawl is complete junk. While the tiny bit of news articles powers almost all the ai answers in Google search.

datsci_est_2015today at 4:31 PM

How many Reddit, HN, etc. posts are based on NYT articles? How many derivative news articles, blog posts, YouTube videos, TikToks, etc. are responses to those articles?

At least NYT is probably on the correct side of Sturgeon’s Law: https://en.wikipedia.org/wiki/Sturgeon%27s_law

show 1 reply
Melatonictoday at 4:42 PM

0.06% is way higher than I would expect