logoalt Hacker News

alenmangattulast Sunday at 5:52 PM2 repliesview on HN

I’ve spent the last 3 months building a crawler to index the public parts of Telegram (https://telehunt.org). The native search is essentially a black box that favors the top 0.1% of bot almost invisible. The Tech: I had to deal with rate limits and the lack of a global 'sitemap'. I’m currently using a hybrid approach of metadata scraping to keep the index fresh. The Goal: It’s an experiment in making 'un-indexable' bot data discoverable.


Replies

duskwufftoday at 8:16 AM

You may be overestimating the number of bots that meaningfully exist. The vast majority of bots (and public channels) on the platform are nonfunctional and/or spam.

Antibabelictoday at 6:43 AM

Where is the search engine? The site says that it's a bot directory.

show 1 reply