I'm sure their crawler can handle a zip bomb. Plus it might interpret that as "this site d...

delecti • today at 1:53 PM • 2 replies • view on HN

I'm sure their crawler can handle a zip bomb. Plus it might interpret that as "this site doesn't have a robots.txt" and start scraping that OP is trying to prevent with their current robots.txt.

Replies

marginalia_nu • today at 3:46 PM

Pretty sure every crawler can. You kinda have to go out of your way not to, given how the gzread API looks.

https://refspecs.linuxbase.org/LSB_3.0.0/LSB-Core-generic/LS...

1e1a • today at 2:02 PM

Could allow only the path to the zip bomb for this user agent.

➕ show 1 reply

alt Hacker News

Replies