I use a text-only browser as an offline HTML reader
I make HTTP requests with a TCP client
There are no "false positives"
I only request the resources that I want, e.g., the HTML from the primary domain, JSON from the API domain, etc.
I also use custom filters written in C to extract the information I want from the retreived HTML or JSON and transform it into SQL or "pretty print"
There is nothing to "block" because I'm not using software that automatically tries to request resources I do not want from domains I never indicated I wanted to contact