• Vittelius@feddit.orgOP
    link
    fedilink
    arrow-up
    0
    ·
    9 days ago

    The repo, despite its name, doesn’t only contain a robots.txt. It also has files for popular reverse proxies to block crawlers outright.

    • zod000@lemmy.dbzer0.com
      link
      fedilink
      arrow-up
      0
      ·
      9 days ago

      That was kind of the point of my comment since the name didn’t indicate that. Also many tools that companies would use won’t/can’t use these files, but could still make use of the info. As I am specifically in that case, I wanted people to know that it could still be worth their time taking a look.