We need to crowdsource a db of websites that have opted to exclude themselves from the . The WM has become essential with so many Tor-blocking and blocking sites. I don't want to see WM-excluded sites in my search results. Such archive-resisting sites also downgrade blogs (a dead link invalidates part of an article when there's no archive)

@resist1984 is that page still available? I do think there are some legitimate reasons to want to block iabot from archiving your site, just like there are for indexing.

@edsu it has moved to git.nogafam.es/deCloudflare/de I'm not clear on what legitimate reason you have in mind for blocking bots from harvesting. Can you give an example?

@resist1984 thanks! Generally speaking I think that's up to the publisher to decide. The Internet Archive doesn't own the web and if you don't want them to serve up your content in perpetuity I think that's ok.

@edsu Archive.org gives publishers control, most likely to avoid legal problems. So while it is up to the publisher, as users we have a right to judge that. Now that the has become indispensible (due to Tor-hostility), those who act against WBM act against Tor & thus against privacy. They are not our friends and we have a right to resist propagation of their website URLs.

Follow

@edsu The blocklist is merely objective data for people to use as they see fit. What I hope will happen is someone will cross-reference the wbm blocklist with Tor-blocking sites, and reduce search rankings of sites that block both.

Sign in to participate in the conversation
Mastodon 🔐 privacytools.io

Fast, secure and up-to-date instance. PrivacyTools provides knowledge and tools to protect your privacy against global mass surveillance.

Website: privacytools.io
Matrix Chat: chat.privacytools.io
Support us on OpenCollective, many contributions are tax deductible!