Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://mamot.fr/users/pluralistic/statuses/111127187714057889">Cory Doctorow (pluralistic@mamot.fr)'s status on Saturday, 23-Dec-2023 13:41:13 JST</a><a href="https://mamot.fr/@pluralistic" title="pluralistic@mamot.fr"><img src="https://gnusocial.jp/avatar/1478-48-20240201044531.webp" width="48" height="48" alt="Cory Doctorow" style="position: absolute; left: 0; top: 0;">Cory Doctorow</a><div><a href="https://mamot.fr/@pluralistic/111127187063069815" rel="in-reply-to">in reply to</a></div></section><article><p>For the most part, the Internet Archive limits its scraping to websites that permit it. The <a href="https://mamot.fr/tags/RobotsExclusionProtocol" rel="tag">#RobotsExclusionProtocol</a> (AKA <a href="https://mamot.fr/tags/robots" rel="tag">#robots</a>.txt) makes it easy for webmasters to tell different kinds of crawlers whether or not they are welcome. If your site has a robots.txt file that tells the Archive's crawler to buzz off, it'll go elsewhere.</p><p>Mostly.</p><p>7/</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2489526#notice-4926021">In conversation</a><time datetime="2023-12-23T13:41:13+09:00" title="Saturday, 23-Dec-2023 13:41:13 JST">about a year ago</time> <span>from <span><a href="https://mamot.fr/@pluralistic/111127187714057889" rel="external" title="Sent from mamot.fr via ActivityPub">mamot.fr</a></span></span><a href="https://mamot.fr/@pluralistic/111127187714057889">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
Cory Doctorow (pluralistic@mamot.fr)'s status on Saturday, 23-Dec-2023 13:41:13 JST Cory Doctorow
in reply to
For the most part, the Internet Archive limits its scraping to websites that permit it. The #RobotsExclusionProtocol (AKA #robots.txt) makes it easy for webmasters to tell different kinds of crawlers whether or not they are welcome. If your site has a robots.txt file that tells the Archive's crawler to buzz off, it'll go elsewhere.
Mostly.
7/
In conversationabout a year ago from mamot.frpermalink

Public

Embed Notice

HTML Code

Corresponding Notice