GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Embed Notice

HTML Code

Corresponding Notice

  1. Embed this notice
    tyil (tyil@fedi.tyil.nl)'s status on Wednesday, 21-May-2025 23:04:40 JSTtyiltyil
    in reply to
    • 翠星石
    • VnPower

    @Suiseiseki@freesoftwareextremist.com @vnpower@mstdn.maud.io It's quite easy. If it says "AppleWebKit" and/or "Chrome" and/or "Safari" it's not human.Whether you like it or not, that's not the case. Actual browsers use those in their user agent, and you said yourself earlier we cannot expect users to fix this.It's a massive SKILL ISSUE putting "Mozilla" where "AppleWebKit" belongs;Then so too its a skill issue to not identify IceCat as IceCat/1.0 or whatever version it is.As you can see, most LLM scrapers are banking on blocking or attacking iSheep or Chrome used being too costlyI can see only 3 potential LLM scrapers there, out of many. For the record, I am blocking those, and without Anubis my cgit instance will still serve literal tens of thousands of requests to connections pretending to be users. So no, not "most", only a few, maybe two dozen or so, are identifying themselves appropriately. The other dozens if not hundreds do not.

    In conversationabout 15 days ago from fedi.tyil.nlpermalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: www.this.it
      Progetti architettura e servizi tecnici per immobili
      Consulenza tecnica di architettura ed ingegneria per progettazione, ristrutturazione di immobili, pratiche edilizie, perizie. Investimenti, valorizzazione e trasformazione di immobili

  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.