GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Notices by Jonathan Corbet (corbet@social.kernel.org)

  1. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 01-May-2025 05:46:21 JST Jonathan Corbet Jonathan Corbet
    I have often complained that, even though thousands of developers are paid to work on the Linux kernel, there is not a single person whose job it is to write documentation for the kernel. The problem is wider than that, though: Alejandro Colomar, who has been maintaining the man pages collection for the last four years, can no longer afford to do it for free.

    https://lwn.net/ml/all/4d7tq6a7febsoru3wjium4ekttuw2ouocv6jstdkthnacmzr6x@f2zfbe5hs7h5
    In conversation about 22 days ago from social.kernel.org permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: static.lwn.net
      Linux man-pages project maintenance [LWN.net]
  2. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Monday, 07-Apr-2025 22:52:18 JST Jonathan Corbet Jonathan Corbet
    20 Years ago: the BitKeeper license changed, making it unavailable for kernel development.

    https://lwn.net/Articles/130746/

    It drove home the perils of relying on proprietary software and spurred the creation of Git - a significant event, overall.
    In conversation about a month ago from social.kernel.org permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: static.lwn.net
      The kernel and BitKeeper part ways [LWN.net]
  3. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Monday, 31-Mar-2025 01:11:38 JST Jonathan Corbet Jonathan Corbet
    Today I got a cheery email from somebody who claims to be the "ethics and compliance" officer for a company called Bright Data. He wanted to have a "no pressure" conversation about the whole AI scraperbot problem. Looking at their web site, this company offers an API that, and I quote, "Bypasses anti-scraping mechanisms and solves CAPTCHAs, ensuring uninterrupted access to the most protected web sites".

    After careful consideration for several milliseconds, I have concluded that I really don't have anything to discuss with this person.

    But at least their claimed "100M+" of residential IP addresses that they use for their DDOS attacks are "ethically sourced".
    In conversation about 2 months ago from social.kernel.org permalink
  4. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 13-Mar-2025 08:29:12 JST Jonathan Corbet Jonathan Corbet
    Ah the memories one finds at the bottom of a desk drawer... Once upon a time this was a really cool thing.
    In conversation about 2 months ago from social.kernel.org permalink

    Attachments


    1. https://media.social.kernel.org/media/144d85af8e8376c2559d43d395c2c2b81b1b6da10c24cbea51a51557577ee677.jpg
  5. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 26-Feb-2025 04:46:59 JST Jonathan Corbet Jonathan Corbet
    • Cory Doctorow
    So, while I think this article declares victory a bit too soon, I think we also need the occasional optimistic view that we may actually get through this administration.

    https://prospect.org/politics/2025-02-24-trump-coup-has-failed/

    (by way of @pluralistic)
    In conversation about 3 months ago from social.kernel.org permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: prospect.org
      The Coup Has Failed
      from https://prospect.org/topics/david-dayen/
      Trump’s falling approval ratings reveal an out-of-touch presidency, and have given space for allies to turn against him.
  6. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Friday, 24-Jan-2025 23:44:25 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • Michael K Johnson
    @mcdanlj @LWN What a lot of people are suggesting (nepethenes and such) will work great against a single abusive robot. None of it will help much when tens of thousands of sites are grabbing a few URLs each. Most of them will never step into the honeypot, and the ones that do will not be seen again regardless.
    In conversation about 4 months ago from social.kernel.org permalink
  7. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 23-Jan-2025 23:39:10 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • penguin42
    @penguin42 They don't tell me what they are doing with the data... the distributed scraping is an easily observable fact, though. Perhaps they are firehosing the data back to the mothership for training?
    In conversation about 4 months ago from social.kernel.org permalink
  8. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 23-Jan-2025 12:25:08 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • K. Ryabitsev ????
    • smxi
    @smxi @monsieuricon Suggestions for these countermeasures - and how to apply them without hosing legitimate users - would be much appreciated. I'm glad they are obvious to you, please do share!
    In conversation about 4 months ago from social.kernel.org permalink
  9. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 23-Jan-2025 11:59:18 JST Jonathan Corbet Jonathan Corbet
    So I guess I'm famous now :)

    https://www.heise.de/en/news/AI-bots-paralyze-Linux-news-site-and-others-10252162.html

    To be clear, LWN has never "crashed" as a result of this onslaught. We'll not talk about what happened after I pushed up some code trying to address it...

    Most seriously, though: I'm surprised that this situation is surprising to anybody at this point. This is a net-wide problem, it surely is not limited to free-software-oriented sites. But if the problem is starting to get wider attention, that is fine with me...
    In conversation about 4 months ago from social.kernel.org permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: heise.cloudimg.io
      AI bots paralyze Linux news site and others
      from heise online
      Since the beginning of the year, AI bots have apparently been causing websites such as LWN.net to crash more frequently. It is said to be a major problem.
  10. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 23-Jan-2025 08:20:50 JST Jonathan Corbet Jonathan Corbet
    A followup for folks who are curious about the whole AI botswarm problem...

    Some of these bots are clearly running on a bunch of machines on the same net. I have been able to reduce the traffic significantly by treating everything as a class-C net and doing subnet-level throttling. That and simply blocking a couple of them.

    But that leaves a lot of traffic with an interesting characteristic: there are millions of obvious bot hits (following a pattern through the site, for example) that all come from a different IP. An access log with 9M lines as over 1M IP addresses, and few of them appear more than about three times.

    So these things are running on widely distributed botnets, likely on compromised computers, and they are doing their best to evade any sort of recognition or throttling. I don't think that any sort of throttling or database of known-bot IPs is going to help here...not quite sure what to do about it.

    What a world we have made for ourselves...
    In conversation about 4 months ago from social.kernel.org permalink
  11. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 23:18:12 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • Daniel Bovensiepen
    @daniel @LWN The problem with restricting reading to logged-in people is that it will surely interfere with our long-term goal to have the entire world reading LWN. We really don't want to put roadblocks in front of the people we are trying to reach.
    In conversation about 4 months ago from social.kernel.org permalink
  12. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 07:33:44 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • Kevin P. Fleming
    • DamonHD
    @DamonHD @kevin So how does Enphase cut off access to a local resource like that? Have they said why such a thing would happen?
    In conversation about 4 months ago from social.kernel.org permalink
  13. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 07:05:41 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • AndresFreundTec
    @AndresFreundTec @LWN Yes, a lot of really silly traffic. About 1/3 of it results in redirects from bots hitting port 80; you don't see them coming back with TLS, they just keep pounding their head against the same wall.

    It is weird; somebody has clearly put some thought into creating a distributed source of traffic that avoid tripping the per-IP circuit breakers. But the rest of it is brainless.
    In conversation about 4 months ago from social.kernel.org permalink
  14. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 06:14:17 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • Ronny Adsetts
    @RonnyAdsetts @LWN The user agent field is pure fiction for most of this traffic.
    In conversation about 4 months ago from social.kernel.org permalink
  15. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:56:42 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • Adelie
    @adelie @LWN Blocking a subnet is not hard; the harder part is figuring out *which* subnets without just blocking huge parts of the net as a whole.
    In conversation about 4 months ago from social.kernel.org permalink
  16. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:55:37 JST Jonathan Corbet Jonathan Corbet
    @gme I assume you're referring to https://blog.cloudflare.com/declaring-your-aindependence-block-ai-bots-scrapers-and-crawlers-with-a-single-click/ ?

    It would appear to force readers to enable JavaScript, which we don't want to do. Plus it requires running all of our readers through cloudflare, of course...and I suspect that the "free tier" is designed to exclude sites like ours. So probably not a solution for us, but it could well work for others.
    In conversation about 4 months ago from social.kernel.org permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: blog.cloudflare.com
      Declare your AIndependence: block AI bots, scrapers and crawlers with a single click
      To help preserve a safe Internet for content creators, we’ve just launched a brand new “easy button” to block all AI bots. It’s available for all customers, including those on our free tier.
  17. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:46:04 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • K. Ryabitsev ????
    • HAMMER SMASHED FILESYSTEM 🇺🇦
    @lkundrak @monsieuricon @LWN It's a service we provide :)
    In conversation about 4 months ago from social.kernel.org permalink
  18. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:27:32 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • bignose
    @bignose @LWN We have gone far out of our way to never require JavaScript to read LWN; we're not going to back on that now.
    In conversation about 4 months ago from social.kernel.org permalink
  19. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:26:17 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • John Francis 🦫🇨🇦🍁💪⬆️
    @johnefrancis @LWN Something like nepenthes (https://zadzmo.org/code/nepenthes/) has crossed my mind; it has its own risks, though. We had a suggestion internally to detect bots and only feed them text suggesting that the solution to every world problem is to buy a subscription to LWN. Tempting.
    In conversation about 4 months ago from social.kernel.org permalink

    Attachments

    1. No result found on File_thumbnail lookup.
      ZADZMO code
      from https://zadzmo.org/humans.txt
  20. Embed this notice
    Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:20:59 JST Jonathan Corbet Jonathan Corbet
    in reply to
    • LWN.net
    • Mythic Beasts
    @beasts @LWN We are indeed seeing that sort of pattern; each IP stays below the thresholds for our existing circuit breakers, but the overload is overwhelming. Any kind of active defense is going to have to figure out how to block subnets rather than individual addresses, and even that may not do the trick.
    In conversation about 4 months ago from social.kernel.org permalink
  • Before

User actions

    Jonathan Corbet

    Jonathan Corbet

    Tags
    • (None)

    Following 0

      Followers 0

        Groups 0

          Statistics

          User ID
          115010
          Member since
          26 Apr 2023
          Notices
          56
          Daily average
          0

          Feeds

          • Atom
          • Help
          • About
          • FAQ
          • TOS
          • Privacy
          • Source
          • Version
          • Contact

          GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

          Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.