Notices by Jonathan Corbet (corbet@social.kernel.org)

Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 12-Jun-2025 04:05:42 JST Jonathan Corbet

@ljs @ptesarik You made me curious, so I did a couple of checks ... there are all of 12 commits in the mainline with Nacked-by tags. The most productive Nakers are Christoph Hellwig and Tetsuo Handa, with three each. There are single nacks from Hannes Reinecke, Jakub Kicinski, Manish Chopra, Pablo Neira Ayuso, Rob Herring, and Tejun Heo.

All the rest of you are slacking.

In conversation about 5 days ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 01-May-2025 05:46:21 JST Jonathan Corbet

I have often complained that, even though thousands of developers are paid to work on the Linux kernel, there is not a single person whose job it is to write documentation for the kernel. The problem is wider than that, though: Alejandro Colomar, who has been maintaining the man pages collection for the last four years, can no longer afford to do it for free.

https://lwn.net/ml/all/4d7tq6a7febsoru3wjium4ekttuw2ouocv6jstdkthnacmzr6x@f2zfbe5hs7h5
In conversation about 2 months ago from social.kernel.org permalink
Attachments
1. Domain not in remote thumbnail source whitelist: static.lwn.net
  
  Linux man-pages project maintenance [LWN.net]
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Monday, 07-Apr-2025 22:52:18 JST Jonathan Corbet

20 Years ago: the BitKeeper license changed, making it unavailable for kernel development.

https://lwn.net/Articles/130746/

It drove home the perils of relying on proprietary software and spurred the creation of Git - a significant event, overall.
In conversation about 2 months ago from social.kernel.org permalink
Attachments
1. Domain not in remote thumbnail source whitelist: static.lwn.net
  
  The kernel and BitKeeper part ways [LWN.net]
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Monday, 31-Mar-2025 01:11:38 JST Jonathan Corbet

Today I got a cheery email from somebody who claims to be the "ethics and compliance" officer for a company called Bright Data. He wanted to have a "no pressure" conversation about the whole AI scraperbot problem. Looking at their web site, this company offers an API that, and I quote, "Bypasses anti-scraping mechanisms and solves CAPTCHAs, ensuring uninterrupted access to the most protected web sites".

After careful consideration for several milliseconds, I have concluded that I really don't have anything to discuss with this person.

But at least their claimed "100M+" of residential IP addresses that they use for their DDOS attacks are "ethically sourced".

In conversation about 3 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 13-Mar-2025 08:29:12 JST Jonathan Corbet

Ah the memories one finds at the bottom of a desk drawer... Once upon a time this was a really cool thing.
In conversation about 3 months ago from social.kernel.org permalink
Attachments
1. A classic Linuxcare bootable CD, marked "warning: for Linux experts only"
  https://media.social.kernel.org/media/144d85af8e8376c2559d43d395c2c2b81b1b6da10c24cbea51a51557577ee677.jpg
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 26-Feb-2025 04:46:59 JST Jonathan Corbet
- Cory Doctorow
So, while I think this article declares victory a bit too soon, I think we also need the occasional optimistic view that we may actually get through this administration.

https://prospect.org/politics/2025-02-24-trump-coup-has-failed/

(by way of @pluralistic)
In conversation about 4 months ago from social.kernel.org permalink
Attachments
1. Domain not in remote thumbnail source whitelist: prospect.org
  
  The Coup Has Failed
  
  from https://prospect.org/topics/david-dayen/
  
  Trump’s falling approval ratings reveal an out-of-touch presidency, and have given space for allies to turn against him.
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Friday, 24-Jan-2025 23:44:25 JST Jonathan Corbet
in reply to
- LWN.net
- Michael K Johnson
@mcdanlj @LWN What a lot of people are suggesting (nepethenes and such) will work great against a single abusive robot. None of it will help much when tens of thousands of sites are grabbing a few URLs each. Most of them will never step into the honeypot, and the ones that do will not be seen again regardless.

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 23-Jan-2025 23:39:10 JST Jonathan Corbet
in reply to
- penguin42
@penguin42 They don't tell me what they are doing with the data... the distributed scraping is an easily observable fact, though. Perhaps they are firehosing the data back to the mothership for training?

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 23-Jan-2025 12:25:08 JST Jonathan Corbet
in reply to
- K. Ryabitsev ????
- smxi
@smxi @monsieuricon Suggestions for these countermeasures - and how to apply them without hosing legitimate users - would be much appreciated. I'm glad they are obvious to you, please do share!

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 23-Jan-2025 11:59:18 JST Jonathan Corbet

So I guess I'm famous now :)

https://www.heise.de/en/news/AI-bots-paralyze-Linux-news-site-and-others-10252162.html

To be clear, LWN has never "crashed" as a result of this onslaught. We'll not talk about what happened after I pushed up some code trying to address it...

Most seriously, though: I'm surprised that this situation is surprising to anybody at this point. This is a net-wide problem, it surely is not limited to free-software-oriented sites. But if the problem is starting to get wider attention, that is fine with me...
In conversation about 5 months ago from social.kernel.org permalink
Attachments
1. Domain not in remote thumbnail source whitelist: heise.cloudimg.io
  
  AI bots paralyze Linux news site and others
  
  from heise online
  
  Since the beginning of the year, AI bots have apparently been causing websites such as LWN.net to crash more frequently. It is said to be a major problem.
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Thursday, 23-Jan-2025 08:20:50 JST Jonathan Corbet

A followup for folks who are curious about the whole AI botswarm problem...

Some of these bots are clearly running on a bunch of machines on the same net. I have been able to reduce the traffic significantly by treating everything as a class-C net and doing subnet-level throttling. That and simply blocking a couple of them.

But that leaves a lot of traffic with an interesting characteristic: there are millions of obvious bot hits (following a pattern through the site, for example) that all come from a different IP. An access log with 9M lines as over 1M IP addresses, and few of them appear more than about three times.

So these things are running on widely distributed botnets, likely on compromised computers, and they are doing their best to evade any sort of recognition or throttling. I don't think that any sort of throttling or database of known-bot IPs is going to help here...not quite sure what to do about it.

What a world we have made for ourselves...

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 23:18:12 JST Jonathan Corbet
in reply to
- LWN.net
- Daniel Bovensiepen
@daniel @LWN The problem with restricting reading to logged-in people is that it will surely interfere with our long-term goal to have the entire world reading LWN. We really don't want to put roadblocks in front of the people we are trying to reach.

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 07:33:44 JST Jonathan Corbet
in reply to
- Kevin P. Fleming
- DamonHD
@DamonHD @kevin So how does Enphase cut off access to a local resource like that? Have they said why such a thing would happen?

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 07:05:41 JST Jonathan Corbet
in reply to
- LWN.net
- AndresFreundTec
@AndresFreundTec @LWN Yes, a lot of really silly traffic. About 1/3 of it results in redirects from bots hitting port 80; you don't see them coming back with TLS, they just keep pounding their head against the same wall.

It is weird; somebody has clearly put some thought into creating a distributed source of traffic that avoid tripping the per-IP circuit breakers. But the rest of it is brainless.

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 06:14:17 JST Jonathan Corbet
in reply to
- LWN.net
- Ronny Adsetts
@RonnyAdsetts @LWN The user agent field is pure fiction for most of this traffic.

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:56:42 JST Jonathan Corbet
in reply to
- LWN.net
- Adelie
@adelie @LWN Blocking a subnet is not hard; the harder part is figuring out *which* subnets without just blocking huge parts of the net as a whole.

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:55:37 JST Jonathan Corbet

@gme I assume you're referring to https://blog.cloudflare.com/declaring-your-aindependence-block-ai-bots-scrapers-and-crawlers-with-a-single-click/ ?

It would appear to force readers to enable JavaScript, which we don't want to do. Plus it requires running all of our readers through cloudflare, of course...and I suspect that the "free tier" is designed to exclude sites like ours. So probably not a solution for us, but it could well work for others.
In conversation about 5 months ago from social.kernel.org permalink
Attachments
1. Domain not in remote thumbnail source whitelist: blog.cloudflare.com
  
  Declare your AIndependence: block AI bots, scrapers and crawlers with a single click
  
  To help preserve a safe Internet for content creators, we’ve just launched a brand new “easy button” to block all AI bots. It’s available for all customers, including those on our free tier.
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:46:04 JST Jonathan Corbet
in reply to
@lkundrak @monsieuricon @LWN It's a service we provide :)

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:27:32 JST Jonathan Corbet
in reply to
- LWN.net
- bignose
@bignose @LWN We have gone far out of our way to never require JavaScript to read LWN; we're not going to back on that now.

In conversation about 5 months ago from social.kernel.org permalink
Embed this notice
Jonathan Corbet (corbet@social.kernel.org)'s status on Wednesday, 22-Jan-2025 05:26:17 JST Jonathan Corbet
in reply to
- LWN.net
- John Francis 🦫🇨🇦🍁💪⬆️
@johnefrancis @LWN Something like nepenthes (https://zadzmo.org/code/nepenthes/) has crossed my mind; it has its own risks, though. We had a suggestion internally to detect bots and only feed them text suggesting that the solution to every world problem is to buy a subscription to LWN. Tempting.
In conversation about 5 months ago from social.kernel.org permalink
Attachments
1. No result found on File_thumbnail lookup.
  
  ZADZMO code
  
  from https://zadzmo.org/humans.txt

Before

Public

Notices by Jonathan Corbet (corbet@social.kernel.org)

User actions

Following 0

Followers 0

Groups 0

Statistics

Feeds