AI bros really need to learn how to write a proper web scraper before they can convince me they are smart enough to make shits like AGI happen
Conversation
Notices
-
Embed this notice
Tᴀᴋᴀɴᴀsʜɪ Hᴏʀᴏ (petercxy@comfy.social)'s status on Thursday, 13-Mar-2025 08:04:22 JST Tᴀᴋᴀɴᴀsʜɪ Hᴏʀᴏ
- Haelwenn /элвэн/ :triskell: likes this.
-
Embed this notice
Tᴀᴋᴀɴᴀsʜɪ Hᴏʀᴏ (petercxy@comfy.social)'s status on Thursday, 13-Mar-2025 08:10:54 JST Tᴀᴋᴀɴᴀsʜɪ Hᴏʀᴏ
I'm really tired of banning these stupid scrapers again and again. Like I don't even care if you used my code for training. Can you please stop scraping the diff between every single pair of commits in my repo???
iced depresso likes this. -
Embed this notice
Tᴀᴋᴀɴᴀsʜɪ Hᴏʀᴏ (petercxy@comfy.social)'s status on Thursday, 13-Mar-2025 08:10:55 JST Tᴀᴋᴀɴᴀsʜɪ Hᴏʀᴏ
I'm talking to you stupid Alibaba / Anthropic / Amazon AI teams. You ever know there's a thing called robots.txt? You ever learned there are URLs that are not supposed to be links but binary data? You ever realized that not all URLs are static and you shouldn't be scraping things that are generated on the fly and clearly excluded from robots.txt?