Guarding My Git Forge Against AI Scrapers

62 points by technetium


fanf

Worryingly, VNPT and Bunny Communications are home/mobile ISPs. i cannot ascertain for sure that their IPs are from domestic users, but it seems worrisome that these are among the top scraping sources once you remove the most obviously malicious actors.

Yeah there’s a widespread malware infestation involving adware in bottom-scraping apps and attack-cloaking services such as Brightdata. I have extremely cynical opinions about why and how they are allowed to do such evil things out in the open.

https://lobste.rs/search?q=brightdata&what=comments&order=newest

https://lobste.rs/domains/jan.wildeboer.net

conartist6

Ah yes, the Internet as it was meant to work.

I'm about to try running a major code forge and I am not in the least bit excited to face the core UX problem of the modern web: feeding scrapers their alternative slop content at just the right place to keep them gobbling and horfing so that 20% of your resources are still available for people