Supercazzola - Generate spam for web scrapers
32 points by dacav
32 points by dacav
Around November 2025 I stumbled into a tarpit for rude web scrapers: https://maurycyz.com/projects/trap_bots/ Since then I indulged in writing my own, and I thought to share it with fellow hackers.
:D :D :D non-Italian speakers (and I suspect, younger Italian speakers too) might not appreciate what a cultural moment this is
No, mi permetta. No, io; eh scusi noi siamo in quattro. Come se fosse antani anche per lei soltanto in due, oppure in quattro anche scribai con cofandina; come antifurto, per esempio.
Is supercazzola the italian equivalent of turbo encabulator?
The script of that video omits some paragraphs from J.H. Quick’s 1944 journal article describing the turbo-encabulator in industry.
As Italian I love the name 😂
Is there demo site for this? Curious what it looks like.
To compile on Ubuntu I had to comment out the #include <stdckdint.h> from print_bin.c
Hi. Thanks for the feedback: I can address this problem, and I might follow up with a new release one of these days.
EDIT: it is actually not needed at all, thanks again for the info.
Bellisimo
I pretty much have the same thing (Markov chains based generator of infinite maze of pages) under an invisible link at the bottom of https://tilde.cat pointing to the /target. I don't want to paste a direct link here, since I've never linked to it anywhere and I'v also asked the bots not to crawl anything under /target via robots.txt
Obviously bots know it better, so in the last week:
$ rg "GET /target" access.log | wc -l
1251175
$ rg "GET /target" access.log | awk '{print $1}' | sort | uniq -c | wc -l
817484
If you are interested, you can find the source here but be warned - all of it was written by AI for AI ;)