repo-slopscore: Detecting AI/LLM contributions in git repositories via commit history analysis

32 points by ava


Link to source code Disclaimer: This is self-promotion, as I am the author of the tool

hungariantoast

I hate to be negative about something another person has made and chose to share here with others, but in this project's case, it feels like negativity is the point.

Something about this project feels really shitty to me. It feels like this is just a tool to automate disdain or contempt for software projects that are built with, or accept contributions built with, tools and methods that you disagree with.

The grading is not useful. nixpkgs gets a 0 (F) score because it has 228 "commit signals" that suggest AI being used for contributions. The nixpkgs repository has 1,016,046 commits right now. So 0.022% of the commits to nixpkgs having AI-related "commit signals" is enough to earn the project a zero?

Bevy gets a 97 (A+) score. It's not 100 because there was a single pull request with a single commit that had the "co-authored by Claude" annotation on it. Doesn't matter that the pull request was good. Doesn't matter that the maintainers didn't notice the "co-authored by" annotation when they chose to merge it. Doesn't matter that Bevy also has a robust and reasonable AI contributions policy.

The grading is ridiculously harsh, but it's also not the point. The point is:

This project shunts nuance out the door. It automates away the need for a person to actually look into a project when they have concerns. Now a person can just run a project through your program and either breathe a sigh of relief or have their biases confirmed. No longer is there a need to actually get a feel for a project yourself. To discover what the maintainers think. To even consider what the human beings who make the project think and feel or what their reasons might be. Put a URL in. Get a grade out.

So the project is of limited use because it throws away nuance and context. The project feels overly (and purposefully) negative because of the connotation of "slop" and the harsh grading. Unfortunately, the project also feels dehumanizing because it tries to aggregate all of the human decisions and factors that go into producing software, even when that production is done with the assistance of AI, into a single grade.

For a project that, and an author who, seem very concerned about the software other people build, and the ways they build that software, I think there is an unfortunate lack of thoughtfulness and compassion in this project, and how and why it was created.