#`repo-slopscore`: Analyzing git repos for signs of AI involvement

28 messages · Page 1 of 1 (latest)

lethal pond
#

Link to deployment | Source code

"Repo slopscore" is a CLI + optional webapp which collects evidence of AI assistance in git repositories; specifically, right now, "git repositories" means "git https:// url".

It goes through the entire commit history of a repository (upper limit is 5000 commits currently, but I have ideas to change this in the future) and detects visible signs of AI/LLM tool usage in the commit history and the source tree. The tool uses partial cloning to reduce data usage (and potentially forge server load) by skipping blobs/blob contents entirely.

Additionally, aggressive caching is used to ensure that a repo that has been analyzed before does not need to get fully cloned again. Instead, the tool remembers the precise point in time (+commit hash) of the most up-to-date commit to the repo and uses this knowledge to only clone from that point onward the next time analysis is requested.

Go to the source code page I linked to find out more about how this works internally, and feel free to throw as many git repos as you want (...within a reasonable limit) into the tool! :)

brazen gale
#

This is pretty cool

#

While the false negative rate is maybe a little high due to lack of heuristic analysis (a decent amount of AI projects disable commit trailers and don't commit their agent configs either), by the same token no heuristic analysis means you can't have false positives, which is actually quite a beneficial property to have

#

It also runs pretty dang fast (tried via the website linked on the repo)

#

This is gonna save people a lot of future git log / git greping ^^

#

I am curious, how is the score field on the website calculated, and does it hold any particular meaning?

#

I saw a repo with ~13 signals had a score of 62 (C), a repo with 0 signals got 100 (S) (obvious), and a repo with 117 signals got 0 (F)

#

Also, I think it would be really interesting to add some histograms to the scan data

#

Such as %age of AI-assisted commits per month over time

mystic ore
supple pilot
#

I mean some false positives are sadly unavoidable but I really like the idea of the project. Also bonus points for codeberg ❤️

stuck sigil
#

this is a gem ❤️

low eagle
#

we need it here badly

stuck sigil
#

@lethal pond can you make an endpoint for the hosted version that performs a slopscore from a query param or something and then shows it in discord embed

lethal pond
#

Oh my I didn't even see that this got this many responses! I'll try to get back to everyone tomorrow :)

lethal pond
lethal pond
lethal pond
lethal pond
lethal pond
lethal pond
lethal pond
stuck sigil
#

yeah

#

it would be really funy to post that in showcase channel for obviously sloperated projects

low eagle