Email or username:

Password:

Forgot your password?
Dare Obasanjo

Study of codebases over a 4 year period show Github Copilot lowers the quality of code over time by increasing the likelihood of bugs being introduced and copy & pasted code.

This is a healthy counterpoint to studies that show improved productivity.

Fast. Cheap. Good. Pick Two.

visualstudiomagazine.com/artic

19 comments
Kierkegaanks, Ο€/🦴

@carnage4life I’ll have fast and cheap please

- venture capitalists

thomastc

@carnage4life I'm as sceptical about "AI" as anyone, but it's worth noting that this is not an impartial peer-reviewed study, but rather a whitepaper published by a company that sells a product for code review by humans. They have skin in this game.

Robin Syl 🌸:blobcatreach:

@carnage4life "it easy to be fast if every step wrong" - uncle roger

Thomas πŸ”­βœ¨

@carnage4life "Save 5 minutes of developer time now, pay for a class action suit later!"

Jason Sando

@thomasfuchs @carnage4life Ugh. Even with all of us screaming "don't trust the output from AI!" some company will undoubtedly try to blame their own negligence on it. You just know it's coming at some point.

Feoh

@carnage4life It's interesting. I wonder how much of that is the advertised feature where you add a comment and copilot writes the entirety of the code for you.

I use Tab9 - tabnine.com/ and really appreciate its workflow. It just sits there silently while I code and helps with boilerplate and bases suggestions on the patterns that already exist within your code base and repository.

I find it rather helpful, but I'm also fluent enough with Python that I know where the potholes are.

I think there's room for tools like this to be used in a common sense way to boost productivity while maintaining quality, but I realize that's a minority opinion.

@carnage4life It's interesting. I wonder how much of that is the advertised feature where you add a comment and copilot writes the entirety of the code for you.

I use Tab9 - tabnine.com/ and really appreciate its workflow. It just sits there silently while I code and helps with boilerplate and bases suggestions on the patterns that already exist within your code base and repository.

Dave

@carnage4life has GitHub co-pilot been available for 4 years? It looks like it was released in 2021. Did the folks in this study have access to it before that?

San Wu

@carnage4life Even if that's true, I'm sure the statistics are very different per tiers of engineering quality. It's almost a certainty that bug rate increase with Copilot is way, way higher with low-quality software engineers (boot camp graduates) than with SDEs from top 25 schools.

Peter Amstutz

@carnage4life
Save two hours writing boilerplate, spend ten hours debugging incredibly subtle bugs in production, sounds like a win!

Yea, the fact that LLMs just inherently produce "plausible" results rather than "correct" results means producing bugs that are just that much harder to spot.

Lafncow :blobcatcoffee:

@carnage4life I would not be surprised if these conclusions are true, but this "study" is very flimsy. They compare total metrics across all measured codebases between 2023 and the prior years, assume that all differences are due to Copilot use, and then extrapolate conclusions. I am definitely an AI skeptic, but this is a marketing piece, not a research paper.

Patrick Lam

@lafncow @carnage4life the research just hasn't been done yet (and should be!) The press write-up does quote a workshop paper I was involved in but that's also not strong evidence yet either.

Lafncow :blobcatcoffee:

@va2lam @carnage4life I agree, this is an area that really should be researched! Thank you for jumping in, I hope you or others get to research these questions deeper!

Patrick Lam

@lafncow @carnage4life alas, the general productivity question is not one that I'm well equipped to investigate, but yes, specific properties of (generated) code are up my alley.

Go Up