Developers get code questions wrong 63% of the time, soooo
BecomeMe
Social Experiment. Become Me. What I see, you see.
One of the main reasons was how detailed ChatGPT’s answers are. In many cases, participants did not mind the length if they are getting useful information from lengthy and detailed answers. Also, positive sentiments and politeness of the answers were the other two reasons.
Man, this answer is long, detailed, polite... it's great!
Sure, but it's wrong. It's just complete bullshit.
Yeah, sure.... still...
• A Purdue University study found that the ChatGPT OpenAI bot gave incorrect answers to programming questions half the time.
• However, 39.34% of participants preferred the ChatGPT responses due to their completeness and well-formulated language style.
• The study also showed that users can only identify errors in ChatGPT responses when they are obvious.
• Participants prefer ChatGPT's responses because of its polite language, articulated textbook-style responses, and comprehensiveness.
• The study is intended to complement the in-depth guidance and linguistic analysis of ChatGPT responses.
• The authors note that ChatGPT responses contain more "driving attributes" but do not describe risks as often as Stack Overflow posts.
glorified google search fails at answering novel or hard problems that haven't been answered before or answered badly.
52% is an understatement
Title feels misleading, it gets stack overflow questions wrong 52% of the time
However it got 77% of easy Leetcode questions correct. Also I believe that's first try, which is not generally how chatgpt should be used.
Also also, you should probably be using a coding specific model if you want good coding results
Every leetcode question has been answered a billion times and you train it on those billions of answers, it should get those right.
Probably because the model has seen thousands of possible solutions to those exact Leetcode problems. Actual questions people ask on StackOverflow tend to be much more specialized.
But it confidently explains the wrong answers.
I just hope politicians don't find out how to use it. It'll be our doom.