OpenAI’s CriticGPT outperforms humans in catching AI-generated code bugs

Enlarge / An illustration created by OpenAI. (credit: OpenAI)

On Thursday, OpenAI researchers unveiled CriticGPT, a new AI model designed to identify mistakes in code generated by ChatGPT. It aims to enhance the process of making AI systems behave in ways humans want (called “alignment”) through Reinforcement Learning from Human Feedback (RLHF), which helps human reviewers make large language model (LLM) outputs more accurate.

As outlined in a new research paper called “LLM Critics Help Catch LLM Bugs,” OpenAI created CriticGPT to act as an AI assistant to human trainers who review programming code generated by the ChatGPT AI assistant. CriticGPT—based on the GPT-4 family of LLMS—analyzes the code and points out potential errors, making it easier for humans to spot mistakes that might otherwise go unnoticed. The researchers trained CriticGPT on a dataset of code samples with intentionally inserted bugs, teaching it to recognize and flag various coding errors.

The researchers found that CriticGPT’s critiques were preferred by annotators over human critiques in 63 percent of cases involving naturally occurring LLM errors and that human-machine teams using CriticGPT wrote more comprehensive critiques than humans alone while reducing confabulation (hallucination) rates compared to AI-only critiques.

Developing an automated critic

The development of CriticGPT involved training the model on a large number of inputs containing deliberately inserted mistakes. Human trainers were asked to modify code written by ChatGPT, introducing errors and then providing example feedback as if they had discovered these bugs. This process allowed the model to learn how to identify and critique various types of coding errors.

Read 6 remaining paragraphs | Comments

What's your reaction?

Excited

Happy

In Love

Not Sure

Silly

OpenAI’s CriticGPT outperforms humans in catching AI-generated code bugs

Developing an automated critic

What's your reaction?

High Court Upends Purdue Pharma Bankruptcy Settlement

Draghi’s Globalist EU Speech Urges Concentration of Power

AI company trolls San Francisco with billboards saying “stop hiring humans”

AMD’s trusted execution environment blown wide open by new BadRAM attack

Reddit debuts AI-powered discussion search—but will users like it?

Leave a reply Cancel reply

More in:Editor's Pick

Ten months after first tease, OpenAI launches Sora video generation publicly

Your AI clone could target your family, but there’s a simple defense

New Broadcom sales plan may be “insignificant” in deterring VMware migrations

OpenAI’s new $200 monthly ChatGPT subscription will buy you more compute time

Posts List

In Memoriam: Fred Smith

Friday Feature: SEA Homeschoolers

Charlotte Plans an Expensive New Commuter Train in the Post-Commute Era

Developing an automated critic

Share

What's your reaction?

You may also like

Leave a reply Cancel reply

More in:Editor's Pick

Posts List

Latest Posts