New OpenAI GPT-4 service will help spot errors in ChatGPT coding suggestions

In a bid to increase the usefulness of generative AI tools to developers, OpenAI has introduced CriticGPT, a new model it says can help identify errors in ChatGPT code outputs.

Based on GPT-4, OpenAI claims CriticGPT has been able to outperform unaided efforts 60% of the time, showing its ability to enhance human performance in code review tasks, rather than replace human workers.

OpenAI’s initiative aims to refine the ‘Reinforcement Learning from Human Feedback’ (RLHF) process in order to ensure higher quality and greater reliability in AI systems.

OpenAI launches new code-checking model

OpenAI’s latest GPT-4 series, which powers publicly available versions of ChatGPT, relies heavily on RLHF to ensure that its outputs are both reliable and interactive. Up until now, this process has been a manual one that has leaned on the human power of AI trainers, who have rated ChatGPT responses to improve the model’s performance.

With the launch of CriticGPT, OpenAI can now critique ChatGPT’s answers autonomously, which addresses concerns over the AI chatbot becoming too sophisticated for many human trainers.

CriticGPT was trained by trainers providing feedback after inserting intentional mistakes into ChatGPT-generated code. The results were promising, with CriticGPT’s critiques preferred by trainers around two-thirds (63%) of the time thanks to the tool’s ability to reduce nitpicks and hallucinations.

However, the project isn’t without its limitations, and AI-human collaboration continues to prove more effective compared to AI alone.

In its announcement, OpenAI summarized: “CriticGPT’s suggestions are not always correct, but we find that they can help trainers to catch many more problems with model-written answers than they would without AI help.”

The company also acknowledged that “mistakes can be spread across many parts of an answer,” which makes it more complex for an AI tool to identify the cause.

Looking ahead, OpenAI has confirmed plans to scale its work on CriticGPT and to put it into practice.

More from TechRadar Pro

Source link

Breaking News

The best free VPNs of 2025: Expert tested

The Best High-Protein Breakfast for High Cholesterol

Skeleton Crew ending explained: the truth about the Supervisor, Jod’s Jedi abilities, and more

Why I prefer this cordless stick vacuum over my Dyson – and it has nothing to do with price

Finally, a MagSafe battery pack that works flawlessly with my Pixel 9 Pro – and it has a cooling fan

These Alexa-enabled smart glasses beat the Meta Ray-Bans in key ways, and they’re $90 off right now

This low-cost Lenovo gaming PC is a desktop I recommend to most people – and it’s on sale

Harry Potter is getting a surprise collaboration with one of Japan’s longest-running anime

How the ‘ChatGPT of healthcare’ could accelerate rheumatoid arthritis treatment

The best free VPNs of 2025: Expert tested

The Best High-Protein Breakfast for High Cholesterol

Skeleton Crew ending explained: the truth about the Supervisor, Jod’s Jedi abilities, and more

Why I prefer this cordless stick vacuum over my Dyson – and it has nothing to do with price

New OpenAI GPT-4 service will help spot errors in ChatGPT coding suggestions

More From Author

The best free VPNs of 2025: Expert tested

The Best High-Protein Breakfast for High Cholesterol

Skeleton Crew ending explained: the truth about the Supervisor, Jod’s Jedi abilities, and more

+ There are no comments

Cancel reply

Paramount Plus’ Dexter prequel series is ready to slay as Buffy herself joins the cast

Game Dev Delays Its New Game To Avoid The Sequel To The Series It Used To Make

You May Also Like:

The best free VPNs of 2025: Expert tested

The Best High-Protein Breakfast for High Cholesterol

Skeleton Crew ending explained: the truth about the Supervisor, Jod’s Jedi abilities, and more

Why I prefer this cordless stick vacuum over my Dyson – and it has nothing to do with price

Finally, a MagSafe battery pack that works flawlessly with my Pixel 9 Pro – and it has a cooling fan

These Alexa-enabled smart glasses beat the Meta Ray-Bans in key ways, and they’re $90 off right now

This low-cost Lenovo gaming PC is a desktop I recommend to most people – and it’s on sale

Harry Potter is getting a surprise collaboration with one of Japan’s longest-running anime

Breaking News

Top Tagged

+ There are no comments

Paramount Plus’ Dexter prequel series is ready to slay as Buffy herself joins the cast

Game Dev Delays Its New Game To Avoid The Sequel To The Series It Used To Make