OpenAI’s new “CriticGPT” model is trained to critique GPT-4 output
Enlarge / An illustration created by OpenAI. On Thursday, OpenAI researchers unveiled CriticGPT, a new AI model designed to identify bugs in code generated by ChatGPT. It’s intended to improve the process of making AI systems behave in ways that people want (called “alignment”) via reinforcement learning from human feedback (RLHF), which helps human reviewers … Read more