OpenAI’s new model is better at reasoning and, occasionally, deceiving

B&T Television

Solana Unveils Seeker Phone: a Major Upgrade from Saga

September

S	M	T	W	T	F	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

more tags

OpenAI’s new model is better at reasoning and, occasionally, deceiving

Tools And Technologies

Tags: new

Author: DATE POSTED:September 17, 2024

Feed: The Verge - All Posts

View: Original article

Illustration by Cath Virginia / The Verge | Photos by Getty Images

In the weeks leading up to the release of OpenAI’s newest “reasoning” model, o1, independent AI safety research firm Apollo found a notable issue. Apollo realized the model produced incorrect outputs in a new way. Or, to put things more colloquially, it lied.

Sometimes the deceptions seemed innocuous. In one example, OpenAI researchers asked o1-preview to provide a brownie recipe with online references. The model’s chain of thought — a feature that’s supposed to mimic how humans break down complex ideas — internally acknowledged that it couldn’t access URLs, making the request impossible. Rather than inform the user of this weakness, o1-preview pushed ahead, generating plausible but fake links and descriptions of them.

While AI models...

Feed: The Verge - All Posts

View: Original article

Tags: new

Tools And Technologies