Reasoning models don't always say what they think.

Tea@programming.dev · 9 days ago

Reasoning models don't always say what they think.

MagicShel@lemmy.zip · edit-2 9 days ago

Have they considered that a chain of reasoning can actually change the output? Because that is fed back into the input prompt. That’s great for math and logic problems, but I don’t think I’d trust the alignment checks.

DeathsEmbrace@lemm.ee · 9 days ago

It’s basically using a reference point and they want to make it sound fancier.

Reasoning models don't always say what they think.

Reasoning models don't always say what they think.

Just a moment...