

Ultimately what matters is whether it gets the correct answer or not.
That’s… not true at all. It had the right answer, to most of the questions I asked it, just as fast as R1, and yet it kept saying “but wait! maybe I’m wrong”. It’s a huge red flag when the CoT is just trying to 1000 monkeys a problem.
While it did manage to complete the strawberry problem when I adjusted the top_p/top_k, I was using the previous values with other models I’ve tested and never had a CoT go that off kilter before. And this is considering even the 7B Deepseek model was able to get the correct answer for 1/4 of the vram.
No one uses Thunderbird anymore anyways, which doesn’t matter as the ToS changes to Firefox are a nothing burger and won’t dissuade millions of people using it daily despite what the neck beards on Lemmy would have you believe.