LLMs’ “simulated reasoning” abilities are a “brittle mirage,” researchers find

Catoblepas@piefed.blahaj.zone · 4 days ago

LLMs’ “simulated reasoning” abilities are a “brittle mirage,” researchers find

Lvxferre [he/him]@mander.xyz · 4 days ago

You don’t say.

Imagine for a moment you had a machine that allows you to throw bricks at a certain distance. This shit is useful, specially if you’re a griefer; but even if you aren’t, there are some corner cases for that, like transporting construction material at a distance.

And yet whoever sold you the machine calls it a “house auto-builder”. He tells you that it can help you to build your house. Mmmh.

Can house construction be partially automated? Certainly. Perhaps even fully. But not through a brick-throwing machine.

Of course trying to use the machine for its advertised purpose will go poorly, even if you only delegate brick placement to it (and still build the foundation, add cement etc. manually). You might economise a bit of time when the machine happens to throw a brick in the right place, but you’ll waste a lot of time cleaning broken bricks, or replacing them. But it’s still being sold as a house auto-builder.

But the seller is really, really, really invested on this auto-construction babble. Because his investors gave him money to create auto-construction tools. And he keeps babbling on how “soon” we’re going to get fully auto house building, and how it’s an existential threat to builders and all that babble. So he tweaks the machines to include “simulated building”. All it does is to tweak the force and aim of the machine, so it’s slightly less worse at throwing bricks.

It still does not solve the main problem: you don’t build a house by throwing bricks. You need to place them. But you still have some suckers saying “haha, but it’s a building machine lmao, can you prove it doesn’t build? lol”.

That’s all what “reasoning” LLMs are about.

massive_bereavement@fedia.io · 4 days ago

You don’t get it.

In the past, the brick throwing machine was always failing its target and nowadays it is almost always hitting near its target. It depends on how good you are asking the machine to throw bricks (you need to assume some will miss and correct accordingly).

Eventually, brick throwing machines will get so good that they will rely on gravitational forces to place the bricks perfectly and auto-build houses.

Plus you can vibe build: let it throw some random bricks and start building around. You will be surprised of what it can achieve.

#building-is-dead #autobrick-engineer

Lvxferre [he/him]@mander.xyz · edit-2 4 days ago

You don’t get it.

I do get it. And that’s why I’m disdainful towards all this “simulated reasoning” babble.

In the past, the brick throwing machine was always failing its target and nowadays it is almost always hitting near its target.

Emphasis mine: that “near” is a sleight of hand.

It doesn’t really matter if it’s hitting “near” or “far”; in both cases someone will need to stop the brick-throwing machine, get into the construction site (as if building a house manually), place the brick in the correct location (as if building a house manually), and then redo operations as usual.

In other words, “hitting near the target” = “failure to hit the target”.

And it’s obvious why it’s wrong; the idea that an auto-builder should throw bricks is silly. It should detect where the brick should be placed, and lay it down gently.

The same thing applies to those large token* models; they won’t reach anywhere close to reasoning, just like a brick-throwing machine won’t reach anywhere close to an automatic house builder.

*I’m calling it “large token model” instead of “large language model” to highlight another thing: those models don’t even model language fully, except in the brain of functionally illiterate tech bros who think language is just a bunch of words. Semantics and pragmatics are core parts of a language; you don’t have language if utterances don’t have meaning or purpose. The nearest of that LLMs do is to plop some mislabelled “semantic supplement” - because it’s a great red herring (if you mislabel something, you’re bound to get suckers confusing it with the real thing, and saying “I dun unrurrstand, they have semantics! Y u say they don’t? I is so confusion… lol lmao”).

It depends on how good you are asking the machine to throw bricks (you need to assume some will miss and correct accordingly).

If the machine relies on you to be an assumer (i.e. to make shit up, like a muppet), there’s already something wrong with it.

Eventually, brick throwing machines will get so good that they will rely on gravitational forces to place the bricks perfectly and auto-build houses.

To be blunt that stinks “wishful thinking” from a distance.

As I implied in the other comment (“Can house construction be partially automated? Certainly. Perhaps even fully. But not through a brick-throwing machine.”), I don’t think reasoning algorithms are impossible; but it’s clear LLMs are not the way to go.

anachronist@midwest.social · 4 days ago

I think the brick that is the point of this parody sailed right over your head.😁

Lvxferre [he/him]@mander.xyz · edit-2 4 days ago

If it is not a parody, the user got a serious answer. And if it is, I’m just playing along ;-)

(If it is a parody, it’s so good that it allows me to actually answer it as if it wasn’t.)

Mac@mander.xyz · 4 days ago

It is most definitely satire but that doesnt mean your comments aren’t worth reading.

massive_bereavement@fedia.io · 4 days ago

Amd you should see the therapeutic effects of brick throwing and the very promising health applications.

You would be amazed of what you can achieve with a well thrown brick.

massive_bereavement@fedia.io · 4 days ago

Sorry, I just got carried away in your analogy, like the proverbial brick thrown in to the air by a large machine that is always very precisely almost often sometimes hitting its target.

Lvxferre [he/him]@mander.xyz · 4 days ago

I should apologise - I didn’t catch right off the bat that you were playing along the analogy.

TehPers@beehaw.org · 4 days ago

You will be surprised of what it can achieve.

But not by what it can’t.

massive_bereavement@fedia.io · 4 days ago

We are probably ten years away of self-building houses.

Please invest in my company.

TehPers@beehaw.org · 4 days ago

You raise a good point. Consider me in.

massive_bereavement@fedia.io · 4 days ago

deleted by creator

anachronist@midwest.social · 4 days ago

This is a great analogy