AI can't even run a vending machine -- Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents

ray@lemmy.ml · 4 days ago

AI can't even run a vending machine -- Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents

Alexstarfire@lemmy.world · 4 days ago

Why would a vending machine ever need AI?

spartanatreyu@programming.dev · 4 days ago

It wouldn’t, a simple finite state machine that any intelligent entity could emulate would be enough.

But people have completely deluded themselves into thinking that (what CEOs and marketers call) “AI” is actually intelligent, and this case study shows how preposterous that fantasy actually is.

knightly the Sneptaur@pawb.social · edit-2 4 days ago

I really hope people are starting to catch on, large language models aren’t “intelligent”, they’re multidimensional maps of human language use and querying them is just tracing a vector “forward” through language-space from the starting point of a prompt.

It’s the reification fallacy writ so large it’s eclipsing entire national economies. Human intelligence isn’t in language, language is a product of human intelligence. The map is not the territory.

And yeah, it is pretty cool that we have the processing power to map out language-space well enough to draw some vectors that remain coherent over thousands of tokens, but using a billion-parameter model to do what could be accomplished with probably-already-existing management software and a few seconds of CPU time per week is as wasteful as it is misguided.

HiddenLayer555@lemmy.ml · edit-2 3 days ago

In the same way your fridge needs a web browser.

Though the point of this is probably not that it will be a viable product, but managing a vending machine is one of those seemingly easy and straightforward tasks that make good starting applications to test the AI with. Basically, if it can’t even handle something as simple as a vending machine, it definitely can’t be trusted with anything more complex.

Etterra@discuss.online · edit-2 3 days ago

pheonixdown@sh.itjust.works · 4 days ago

Real answer, surge or scarcity pricing.

BartyDeCanter@lemmy.sdf.org · 4 days ago

Totally unnecessary. A simple price/demand curve can easily be written in a few lines of code.

MotoAsh@lemmy.world · edit-2 4 days ago

But your basic algorithms cannot tell if Debbie just broke up with her BF and would totally spend all seven dollars in her purse for that late night candy bar just to bury the pain under something positive now could it?!