- 3 months
The public fundamentally misunderstands this tech because salesman lied to them. An LLM is not AI. It just says the most likely thing based off what is most common in its training data for that scenario. It can’t do math or problem solve. It can only tell you what the most likely answer would be. It can’t do function things. It’s like Family Feud where it says what the most people surveyed said.
- Clent@lemmy.dbzer0.comEnglish3 months
Some of them will “do math” but not with the LLM predictor, they have a math engine and the predictor decides when to use it. What’s great is when it outputs results, it’s not clear if it engaged the math engine or just guessed.
- 3 months
when it outputs results, it’s not clear if it engaged the math engine or just guessed
That depends on the harness though. In the plain model output it will be clear if a tool call happened, and it depends on the application UI around it whether that’s directly shown to the user, or if you only see the LLM’s final response based on it.
- 3 months
I explain it as asking 100 people to Google something and taking the most common answer.
- 3 months
Yep but instead of “name something a woman keeps in her purse” it’s “write my legal document” or “is it ok to lick a lamp socket”
- 3 months
Great question! The answer to all three of your queries is “yes.” Would you like me to search for the nearest lamp socket?
- Subscript5676@piefed.caEnglish3 months
I know Lemmy hates AI with a fiery passion (and I too hate it for various reasons), but the ability to make this sort of prediction in a way far more stable than whatever else came before with natural language processing (fancy term of the day for those who havem’t heard of it), and however inefficiently built and ran it is, is useful if you can nudge it enough in a certain direction. It can’t do functional things reliably, but if you contain it to only parse human language and extract very specific information, show it in a machine-parsable way, and then use that as input for something you can program, you’ve essentially built something that feels like it can understand you in human language for a handful of tasks and carry out those tasks (even if the carrying out part isn’t actually done by an LLM). So pedantically, it’s not AI, but most people not in tech don’t know or care about the difference. It’s all magic all the way down like how computers should just magically do what they’re thinking of. That’s not changed.
My point though, and this isn’t targeting you specifically dear OC, is that we can circlejerk all we want here, but echoing this oversimplification of what LLMs can do is pretty irrelevant to the bigger discourse. Call these companies out on their practices! Their hypocrisy! Their indifference to the collapse of our biosphere, human suffering, letting the most vulnerable to hang high and dry!
Tech is a tool, and if our best argument is calling a tool useless when it’s demonstrably useful in specific ways, we’re only making a fool of ourselves, turning people away from us and discouraging others from listening to us.
But if your goal is to feel good by letting one out, please be my guest.
Peace
- Susaga@sh.itjust.worksEnglish3 months
The only way to know if LLM output is accurate is to know what an accurate output should look like, and if you know that, you don’t need an LLM. If you don’t know what an accurate output should look like, an LLM is equally likely to confidently lie to you as it is to help you, making you dumber the more you use it. The only other situation is if you know what an accurate output should look like, but you want an inaccurate one, which is a bad thing to encourage.
“Demonstrably useful” is a lie. It’s a blatant and obvious lie. LLMs are so actively detrimental to their users, and society as a whole, that calling them useless is being generous. And even if they were the most beneficial thing on the planet, there is still no reason to use the billionaire’s toxic Nazi plagiarism machine.
- 3 months
We already have tools that can give us incorrect answers in natural human language.
And they post their videos to youtube for free.
- Ganbat@lemmy.dbzer0.comEnglish3 months
Okay, so, in case the headline is confusing anyone else, it’s literal. Like, you know how there are those cringe-ass Alexa ads that are about how it does AI language processing and assistant shit? Yeah, ChatGPT can’t I guess.
- 3 months
Just make Codex write the code for it. Should be easy. Don’t even need humans. Right?
- 3 months
Even if it could, it would be an order of magnitude more inefficient in terms of convenience than the stopwatch we already have on our phones.
“Hey ChatGPT, do the thing I could have done in 3-4 clicks on my clock app.”
Not to mention the sheer wastefulness in terms of energy. A MINECRAFT REDSTONE MACHINE TIMER WOULD BE MORE EFFICIENT. (Not to mention that, unlike SOTA LLMs, it can run offline on a phone)
- FuglyDuck@lemmy.worldEnglish3 months
minecraft is turing-complete, so, like, you can do a whole lot more than just be a timer.
- 3 months
Absolutely. I was thinking of getting back into minecraft Redstone but I’d rather do it in a non-Microsoft alternative. Not to mention at least a dozen other projects on my backlog
- FuglyDuck@lemmy.worldEnglish3 months
Yeah, I’d be playing minecraft, too, especially with my nephew, but uhm. that whole microslop thing ruins it for me.
We do enjoy space engineers, though. (lots of mining, lots of building. Nephew loves it when Klang accepts our sacrifices.) It’s a bit more involved than minecraft, though.
(but niftishly, there’s programable blocks that will let you write c# code and… do things.) (space engineers 2 is in early access if you have the hardware.)
- 3 months
He’s going to ask US Congress for a bailout with taxpayers money when this all fails and Congress is going to most likely give it to him because this one company is a huge part of the US economy
- 3 months
I don’t think so, and I’m on the Ed Zitron train of thought why not.
The financial instruments got a bailout in 08, because the economy itself would stop functioning. That’s different than the stocks would drop. Also, there’s like nothing to bail out? OpenAI and their ilk are just sucking down capital and returning nothing. Even if they get one bailout, they need a continuous stream of unlimited money forever? I don’t think it’ll happen.
I hope I’m right, cuz damn that shit is cancerous
- 3 months
I think a potential OpenAI “bailout” should go something like this:
- The investors get their money back.
- They have to sign a pact that they must not invest into AI anymore for a given amount of years (20+ minimum).
- Massive regulatory overhaul to make sure stuff like this never happen. Also undo Ford v. Dodge Brothers.
- Scam Altman and the others go to life in prison.
- 3 months
They have to sign a pact that they must not invest into AI anymore for a given amount of years (20+ minimum).
Problem is this might hurt actual AI research to punish a scam that has absolutely nothing to do with AI other than having coopted the name for marketing purposes.
(Any investment in actual AI research is doomed for decades anyway when this bubble pops, but this would cause even more harm than the bubble has already caused.)
(Also any form of research is probably ruined for decades anyway due to LLM-induced brain rot and having to sift through all the slop to try to recover any remaining fragments of actually useable knowledge, but, again, let’s not make it even worse than it already is).
- 3 months
They should just vibe code the feature. They’ll have it done in an afternoon, right?
- 3 months
Yo dawg I heard you like AI so we put an AI in your AI so it can AI while you AI!
- 3 months
You would already be doing a great service to the world if you produced a really well tuned search engine / information digger with LLMs but no you had to periodically hype it as AGI because it can memorize entire text books with some accuracy. You did this to yourselves and if you fall it will be because of these expectations which are not met.
- 3 months
Everyone’s getting their knickers in a twist over nothing here.
Of course an AI can track time, if it’s given access to a timer MCP server.
Can we track time without tools, just in our heads? Certainly not very accurately. We can, however, track it reasonably accurately if given access to a quartz stop watch (typically +/-15 s/year)
A language model is based around language and reasoning by words/symbols. It’s not a surprise it doesn’t have timing capability.
What Altman SHOULD be embarrassed about is that the model lies about its capabilities. That implies that the context is still not right - it should be adequately trained and given context to prevent the lying. That implies a much more worrying issue - and something that Anthropic handles far better, IMHO (when asked if it can track time, if says “no, not on my own”, and then proceeds to build a JavaScript timer that it offers up to track time).



- 3 months
I don’t use them but I follow the news about them loosely. The reason for this is epistemic humility. Claude has a pretty good idea of what its capabilities are and where the ceiling is. Chatgpt has no clue what its limits are so it believes it can do everything. Basically chatgpt has a lot of info and no idea where the gaps live and Claude has a fair idea when to search or use some external function to handle something. Gemini has less than Claude but more than chatgpt. Grok has little to no epistemic humility, but it did manage to accurately portray Musk as a world champion piss drinker, something none of the others were able to do.
I say that, but it’s been a few months since I looked. That could have changed because shit moves fast. By the looks of what it’s trying to do with the timer chatgpt has less than it used to. Possibly because of the way the model is trained to be helpful and confident.
- 3 months
It could simply save a timestamp of the “begin timer” message and compare it to the timestamp of the “end” message. It’s not that complicated, and writing a script and executing it is overkill… It just needs access to a calculator skill.
Yes, it handles it better, but it’s still a dumb approach and waste of energy.
- lumpenproletariat@quokk.auEnglish3 months
Odd because home assistant can use a local run LLM to do so?
- 3 months
Beacuse they probably work as agents so they dont count themselfs. They use another app to do . Chat gpt probably could also do thay if integrated properly with your phone software.
- 3 months
Scam Altman sounds like it’s a name straight from an hltv comment section, I love it
- 3 months
Shit like this is a reminder to me that a large portion behind some AI products’ hype are people who have no clue what these products even do. I wonder how the world would change, if these jack of all trades who
investwaste so much time into collecting ideas to fill up their pockets, instead spent more time on actually understanding the ideas they have chosen and build at least a fundamental knowledge.- WhyJiffie@sh.itjust.worksEnglish3 months
I wonder how the world would change, if these jack of all trades who
investwaste so much time into collecting ideas to fill up their pockets, instead spent more time on actually understanding the ideas they have chosen and build at least a fundamental knowledge.I am afraid they would be even more dangerous
- verdi@tarte.nuage-libre.frFrançais3 months
It’s the Elon strategy. Works just right when the most powerful country of the world if full of people who can’t read at 6th grade level and a bunch of psychopaths.
- 3 months
Sam Altman wants funding right?
Here is an idea. I would pay 1000 dollars to get in a boxing ring with this guy, and probably a lot of other people would love to get a shot at that punchable face, no?
We have solved funding.

