Shi-Botte Chatgpt cannot be trusted: he is lying in 37% of cases although it costs billions of dollars

< img src = "/uploads/blogs/29/fc/ib-fc-fcsscssc75_16240df4.jpg" Alt = "shi-bot Chatgpt can not Interestingly that O3-mini from Openai, cheaper and reduced version of GPT-4O, “hallucin” in 80.3% of cases.

~ ~ Using Simpleqa, the Facts Assessment tool, OpenAi acknowledged that its new large linguistic model (VMM) GPT-4.5 “hallucin”, that is, AI is fictional inventions in 37% of cases, & Shi Model from a company that costs hundreds of billions of dollars, lies in more than one of the three answers it gives. Openai tries to present the problem of “lies” gpt-4.5 as something good, claiming that this chatbot does not “hallucin” as often as it will be from other companies.

The graph shows how often a new Shi model is lying. It is also known that the GPT-4O, allegedly improved model of “reasoning”, “hallucin” in 61.8% of cases, which was found to be clarified by checking the facts of SIMPLEQA. It was found that O3-mini from Openai, cheaper and reduced version of GPT-4O, “hallucin” in 80.3% of cases.

Of course this problem is not unique for Openai, writes media.

“Currently, even the best models can generate text without” hallucinations “only in 35% of cases”, & mdash; Wenting Zhao, Doctoral Student at the University of Cornell. & mdash; “The most important conclusion from our facts testing is that we cannot yet fully trust the generation results.”

In addition to distrust of a company that receives hundreds of billions of dollars of investments in products that have such problems, this is a lot of indicatives of the AI industry, & mdash; What exactly sell us ? expensive, resource -intensive systems that should approach the human level of intelligence, but still cannot properly understand the basic facts, complain the authors of the material.

~ Since Openai's Beenai ceases to grow in performance, the company obviously grabs a straw to restore interest in its chatbot, which was high in those days when Chatgpt first appeared. But for this we probably need to see a real breakthrough rather than even more lies, the authors summarized.

Shi-Botte Chatgpt cannot be trusted: he is lying in 37% of cases although it costs billions of dollars

ByNatasha Kumar

By Natasha Kumar

Related Post

New Development will allow you to feel computer games to taste

Issued a record number of messages: a “judge's day” radio station in the Russian Federation.

Can mobile operators listen to your conversations: True or is the myth

You missed

Karol Nawrocki comments on the decision of Donald Trump. Extremely strong words were said

Posted Video Disassembling the New IPhone 16E smartphone

The iPhone 17 Air smartphone will have only 5.44 mm thick thanks to a reduced battery

Shi-Botte Chatgpt cannot be trusted: he is lying in 37% of cases although it costs billions of dollars