ton = $3.31 -0.04 (-1.21 %)

btc = $114 606.00 - 791.12 (-0.69 %)

eth = $4 740.75 11.11 (0.23 %)

ton = $3.31 -0.04 (-1.21 %)

btc = $114 606.00 - 791.12 (-0.69 %)

btc = $114 606.00 - 791.12 (-0.69 %)
eth = $4 740.75 11.11 (0.23 %)
ton = $3.31 -0.04 (-1.21 %)

btc = $114 606.00 - 791.12 (-0.69 %)
eth = $4 740.75 11.11 (0.23 %)
ton = $3.31 -0.04 (-1.21 %)

Liubov Panova

cover

Apple study highlights major flaws in AI's logical reasoning abilities

14 Oct, 2024

1 min time to read

A recent study by Apple's AI research team has revealed critical weaknesses in the logical reasoning capabilities of large language models (LLMs), including those from OpenAI and Meta.

Published on arXiv, the study found that slight changes in the wording of mathematical questions can cause significant variations in the models' answers, undermining their reliability in tasks requiring logical consistency.

The study showed that LLMs tend to rely on pattern matching rather than genuine reasoning. In one test, adding irrelevant information to a math problem caused the models to miscalculate, highlighting their unreliability.

We found no evidence of formal reasoning in language models. Their behavior is better explained by sophisticated pattern matching—so fragile, in fact, that changing names can alter results by ~10%.

Apple suggests that combining neural networks with traditional symbol-based reasoning, known as neurosymbolic AI, may be necessary to improve AI's problem-solving abilities.

Share

Durov˙s Code on Telegram

Editor’s pick

How ‘Utility’ NFTs on TON Will Reshape the World of Digital Collectibles

18 Aug, 2025

How ‘Utility’ NFTs on TON Will Reshape the World of Digital Collectibles

What’s in use — Dina Saeva // Durov’s Code

24 May, 2023

What’s in use — Dina Saeva // Durov’s Code

Exploring the Inner Landscapes: Sarah McDaniel's Artistic Journey from Modeling to Neuroscience

21 May, 2023

Exploring the Inner Landscapes: Sarah McDaniel's Artistic Journey from Modeling to Neuroscience

Hot take: ’’Now AI is the message‘‘

17 May, 2023

Hot take: ’’Now AI is the message‘‘

21 Mar, 2023

Brand New Durov’s Code: what has changed, how we are reinventing tech media and who made it

What's in use — Ellen Sheidlin // Durov's Code

7 Feb, 2023

What's in use — Ellen Sheidlin // Durov's Code

"The more you give, the more you get" — How Mohamed Al Banna Helps Startups From All Over the World Discover Opportunities in Dubai

24 Jan, 2023

"The more you give, the more you get" — How Mohamed Al Banna Helps Startups From All Over the World Discover Opportunities in Dubai

What's in use — Palagin // Durov's Code

20 Jan, 2023

What's in use — Palagin // Durov's Code

Urbi: analytics and immersive maps perform a power grab of the Dubai cartographic market

17 Dec, 2022

Urbi: analytics and immersive maps perform a power grab of the Dubai cartographic market

"The machine has won and we have to use it" — Professional Artists about AI-Generated Arts

15 Dec, 2022

"The machine has won and we have to use it" — Professional Artists about AI-Generated Arts

/ Engineering the Future: <br/>How Dubai Future Labs Is Shaping the City of Tomorrow

/ Engineering the Future: <br/>How Dubai Future Labs Is Shaping the City of Tomorrow

Source: Dubai Future Labs