Business

Apple researchers question AI’s reasoning ability in mathematics

Sat, Oct 12 2024 10:52:03 AM

New Delhi, Oct 12 (IANS): A team of Apple researchers has questioned the formal reasoning capabilities of large language models (LLMs), particularly in mathematics.

They found that LLMs exhibit noticeable variance when responding to different instantiations of the same question.

Literature suggests that the reasoning process in LLMs is probabilistic pattern-matching rather than formal reasoning.

Although LLMs can match more abstract reasoning patterns, they fall short of true logical reasoning. Small changes in input tokens can drastically alter model outputs, indicating a strong token bias and suggesting that these models are highly sensitive and fragile.

“Additionally, in tasks requiring the correct selection of multiple tokens, the probability of arriving at an accurate answer decreases exponentially with the number of tokens or steps involved, underscoring their inherent unreliability in complex reasoning scenarios,” said Apple researchers in their paper titled “GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models.”

The ‘GSM8K’ benchmark is widely used to assess the mathematical reasoning of models on grade-school level questions.

While the performance of LLMs on GSM8K has significantly improved in recent years, it remains unclear whether their mathematical reasoning capabilities have genuinely advanced, raising questions about the reliability of the reported metrics.

To address these concerns, the researchers conducted a large-scale study on several state-of-the-art open and closed models.

“To overcome the limitations of existing evaluations, we introduce GSM-Symbolic, an improved benchmark created from symbolic templates that allow for the generation of a diverse set of questions,” the authors wrote.

GSM-Symbolic enables more controllable evaluations, providing key insights and more reliable metrics for measuring the reasoning capabilities of models.

“Our findings reveal that LLMs exhibit noticeable variance when responding to different instantiations of the same question,” said researchers, adding that overall, "our work provides a more nuanced understanding of LLMs’ capabilities and limitations in mathematical reasoning”.

Follow Daijiworld News Network on

Latest

West Asia crisis boosts demand for Kashmiri saffron

Irdai proposes easier merger norms for insurers

Udupi: Premium residential project ‘Kalkura Heights’ set to be inaugurated on Jun 21

Asian markets edge higher as investors await central bank decisions

Nepal tea factories shut down as India's new export rules disrupt trade

India’s auto industry posts record May sales, driven by strong demand aacross vehicle segments

More measures planned to attract foreign capital, says Sitharaman

Business

Apple researchers question AI’s reasoning ability in mathematics

Top Stories

Udupi: Premium residential project ‘Kalkura Heights’ to be inaugurated on Jun 21

Leave a Comment Your Email address will not be published.

Title: Apple researchers question AI’s reasoning ability in mathematics

You might also like

Kiran Pahal dominates women’s 400m as Indian Athletics Series concludes 12th leg in Surat

Deepti, Charani make big gains as Hayley Matthews regains no.1 spot in ICC T20I rankings

Ashok Sharma replaces injured Yudhvir in India A squad

Smriti Mandhana named in TIME's 100 most influential sports personalities

Sooryavanshi involved in heated exchange after India A's Super Over defeat

England hand debuts to Jordan Cox and Sonny Baker for second test against New Zealand

Real Madrid sign Marc Cucurella from Chelsea on six-year deal to bolster defence

Yasin Ayari scores twice on World Cup debut as Sweden crush Tunisia

India crush Pakistan by 64 runs in Women’s T20 World Cup

Successor debate inevitable for Rohit, Virat: Graeme Swann