Skip to content
Christian Bühlmann
  • À propos / About
    • About (ENG)
    • Au sujet de ce site
    • Hello
    • Follow
      • Follow me with RSS
      • Follow me on the fediverse
    • Contact
    • Curriculum Vitae
      • Employment History
      • Education and skills
      • Volunteering
  • Articles
    • Profession
    • Personnel
  • Recherches et publications
    • Thèse de doctorat
    • Blogue Recherche / Research Blog
    • Liste des publications / Publication list
    • Commentaires
    • Présentations
    • Détail des publications
    • Video Trailers
  • Photos
    • Galerie
    • Instagraphe
  • Ramage
  • Webmentions
note

Llm

User Avatar of Christian Bühlmann Christian Bühlmann
· 31 octobre 2024 · 1 minute to read
Llm

The LLM Reasoning Debate Heats Up

Three recent papers examine the robustness of reasoning and problem-solving in large language models


Melanie Mitchell Oct 21, 2024

One of the fieriest debates in AI these days is whether or not large language models can reason.

In May 2024, OpenAI released GPT-4o (omni), which, they wrote, “can reason across audio, vision, and text in real time.”  And last month they released the GPT-o1 model, which they claim performs “complex reasoning”, and which achieves record accuracy on many “reasoning-heavy” benchmarks.

But others have questioned the extent to which LLMs (or even enhanced models such as GPT-4o and o1) solve problems by reasoning abstractly, or whether their success is due, at least in part, to matching reasoning patterns memorized from their training data, which limits their ability to solve problems that differ too much from what has been seen in training.

Melanie Mitchell https://aiguide.substack.com/p/the-llm-reasoning-debate-heats-up

Sharing is caring ❤️

User Avatar of Christian Bühlmann Christian Bühlmann
Subscribe to author feed
Categories
  • Ramage
Syndication Links
Licence Creative Commons
Ce(tte) œuvre est mise à disposition selon les termes de la Licence Creative Commons Attribution - Pas d'Utilisation Commerciale - Pas de Modification 4.0 International.
I Don’t Track You Here, But Others Might
-----