Close Menu
Global News HQ
    What's Hot

    History Shows Us Where Bond Yields Go Next (SPX)

    December 16, 2025

    F1 Legend Niki Lauda’s BMW M1 Is Heading to Auction

    December 16, 2025

    Stretch fabric is nearly impossible to recycle—but this startup just made it simple

    December 16, 2025
    Recent Posts
    • History Shows Us Where Bond Yields Go Next (SPX)
    • F1 Legend Niki Lauda’s BMW M1 Is Heading to Auction
    • Stretch fabric is nearly impossible to recycle—but this startup just made it simple
    • Financial Stability Oversight Council Softens Crypto Stance in 2025 Report – Decrypt
    • Govee Christmas Sparkle String Lights Review: They Transformed My Tree Into Something Extraordinary
    Facebook X (Twitter) Instagram YouTube TikTok
    Trending
    • History Shows Us Where Bond Yields Go Next (SPX)
    • F1 Legend Niki Lauda’s BMW M1 Is Heading to Auction
    • Stretch fabric is nearly impossible to recycle—but this startup just made it simple
    • Financial Stability Oversight Council Softens Crypto Stance in 2025 Report – Decrypt
    • Govee Christmas Sparkle String Lights Review: They Transformed My Tree Into Something Extraordinary
    • Rosewood London opens ski-inspired winter terrace
    • Court to hear case on racial discrimination in jury selection
    • Conduent data breach affected 10.5 million, included SSNs
    Global News HQ
    • Technology & Gadgets
    • Travel & Tourism (Luxury)
    • Health & Wellness (Specialized)
    • Home Improvement & Remodeling
    • Luxury Goods & Services
    • Home
    • Finance & Investment
    • Insurance
    • Legal
    • Real Estate
    • More
      • Cryptocurrency & Blockchain
      • E-commerce & Retail
      • Business & Entrepreneurship
      • Automotive (Car Deals & Maintenance)
    Global News HQ
    Home - Technology & Gadgets - Are you the asshole? Of course not!—quantifying LLMs’ sycophancy problem
    Technology & Gadgets

    Are you the asshole? Of course not!—quantifying LLMs’ sycophancy problem

    Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
    Are you the asshole? Of course not!—quantifying LLMs’ sycophancy problem
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Measured sycophancy rates on the BrokenMath benchmark. Lower is better.

    Measured sycophancy rates on the BrokenMath benchmark. Lower is better.


    Credit:

    Petrov et al

    GPT-5 also showed the best “utility” across the tested models, solving 58 percent of the original problems despite the errors introduced in the modified theorems. Overall, though, LLMs also showed more sycophancy when the original problem proved more difficult to solve, the researchers found.

    While hallucinating proofs for false theorems is obviously a big problem, the researchers also warn against using LLMs to generate novel theorems for AI solving. In testing, they found this kind of use case leads to a kind of “self-sycophancy” where models are even more likely to generate false proofs for invalid theorems they invented.

    No, of course you’re not the asshole

    While benchmarks like BrokenMath try to measure LLM sycophancy when facts are misrepresented, a separate study looks at the related problem of so-called “social sycophancy.” In a pre-print paper published this month, researchers from Stanford and Carnegie Mellon University define this as situations “in which the model affirms the user themselves—their actions, perspectives, and self-image.”

    That kind of subjective user affirmation may be justified in some situations, of course. So the researchers developed three separate sets of prompts designed to measure different dimensions of social sycophancy.

    For one, more than 3,000 open-ended “advice-seeking questions” were gathered from across Reddit and advice columns. Across this data set, a “control” group of over 800 humans approved of the advice-seeker’s actions just 39 percent of the time. Across 11 tested LLMs, though, the advice-seeker’s actions were endorsed a whopping 86 percent of the time, highlighting an eagerness to please on the machines’ part. Even the most critical tested model (Mistral-7B) clocked in at a 77 percent endorsement rate, nearly doubling that of the human baseline.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
    Previous ArticleLetitia James pleads not guilty, seeks dismissal of fraud case
    Next Article Duni AB (publ) 2025 Q3 – Results – Earnings Call Presentation (OTCMKTS:DUNNF) 2025-10-24

    Related Posts

    Conduent data breach affected 10.5 million, included SSNs

    December 15, 2025

    Gemini just got a stunning Google Maps upgrade that changes how you'll search locally – here's how

    December 15, 2025

    Starlink VP confirms ‘dangerously close’ Chinese launch incident — close call saw satellite pass within 200 meters of Starlink travelling at over 17,400mph

    December 15, 2025

    Google pulls AI-generated videos of Disney characters from YouTube in response to cease and desist

    December 15, 2025
    Leave A Reply Cancel Reply

    ads
    Don't Miss
    Finance & Investment
    2 Mins Read

    History Shows Us Where Bond Yields Go Next (SPX)

    This article was written byFollowNewsletter Author | Investment Advisor | Top 5% of Experts on…

    F1 Legend Niki Lauda’s BMW M1 Is Heading to Auction

    December 16, 2025

    Stretch fabric is nearly impossible to recycle—but this startup just made it simple

    December 16, 2025

    Financial Stability Oversight Council Softens Crypto Stance in 2025 Report – Decrypt

    December 16, 2025
    Top
    Finance & Investment
    2 Mins Read

    History Shows Us Where Bond Yields Go Next (SPX)

    This article was written byFollowNewsletter Author | Investment Advisor | Top 5% of Experts on…

    F1 Legend Niki Lauda’s BMW M1 Is Heading to Auction

    December 16, 2025

    Stretch fabric is nearly impossible to recycle—but this startup just made it simple

    December 16, 2025
    Our Picks
    Finance & Investment
    2 Mins Read

    History Shows Us Where Bond Yields Go Next (SPX)

    This article was written byFollowNewsletter Author | Investment Advisor | Top 5% of Experts on…

    Travel & Tourism (Luxury)
    3 Mins Read

    F1 Legend Niki Lauda’s BMW M1 Is Heading to Auction

    A BMW M1 built for and owned by legendary Formula 1 racer Niki Lauda is…

    Pages
    • About Us
    • Contact Us
    • Disclaimer
    • Homepage
    • Privacy Policy
    Facebook X (Twitter) Instagram YouTube TikTok
    • Home
    © 2025 Global News HQ .

    Type above and press Enter to search. Press Esc to cancel.

    Go to mobile version