Close Menu
Global News HQ
    What's Hot

    Client Challenge

    November 18, 2025

    ‘He’s Not Doing Nonsense’: Federal Appeals Panel Receptive to Tossed Plaintiffs’ Expert on Tylenol and Autism| Law.com

    November 18, 2025

    Google is fighting the defamation battle Meta caved on

    November 18, 2025
    Recent Posts
    • Client Challenge
    • ‘He’s Not Doing Nonsense’: Federal Appeals Panel Receptive to Tossed Plaintiffs’ Expert on Tylenol and Autism| Law.com
    • Google is fighting the defamation battle Meta caved on
    • Viral Duo Wanda and Jamal Are Planning Their 10th Thanksgiving: What They’re Serving This Year
    • Crypto firm LevelField secures Illinois approval to buy Chicago bank
    Facebook X (Twitter) Instagram YouTube TikTok
    Trending
    • Client Challenge
    • ‘He’s Not Doing Nonsense’: Federal Appeals Panel Receptive to Tossed Plaintiffs’ Expert on Tylenol and Autism| Law.com
    • Google is fighting the defamation battle Meta caved on
    • Viral Duo Wanda and Jamal Are Planning Their 10th Thanksgiving: What They’re Serving This Year
    • Crypto firm LevelField secures Illinois approval to buy Chicago bank
    • Nikka Just Dropped a Stellar New Twist on Its Blended Japanese Whisky
    • That Epstein ‘Bubba’ email has inspired a wave of inappropriate Trump-Clinton merch on Etsy
    • FEMA Chief out after just six months, leaving agency in turmoil
    Global News HQ
    • Technology & Gadgets
    • Travel & Tourism (Luxury)
    • Health & Wellness (Specialized)
    • Home Improvement & Remodeling
    • Luxury Goods & Services
    • Home
    • Finance & Investment
    • Insurance
    • Legal
    • Real Estate
    • More
      • Cryptocurrency & Blockchain
      • E-commerce & Retail
      • Business & Entrepreneurship
      • Automotive (Car Deals & Maintenance)
    Global News HQ
    Home - Technology & Gadgets - AI isn’t ready to replace human coders for debugging, researchers say
    Technology & Gadgets

    AI isn’t ready to replace human coders for debugging, researchers say

    Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
    AI isn’t ready to replace human coders for debugging, researchers say
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Agents using debugging tools drastically outperformed those that didn’t, but their success rate still wasn’t high enough.


    Credit:

    Microsoft Research

    This approach is much more successful than relying on the models as they’re usually used, but when your best case is a 48.4 percent success rate, you’re not ready for primetime. The limitations are likely because the models don’t fully understand how to best use the tools, and because their current training data is not tailored to this use case.

    “We believe this is due to the scarcity of data representing sequential decision-making behavior (e.g., debugging traces) in the current LLM training corpus,” the blog post says. “However, the significant performance improvement… validates that this is a promising research direction.”

    This initial report is just the start of the efforts, the post claims.  The next step is to “fine-tune an info-seeking model specialized in gathering the necessary information to resolve bugs.” If the model is large, the best move to save inference costs may be to “build a smaller info-seeking model that can provide relevant information to the larger one.”

    This isn’t the first time we’ve seen outcomes that suggest some of the ambitious ideas about AI agents directly replacing developers are pretty far from reality. There have been numerous studies already showing that even though an AI tool can sometimes create an application that seems acceptable to the user for a narrow task, the models tend to produce code laden with bugs and security vulnerabilities, and they aren’t generally capable of fixing those problems.

    This is an early step on the path to AI coding agents, but most researchers agree it remains likely that the best outcome is an agent that saves a human developer a substantial amount of time, not one that can do everything they can do.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
    Previous ArticleColleges, Universities Formally Withdraw From State Minority-Exclusive Scholarship Program
    Next Article Sleep Tracking: 8 Things Sleep Doctors Want You to Know

    Related Posts

    Google is fighting the defamation battle Meta caved on

    November 18, 2025

    Our Favorite Compact Soundbar Is $100 Off

    November 17, 2025

    ‘The Mighty Nein’ review: Critical Role hits new highs with their darkest series yet

    November 17, 2025

    I changed 10 settings on my Samsung phone to give it a big performance boost

    November 17, 2025
    Leave A Reply Cancel Reply

    ads
    Don't Miss
    Finance & Investment
    1 Min Read

    Client Challenge

    Client Challenge JavaScript is disabled in your browser. Please enable JavaScript to proceed. A required…

    ‘He’s Not Doing Nonsense’: Federal Appeals Panel Receptive to Tossed Plaintiffs’ Expert on Tylenol and Autism| Law.com

    November 18, 2025

    Google is fighting the defamation battle Meta caved on

    November 18, 2025

    Viral Duo Wanda and Jamal Are Planning Their 10th Thanksgiving: What They’re Serving This Year

    November 18, 2025
    Top
    Finance & Investment
    1 Min Read

    Client Challenge

    Client Challenge JavaScript is disabled in your browser. Please enable JavaScript to proceed. A required…

    ‘He’s Not Doing Nonsense’: Federal Appeals Panel Receptive to Tossed Plaintiffs’ Expert on Tylenol and Autism| Law.com

    November 18, 2025

    Google is fighting the defamation battle Meta caved on

    November 18, 2025
    Our Picks
    Finance & Investment
    1 Min Read

    Client Challenge

    Client Challenge JavaScript is disabled in your browser. Please enable JavaScript to proceed. A required…

    Legal
    1 Min Read

    ‘He’s Not Doing Nonsense’: Federal Appeals Panel Receptive to Tossed Plaintiffs’ Expert on Tylenol and Autism| Law.com

    The industry-leading media platform offering competitive intelligence to prepare for today and anticipate opportunities for…

    Pages
    • About Us
    • Contact Us
    • Disclaimer
    • Homepage
    • Privacy Policy
    Facebook X (Twitter) Instagram YouTube TikTok
    • Home
    © 2025 Global News HQ .

    Type above and press Enter to search. Press Esc to cancel.

    Go to mobile version