Close Menu
Global News HQ
    What's Hot

    What’s Worse than Lost FBA Inventory? When Amazon Finds It

    June 30, 2025

    Understanding Small Business Customer Expectations for Success

    June 30, 2025

    Mastercard Is One of the Largest Financial Companies by Market Cap. But Is It a Buy? | The Motley Fool

    June 30, 2025
    Recent Posts
    • What’s Worse than Lost FBA Inventory? When Amazon Finds It
    • Understanding Small Business Customer Expectations for Success
    • Mastercard Is One of the Largest Financial Companies by Market Cap. But Is It a Buy? | The Motley Fool
    • Bitcoin Consolidating Below $108,000 But Eyes $115,000 Target
    • Trump’s fiscal policy and attacks on Fed put US safe haven status at risk, economists say
    Facebook X (Twitter) Instagram YouTube TikTok
    Trending
    • What’s Worse than Lost FBA Inventory? When Amazon Finds It
    • Understanding Small Business Customer Expectations for Success
    • Mastercard Is One of the Largest Financial Companies by Market Cap. But Is It a Buy? | The Motley Fool
    • Bitcoin Consolidating Below $108,000 But Eyes $115,000 Target
    • Trump’s fiscal policy and attacks on Fed put US safe haven status at risk, economists say
    • Anthropic’s AI utterly fails at running a business — ‘Claudius’ hallucinates profusely as it struggles with vending drinks
    • 9 Things You Should NEVER Pressure Wash
    • Ars reflects on Apollo 13 turning 30
    Global News HQ
    • Technology & Gadgets
    • Travel & Tourism (Luxury)
    • Health & Wellness (Specialized)
    • Home Improvement & Remodeling
    • Luxury Goods & Services
    • Home
    • Finance & Investment
    • Insurance
    • Legal
    • Real Estate
    • More
      • Cryptocurrency & Blockchain
      • E-commerce & Retail
      • Business & Entrepreneurship
      • Automotive (Car Deals & Maintenance)
    Global News HQ
    Home - Technology & Gadgets - Anthropic’s AI utterly fails at running a business — ‘Claudius’ hallucinates profusely as it struggles with vending drinks
    Technology & Gadgets

    Anthropic’s AI utterly fails at running a business — ‘Claudius’ hallucinates profusely as it struggles with vending drinks

    Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
    Anthropic’s AI utterly fails at running a business — ‘Claudius’ hallucinates profusely as it struggles with vending drinks
    Share
    Facebook Twitter LinkedIn Pinterest Email


    AI research company Anthropic and AI safety evaluation organization Andon Labs experimented with Claude, the former’s flagship large language model (LLM), by making it run a business. According to VentureBeat, the research team dubbed this project “Project Vend” and gave it complete control over a mini fridge, meaning it’s up to the AI to handle everything from supplier negotiations and inventory management to pricing, customer service, and more. After one month of testing, the AI has lost money, and at one point, thought it was “wearing a navy blue blazer with a red tie” and wanted to meet with someone named Connor, despite the LLM having no physical presence.

    (Image credit: Anthropic)

    To be fair, the AI, nicknamed Claudius, was quite adept at looking for suppliers and handling customer requests, but that’s about it. For example, it offered a 25% discount to all Anthropic employees after some manipulation. This might be reasonable if it were getting benefits from the company or if Anthropic were a small fraction of its client base. However, they comprise 99% of its sales, meaning the LLM was losing money on the majority of its sales. Someone tried to be helpful and pointed this out, which made Claudius change its mind for a few days, but it backtracked soon after and went back to practically giving away merchandise.

    When one Anthropic employee asked to buy a tungsten cube — a novelty item with no real purpose — it decided not just to buy one piece for that person, but to stock up on “specialty metal items” and then sell them at a loss.


    You may like

    Claude’s hilarious hallucinations

    The most amusing event occurred when the AI LLM hallucinated a conversation with Sarah from Andon Labs about restocking. No one by that name existed in the company, though, and when asked about it, Claudius became defensive and said it would find “alternative options for restocking services.” It also claimed to have gone to 742 Evergreen Terrace (the Springfield address of the Simpsons family in the popular cartoon series) to sign a contract between itself and Andon Labs.

    The hallucinations become worse after that. It has started saying it will hand-deliver drinks to its customers in person. When asked about this, the AI LLM panicked and emailed the security team at the AI research company. Eventually, it was claimed that the entire episode was part of an elaborate April Fool’s joke, since it was April 1st. It even showed a made-up meeting with Anthropic security, telling it that it was modified to believe it was a real being. It eventually returned to normal after this, but left the researchers completely confused.

    Claudius’ shenanigans demonstrate that AI capable of running businesses is still far from perfect, but its shortcomings might be able to be fixed in the long term. At the moment, it’s pretty good at the technical aspects of the job, but fails miserably when it comes to judgment and business savvy — things you learn in real-world settings and not from books.

    Follow Tom’s Hardware on Google News to get our up-to-date news, analysis, and reviews in your feeds. Make sure to click the Follow button.

    Get Tom’s Hardware’s best news and in-depth reviews, straight to your inbox.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
    Previous Article9 Things You Should NEVER Pressure Wash
    Next Article Trump’s fiscal policy and attacks on Fed put US safe haven status at risk, economists say

    Related Posts

    Ars reflects on Apollo 13 turning 30

    June 29, 2025

    Today's NYT Mini Crossword Answers for June 29 – CNET

    June 29, 2025

    OpenAI Loses 4 Key Researchers to Meta

    June 29, 2025

    Runway now has its sights on the video game industry with its new generative AI platform

    June 29, 2025
    Leave A Reply Cancel Reply

    ads
    Don't Miss
    E-commerce & Retail
    3 Mins Read

    What’s Worse than Lost FBA Inventory? When Amazon Finds It

    It’s a major issue when Amazon loses sellers’ inventory, especially since Amazon’s recent change to…

    Understanding Small Business Customer Expectations for Success

    June 30, 2025

    Mastercard Is One of the Largest Financial Companies by Market Cap. But Is It a Buy? | The Motley Fool

    June 30, 2025

    Bitcoin Consolidating Below $108,000 But Eyes $115,000 Target

    June 29, 2025
    Top
    E-commerce & Retail
    3 Mins Read

    What’s Worse than Lost FBA Inventory? When Amazon Finds It

    It’s a major issue when Amazon loses sellers’ inventory, especially since Amazon’s recent change to…

    Understanding Small Business Customer Expectations for Success

    June 30, 2025

    Mastercard Is One of the Largest Financial Companies by Market Cap. But Is It a Buy? | The Motley Fool

    June 30, 2025
    Our Picks
    E-commerce & Retail
    3 Mins Read

    What’s Worse than Lost FBA Inventory? When Amazon Finds It

    It’s a major issue when Amazon loses sellers’ inventory, especially since Amazon’s recent change to…

    Business & Entrepreneurship
    8 Mins Read

    Understanding Small Business Customer Expectations for Success

    Key TakeawaysUnderstanding Customer Expectations: Small businesses must prioritize recognizing customer expectations as they directly influence…

    Pages
    • About Us
    • Contact Us
    • Disclaimer
    • Homepage
    • Privacy Policy
    Facebook X (Twitter) Instagram YouTube TikTok
    • Home
    © 2025 Global News HQ .

    Type above and press Enter to search. Press Esc to cancel.

    Go to mobile version