Close Menu
Global News HQ
    What's Hot

    Kenzo Spring 2026 Menswear Collection

    June 28, 2025

    XRP spikes 3% after Garlinghouse says Ripple dropping SEC cross-appeal

    June 28, 2025

    10 Outdated Travel Items It’s Time to Toss—and What to Replace Them With, From $13

    June 28, 2025
    Recent Posts
    • Kenzo Spring 2026 Menswear Collection
    • XRP spikes 3% after Garlinghouse says Ripple dropping SEC cross-appeal
    • 10 Outdated Travel Items It’s Time to Toss—and What to Replace Them With, From $13
    • PwC to cut 175 junior auditors amid slowdown
    • Are You Approachable at Work? Tips for Building Better Connections
    Facebook X (Twitter) Instagram YouTube TikTok
    Trending
    • Kenzo Spring 2026 Menswear Collection
    • XRP spikes 3% after Garlinghouse says Ripple dropping SEC cross-appeal
    • 10 Outdated Travel Items It’s Time to Toss—and What to Replace Them With, From $13
    • PwC to cut 175 junior auditors amid slowdown
    • Are You Approachable at Work? Tips for Building Better Connections
    • Microsoft Sued in Manhattan Federal Court for Allegedly Using Pirated Material to Train AI Models | Law.com
    • Baglietto and Meyer Davis Just Teamed up on a Sleek 183-Foot Superyacht
    • Trump Thinks Reporting The Truth Is A Punishable Offense – See Also – Above the Law
    Global News HQ
    • Technology & Gadgets
    • Travel & Tourism (Luxury)
    • Health & Wellness (Specialized)
    • Home Improvement & Remodeling
    • Luxury Goods & Services
    • Home
    • Finance & Investment
    • Insurance
    • Legal
    • Real Estate
    • More
      • Cryptocurrency & Blockchain
      • E-commerce & Retail
      • Business & Entrepreneurship
      • Automotive (Car Deals & Maintenance)
    Global News HQ
    Home - Technology & Gadgets - Eerily realistic AI voice demo sparks amazement and discomfort online
    Technology & Gadgets

    Eerily realistic AI voice demo sparks amazement and discomfort online

    Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
    Eerily realistic AI voice demo sparks amazement and discomfort online
    Share
    Facebook Twitter LinkedIn Pinterest Email



    An example argument with Sesame’s CSM created by Gavin Purcell.

    An example argument with Sesame’s CSM created by Gavin Purcell.

    Gavin Purcell, co-host of the AI for Humans podcast, posted an example video on Reddit where the human pretends to be an embezzler and argues with a boss. It’s so dynamic that it’s difficult to tell who the human is and which one is the AI model. Judging by our own demo, it’s entirely capable of what you see in the video.

    “Near-human quality”

    Under the hood, Sesame’s CSM achieves its realism by using two AI models working together (a backbone and a decoder) based on Meta’s Llama architecture that processes interleaved text and audio. Sesame trained three AI model sizes, with the largest using 8.3 billion parameters (an 8 billion backbone model plus a 300 million parameter decoder) on approximately 1 million hours of primarily English audio.

    Sesame’s CSM doesn’t follow the traditional two-stage approach used by many earlier text-to-speech systems. Instead of generating semantic tokens (high-level speech representations) and acoustic details (fine-grained audio features) in two separate stages, Sesame’s CSM integrates into a single-stage, multimodal transformer-based model, jointly processing interleaved text and audio tokens to produce speech. OpenAI’s voice model uses a similar multimodal approach.

    In blind tests without conversational context, human evaluators showed no clear preference between CSM-generated speech and real human recordings, suggesting the model achieves near-human quality for isolated speech samples. However, when provided with conversational context, evaluators still consistently preferred real human speech, indicating a gap remains in fully contextual speech generation.

    Sesame co-founder Brendan Iribe acknowledged current limitations in a comment on Hacker News, noting that the system is “still too eager and often inappropriate in its tone, prosody and pacing” and has issues with interruptions, timing, and conversation flow. “Today, we’re firmly in the valley, but we’re optimistic we can climb out,” he wrote.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
    Previous ArticleOpinion | Maybe Trump Will Listen to Milei
    Next Article Our Favorite Pruning Saw’s Two-Way Cutting Action Sets It Apart From The Competition

    Related Posts

    SCOTUS upholds part of ACA that makes preventive care fully covered

    June 28, 2025

    Best Pressure Washers of 2025: I Tested Six Power Washers on Wood, Metal and More

    June 28, 2025

    Trump ends trade talks with Canada over a digital services tax

    June 28, 2025

    Adobe’s new camera app is making me rethink phone photography

    June 27, 2025
    Leave A Reply Cancel Reply

    ads
    Don't Miss
    Luxury Goods & Services
    2 Mins Read

    Kenzo Spring 2026 Menswear Collection

    Can luxury be punk? Can luxury be hilarious? These were some of the more highfalutin…

    XRP spikes 3% after Garlinghouse says Ripple dropping SEC cross-appeal

    June 28, 2025

    10 Outdated Travel Items It’s Time to Toss—and What to Replace Them With, From $13

    June 28, 2025

    PwC to cut 175 junior auditors amid slowdown

    June 28, 2025
    Top
    Luxury Goods & Services
    2 Mins Read

    Kenzo Spring 2026 Menswear Collection

    Can luxury be punk? Can luxury be hilarious? These were some of the more highfalutin…

    XRP spikes 3% after Garlinghouse says Ripple dropping SEC cross-appeal

    June 28, 2025

    10 Outdated Travel Items It’s Time to Toss—and What to Replace Them With, From $13

    June 28, 2025
    Our Picks
    Luxury Goods & Services
    2 Mins Read

    Kenzo Spring 2026 Menswear Collection

    Can luxury be punk? Can luxury be hilarious? These were some of the more highfalutin…

    Cryptocurrency & Blockchain
    3 Mins Read

    XRP spikes 3% after Garlinghouse says Ripple dropping SEC cross-appeal

    XRP’s price jumped over 3% on Friday just hours after Ripple Labs CEO Brad Garlinghouse…

    Pages
    • About Us
    • Contact Us
    • Disclaimer
    • Homepage
    • Privacy Policy
    Facebook X (Twitter) Instagram YouTube TikTok
    • Home
    © 2025 Global News HQ .

    Type above and press Enter to search. Press Esc to cancel.

    Go to mobile version