Close Menu
Global News HQ
    What's Hot

    Tom Clancy’s Widow Just Dropped $21.5 Million on a 3-Story Penthouse in Lower Manhattan

    June 18, 2025

    Nintendo just revealed Pauline as a surprise character in Donkey Kong Bananza

    June 18, 2025

    The Best Phone Deals from the Best Buy Android Savings Event

    June 18, 2025
    Recent Posts
    • Tom Clancy’s Widow Just Dropped $21.5 Million on a 3-Story Penthouse in Lower Manhattan
    • Nintendo just revealed Pauline as a surprise character in Donkey Kong Bananza
    • The Best Phone Deals from the Best Buy Android Savings Event
    • How I Made Partner: ‘Seize the Initiative From Day One,’ Says Jesse Green of White & Case | Law.com
    • How Trump’s disruption of the crypto supply chain could be a security risk for the U.S.
    Facebook X (Twitter) Instagram YouTube TikTok
    Trending
    • Tom Clancy’s Widow Just Dropped $21.5 Million on a 3-Story Penthouse in Lower Manhattan
    • Nintendo just revealed Pauline as a surprise character in Donkey Kong Bananza
    • The Best Phone Deals from the Best Buy Android Savings Event
    • How I Made Partner: ‘Seize the Initiative From Day One,’ Says Jesse Green of White & Case | Law.com
    • How Trump’s disruption of the crypto supply chain could be a security risk for the U.S.
    • ‘Global Response’ to Crypto Regulation Needed as US Advances GENIUS Act: FCA – Decrypt
    • City providing $90M in subsidies for Coney Island affordable housing project
    • The 5 Best Linen Sheet Sets for a Cool Night’s Rest
    Global News HQ
    • Technology & Gadgets
    • Travel & Tourism (Luxury)
    • Health & Wellness (Specialized)
    • Home Improvement & Remodeling
    • Luxury Goods & Services
    • Home
    • Finance & Investment
    • Insurance
    • Legal
    • Real Estate
    • More
      • Cryptocurrency & Blockchain
      • E-commerce & Retail
      • Business & Entrepreneurship
      • Automotive (Car Deals & Maintenance)
    Global News HQ
    Home - Technology & Gadgets - Eerily realistic AI voice demo sparks amazement and discomfort online
    Technology & Gadgets

    Eerily realistic AI voice demo sparks amazement and discomfort online

    Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
    Eerily realistic AI voice demo sparks amazement and discomfort online
    Share
    Facebook Twitter LinkedIn Pinterest Email



    An example argument with Sesame’s CSM created by Gavin Purcell.

    An example argument with Sesame’s CSM created by Gavin Purcell.

    Gavin Purcell, co-host of the AI for Humans podcast, posted an example video on Reddit where the human pretends to be an embezzler and argues with a boss. It’s so dynamic that it’s difficult to tell who the human is and which one is the AI model. Judging by our own demo, it’s entirely capable of what you see in the video.

    “Near-human quality”

    Under the hood, Sesame’s CSM achieves its realism by using two AI models working together (a backbone and a decoder) based on Meta’s Llama architecture that processes interleaved text and audio. Sesame trained three AI model sizes, with the largest using 8.3 billion parameters (an 8 billion backbone model plus a 300 million parameter decoder) on approximately 1 million hours of primarily English audio.

    Sesame’s CSM doesn’t follow the traditional two-stage approach used by many earlier text-to-speech systems. Instead of generating semantic tokens (high-level speech representations) and acoustic details (fine-grained audio features) in two separate stages, Sesame’s CSM integrates into a single-stage, multimodal transformer-based model, jointly processing interleaved text and audio tokens to produce speech. OpenAI’s voice model uses a similar multimodal approach.

    In blind tests without conversational context, human evaluators showed no clear preference between CSM-generated speech and real human recordings, suggesting the model achieves near-human quality for isolated speech samples. However, when provided with conversational context, evaluators still consistently preferred real human speech, indicating a gap remains in fully contextual speech generation.

    Sesame co-founder Brendan Iribe acknowledged current limitations in a comment on Hacker News, noting that the system is “still too eager and often inappropriate in its tone, prosody and pacing” and has issues with interruptions, timing, and conversation flow. “Today, we’re firmly in the valley, but we’re optimistic we can climb out,” he wrote.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
    Previous ArticleOpinion | Maybe Trump Will Listen to Milei
    Next Article Our Favorite Pruning Saw’s Two-Way Cutting Action Sets It Apart From The Competition

    Related Posts

    Nintendo just revealed Pauline as a surprise character in Donkey Kong Bananza

    June 18, 2025

    Mortgage Rates and the Federal Reserve: Everything to Know Before Today's Decision

    June 18, 2025

    Senate passes GENIUS stablecoin bill in a win for the crypto industry

    June 18, 2025

    Far-Right ‘Appeal to Heaven’ Flag Seen at January 6 Riot Flown Above Government Agency in DC

    June 18, 2025
    Leave A Reply Cancel Reply

    ads
    Don't Miss
    Travel & Tourism (Luxury)
    4 Mins Read

    Tom Clancy’s Widow Just Dropped $21.5 Million on a 3-Story Penthouse in Lower Manhattan

    The case is solved! Turns out the mystery buyer who recently doled out $21.5 million…

    Nintendo just revealed Pauline as a surprise character in Donkey Kong Bananza

    June 18, 2025

    The Best Phone Deals from the Best Buy Android Savings Event

    June 18, 2025

    How I Made Partner: ‘Seize the Initiative From Day One,’ Says Jesse Green of White & Case | Law.com

    June 18, 2025
    Top
    Travel & Tourism (Luxury)
    4 Mins Read

    Tom Clancy’s Widow Just Dropped $21.5 Million on a 3-Story Penthouse in Lower Manhattan

    The case is solved! Turns out the mystery buyer who recently doled out $21.5 million…

    Nintendo just revealed Pauline as a surprise character in Donkey Kong Bananza

    June 18, 2025

    The Best Phone Deals from the Best Buy Android Savings Event

    June 18, 2025
    Our Picks
    Travel & Tourism (Luxury)
    4 Mins Read

    Tom Clancy’s Widow Just Dropped $21.5 Million on a 3-Story Penthouse in Lower Manhattan

    The case is solved! Turns out the mystery buyer who recently doled out $21.5 million…

    Technology & Gadgets
    3 Mins Read

    Nintendo just revealed Pauline as a surprise character in Donkey Kong Bananza

    Nintendo just dropped a ton of details about the next major Switch 2 first-party game.…

    Pages
    • About Us
    • Contact Us
    • Disclaimer
    • Homepage
    • Privacy Policy
    Facebook X (Twitter) Instagram YouTube TikTok
    • Home
    © 2025 Global News HQ .

    Type above and press Enter to search. Press Esc to cancel.

    Go to mobile version