Close Menu
Global News HQ
    What's Hot

    Year of the stablecoin: The GENIUS Act, Wall Street, and the dollar’s digital leap

    July 27, 2025

    Why Small Business Must Adopt AI

    July 27, 2025

    Trump Wants Cane Sugar Coke: Will Soda Fans Pay Higher Prices and Taxes?

    July 27, 2025
    Recent Posts
    • Year of the stablecoin: The GENIUS Act, Wall Street, and the dollar’s digital leap
    • Why Small Business Must Adopt AI
    • Trump Wants Cane Sugar Coke: Will Soda Fans Pay Higher Prices and Taxes?
    • Citi Rewards+ Card rebrands as Citi Strata Card – The Points Guy
    • Wall Street Week Ahead
    Facebook X (Twitter) Instagram YouTube TikTok
    Trending
    • Year of the stablecoin: The GENIUS Act, Wall Street, and the dollar’s digital leap
    • Why Small Business Must Adopt AI
    • Trump Wants Cane Sugar Coke: Will Soda Fans Pay Higher Prices and Taxes?
    • Citi Rewards+ Card rebrands as Citi Strata Card – The Points Guy
    • Wall Street Week Ahead
    • 5 Predictions for 2025 Holiday Shopping
    • These Neuroprotective Nutrients Can Help Lower Your Dementia Risk
    • 10 Must-Know Tips for Growing Sweeter, Juicier Watermelons
    Global News HQ
    • Technology & Gadgets
    • Travel & Tourism (Luxury)
    • Health & Wellness (Specialized)
    • Home Improvement & Remodeling
    • Luxury Goods & Services
    • Home
    • Finance & Investment
    • Insurance
    • Legal
    • Real Estate
    • More
      • Cryptocurrency & Blockchain
      • E-commerce & Retail
      • Business & Entrepreneurship
      • Automotive (Car Deals & Maintenance)
    Global News HQ
    Home - Technology & Gadgets - Microsoft’s new AI agent can control software and robots
    Technology & Gadgets

    Microsoft’s new AI agent can control software and robots

    Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
    Microsoft’s new AI agent can control software and robots
    Share
    Facebook Twitter LinkedIn Pinterest Email


    On Wednesday, Microsoft Research introduced Magma, an integrated AI foundation model that combines visual and language processing to control software interfaces and robotic systems. If the results hold up outside of Microsoft’s internal testing, it could mark a meaningful step forward for an all-purpose multimodal AI that can operate interactively in both real and digital spaces.

    Microsoft claims that Magma is the first AI model that not only processes multimodal data (like text, images, and video) but can also natively act upon it—whether that’s navigating a user interface or manipulating physical objects. The project is a collaboration between researchers at Microsoft, KAIST, the University of Maryland, the University of Wisconsin-Madison, and the University of Washington.

    We’ve seen other large language model-based robotics projects like Google’s PALM-E and RT-2 or Microsoft’s ChatGPT for Robotics that utilize LLMs for an interface. However, unlike many prior multimodal AI systems that require separate models for perception and control, Magma integrates these abilities into a single foundation model.

    A combined graphic that shows off various capabilities of the Magma model.


    Credit:

    Microsoft Research

    Microsoft is positioning Magma as a step toward agentic AI, meaning a system that can autonomously craft plans and perform multistep tasks on a human’s behalf rather than just answering questions about what it sees.

    “Given a described goal,” Microsoft writes in its research paper. “Magma is able to formulate plans and execute actions to achieve it. By effectively transferring knowledge from freely available visual and language data, Magma bridges verbal, spatial, and temporal intelligence to navigate complex tasks and settings.”

    Microsoft is not alone in its pursuit of agentic AI. OpenAI has been experimenting with AI agents through projects like Operator that can perform UI tasks in a web browser, and Google has explored multiple agentic projects with Gemini 2.0.

    Spatial intelligence

    While Magma builds off of Transformer-based LLM technology that feeds training tokens into a neural network, it’s different from traditional vision-language models (like GPT-4V, for example) by going beyond what they call “verbal intelligence” to also include “spatial intelligence” (planning and action execution). By training on a mix of images, videos, robotics data, and UI interactions, Microsoft claims that Magma is a true multimodal agent rather than just a perceptual model.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
    Previous ArticleHow to Get Every Stain Out of Leather for Your Cleanest Sofa Ever
    Next Article Buy Frontier Airlines’ summer-long all-you-can-fly pass for $399 for limited time – The Points Guy

    Related Posts

    Your Comic-Con 2025 News: 'Peacemaker,' 'Starfleet Academy' and More Thrills

    July 27, 2025

    DOGE is reportedly pushing an AI tool that would put half of all federal regulations on a ‘delete list’

    July 27, 2025

    Astronomer taps Gwyneth Paltrow as ‘temporary’ spokesperson’ after Coldplay kiss cam scandal

    July 27, 2025

    Here are the laptops I’d tell any parent to consider for their back-to-school student

    July 26, 2025
    Leave A Reply Cancel Reply

    ads
    Don't Miss
    Cryptocurrency & Blockchain
    11 Mins Read

    Year of the stablecoin: The GENIUS Act, Wall Street, and the dollar’s digital leap

    Welcome to Slate Sundays, CryptoSlate’s new weekly feature showcasing in-depth interviews, expert analysis, and thought-provoking op-eds…

    Why Small Business Must Adopt AI

    July 27, 2025

    Trump Wants Cane Sugar Coke: Will Soda Fans Pay Higher Prices and Taxes?

    July 27, 2025

    Citi Rewards+ Card rebrands as Citi Strata Card – The Points Guy

    July 27, 2025
    Top
    Cryptocurrency & Blockchain
    11 Mins Read

    Year of the stablecoin: The GENIUS Act, Wall Street, and the dollar’s digital leap

    Welcome to Slate Sundays, CryptoSlate’s new weekly feature showcasing in-depth interviews, expert analysis, and thought-provoking op-eds…

    Why Small Business Must Adopt AI

    July 27, 2025

    Trump Wants Cane Sugar Coke: Will Soda Fans Pay Higher Prices and Taxes?

    July 27, 2025
    Our Picks
    Cryptocurrency & Blockchain
    11 Mins Read

    Year of the stablecoin: The GENIUS Act, Wall Street, and the dollar’s digital leap

    Welcome to Slate Sundays, CryptoSlate’s new weekly feature showcasing in-depth interviews, expert analysis, and thought-provoking op-eds…

    Business & Entrepreneurship
    1 Min Read

    Why Small Business Must Adopt AI

    With a little curiosity and the right guidance, AI might just become your most powerful…

    Pages
    • About Us
    • Contact Us
    • Disclaimer
    • Homepage
    • Privacy Policy
    Facebook X (Twitter) Instagram YouTube TikTok
    • Home
    © 2025 Global News HQ .

    Type above and press Enter to search. Press Esc to cancel.

    Go to mobile version