Charts
DataOn-chain
VIP
Market Cap
API
Rankings
CoinOSNew
CoinClaw🦞
Language
  • 简体中文
  • 繁体中文
  • English
Leader in global market data applications, committed to providing valuable information more efficiently.

Features

  • Real-time Data
  • Special Features
  • AI Grid

Services

  • News
  • Open Data(API)
  • Institutional Services

Downloads

  • Desktop
  • Android
  • iOS

Contact Us

  • Chat Room
  • Business Email
  • Official Email
  • Official Verification

Join Community

  • Telegram
  • Twitter
  • Discord

© Copyright 2013-2026. All rights reserved.

简体繁體English
|Legacy

Meta Launches Muse Spark, Its Most Capable AI Yet—But Gemini 3.1 Pro Still Leads the Pack

CN
Decrypt
Follow
3 hours ago
AI summarizes in 5 seconds.

Meta launched Muse Spark on Wednesday, marking the first model built by Meta Superintelligence Labs—the team assembled nine months ago under Chief AI Officer Alexandr Wang after Meta's $14 billion Scale AI acquisition. It's live now at meta.ai and the Meta AI app, with a rollout to Facebook, Instagram, and WhatsApp coming in the next few weeks.


This isn't just another chatbot upgrade or a new version of Llama. Muse Spark is natively multimodal—it processes images, text, and voice from the ground up, rather than bolting vision onto an existing text model. It comes with visual chain-of-thought, tool-use support, and something Meta is calling "Contemplating mode": a setup that runs multiple AI agents in parallel to tackle harder problems. That's Meta's answer to the extended thinking modes from Google’s Gemini Deep Think and OpenAI’s GPT Pro.


“Muse Spark is the first step on our scaling ladder and the first product of a ground-up overhaul of our AI efforts,” Meta wrote in an official announcement. “To support further scaling, we are making strategic investments across the entire stack—from research and model training to infrastructure, including the Hyperion data center.”





The company worked with more than 1,000 physicians to curate training data for Muse Spark's medical reasoning. The results on HealthBench Hard—an open-ended health queries benchmark—are striking: Muse Spark scored 42.8, compared to 40.1 for GPT 5.4 and just 20.6 for Gemini 3.1 Pro. That's not a marginal difference.


On agentic search (DeepSearchQA), Muse Spark also leads with 74.8, beating Gemini (69.7) and GPT 5.4 (73.6). On CharXiv Reasoning—figure understanding from scientific papers—it scored 86.4, the highest across the models in the comparison.


For those into jailbreaking AI, the model was cracked open within minutes:



But good isn’t the same as great. The overall benchmark picture shows Gemini 3.1 Pro still running ahead on most categories. The gap is most visible on ARC AGI 2, the abstract reasoning puzzle benchmark: Gemini scored 76.5 to Muse Spark's 42.5.


On coding (LiveCodeBench Pro), Gemini's 82.9 outpaces Meta's 80.0. On MMMU Pro—multimodal understanding—Gemini scored 83.9 versus 80.4. Meta's own blog acknowledges current performance gaps in long-horizon agentic systems and coding workflows.




There's also a notable strategic shift baked into this launch. Muse Spark is a closed model—its architecture and weights won't be made public. That's a sharp departure from Llama, which built Meta's reputation in open AI circles. After Llama 4's underwhelming reception earlier this year, Meta appears to have decided the next chapter needs to be written differently.


The company says it hopes to open-source future versions of Muse, but for now the code stays inside Meta. The tech giant’s stock climbed nearly 9% on Wednesday following the announcement, and finished the trading day up 6.5% to a price of $612.42.


“Contemplating mode” uses parallel agent orchestration to push the model's ceiling higher. In that configuration, Muse Spark hit 58% on Humanity's Last Exam and 38% on FrontierScience Research—territory that makes it competitive with the most capable versions of Gemini and GPT, rather than their standard releases.


Meta is also rolling out a shopping assistant that compares products and links directly to purchases, and plans to bring Muse Spark to Facebook, Instagram, and WhatsApp in the coming weeks—following the same script implemented since Llama 3, putting it in front of more than 3.5 billion users. A private API preview is opening to select developers.


The model was built in nine months, internally codenamed Avocado, with Meta claiming that its new pretraining stack can reach the same capability level as Llama 4 Maverick using over 10 times less compute.


Muse Spark is described internally as a "small and fast" first step in the Muse family. A more capable version is already in development.


免责声明:本文章仅代表作者个人观点,不代表本平台的立场和观点。本文章仅供信息分享,不构成对任何人的任何投资建议。用户与作者之间的任何争议,与本平台无关。如网页中刊载的文章或图片涉及侵权,请提供相关的权利证明和身份证明发送邮件到support@aicoin.com,本平台相关工作人员将会进行核查。

美伊停火,合约党速领5000U
广告
|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

Selected Articles by Decrypt

2 hours ago
Bitcoin Depot ATM Operator Says $3.6 Million in BTC Stolen in Corporate Hack
2 hours ago
Researchers Propose New Way to Manage Financial Risk When AI Agents Fumble Trades
3 hours ago
Bitcoin Miner Cango Sells $143 Million in BTC, Slashes Production Costs
View More

Table of Contents

|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

Related Articles

avatar
avatarbitcoin.com
49 minutes ago
Canary Capital Files PEPE ETF as Wall Street Tests Institutional Demand for Meme Coins
avatar
avatarbitcoin.com
1 hour ago
Crypto ETFs Turn Red: Bitcoin Loses $159 Million, Ether Drops $64 Million
avatar
avatarbitcoin.com
2 hours ago
UK’s £120 Million Gambling Levy Delivers First OHID Prevention Grants Amid Sector Turmoil
avatar
avatarDecrypt
2 hours ago
Bitcoin Depot ATM Operator Says $3.6 Million in BTC Stolen in Corporate Hack
avatar
avatarDecrypt
2 hours ago
Researchers Propose New Way to Manage Financial Risk When AI Agents Fumble Trades
APP
Windows
Mac

X

Telegram

Facebook

Reddit

CopyLink