Charts
DataOn-chain
VIP
Market Cap
API
Rankings
CoinOSNew
CoinClaw🦞
Language
  • 简体中文
  • 繁体中文
  • English
Leader in global market data applications, committed to providing valuable information more efficiently.

Features

  • Real-time Data
  • Special Features
  • AI Grid

Services

  • News
  • Open Data(API)
  • Institutional Services

Downloads

  • Desktop
  • Android
  • iOS

Contact Us

  • Chat Room
  • Business Email
  • Official Email
  • Official Verification

Join Community

  • Telegram
  • Twitter
  • Discord

© Copyright 2013-2026. All rights reserved.

简体繁體English
|Legacy
BTCBTC
💲71289.42
+
1.49%
ETHETH
💲2110.37
+
2.29%
SOLSOL
💲88.97
+
2.83%
TRUMPTRUMP
💲3.86
+
32.65%
XRPXRP
💲1.40
+
2.19%
DOGEDOGE
💲0.09643
+
2.43%

0xTodd
0xTodd|9月 16, 2025 13:39
Sentient has launched a new multi-agent open-source architecture called ROMA, which surprisingly achieves high inference and search performance. First, let's do a simple science popularization. What are the three question banks used by ROMA this time? Seal-0 question bank: It contains a carefully selected set of extremely difficult questions, each of which is iterated to the strength of "multiple cutting-edge models tried multiple times but almost all of them are wrong". It examines AI's ability to use web search/tool models for fact checking, reasoning, and denoising under conflicting, noisy, or useless search results. FRAMES question bank: RAG unified evaluation set proposed by Google/Harvard (Factuality+Retrieval+Reasoning), multi hop and multi constraint problems (824 questions+papers&datasets provided by the official). It examines whether AI retrieval is correct, whether citations are accurate, and whether reasoning is in place. SimpleQA question bank: OpenAI's benchmark set of short factual Q&A questions, with short questions, easy scoring, and wide coverage. It mainly tests whether AI answers correctly and has less hallucinations, and can also evaluate the model's self calibration (matching confidence with actual accuracy). Then let's talk about how it was implemented? ROMA mainly does it through task decomposition, which is a three-step process: 1. Judgment and disassembly: The parent node determines whether the task is simple or complex, and if it is a complex task, it is disassembled into several subtasks; 2. Targeted treatment: Sub nodes find the most suitable AI agents and tools to solve sub tasks; 3. Summary report: Summarize the results and report them layer by layer to form the final answer. In this way, ROMA achieved high scores in inference and search (although the disadvantage is that it also brings higher load and longer thinking time to the server). In addition, some multi-agent systems have also adopted similar architectures in the past, but many have encountered a problem called "error accumulation". For example, if the accuracy of a single AI is 90% and six layers are accumulated, the accuracy will only be around 50%. @The idea of SentientAGI ROMA architecture is to make the entire inference process transparent and open source, which facilitates developers to make targeted adjustments to the entire process to increase the accuracy of long tasks. This is the benefit of open source.
+6
Mentioned
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

Timeline

10月 16, 00:00Stablecoin launched on Aptos
10月 15, 23:02Google launches Veo 3.1 to compete with OpenAI's Sora 2
10月 15, 18:03Launch Universal Backlot Club fan interaction platform
10月 15, 17:53Mina Stack is gradually implementing Zeko Bridge
10月 15, 17:18gRPC officially launched on Sui
10月 15, 15:07Mind Cripto Caffe launches automated Barista robot
10月 15, 11:24Launch PunchSwap Risk Analysis Dashboard
10月 15, 03:49Jupnet launches JupVM to extend SVM functionality
10月 14, 21:30OpenAI plans to launch a new version of ChatGPT
10月 14, 20:30OpenAI launches Apps SDK for developers to use

HotFlash

|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

APP
Windows
Mac

X

Telegram

Facebook

Reddit

CopyLink

Hot Reads