Huawei will release AI reasoning innovation technology UCM to achieve high throughput and low latency experience

同花顺|Aug 12, 2025 05:23
On August 12th, at the 2025 Financial AI Reasoning Application Landing and Development Forum, Huawei will release its AI reasoning innovation technology UCM (Reasoning Memory Data Manager). As an inference acceleration kit centered around KV Cache, it integrates multiple types of cache acceleration algorithm tools, hierarchically manages the KV Cache memory data generated during the inference process, expands the inference context window, and achieves high throughput and low latency inference experience, reducing the inference cost per token. (Shanghai Securities News)
Share To
Timeline
HotFlash
APP
X
Telegram
CopyLink