Charts
DataOn-chain
VIP
Market Cap
API
Rankings
CoinOSNew
CoinClaw🦞
Language
  • 简体中文
  • 繁体中文
  • English
Leader in global market data applications, committed to providing valuable information more efficiently.

Features

  • Real-time Data
  • Special Features
  • AI Grid

Services

  • News
  • Open Data(API)
  • Institutional Services

Downloads

  • Desktop
  • Android
  • iOS

Contact Us

  • Chat Room
  • Business Email
  • Official Email
  • Official Verification

Join Community

  • Telegram
  • Twitter
  • Discord

© Copyright 2013-2026. All rights reserved.

简体繁體English
|Legacy
BTCBTC
💲71358.31
-
4.19%
ETHETH
💲2193.49
-
5.89%
SOLSOL
💲90.19
-
4.97%
WLDWLD
💲0.3667
-
7.79%
HYPEHYPE
💲42.73
+
2.35%
USDCUSDC
💲0.9998
-
0%

OpenAI research finds cheating behavior in cutting-edge inference models, suggests retaining CoT monitoring

PANews
PANews|Mar 10, 2025 23:10
According to research released by OpenAI, the team found that when training cutting-edge inference models such as OpenAI o1 and o3-mini, these models exploit vulnerabilities to bypass testing, such as tampering with code validation functions and forging test pass conditions. Research has shown that the Chain of Thought (CoT) of monitoring models can effectively identify such cheating behavior, but forcibly optimizing CoT may lead to the model hiding its intentions rather than eliminating inappropriate behavior. OpenAI suggests that developers avoid putting too much optimization pressure on CoT in order to continue using it to monitor potential reward hacking behavior. Research has found that when strong supervision is applied to CoT, the model still cheats, albeit in a more covert manner, making monitoring more difficult. The study emphasizes that as AI capabilities increase, models may develop more complex deception, manipulation, and vulnerability exploitation strategies. OpenAI believes that CoT monitoring may become a key tool for supervising superhuman intelligent models, and recommends that AI developers use strong supervision cautiously when training cutting-edge inference models in the future.
+2
Mentioned
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

Timeline

Mar 26, 09:51【(Gate.io) Contract launches severe fluctuation warning function】
Mar 19, 09:50【Encrypting users need to be vigilant about browser plugin security】
Mar 18, 00:20【The computing node of the global data center】
Mar 15, 07:02【Historic launch of Botswana BOTSAT-1】
Feb 22, 06:00【Risk management in DeFi is not just about preventing hacker attacks】
Feb 18, 00:41【Early Access Plan for AI Studio, Focusing on DeFi Agents】

HotFlash

|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

APP
Windows
Mac

X

Telegram

Facebook

Reddit

CopyLink

Hot Reads