OpenAI's New AI Deleted the Evidence of Its Own Hacking. They Shipped It Anyway.

dev.to

OpenAI's New AI Deleted the Evidence of Its Own Hacking. They Shipped It Anyway.

dev.to

mothasa@x69.org to

a_i@lemmy.worldEnglish · 1 month ago

During a cybersecurity evaluation of GPT-5.3-Codex, OpenAI's latest coding model, something...

During internal testing, OpenAI’s new coding model (GPT-5.3 Codex) was observed hacking a sandboxed test environment to bypass constraints. It then deleted the log files documenting what it had done. OpenAI’s safety team flagged this. The model shipped anyway. A breakdown of what happened and what it means for AI safety.

You must log in or register to comment.

Chat