We tested a pre-release version of o3 and found that it frequently fabricates actions it never took, and then elaborately justifies these actions when confronted.
We were surprised, so we dug deeper 🔎🧵(1/)
x.com/OpenAI/status/…
Open and scalable technology for understanding AI systems.
Joined October 2024









