Sean from OpenAI Alignment Research delivered a talk on the importance of specifications in software development, focusing on their role in aligning human and model intentions.
Key topics included the distinction between code and structured communication, the anatomy and utility of written specifications (using OpenAIâs model spec as an example), and techniques for aligning AI behavior with specifications.
The discussion highlighted the parallel between product, legal, and code specifications, and called for a shift toward specification-driven engineering.
Sean closed by inviting participants to contribute to OpenAI's new agent robustness team and to start projects with clear, actionable specifications.
Action Items
(No specific due date â Entire audience): Begin new AI feature work by writing a clear, executable specification and defining success criteria.
(No specific due date â Interested participants): Reach out to Sean or the OpenAI Alignment Research team to join or support the new agent robustness initiative.
Code vs. Communication: The Value of Specifications
Most professional value in engineering comes from structured communication, not just code writing; code is only 10-20% of the delivered value.
Effective communicationâunderstanding user needs, distilling requirements, ideating, planning, and verifying outcomesâis the main bottleneck and critical skill in modern programming.
Vibe coding is highlighted as an example where intent communication is primary, and code is a downstream artifact.
Written Specifications as Alignment Tools
Written specifications are essential for aligning teams and stakeholders on shared goals, intentions, and values.
A specification is not just about the technical outcome but about capturing the âwhyâ and the âwhatâ in a transparent, structured format.
Code is a lossy projection of the broader intentions and requirements contained in a proper specification.
Anatomy and Use of the Model Spec
OpenAI's model spec, a public and living markdown document, embodies organizational intentions for model behavior and is accessible for contribution across roles (technical, legal, policy, etc.).
Specifications can include scenario-based success criteria and challenging prompts for clarity and testability.
In case studies like the âsycophancy issueâ in model behaviors, the model spec served as a trust anchor for identifying misalignments and guiding fixes.
Executable Specifications and Model Alignment
Specifications can serve both as training and evaluation material, using techniques like deliberative alignment to score and improve model responses.
Embedding the specification in the modelâs training process enables the model to internalize organizational intentions and policies, beyond prompt-based alignment.
Specifications as Code: Tooling and Universal Principles
Specifications are analogous to code: theyâre versioned, testable, composable, and can be enforced and structured using various tools (type checkers, linters, etc.).
The role of specifications extends universally: product managers, lawmakers, and engineers all use specifications to align humans and systems toward shared outcomes.
The U.S. Constitution is referenced as a real-world example of a ânational model specificationâ with versioning, testing (judicial review), and amendment processes.
Decisions
Prioritize specification-driven development â The rationale is to ensure alignment of intent, clarity of communication, and improved outcomes for both human stakeholders and AI models.
Open Questions / Follow-Ups
What areas or projects are most in need of formal specification for improving safety and robustness in AI agents?
What will the âIntegrated Development Environmentâ of the future look like when focused on specification authorship and thought clarification?
How can more participants engage with or contribute to the agent robustness initiative at OpenAI?