🤖

AI Frontier Models and Safety Testing

Dec 22, 2024

Lecture Notes: Frontier Models and AI Development

Introduction

Event celebrating AI advancements over 12 days.
Launch of 01 model; introduction to the next phase of AI reasoning.
Announcement of new models: 03 and 03 Mini.
Emphasis on safety testing before public release.

New Models: 03 and 03 Mini

03 Model:
- Very smart model aimed at complex tasks requiring reasoning.
- Not publicly launched yet; available for safety testing.
- Researchers can apply for early access.
03 Mini Model:
- Cost-effective with strong performance.
- Safety and security researchers granted access for testing.
- Supports adaptive thinking time: low, medium, high.

Performance and Benchmarks

03 Performance:
- Software benchmarks: 03 achieves 71.7% accuracy in coding, 20% better than 01.
- Competition coding site: 03's ELO score is 2727 compared to 01's 1891.
- Mathematics benchmarks: 03 scores 96.7% on competitive math exams.
- GPQ Diamond Benchmark: 03 achieves 87.7% accuracy, surpassing PhD level experts.
Epic AI’s Frontier Math Benchmark:
- 03 scores 25% on the toughest mathematical benchmark.

Collaboration and New Benchmarks

Arc Benchmark:
- Developed in part by Francois Cholle.
- Tests AI's ability to learn new skills on the fly.
- 03 has set a new state-of-the-art score, outperforming previous models.

Technical Demonstrations

Live Demo of 03 Mini:
- Demonstrated adaptive thinking time in coding tasks.
- Cost-efficient performance.
- Supports function calling, structured outputs, developer messages.

Safety Testing and Deliberative Alignment

Safety Testing:
- Extending safety testing to external researchers.
- Application process open until January 10th.
Deliberative Alignment:
- Advanced technique leveraging model's reasoning capabilities for safety.
- Helps in determining safe and unsafe prompts more accurately.

Conclusion

Timeline for public launch: 03 Mini by end of January and 03 afterwards.
Call to action for researchers to help with safety testing.
Excitement for future advancements in AI.

Remember to apply for early access if interested in safety testing these models. Engage with the community for updates and further developments in AI models.

Full transcript