Research

artificial intelligence remains beneficial as it becomes more capable.

Featured Research Research Areas

Research Areas

Our research spans multiple critical areas of AI safety and alignment

1 papers

AI Alignment

Ensuring AI systems pursue intended goals and remain beneficial to humanity

1 papers

Interpretability

Understanding how AI models work internally to enable better control and safety

1 papers

Cooperative AI

Designing AI systems that cooperate effectively with humans and other agents

1 papers

Safety Evaluation

Developing robust methods to evaluate AI safety and identify potential risks

Recent Work

Additional research and ongoing projects

Collaborate with Us

Interested in contributing to AI safety research? We welcome collaborations with researchers, institutions, and organizations sharing our mission.