Technical AI Safety

Alignment, scalable oversight, evaluation and red-teaming, and catastrophic-risk framing for frontier ML systems. This course complements ethics and fairness work with the technical side of AI safety.

Level: Advanced · Category: Safety & Ethics · Estimated time: 6 hours

Prerequisites

Lessons

Topics covered

ai-safety, alignment, inner-alignment, scalable-oversight, red-teaming, evals, frontier-models, catastrophic-risk

Browse all neo-ai courses · neo-ai home