RLHF & Alignment

Reinforcement learning from human feedback, reward modeling, and AI alignment.

Part of Transformers & NLP on neo-ai.

Browse all neo-ai courses · Back to course overview