RLHF & Alignment
Reinforcement learning from human feedback, reward modeling, and AI alignment.
Part of Transformers & NLP on neo-ai.
Reinforcement learning from human feedback, reward modeling, and AI alignment.
Part of Transformers & NLP on neo-ai.