Evaluation & Benchmarking

Perplexity, task metrics, and human evaluation.

Part of Fine-Tuning LLMs on neo-ai.

Browse all neo-ai courses · Back to course overview