Artificial intelligence: Performance on knowledge tests vs. dataset size

Performance on knowledge tests is measured with the MMLU benchmark, here with 5-shot learning, which gauges a model’s accuracy after receiving only fiveexamples for each task. Training dataset size refers to the volume of text that is employed to train a model effectively.

Artificial intelligence: Performance on knowledge tests vs. dataset size

Interactive visualization requires JavaScript