Epidemiology of Large Language Models: A Benchmark for Observational Distribution Knowledge

Автори: Drago Plecko, Patrik Okanovic, Shreyas Havaldar, Torsten Hoefler, Elias Bareinboim

Опубліковано: 2025-11-25

Переглянути на arXiv →

Анотація

This paper introduces a benchmark to evaluate the epidemiology of Large Language Models, specifically focusing on their observational distribution knowledge, which is crucial for understanding and improving their real-world applicability.

Epidemiology of Large Language Models: A Benchmark for Observational Distribution Knowledge

Автори: Drago Plecko, Patrik Okanovic, Shreyas Havaldar, Torsten Hoefler, Elias Bareinboim

Опубліковано: 2025-11-25

Переглянути на arXiv →

Анотація

This paper introduces a benchmark to evaluate the epidemiology of Large Language Models, specifically focusing on their observational distribution knowledge, which is crucial for understanding and improving their real-world applicability.

FEEDBACK

Проекти

Немає проектів

Epidemiology of Large Language Models: A Benchmark for Observational Distribution Knowledge | ArXiv Intelligence