References & Citations
Computer Science > Machine Learning
Title: Forecasting Future World Events with Neural Networks
(Submitted on 30 Jun 2022 (v1), last revised 9 Oct 2022 (this version, v2))
Abstract: Forecasting future world events is a challenging but valuable task. Forecasts of climate, geopolitical conflict, pandemics and economic indicators help shape policy and decision making. In these domains, the judgment of expert humans contributes to the best forecasts. Given advances in language modeling, can these forecasts be automated? To this end, we introduce Autocast, a dataset containing thousands of forecasting questions and an accompanying news corpus. Questions are taken from forecasting tournaments, ensuring high quality, real-world importance, and diversity. The news corpus is organized by date, allowing us to precisely simulate the conditions under which humans made past forecasts (avoiding leakage from the future). Motivated by the difficulty of forecasting numbers across orders of magnitude (e.g. global cases of COVID-19 in 2022), we also curate IntervalQA, a dataset of numerical questions and metrics for calibration. We test language models on our forecasting task and find that performance is far below a human expert baseline. However, performance improves with increased model size and incorporation of relevant information from the news corpus. In sum, Autocast poses a novel challenge for large language models and improved performance could bring large practical benefits.
Submission history
From: Dan Hendrycks [view email][v1] Thu, 30 Jun 2022 17:59:14 GMT (496kb,D)
[v2] Sun, 9 Oct 2022 23:32:19 GMT (609kb,D)
Link back to: arXiv, form interface, contact.