Using Large Language Models to Forecast Local Government Revenue
Main Article Content
Keywords
Artificial Intelligence, ChatGPT, Forecasting, Government Revenue, Large Language Model
Abstract
We examine the use of a public access large language model (LLM) to make local government revenue forecasts. ChatGPT is an LLM that is not specifically designed to perform quantitative analysis. However, it is capable of completing a wide range of tasks. The goals of this article are to determine the accuracy that can be obtained and to examine its potential bias. This study is based on a government revenue dataset from the Government Finance Officers Association (GFOA). The benefits of determining the accuracy and bias of LLM forecasts include providing a low-cost forecast method for small- and medium-sized governments and enabling external observers to validate forecasts made by official sources. Discovering the limitations of ChatGPT and similar LLMs, as well as the specific conditions required to use them wisely, may help localities avoid adverse outcomes. We find that a combination of LLM and human input provides a viable alternative forecasting method for small- and medium-sized governments, and it enables external observers to validate forecasts made by official sources. Errors in forecasting with the human-in-the-loop can be as low as 9.9 percent at the aggregated annual level. Using ChatGPT results alone can lead to high-error forecasts that may not be reliable.