Not all AI prompts are equal. Some emit 50x more carbon than others. Here’s why.

By Rob | June 28, 2025

When researchers delved into the tradeoff between AI sustainability and accuracy, they uncovered strategies for greener chatbots.

By Sarah DeWeerdt

June 24, 2025 in Anthropocene magazine

Some AI prompts result in 50 times more carbon emissions than others, according to a new study. The findings suggest that large language models (LLMs)—the technology behind advanced chatbots and ChatGPT—face a tradeoff between sustainability and accuracy when answering questions or responding to prompts from human users.

Generative AI models, a category that includes LLMs, consume an estimated 29.3 terawatt hours of electricity every year, roughly equivalent to Ireland’s annual energy consumption. But relatively little scientific research has focused on the environmental impacts of LLMs.

In the new study, researchers measured the carbon emissions generated by each of 14 different LLMs that were asked a series of 1,000 standard questions—500 multiple-choice and 500 free-response—covering philosophy, high school world history, international law, abstract algebra, and high school mathematics.

The LLMs in the study ranged in size from 7 billion to 72 billion parameters, referring to the controls or settings inside the model that calibrate how it responds. The study included both concise models, which produce short answers to prompts quickly and with minimal intermediate steps, and “reasoning-enabled” models, which explicitly describe the step-by-step logic underpinning their answers.

Larger models and reasoning-enabled models tend to yield more accurate answers, the researchers report in the journal Frontiers in Communication. But they also have a greater climate impact.

Cogito-70B, a reasoning-enabled model with 70 billion parameters, answered 84.9% of the questions correctly and generated 1.34 kilograms of carbon dioxide emissions. Qwen-7B, a concise model with just 7 billion parameters, only emitted 27.7 grams of carbon dioxde—but only got about one-third of the answers right.

People speak in words, while computers speak in a code of ones and zeroes. To translate between the two, LLMs generate “tokens”—words or parts of words that are converted into a string of numbers. Every token requires energy to produce and results in carbon emissions.