top of page

The Energy Cost of Intelligence: AI and Environmental Impact

  • ShiftQuality Contributor
  • Aug 10, 2025
  • 5 min read

A single training run of GPT-3 consumed an estimated 1,287 MWh of electricity and emitted roughly 552 tons of CO2 — equivalent to the lifetime emissions of five average American cars. The models that followed are larger, trained on more data, for longer, with more compute.

AI has a carbon footprint. It has a water footprint — data centers use millions of gallons of water for cooling. It has an electronic waste footprint from the specialized hardware that's obsolete within a few years. These costs are real, growing, and largely invisible to the people using and building AI products.

This post isn't about whether AI is worth the environmental cost. It's about making that cost visible so decisions can be informed, not ignorant.

Where the Energy Goes

AI's energy consumption breaks into three phases, and they're not equally expensive.

Training

Training is the most energy-intensive phase and the one that gets the most attention. It involves running massive datasets through billions of parameters over thousands of GPU-hours. Training a large foundation model requires a cluster of thousands of high-end GPUs running for weeks or months.

The energy consumption of training scales with model size, dataset size, and the number of training runs. Hyperparameter tuning — the process of finding the best model configuration — often requires training dozens or hundreds of models to find the best one. The final model's energy cost includes all the failed experiments.

Research from the University of Massachusetts Amherst estimated that training a large transformer model produces five times the lifetime carbon emissions of an average car. That was in 2019. Models have gotten significantly larger since then.

Inference

Inference — actually using a trained model to make predictions or generate text — consumes less energy per query than training but adds up because it runs continuously at scale. Every ChatGPT conversation, every Google search that uses AI ranking, every recommendation served on Netflix is an inference call.

A single ChatGPT query consumes an estimated 10x the energy of a standard Google search. When millions of people make dozens of queries per day, inference energy consumption can rival or exceed training costs over time.

This is the cost that scales with adoption. As AI becomes embedded in more products and used by more people, inference energy consumption grows proportionally.

Data Center Infrastructure

The GPUs and servers that run AI workloads sit in data centers that require cooling, networking, power distribution, and redundancy. Power Usage Effectiveness (PUE) — the ratio of total data center energy to computing energy — means that for every watt of computation, roughly 0.1-0.5 additional watts go to cooling and infrastructure.

Water consumption is significant. A 2023 study estimated that training GPT-3 in Microsoft's US data centers consumed approximately 700,000 liters of freshwater for cooling. As data centers expand into water-stressed regions, this becomes a real resource conflict.

The Scale of the Problem

The International Energy Agency estimated that data centers consumed about 1-1.5% of global electricity in 2022. AI workloads are the fastest-growing component of that demand. Some projections suggest AI could account for 3-4% of global electricity consumption by 2030.

To put that in context: that's comparable to the energy consumption of a mid-sized country. And it's growing at a rate that's outpacing the expansion of renewable energy capacity in most regions.

The hardware lifecycle compounds the problem. AI-specific chips (GPUs, TPUs) have intense manufacturing processes and relatively short useful lifespans due to the pace of improvement. Hardware that's cutting-edge today is economically obsolete in 3-5 years. The embodied energy in manufacturing these chips — the energy consumed in producing the hardware itself — is substantial and rarely included in AI carbon footprint calculations.

What Individual Builders Can Do

You probably aren't training a foundation model. But you're making decisions about how AI is used in your products, and those decisions have environmental consequences at scale.

Choose the Right Model Size

A 7-billion parameter model that handles your use case is dramatically more efficient than a 175-billion parameter model. Not every application needs the largest model available. Text classification, sentiment analysis, and structured extraction often work well with small, fine-tuned models at a fraction of the compute cost.

The principle: use the smallest model that meets your quality requirements. The gap between "good enough" and "state of the art" is often imperceptible to users but massive in compute costs.

Cache and Batch

If the same query produces the same result, cache it. If you're processing many similar inputs, batch them. These are standard engineering practices that apply directly to AI workloads.

A product that calls an LLM API for every user request when 40% of those requests have identical or near-identical answers is wasting energy (and money). A semantic cache that matches similar queries to cached responses can dramatically reduce inference volume.

Run Inference in Low-Carbon Regions

If you have a choice of data center region, carbon intensity varies significantly. A data center powered by hydroelectric in Norway has a very different carbon footprint than one powered by coal in the central United States.

Cloud providers are increasingly transparent about the carbon intensity of their regions. Google Cloud, AWS, and Azure all publish sustainability data by region. Choosing a low-carbon region for non-latency-sensitive workloads (batch processing, training, offline analysis) is a concrete action.

Measure and Report

You can't improve what you don't measure. Tools like CodeCarbon (Python), ML CO2 Impact, and cloud provider carbon dashboards let you estimate the energy consumption and carbon emissions of your AI workloads.

Including energy metrics alongside accuracy and latency in your model evaluation creates accountability. If Model A is 2% more accurate than Model B but uses 5x the energy, that trade-off should be visible and deliberate.

The Systemic Level

Individual optimization matters but doesn't solve the systemic problem. The systemic issues require industry and policy responses.

Transparency: AI companies should disclose the energy consumption and carbon emissions of training and operating their models. Some do — Google, Meta, and others publish research papers with energy data. Many don't. Without transparency, there's no accountability and no basis for comparing alternatives.

Efficiency research: Investment in making models more efficient — achieving the same capability with less compute — has a multiplied environmental benefit. Techniques like distillation (training small models to mimic large ones), quantization (reducing numerical precision), and efficient architectures (mixture of experts) can dramatically reduce energy requirements.

Renewable energy commitments: Major cloud providers have pledged to run on 100% renewable energy. The timelines and definitions vary. "Matching" energy consumption with renewable energy purchases isn't the same as running on renewables in real time. The pledges are a start, not a solution.

Hardware lifecycle management: The industry needs better practices for extending hardware life, recycling components, and accounting for embodied energy in total cost calculations.

The Honest Position

AI has genuine environmental costs. Those costs are growing. Ignoring them because the technology is exciting or economically valuable doesn't make them disappear — it makes them someone else's problem.

The responsible position isn't anti-AI. It's informed AI. Use the right model size. Cache aggressively. Choose low-carbon infrastructure when possible. Measure your energy consumption. And support transparency and efficiency research in the broader ecosystem.

AI should be a beacon that amplifies human capability. That mission is harder to justify if the energy required to power it comes at the expense of the environment that humans live in.

Key Takeaway

AI's environmental impact — energy, water, hardware lifecycle — is real and growing. Builders can reduce their footprint by choosing appropriate model sizes, caching results, selecting low-carbon infrastructure, and measuring energy consumption. Systemically, the industry needs transparency, efficiency research, and genuine renewable commitments. Informed decisions require visible costs.

Comments


bottom of page