AI Progress So Far and a 10-Year Forecast

AI progress has become hard to read because people keep trying to force five different stories into one debate. One story is about model capability. Another is about how cheap intelligence becomes. A third is about whether companies can actually rebuild work around it. Then come the physical limits: chips, power, water, grids, regulation, trust, and public tolerance.

That is the better way to understand where artificial intelligence stands in 2026. The field is not simply racing toward one magic date called AGI. It is becoming an operating layer for software, science, media, education, security, and business process design. The next decade will be decided less by who has the flashiest chatbot and more by who can make AI reliable enough to run real work without losing control of quality, cost, or accountability.

This forecast uses current evidence from the Stanford 2026 AI Index, OECD scenario work on AI trajectories through 2030, Epoch AI trend data, METR research on long-task autonomy, the IEA report on energy and AI, enterprise surveys from McKinsey, workplace data from Microsoft, usage research from OpenAI, consumer adoption data from OpenAI Signals, economic task analysis from Anthropic, research milestones from Google DeepMind, and governance references from NIST and the EU AI Act implementation timeline. The point is not to pretend the future is settled. The point is to separate what has already changed from what still has to prove itself.

Key takeaways

AI progress is no longer one curve. Capability, adoption, infrastructure, governance, and economics are moving at different speeds.
The most important shift is from chat-style answers to reliable task completion inside real workflows.
AI is becoming cheaper for ordinary users while frontier systems become more capital intensive to build.
Science, software, energy, and enterprise operations are likely to feel the deepest changes before fully general autonomy arrives.
The next decade will reward organizations that can give AI context, controls, evaluation, and human accountability.

The real scorecard for AI progress

The most important AI progress so far is not that models became better at sounding fluent. That was the opening act. The real step change is that AI systems now combine language, images, code, tools, memory, search, structured data, and multi-step reasoning in a way that starts to resemble work.

Stanford’s 2026 AI Index says AI capability is still advancing rather than plateauing. It notes that industry produced more than 90% of notable frontier models in 2025, that several models reached or exceeded human baselines on advanced science, math, and multimodal reasoning tasks, and that performance on SWE-bench Verified moved sharply higher in a single year. Those benchmark gains do not mean every AI product is ready for mission-critical use. They do mean the old idea of AI as a clever autocomplete box is already outdated.

The better question is no longer, “Can the model answer?” It is, “Can the system complete useful work with enough reliability, context, security, and auditability that a human team will trust it?” That is where the next 10 years begin.

From answers to durable work

The first stage of generative AI was conversational. People asked questions, drafted emails, summarized documents, and generated images. The next stage is operational. AI agents will be asked to plan, use tools, check outputs, call APIs, write code, query databases, update systems, and come back when a task is done.

That shift is visible in research. METR’s work on long software tasks introduced the idea of a 50% task-completion time horizon: roughly, the length of task a model can complete with 50% reliability when measured against human expert time. In its paper, METR estimated that frontier AI time horizons had been doubling roughly every seven months since 2019, with clear caveats about external validity. That last phrase matters. A software benchmark is not the same as running a hospital, a newsroom, or a supply chain. Still, it is a useful signal because many economic tasks fail for the same reason agents fail: they lose context, make unchecked assumptions, or cannot recover cleanly from mistakes.

Over the next decade, the meaningful progress will come from longer reliable task chains, not just bigger context windows or higher benchmark scores. By 2028, many knowledge workers should expect AI systems that can handle multi-hour tasks with human review. By the early 2030s, some well-instrumented enterprise processes may be run by agent teams with people reviewing exceptions. By 2036, the most advanced organizations may treat AI agents less like tools and more like managed process layers.

That does not mean humans disappear from the workflow. It means the human job moves toward setting intent, validating judgment, handling ambiguity, and deciding when the system should stop.

TECHi has already tracked the first public wave of AI agents becoming product strategy. The important next step is quieter: agents moving from demos into permissions, logs, budget controls, and failure handling.

AI is becoming both cheaper and more expensive

The economics of AI look contradictory because two things are true at the same time. Running a fixed level of intelligence is getting cheaper. Building the frontier is getting more expensive.

Epoch AI’s trend dashboard estimates that frontier language-model training compute has been growing around 5x per year since 2020, while pre-training compute efficiency has been improving rapidly as well. This is the central economic paradox of the field. The same progress that makes ordinary AI features cheaper also encourages labs to spend more on the next frontier model, the next reasoning stack, the next data center, and the next custom chip system.

For users, this means AI will feel increasingly abundant. Summaries, search, drafting, translation, spreadsheet help, document comparison, customer-service triage, basic coding, image editing, and routine analysis will become cheap enough to be bundled into almost everything. For the companies building frontier systems, the race will remain capital intensive and concentrated.

That split will shape the next decade. Commodity intelligence will spread widely. Frontier intelligence will be fought over through cloud contracts, power access, chip supply, model talent, proprietary data, and distribution. The winners will not only have better models. They will have better systems around the models.

Adoption is mainstream. Transformation is not.

AI is already mainstream in a way earlier enterprise technologies were not. OpenAI said in late 2025 that ChatGPT served more than 800 million weekly users, and its enterprise report described sharp growth in workplace usage, structured workflows, and reasoning-heavy tasks. In May 2026, OpenAI’s Signals work described ChatGPT adoption broadening across age groups, geographies, and recurring work use cases.

But adoption is not the same as transformation. McKinsey’s 2025 survey found that 88% of respondents said their organizations used AI in at least one business function, yet most were still experimenting or piloting, and only 39% reported enterprise-level EBIT impact. That gap is the whole market in miniature. Everyone has access to AI. Far fewer organizations have redesigned work around it.

This is why the next decade will reward boring work that does not look like a keynote demo: data cleanup, permission design, retrieval systems, human review queues, quality metrics, escalation rules, and workflow redesign. Anthropic’s Economic Index points in the same direction. Its enterprise API data showed automation-heavy usage patterns, but it also emphasized that sophisticated deployment depends on giving models the right context. AI cannot automate what the company itself cannot describe, access, or measure.

For businesses, the hidden AI advantage will be institutional memory. The model is only one part of the system. The harder asset is knowing what the model needs to know, who is allowed to act on its output, and what counts as a good result.

The science story is bigger than chatbots

The public debate often treats AI as a workplace automation story, but the deeper long-term impact may be scientific. AlphaFold is the cleanest example. Google DeepMind says the AlphaFold database expanded to more than 200 million protein structures, and AlphaFold 3 moved beyond proteins to predict structures and interactions across a wider range of biological molecules. DeepMind also says AlphaFold was being used by more than 3 million researchers across more than 190 countries by late 2025.

That is not just a product milestone. It is a change in the research loop. AI can help generate hypotheses, search design spaces, propose molecules, write simulation code, interpret experimental results, and decide what to test next. TECHi has seen a similar pattern in coverage of AI weather forecasting models, where the point is not that AI replaces physics, but that it changes the speed and cost of prediction.

The 10-year forecast here is strong but not magical. AI will accelerate parts of biology, chemistry, materials science, chip design, climate modeling, and medical research. It will not erase clinical trials, safety validation, peer review, manufacturing constraints, or regulatory approval. The breakthrough will be cycle time. More ideas will be tested, more dead ends will be killed early, and more scientific work will become software-mediated.

By 2036, the leading labs may look less like groups of people manually driving every experiment and more like supervised discovery systems. Humans will still set the questions. AI will increasingly map the search space.

Work changes before jobs vanish

The job debate is usually framed as a binary: AI either replaces workers or does not. That framing misses how technology normally changes labor. Work is decomposed first. Some tasks are automated, some become more valuable, some move to different roles, and some new work appears because the system itself needs supervision.

Microsoft’s 2025 Work Trend Index argued that companies are moving toward human-agent teams, with many leaders expecting agents to become part of AI strategy in the near term. McKinsey found a more cautious version of the same story: broad use, real experimentation, but uneven scaling. Anthropic’s data adds another layer, showing that API-based business use is more automation-heavy than casual consumer use.

For workers, the next 10 years will feel uneven. People who already know how to define good work, check quality, manage exceptions, and use domain context will become much more productive. People whose roles consist mainly of producing first drafts, routine summaries, basic reporting, or simple ticket handling will face pressure. The most exposed roles are not always the lowest-paid ones. Many entry-level white-collar tasks were designed around information assembly, and AI is getting good at exactly that.

The hardest institutional question is training. If AI removes junior tasks, where do senior professionals come from later? Law firms, newsrooms, software teams, finance departments, design studios, and consulting groups will all have to rebuild apprenticeship paths. A company that automates the bottom rung without creating a new learning path may save money for a while and then discover it has no bench.

TECHi’s recent look at Amazon’s AI workforce strategy fits into this wider picture. The real story is not one employer. It is that every large organization is trying to decide which human skills become scarce when routine cognitive labor becomes cheap.

Energy becomes an AI policy question

AI is digital, but its constraints are physical. The IEA estimates that data centers used about 415 TWh of electricity in 2024, around 1.5% of global electricity consumption, and projects that data center electricity consumption will more than double to roughly 945 TWh by 2030. The agency also notes that AI is the most important driver of that growth, alongside demand for other digital services.

This is where many AI forecasts become too airy. They talk about intelligence without talking about substations, transformers, cooling, grid queues, gas turbines, renewables, nuclear, water, land, and local politics. The next decade of AI will be shaped by geography as much as by algorithms. Regions that can deliver power, permits, fiber, water, and chip logistics will attract compute. Regions that cannot will complain about AI from the sidelines or buy access from those that can.

Efficiency improvements will help, but they will not automatically reduce total demand. Cheaper inference usually creates more use. Better models create new use cases. More capable agents run longer tasks. The likely result is not less compute, but a larger and more carefully managed compute economy.

That matters for tech investors too. AI chip demand, cloud capex, memory supply, power contracts, and data-center buildouts are already connected. TECHi’s coverage of TSMC and AI chip demand is part of the same chain. AI progress is now an infrastructure story.

Regulation will not stop AI. It will decide where it can be trusted.

The next decade will also be shaped by rules. The EU AI Act is being phased in through a multi-year timeline, while the United States has leaned more heavily on agency guidance, standards, procurement rules, litigation, and frameworks such as the NIST Generative AI Profile. Different regions will not regulate AI the same way, but they will all move toward a similar demand: prove what the system did, why it acted, who approved it, and how harm is handled.

This will frustrate companies that want frictionless deployment. It will also create a market for safer deployment. Audit logs, model evaluations, provenance, incident reporting, red-team results, data lineage, and permission controls will become product features. In high-stakes sectors, the ability to explain and govern AI may matter as much as raw model quality.

The public trust question is not a side issue. If AI systems produce fake media, biased decisions, unsafe recommendations, or opaque automated denials at scale, political backlash will slow deployment. If companies build systems that are useful, reviewable, and accountable, AI will become boring in the best sense: infrastructure people rely on.

What AI may look like by 2036

The most realistic 10-year forecast is not one dramatic moment. It is a layered transition.

By 2036, most professional software will likely have an agent layer. Users will not simply click through menus. They will ask systems to prepare work, compare options, draft actions, update records, and monitor outcomes. Some of this will still look like chat. Much of it will disappear into buttons, workflows, alerts, and background automations.

Personal AI assistants will become more useful, but not omnipotent. They will manage calendars, travel, documents, shopping, inboxes, learning plans, health records, and financial paperwork with varying degrees of permission. The high-stakes parts will still need strong identity, audit, and human approval.

Enterprise AI will move from copilots to process ownership in narrow domains. An AI system may manage invoice exceptions, security triage, contract review prep, sales research, customer support routing, compliance monitoring, software maintenance, or procurement analysis. The key word is narrow. Broad autonomy will be harder than marketing suggests.

Scientific AI will become one of the decade’s biggest upside cases. The best systems will combine literature search, code generation, simulation, lab automation, and experimental design. The impact will be largest where experiments can be digitized, measured, repeated, and fed back into models.

Education will have to move away from homework that AI can trivially produce. Oral defense, live problem solving, projects, lab work, portfolios, and process evidence will matter more. The question will become not whether a student used AI, but whether the student can reason with it and defend the result.

Media will split between cheap synthetic volume and trusted brands with provenance. Anyone will be able to generate passable text, images, audio, and video. That will make trust, sourcing, bylines, editorial standards, and verification more valuable, not less.

Robotics will improve, but more slowly than software agents. The physical world is unforgiving. Homes, hospitals, roads, warehouses, factories, and farms vary in ways that browsers and spreadsheets do not. AI will make robots smarter, but hardware cost, safety, maintenance, and liability will keep the curve slower.

What could break this forecast

The optimistic version of AI progress assumes that scaling continues, inference gets cheaper, agents become more reliable, energy buildouts happen fast enough, and institutions adapt. Any one of those can disappoint.

Models may hit harder limits in reasoning, data quality, or reliability than current trend lines imply. The economics may tighten if frontier systems cost too much to train and serve. Energy bottlenecks may delay data centers. Regulation may slow deployment in sensitive sectors. Security failures may make companies cautious. Users may also become more skeptical if AI tools keep overpromising and underdelivering.

The biggest risk is not that AI suddenly stops working. It is that AI remains impressive but uneven, good enough for demos and isolated tasks, not dependable enough for deep transformation in the places that matter most.

The bottom line

AI progress to date is real. It is visible in benchmarks, adoption, coding, agents, scientific tools, enterprise workflows, and the physical buildout of compute. But the next 10 years will not be won by vague excitement. They will be won by reliability, context, infrastructure, governance, and human judgment.

The future of AI is not just smarter models. It is the redesign of systems around models. That is a harder story than hype, and it is also a much bigger one.

M	T	W	T	F	S	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31