Who actually picks the AI coding tool in an enterprise?

In the engagements I have run, the answer is rarely the CTO and almost never procurement. The right decision sits with the VP Engineering or the platform lead, advised by a working group of three or four senior engineers who will actually use the tool every day. The CIO signs the licence and the CISO clears the data path, but the choice itself should not be made above the engineering organisation. Tools chosen by people who do not write code are tools that get worked around within a quarter.

Is Cursor still ahead of Claude Code in 2026?

It depends on the workflow, not the vendor. Cursor leads on IDE-bound work and on team rollouts where junior and mid-level engineers need a deeply integrated surface. Claude Code leads on autonomous multi-step work and on senior engineers who already live in the terminal. Both lead on different axes. The market has stopped having a single leader and split into workflow-specific leaders, which is the procurement signal that matters more than any benchmark.

Should we wait for the AI coding tools market to settle before buying?

No, and the assumption behind the question is wrong. The market is not converging — it is fragmenting along workflow lines that are stable enough to commit to. Buying the cheapest defensible thing on a one-year contract and assuming you will reassess in twelve months is the correct posture. Three-year enterprise licences in this category are a procurement mistake; the per-seat or usage-based pricing on all the serious tools is precisely what lets you churn without sunk-cost guilt.

What is the actual cost of an AI coding tool rollout — licence, or all-in?

The licence is between 20% and 40% of the all-in cost in the first year and a smaller share thereafter. The other costs are the platform-engineering time to integrate, evaluate, and maintain the tool; the code-review capacity expansion that the tool's volume increase requires; the testing infrastructure investment that catches the correctness categories AI tools miss; and the lost productivity during the rollout's first three months while engineers learn what the tool reliably does and does not do. The teams that budget only the licence cost are the teams that report disappointment at month nine.

Capabilities

AI Coding Tools in 2026: The Procurement Frame No Vendor Will Draw For You

The serious AI coding tools market has compressed to four enterprise contenders, two emerging agents, and a handful of IDE plays. The strategic question is not which is best — it is who decides, what is actually being procured, and what the licence-versus-API split does to the budget twelve months in. Written for the VPE, the CIO, and the platform lead who will own the consequences.

39 briefs · updated continuously Capabilities

Essay

Claude Code vs Windsurf: Terminal-First Versus IDE-First, and the Workflow That Decides

Updated 2026-06-17 · 10 min

Essay

Claude Code Pricing: A Procurement-Decision Read for CTOs and Finance

Updated 2026-06-17 · 14 min

Essay

Cursor Pricing: The Procurement Model Behind the Flat Per-Seat Number

Updated 2026-06-17 · 12 min

Essay

Cursor vs Claude Code: An Honest Practitioner Comparison in 2026

Updated 2026-06-17 · 14 min

Essay

Cursor vs Windsurf: Two VS Code Forks, One Failed OpenAI Deal, and a Cognition Ownership Question

Updated 2026-06-17 · 11 min

Essay

Enterprise AI Copilots: The Procurement Reality of Glean, Moveworks, M365 Copilot and the Rest

Updated 2026-06-17 · 17 min

Essay

How to Rank in ChatGPT: The Honest Operator's Read

Updated 2026-06-17 · 12 min

Essay

How to Rank in Google AI Overviews: The Operator's Read

Updated 2026-06-17 · 11 min

Essay

Agentic Commerce Strategy: What the Board Needs to Decide Before AI Agents Buy

Updated 2026-06-10 · 12 min

Essay

AI for Engineering Teams: What the VP of Engineering Actually Needs to Know

Updated 2026-06-06 · 12 min

Essay

AI Agent Security: The Runtime Tooling View of Prompt Injection, Data Exfil, and Output Trust

Updated 2026-05-30 · 15 min

Essay

AI Brand Visibility Tools in 2026: The Honest Procurement Read

Updated 2026-05-30 · 15 min

Essay

AI Data Readiness: The Data Layer That Pre-Dates Your Strategy Approval

Updated 2026-05-30 · 14 min

Essay

AI Orchestration Frameworks in 2026: The Practitioner's Comparison

Updated 2026-05-30 · 14 min

Essay

AIOps Platforms in 2026: Buy Standalone, Consolidate, or Skip

Updated 2026-05-30 · 16 min

Essay

GPU and Inference Platforms in 2026: The Procurement Read

Updated 2026-05-30 · 14 min

Essay

Legal AI Platforms in 2026: Harvey, Spellbook, Robin AI, Lexion, Casetext — The Procurement Read

Updated 2026-05-30 · 13 min

Essay

ML Platforms in 2026: SageMaker vs Vertex AI vs Azure ML vs Databricks vs MLflow vs W&B

Updated 2026-05-30 · 14 min

Essay

RAG Architecture in 2026: The Four Choices That Decide Payback

Updated 2026-05-30 · 16 min

Essay

Voice AI in 2026: Four Procurement Categories, Named Verdicts

Updated 2026-05-30 · 16 min

Essay

Agentic AI Architecture Patterns: The Four Shapes That Stabilised in 2026

Updated 2026-05-29 · 14 min

Essay

AI Orchestration Architecture in 2026: The Smallest Thing That Works

Updated 2026-05-29 · 11 min

Essay

AI-SRE Tools in 2026: The Honest Buyer's Read

Updated 2026-05-29 · 14 min

Essay

AI-SRE Vendor Comparison 2026: Ten Tools, Four Criteria, Named Verdicts

Updated 2026-05-29 · 18 min

Essay

The Cost of Failed AI Projects: A Failure-Mode Catalogue From Forty Engagements

Updated 2026-05-29 · 11 min

Essay

Enterprise AI Literacy: A Role-Conditional Programme, Not a Compliance Seminar

Updated 2026-05-29 · 13 min

Essay

Enterprise AI Readiness Assessment: The Four-Page Version

Updated 2026-05-29 · 11 min

Essay

Enterprise AI Red Teaming: The Operational Version

Updated 2026-05-29 · 12 min

Essay

Langsmith vs Langfuse vs Helicone: The Three-Archetype Procurement Read

Updated 2026-05-29 · 14 min

Essay

Resolve AI Alternatives: Four Honest Options for Incident-Response Procurement

Updated 2026-05-29 · 12 min

Essay

Scalable AI Adoption: The Twenty-First Workload Problem

Updated 2026-05-29 · 12 min

Essay

Traversal AI vs Resolve AI: The Autonomy Ceiling Decides the Procurement

Updated 2026-05-29 · 12 min

The procurement review I walked into in February covered four AI coding tools across a 220-engineer organisation. The deck recommended a single-vendor standardisation on the strongest brand in the room. Three of the four senior engineers on the working group had quietly disagreed in the previous week’s prep call. None of them said so in the review because the CTO had already signalled which vendor he preferred, and the senior engineers had read the room. The deal closed on the recommended tool. Six months later the same engineers were running a parallel evaluation of a competitor, because the productivity gains the licence had promised had not arrived. The wrong tool had been picked by the wrong person against the wrong criteria, and the organisation was now paying twice — for the licence and for the shadow evaluation of the tool it should have chosen.

That is the structural problem this hub exists to address. The AI coding tools market in 2026 has compressed to a small enough number of serious contenders that the procurement question looks simple. It is not. The question of which tool wins is downstream of three earlier questions — who picks, what is actually being procured, and how the cost trajectory bends past the first year — and the published comparisons mostly answer the wrong one. They benchmark code generation quality. They rank features. They count integrations. They do not draw the procurement frame that determines whether the tool, once chosen, actually pays back. That frame is what this page is for.

The deep-dive comparisons under this hub each take one of the head-to-head decisions seriously. Cursor vs Claude Code is the most-searched comparison in the category and the one with the cleanest workflow split. Cursor vs Windsurf is the IDE-versus-IDE decision now reshaped by Windsurf’s failed OpenAI deal, Google licensing event, and Cognition acquisition. Claude Code vs Windsurf is the terminal-versus-IDE decision that surfaces when the engineering organisation has the senior density to consider both. Claude Code vs GitHub Copilot is the enterprise-procurement default comparison most teams are running — Copilot Enterprise as the safe procurement choice against Claude Code as the senior-engineer expense-it-anyway tool. The pricing pages — Claude Code pricing and Cursor pricing — are the cost-model deep dives that the comparison pieces touch on but cannot exhaust.

The market in 2026, named honestly

The serious enterprise contenders are four. Cursor, Claude Code, GitHub Copilot, and the Cognition-owned Windsurf/Devin line. Each one has a defensible workflow that the others either do not serve or serve worse. Each one has procurement-grade controls — SSO, audit logging, data-handling commitments, on-prem or in-cloud routing options — that put them above the threshold an enterprise procurement team can clear in writing, although the exact control set still has to be checked against the current contract. None of the four is the universal answer. The vendor decks that imply otherwise are selling against your interests, which is the predictable consequence of the structural point made on the AI-SRE page: consultancy-authored and vendor-authored comparison documents reliably favour the billing surface.

Two emerging contenders sit just below the broad-enterprise threshold and are worth watching but not yet worth standardising on for most organisations. Devin, from Cognition, is the most aggressive autonomous-agent play in the category — closer to a junior engineer than to a tool, with a workflow model that demands an organisation rethink what its on-call rotation, its code-review process, and its mid-level hiring strategy actually do. The product is real, but the operating-model change is larger than most procurement teams price correctly. OpenHands, the open-source agent framework from All Hands AI, sits at the same threshold for the opposite reason — strong technical foundation, no commercial wrapper that an enterprise procurement team can sign against without internal platform work.

The IDE plays — Continue, Roo Code, and a handful of smaller VS Code extensions — are the right answer for teams that want to maintain editor neutrality and bring their own models. They are not the right answer for organisations that need a vendor on the hook for SLA, data residency, and the kind of support call you make at 3 a.m. on a Tuesday. The distinction matters because the engineering organisation that picks the IDE play has implicitly committed to operating the AI tooling itself as a platform capability, which is a real platform-engineering commitment, not a free choice.

Six tools in the serious bracket, plus a fringe. The procurement question is not which is best. It is which workflow yours actually maps to.

The four questions that decide procurement

The published comparisons spend most of their pages on features and benchmarks. The procurement question they should be answering instead has four parts.

One. Who decides, and what authority does that person actually have. In a 50-engineer organisation the right decider is the engineering lead, possibly the CTO if the organisation is small enough that the CTO writes code. In a 200-engineer organisation the right decider is the VP Engineering with a working group. In a 1,000-engineer organisation the right decider is the platform lead with input from the senior engineering management line. The wrong decider — in every size — is procurement, the CIO acting alone, or any executive who has not written code in the last two years. The decision is technical, the consequences are technical, and the only way to read the trade-offs honestly is to have done the work the tool will accelerate. This connects directly to the AI for engineering teams piece, where the throughput-versus-velocity gap I named in that piece is the consequence of decisions made by people who measured the activity rather than the outcome.

Two. What is actually being procured — a tool, a workflow change, or an organisational redesign. A team that buys Cursor and changes nothing else about its delivery process has bought a tool. A team that buys Claude Code and reorganises its senior engineering rotation around multi-step agent work has bought a workflow change. A team that adopts Devin and rethinks its mid-level hiring posture has signed up for an organisational redesign. The licence cost is similar across all three. The total cost — including the operational changes, the code-review capacity scaling, the platform-engineering investment in the tooling itself — varies by an order of magnitude. The vendor will price the licence. You have to price the rest.

Three. What is the licence-versus-API cost split, projected at twentyfold scale. The cost models in this category are genuinely different. Cursor and GitHub Copilot are seat-based, with included usage that most teams stay inside. Claude Code is hybrid — a subscription with included usage and an API path for higher-volume work, with the cost trajectory bending sharply if your engineers are heavy users. Windsurf should be modelled as an IDE seat with usage allowances and an ownership-roadmap risk check after the Cognition acquisition, not as an OpenAI Enterprise extension. Devin is consumption-shaped at a level where a single autonomous task can cost more than a day of an engineer’s seat-based licence elsewhere. Run the math at three times your current usage. The shape of the cost curve at twenty workloads is the procurement question; the shape at one is marketing.

Four. What is the security and compliance surface — and does the regulated-industry version of your enterprise actually pass it. The data path matters. Where does the prompt go. Where does the context go. Where do the responses get logged. Who else can read them. Does the vendor’s data-handling commitment match the regulatory posture you owe your auditors. The serious contenders all have answers that pass procurement at most enterprises, but the answers differ. Cursor’s enterprise tier has the most explicit on-prem routing story. Claude Code routes through Anthropic’s API with the same data-handling commitments as any other Anthropic enterprise deployment. GitHub Copilot for Business has the deepest integration with existing GitHub enterprise security policies. Windsurf’s check is now a Cognition/Windsurf contract check, including which model providers are enabled and how enterprise logs and retained context are handled. For a regulated enterprise — financial services, health, the EU AI Act’s high-risk categories — the security surface is procurement-determining, not a footnote. The governance hub covers the broader policy context; this hub covers the procurement-specific check.

Four questions. The published comparisons answer none of them with the seriousness they require. The deep-dive pages under this hub answer them per pair.

The throughput-versus-velocity reality

The procurement decision is downstream of the operational reality the AI for engineering teams piece names. Individual engineers using AI coding tools generate code faster — measurably, reproducibly, in the 15-35% range for routine work. Team-level shipping velocity rises by a smaller amount — typically 5-15% in the first year — because the bottleneck moves downstream to code review, integration, and on-call. The tool choice does not change this reality. No tool in the market in 2026 collapses the team-level gap to match the individual-task gain, because the gap is not a tool problem. It is an organisational problem.

Goodhart’s law applies precisely here. The moment lines of code or pull requests merged become the measure of AI coding tool success, the measure stops measuring the thing. Engineers learn to produce code volume the tool can take credit for; reviewers approve faster to clear queues that grow with the tool’s output; the visible number rises while the underlying delivery system absorbs the cost. The CTO who reports the visible number to the board is the CTO who will have a credibility problem six months later when shipping velocity has not followed. The procurement decision that does not budget for the code-review capacity scaling and the testing infrastructure investment is the procurement decision that produces this outcome.

The implication for the procurement frame is that the tool choice and the operational investment have to be priced together. A cheaper tool with the right operational investments outperforms an expensive tool without them, every time. The vendors do not budget the operational investments for you; their decks assume the tool is the whole answer. It is not. This is the single most important thing to internalise before signing any AI coding tool licence in 2026.

The licence-versus-API split, in practice

The cost models deserve their own paragraph because they are genuinely different, and the difference matters more than the comparison pieces usually acknowledge.

Seat-based pricing — GitHub Copilot, Cursor’s base tier — is the model that maps best to an enterprise procurement team’s mental model. Predictable, per-engineer, scales linearly with headcount. The cost trajectory is flat at the workload level. The trap is that engineers using the tool at high intensity hit usage caps inside the seat tier and either stop using the tool or push for the higher tier, which is roughly two to three times the cost per seat.

Hybrid subscription-plus-usage pricing — Claude Code, parts of Cursor’s higher tier — combines a predictable subscription with usage-based overage. The cost trajectory bends with how heavily individual engineers use the tool. For senior engineers running multi-step autonomous workflows, the usage component can dominate the licence component within six months. This is not a vendor trick; it reflects the genuinely higher value of the autonomous work, but it does mean the budget conversation is different than the seat-based conversation. The CFO needs to be told this in advance, not in arrears.

Pure consumption-based pricing — Devin, much of OpenHands when wrapped commercially, the autonomous-agent end of the market — prices per task or per token. The cost trajectory is the workflow’s cost trajectory; predicting it requires knowing your workload mix, which most teams do not until they have operated the tool for a quarter. The procurement strategy that works is a small budget commitment with explicit ceilings and a 60-day reassessment. The strategy that does not work is signing a year for a category you have not yet measured your usage of.

The pricing pages — Claude Code pricing and Cursor pricing — take this further, with the actual cost trajectories I have seen at engagement scale and the budgeting templates that survive the first year.

The security and compliance surface, briefly

The regulated-enterprise procurement check has four parts. Data residency — where does the prompt go, where does the response come from, are both inside the regulatory boundary your data classification requires. Audit logging — every prompt, response, and code modification logged with a retention policy that matches the auditor’s expectations. Model-vendor commitments — what the upstream model vendor has signed up to in terms of training-data exclusion, content retention, and breach notification. Identity and access — SSO integration, role-based access, the kill-switch procedure for revoking access when an engineer leaves.

All four serious contenders pass procurement at most enterprises on these checks. The differences are at the margins, and the margins matter for specific regulatory postures. Financial services in the EU have to clear a different bar than a US-based SaaS company; health and life sciences have to clear a different bar than financial services; defence and aerospace have to clear a different bar than any of them. The CISO boundary work in the governance hub covers the broader policy context. The procurement-specific check here is that the vendor’s commitments, in writing, match the regulatory posture you owe. If they do not, no licence at any price clears the deal.

How to read the deep-dives under this hub

Start with the comparison closest to your actual decision. Most enterprise procurement engagements in 2026 are between Cursor and Claude Code at the senior end, between Cursor and Copilot at the broader rollout, or between Claude Code and Windsurf at the platform-engineering edge. The Cursor vs Claude Code page is the most-read for a reason — the workflow split between the two is the cleanest in the category. Cursor vs Windsurf covers the IDE-versus-IDE decision and the post-Cognition ownership implications. Claude Code vs Windsurf covers the terminal-versus-IDE decision that surfaces when your engineering organisation is mature enough to consider both.

The pricing pages are the cost-model deep dives. Read them after the comparison if the procurement question has reached the budget conversation. They will save the CFO a quarter of surprise.

The AI for engineering teams piece is the operational context for any of these tool decisions. Read it before the procurement conversation if you have not. The decisions on this hub make sense only against the throughput-versus-velocity reality that piece names; without it, every comparison reads as a benchmark contest rather than a procurement frame.

How this hub fits the rest of the site

The parent capabilities hub covers the broader capability layer that sits underneath the AI strategy. This hub is the capability-layer treatment of one specific procurement category. The orchestration architecture work at orchestration architecture is the technical context for tools that integrate with the broader AI stack — Claude Code and Cursor both connect to the same model-vendor surfaces that orchestration architectures route through, and the architectural decisions in that piece constrain which AI coding tools work well inside an existing platform.

The strategy root hub is upstream of all of this. If your engineering organisation has been asked to standardise on an AI coding tool without a clear strategy underneath, the four-question diagnostic on the root hub is the place to start. The procurement decisions here only make sense against a strategy that has answered the posture question — leader, follower, or absentee — and the budget-ceiling question. Tools chosen without those answers are tools chosen against vendor narratives, which is the failure mode this site exists to help you avoid.

The hub will get updated quarterly. The market is moving fast enough that any specific tool recommendation has a six-to-twelve-month half-life; the procurement frame above has a longer one. Bookmark the frame; reread the comparisons each refresh.

Sources & methodology

Anthropic — Claude Code documentation — primary vendor reference for the Claude Code surface and pricing model
Cursor — Documentation and pricing — primary vendor reference for the Cursor surface and tier structure
GitHub Copilot for Business — Documentation — primary vendor reference for the GitHub Copilot enterprise surface
Cognition — Windsurf acquisition announcement — primary reference for Windsurf’s July 2025 ownership change
TechCrunch — Windsurf’s CEO goes to Google; OpenAI’s acquisition falls apart — public reference for the failed OpenAI transaction and Google licensing event
Latent Space — AI coding agents coverage, 2025-2026 — third-party engineering coverage of the category’s evolution
Simon Willison’s blog — AI coding tool reviews — independent practitioner-voice reviews that have held up across multiple model releases
METR — Developer productivity studies — methodologically careful work on the perception-versus-measurement gap that constrains tool-procurement claims
Methodology: procurement frame drawn from fractional CTO engagements (2024-2026) covering tool selection and rollout across roughly twenty engineering organisations ranging from 40 to 1,200 engineers. The four-question frame is the one I use on every procurement engagement; the deep-dive pages report the scoring on the specific pairs.

If a number disagrees with your own organisation’s measurement, send the disagreement and I will publish it with attribution. The scoring sheets are CC-BY-4.0; fork them and publish a different verdict if your weights produce one.

Across the guide

AI Strategy Frameworks: A Practitioner Comparison AI Maturity Models: A Strategic-Posture-Conditional Reading AI Governance: A Practitioner's Map of the 2026 Landscape AI Roadmap and Transformation: Sequencing the Work When Budget is Real Teams and Roles: The CAIO Question, Honestly AI Software Engineering: An Engineering Leader's Map

AI Coding Tools in 2026: The Procurement Frame No Vendor Will Draw For You

Communities of Practice for AI: An Operating Model

Measuring AI Adoption: A Leading and Lagging Indicator Set

How to Ring-Fence AI Learning Time Without Killing Delivery

Workforce AI Readiness: Why Training Fails and What Replaces It

24 AI Mini-Challenges: An Operator's Library for Building Real Proficiency

What AI-Native ERP Actually Means: A CFO/CTO Evaluation Framework

Claude Code vs GitHub Copilot: The Procurement Read, Not the Feature Matrix

Claude Code vs Windsurf: Terminal-First Versus IDE-First, and the Workflow That Decides

Claude Code Pricing: A Procurement-Decision Read for CTOs and Finance

Cursor Pricing: The Procurement Model Behind the Flat Per-Seat Number

Cursor vs Claude Code: An Honest Practitioner Comparison in 2026

Cursor vs Windsurf: Two VS Code Forks, One Failed OpenAI Deal, and a Cognition Ownership Question

Enterprise AI Copilots: The Procurement Reality of Glean, Moveworks, M365 Copilot and the Rest

How to Rank in ChatGPT: The Honest Operator's Read

How to Rank in Google AI Overviews: The Operator's Read

Agentic Commerce Strategy: What the Board Needs to Decide Before AI Agents Buy

AI for Engineering Teams: What the VP of Engineering Actually Needs to Know

AI Agent Security: The Runtime Tooling View of Prompt Injection, Data Exfil, and Output Trust

AI Brand Visibility Tools in 2026: The Honest Procurement Read

AI Data Readiness: The Data Layer That Pre-Dates Your Strategy Approval

AI Orchestration Frameworks in 2026: The Practitioner's Comparison

AIOps Platforms in 2026: Buy Standalone, Consolidate, or Skip

GPU and Inference Platforms in 2026: The Procurement Read

Legal AI Platforms in 2026: Harvey, Spellbook, Robin AI, Lexion, Casetext — The Procurement Read

ML Platforms in 2026: SageMaker vs Vertex AI vs Azure ML vs Databricks vs MLflow vs W&B

RAG Architecture in 2026: The Four Choices That Decide Payback

Voice AI in 2026: Four Procurement Categories, Named Verdicts

Agentic AI Architecture Patterns: The Four Shapes That Stabilised in 2026

AI Orchestration Architecture in 2026: The Smallest Thing That Works

AI-SRE Tools in 2026: The Honest Buyer's Read

AI-SRE Vendor Comparison 2026: Ten Tools, Four Criteria, Named Verdicts

The Cost of Failed AI Projects: A Failure-Mode Catalogue From Forty Engagements

Enterprise AI Literacy: A Role-Conditional Programme, Not a Compliance Seminar

Enterprise AI Readiness Assessment: The Four-Page Version

Enterprise AI Red Teaming: The Operational Version

Langsmith vs Langfuse vs Helicone: The Three-Archetype Procurement Read

Resolve AI Alternatives: Four Honest Options for Incident-Response Procurement

Scalable AI Adoption: The Twenty-First Workload Problem

Traversal AI vs Resolve AI: The Autonomy Ceiling Decides the Procurement

The market in 2026, named honestly

The four questions that decide procurement

The throughput-versus-velocity reality

The licence-versus-API split, in practice

The security and compliance surface, briefly

How to read the deep-dives under this hub

How this hub fits the rest of the site

Sources & methodology

Across the guide

Frequently asked questions