Editorial · agents tools

Databricks plus GPT-5.5 makes agent location a data-gravity call

Most CTOs are architecting enterprise agents away from their data — and the Databricks/GPT-5.5 integration makes that a 12-month cost problem.

May 18, 2026· 5 min read· Domani AI

Databricks has embedded GPT-5.5 directly into enterprise agent workflows, and GPT-5.5 now holds state-of-the-art on the OfficeQA Pro benchmark. For most CTOs, that headline reads as a model upgrade. It isn't — it's a forcing function on where your agents live relative to your data, and the window to make that call cheaply is closing.

What changed

OpenAI and Databricks announced that GPT-5.5 is now available natively within Databricks' enterprise agent workflows. The integration means teams already on the Databricks lakehouse can point GPT-5.5 at their existing catalogs, governance rules, and compute without exporting data to a separate agent platform. GPT-5.5 reached the top position on the OfficeQA Pro benchmark, a test designed to approximate the document-heavy, multi-step reasoning work that characterizes enterprise back-office agents — contract review, financial analysis, cross-system lookups.

The practical result is that a Databricks customer can now run a reasoning-grade model inside the same security perimeter as their Delta tables and Unity Catalog policies. Previously, connecting a capable frontier model to enterprise data meant either accepting data egress to an external API or building substantial middleware to enforce governance at the boundary. That middleware cost — in engineering time, latency, and audit surface — was often the hidden argument against enterprise agent adoption.

This is not a research preview. Databricks has a large installed base among companies in the 200–2,000 FTE range, and those companies typically already have structured and semi-structured data in the platform. GPT-5.5's benchmark performance means the model is capable enough to handle the reasoning complexity those use cases actually require, not just simple retrieval.

Why this reframes the build-vs-buy decision for your stack

The conventional enterprise agent architecture debate has three positions: build a custom agent stack on raw APIs, buy a vertical SaaS agent tool, or use a general orchestration framework like LangChain or CrewAI sitting in front of your own infrastructure. Each of those assumes the agent layer is separate from the data layer, connected by retrieval pipelines you own and maintain.

The Databricks/GPT-5.5 integration breaks that assumption. If your data already lives in Databricks, running agents natively in that environment eliminates a retrieval pipeline, a governance enforcement layer, and a separate compute bill. The architectural win is not model quality alone — it's that the model runs where the data is, which removes the latency and compliance overhead that kills enterprise agent projects in production. Data gravity — the principle that compute moves toward data, not the other way around — now applies directly to agent orchestration.

The risk is symmetric. If you have built, or are building, an agent stack outside Databricks while your data warehouse sits inside it, you now face a harder comparison. The standalone stack has to be meaningfully better on your specific use case, or cheaper at scale, or more flexible in ways you can actually name. "We own the architecture" is not sufficient justification if the cost of owning it includes continuous ETL overhead and a governance gap. CTOs who made that call 18 months ago, before a model of this reasoning caliber was available natively in the platform, should rerun the math.

There are legitimate reasons to stay off-platform: multi-cloud neutrality requirements, use cases that span multiple data stores not consolidated in Databricks, or agent complexity that requires orchestration primitives Databricks doesn't yet expose. But those reasons need to be explicit in your architecture docs, not implicit.

What to do Monday morning

The decision tree here has five inputs. Work through them before your next infrastructure review.

Data location: What percentage of the data your agents need is already in Databricks? If it's above 70%, the native path deserves a formal evaluation, not a casual dismissal.
Existing Databricks spend: Are you already paying for a Databricks tier that includes agent tooling? If yes, the marginal cost of the native path may be lower than your current standalone stack's API and infrastructure costs combined.
Governance posture: Does your security or compliance team require that model inference stay within a specific data boundary? Native-in-platform is often the shortest path to satisfying that requirement.
Agent complexity: Are your agents doing multi-step reasoning over structured enterprise data, or simple retrieval-augmented generation on documents? GPT-5.5's OfficeQA Pro performance is most relevant to the former.
Team ownership: Who currently owns your agent stack? If it's a small data engineering team already running Databricks, consolidation reduces their operational surface. If it's a product team building differentiated logic, they may need the flexibility of a standalone orchestration layer.

If your answers point toward consolidation, run a 2-week proof of concept on one existing agent use case — pick something in production with measurable latency and cost baselines. You need real numbers, not benchmark extrapolations, before you commit to a migration.

What it costs and what it saves

The cost of the native path is platform dependency. Databricks is not a neutral infrastructure layer; it has pricing power, and consolidating your agent compute there increases your exposure to future price changes. You are also accepting the platform's orchestration primitives, which may lag behind open-source frameworks on features like complex multi-agent coordination. For teams that have built significant internal tooling on top of LangGraph or similar, the migration cost is real engineering time — likely 4 to 8 weeks for a non-trivial agent surface.

The savings case is less abstract. Removing a retrieval pipeline between your agent layer and your data layer typically cuts p95 latency by a measurable margin on data-heavy queries. Governance enforcement at the data layer rather than the API boundary reduces the audit surface your security team has to maintain. And if you are currently running a separate vector store and embedding pipeline for enterprise retrieval, the native lakehouse approach can collapse that infrastructure entirely. For a company with 3 or 4 active agent workflows and a modest Databricks footprint, the annual infrastructure savings can justify the migration cost inside a single quarter — but that math depends entirely on your current stack's actual cost, which most teams have not fully accounted for.

Talk to Domani AI about building this →

Source: https://openai.com/index/databricks

Have a similar build in mind? → Start the conversation

Start the conversation →

← Back to Insights