Introducing Deep Max: State-of-the-Art Agentic Search

Apr 20, 2026

Introducing Deep Max

Today we're launching Deep Max: our highest-quality agentic search endpoint. Deep Max combines frontier LLMs with dozens of parallel calls to Exa Search to answer the hardest research questions on the web.

It hits state-of-the-art accuracy on every popular agentic search benchmark, and does it up to 20x faster than the closest competitor. Deep Max releasing soon, reach out to our team about usage and pricing.

Deep Search QA accuracy. Exa Deep Max leads at 90%, ahead of You Frontier, Parallel Ultra 8x, Perplexity Deep Research, Gemini 3.1 Pro, GPT 5.4, and Claude Opus 4.7

Evals

We benchmarked Deep Max against every major agentic search system (Parallel Ultra, You.com Frontier, Perplexity Deep Research), as well as frontier LLMs (GPT 5.4, Gemini 3.1 Pro, Claude Opus 4.7) running their own native search tools.

On all three evals, Deep Max is up and to the right: higher accuracy, lower latency.

Deep Search QA: accuracy vs latency. Exa Deep Max at 90% / 64s beats You Frontier (84% / 5908s) and Parallel Ultra 8x (82% / 1703s)

FRAMES: accuracy vs latency. Exa Deep Max at 94% / 11s beats Parallel Ultra (88% / 1457s) and all native LLM searchers

HLE-Search: accuracy vs latency. Exa Deep Max at 80% / 25s, tied with GPT 5.4 on quality but at half the latency

Why it's so fast

A typical Deep Max query finishes in tens of seconds, not tens of minutes. Three things make that possible:

Parallel tool calls. Modern LLM SDKs fan out search and contents calls in parallel, each targeting a different angle of the question. The model aggregates as results come back.

Token-efficient contents. Exa returns page text compact enough that the model spends its context on reasoning, not on re-reading headers and nav bars. Highlights guide the model to the right pages; full crawls back the final answer.

Fast in-house search.Every tool call hits Exa's own search stack, which returns results in under a second. At dozens of calls per query, that compounds into a very different user experience than orchestration layers built on older, slower search APIs.

Reach out to our team about usage and pricing for Deep Max.

Where search is going

A primary bottleneck in agentic search is the search tool itself: how broad the index is, how clean the page text is, how fast the results come back.

AIs will soon search the web more than humans, and those agents need search that is fast, accurate, and honest about what's on the page. Deep Max is the most advanced, highest compute version of search on Exa.

We're hiring. Come help us build it.

Cheers,

The Exa Team

SOTA Search Over Academic Publications

The Exa Team

July 23, 2026

Introducing Exa Agent

The Exa Team

June 16, 2026

Exa raises $250M Series C to build the search engine for AIs

Will Bryk

May 20, 2026

Introducing Deep Max: State-of-the-Art Agentic Search

Introducing Deep Max

Evals

Why it's so fast

Where search is going

Cheers,

The Exa Team

SOTA Search Over Academic Publications

Introducing Exa Agent

Exa raises $250M Series C to build the search engine for AIs

Products

Company

Developers

Resources

Connect