Frequently asked questions

What is Exa?

Exa is a new search engine offering both proprietary neural search and industry-standard keyword search. It excels in finding precise web content, retrieving clean/rich web content, and can even identify similar pages based on input URLs. These technologies make Exa ideal for enhancing RAG pipelines, automating research, and creating niche datasets.

What's different about Exa's Neural Search?

We use a transformer-based model to understand your query and return the most relevant links. Exa has embedded large portions of the web so you can make extremely specific and complex queries, and get only the highest quality results.

How is Neural Search different from Google?

Google search is mostly keyword-based, matching query words to webpage words. For example, a Google search for "companies working on AI for finance" typically returns links like "Top 10 companies developing AI for financial services".

In contrast, Exa's neural search understands meaning, returning actual company URLs. Additionally, Exa's results are not influenced by SEO, unlike Google/other engines, which can be affected by optimized content. This allows Exa to provide more precise and relevant results based on the query's intent rather than by keywords alone.

How is Exa different from LLMs?

Exa is a new search engine built from the ground up. LLMs are models built to predict the next piece of text. Exa predicts specific links on the web given their relevance to a query. LLMs have intelligence, and are getting smarter over time as new models are trained. Exa connects these intelligences to the web.

How can Exa be used in an LLM?

Exa enhances LLMs by supplying high-quality, relevant web content, minimizing hallucination and outdated responses. An LLM can take a user's query, use Exa to find pertinent web content, and generate answers based on reliable, up-to-date information.

How does Exa compare to other search APIs?

Exa.ai offers unique capabilities:

  • Neural Search Technology: Uses transformers for semantic understanding, handling complex queries based on meaning.
  • Natural Language Queries: Processes and understands natural language queries for more accurate results.
  • Instant Content Retrieval: Instantly returns clean and parsed content for any page in its index.
  • Large-scale Searches: Capable of returning thousands of results for automatic processing, ideal for batch use cases.
  • Content Highlights: Extracts relevant excerpts or highlights from retrieved content for targeted information.
  • Optimized for AI Applications: Specifically designed for enhancing AI models, chatbots, and research automation.
  • Auto Search: Automatically selects the best search type (neural or keyword) based on the query for optimal results.

How often is the index updated?

We update our index every two minutes, and are constantly adding batches of new links. We target the highest quality web pages. Our clients often request specific domains to be more deeply covered - if there is a use-case we can unlock by additional domain coverage in our index, please contact us.

How does similarity search work?

When you search using a URL, Exa crawls the URL, parses the main content from the HTML, and searches the index with that parsed content.

The model chooses webpages which it predicts are talked about in similar ways to the prompt URL. That means the model considers a range of factors about the page, including the text style, the domain, and the main ideas inside the text.

Similarity search is natural extension for a neural search engine like Exa, and something that's difficult with keyword search engines like google.

What security measures does Exa take?

We have robust policies and everything we do is either in standard cloud services, or built in house (e.g., we have our own vector database that we serve in house, our own GPU cluster, our own query model and our own SERP solution). In addition to this, we can offer unique security arrangements like zero data retention as part of a custom enterprise agreement -- just chat with us!

What's our roadmap?

  • Support arbitrarily complex queries
  • Build a (much) larger index
  • Solve search. No, really.