Deep Search

Search inside your content, not just about it. Find the exact paragraph, the exact moment, the exact frame.

Frontier search, that works out of the box.

AI-native file storage for agents and apps. Files, notes, bookmarks, documents.

Full CRUD, semantic search, folders and tags.

Read the docs

// Query
{ "text": "quarterly revenue breakdown" }

// Response (one hit)
{
  "score": 0.94,
  "chunks": [
    {
      "text": "Q3 revenue reached $4.2M, up 18% from Q2...",
      "pageNumber": 7,
      "score": 0.94
    }
  ],
  "name": "Q3 Financial Report.pdf",
  "kind": "document"
}

// Query
{ "text": "quarterly revenue breakdown" }

// Response (one hit)
{
  "score": 0.94,
  "chunks": [
    {
      "text": "Q3 revenue reached $4.2M, up 18% from Q2...",
      "pageNumber": 7,
      "score": 0.94
    }
  ],
  "name": "Q3 Financial Report.pdf",
  "kind": "document"
}

What is Deep Search?

Exabase Deep Search is a multi-modal, hybrid search API built into Exabase's storage layer.

It searches inside your content at the sub-document level. Paragraphs in PDFs, timestamps in audio and video, objects and colors in images.

Content is indexed automatically when you store it. No embedding pipelines, no chunking logic, no search infrastructure to build or maintain.

Problem

Months building search infrastructure.

You need your agent to find the right passage in a PDF, the right moment in a recording, the right image in a library.

So you build it. Chunking pipelines, embedding models, PDF parsers, transcription services, citation logic, reranking.

You stitch it together, tune the chunk sizes, handle edge cases for every file format. Then you maintain all of it.

And you still don't get the results you want.

Solution

Frontier deep search, ready to go.

State-of-the-art deep search, that works out of the box. Baked into Exabase's storage.

Everything you store becomes searchable at the sub-document level – paragraphs in PDFs, timestamps in audio and video, objects and colours in images.

Search with multi-modal input. Get back precise chunks with location references and relevance scores. No pipeline to build. No models to manage.

Up and running in 3 minutes.

Production-ready

Private by design

Security-first

Scalable

Production-ready

Private by design

Security-first

Scalable

Production-ready

Private by design

Security-first

Scalable

Search modes

Text queries

hybrid semantic and keyword search, automatically balanced

Image queries

find visually similar content across your resources

Image queries

find visually similar content across your resources

Colour search

find resources by dominant colour or palette

File or resource similarity

use any stored resource as a query

File or resource similarity

use any stored resource as a query

Filtered search

narrow by tags, folders, resource type, date range

Multi-query search

fan out multiple queries in a single call

Multi-query search

fan out multiple queries in a single call

Precision

The precision parameter (0 to 1) lets you tune the tradeoff between recall and relevance. At 0.1, you get a wide net of loosely related results. At 0.9, you get only the closest matches.

For an agent doing research, set it low to surface everything potentially relevant. For an agent answering a specific question, set it high to get the one right chunk. Same endpoint, one parameter change.

Results that deliver

Matched chunks with relevance scores, both per-hit and per-chunk

Page numbers on document chunks, so you know exactly where in the PDF the match came from

Timestamp ranges on audio and video chunks (timeStart, timeEnd), so you can link to the exact moment

Text content in each chunk, ready to pass as context to your LLM or display to users

Works across every resource type in your Base: documents, images, audio, video, notes, bookmarks

How it works

Store a resource. Upload files, save notes, or bookmark links through the Resources API into any Base. Content is indexed automatically at write time. No embedding pipeline to configure.

Search with anything. Send a text query, an image, a color, or a reference to an existing resource. Add filters to narrow by file type, folder, tag, date range, or user. Combine multiple queries in a single call.

Get precise results. Each hit includes the resource metadata, a relevance score, and the matched chunks with their location. For documents, that's the page number. For audio and video, that's the timestamp range. Ready to pass to your LLM or display to users.

const results = await api.resources.search({
  text: "quarterly revenue breakdown",
  filters: { kinds: ["document"] },
  precision: 0.5,
});

for (const hit of results.hits) {
  console.log(hit.name, hit.chunks[0].text, hit.chunks[0].pageNumber);
}

const results = await api.resources.search({
  text: "quarterly revenue breakdown",
  filters: { kinds: ["document"] },
  precision: 0.5,
});

for (const hit of results.hits) {
  console.log(hit.name, hit.chunks[0].text, hit.chunks[0].pageNumber);
}

API at a glance

Field

Type

Description

/v2/search

POST

Search by text, image, color, or similarity

Use cases

RAG pipelines

Search your knowledge base for relevant chunks, pass them as context to your LLM. Each chunk includes a page number or timestamp for source citations.

Agent knowledge retrieval

Your agent searches its Base before responding. It finds the exact paragraph, not just the right document.

Agent knowledge retrieval

Your agent searches its Base before responding. It finds the exact paragraph, not just the right document.

Customer support

Search across support docs, past tickets, and product guides to find the answer your agent needs.

Customer support

Search across support docs, past tickets, and product guides to find the answer your agent needs.

Research and analysis

Search across a library of papers, reports, and articles. Find every mention of a topic across all documents.

Research and analysis

Search across a library of papers, reports, and articles. Find every mention of a topic across all documents.

Media search

Find the exact moment in a recording by searching with text. Results include timestamp ranges you can link to directly.

Media search

Find the exact moment in a recording by searching with text. Results include timestamp ranges you can link to directly.

Visual search

Upload an image and find similar images across your library. Filter by color for design, e-commerce, or creative workflows.

Visual search

Upload an image and find similar images across your library. Filter by color for design, e-commerce, or creative workflows.

Internal knowledge base

Employees search across all company documents, notes, and bookmarks from a single endpoint.

Ready for scale

Fast

Deliver speed to your users. Infrastructure that won't slow your agent down.

Secure

Encrypted in transit (SSL) and at rest (AES-256). CASA certified.

Reliable

99.9% uptime. Built on Exabase's consumer-grade scaled infrastructure.

Ready for scale

Fast

Deliver speed to your users. Infrastructure that won't slow your agent down.

Secure

Encrypted in transit (SSL) and at rest (AES-256). CASA certified.

Reliable

99.9% uptime. Built on Exabase's consumer-grade scaled infrastructure.

Ready for scale

Fast

Deliver speed to your users. Infrastructure that won't slow your agent down.

Secure

Encrypted in transit (SSL) and at rest (AES-256). CASA certified.

Reliable

99.9% uptime. Built on Exabase's consumer-grade scaled infrastructure.

Why Exabase

No pipeline to build

Most search APIs give you semantic search over vectors you create. You still build the ingestion pipeline, handle every file format, manage chunking, and figure out citations yourself. Exabase indexes content automatically when you store it. No external parsers, no embedding models, no chunking logic to maintain.

No pipeline to build

Sub-document precision

Results aren't just documents. They're specific paragraphs in PDFs, and timestamps in audio and video. Each chunk comes with a relevance score and a location reference. Your agent can cite exactly where it found the answer.

Sub-document precision

Hybrid by default

Every text query combines semantic search with typo-tolerant keyword matching. No configuration needed. Wrap a phrase in double quotes to force exact match when you need it.

Hybrid by default

Every text query combines semantic search with typo-tolerant keyword matching. No configuration needed. Wrap a phrase in double quotes to force exact match when you need it.

Multi-modal in one call

Search with text, an image, a color, or a reference to an existing resource. Combine multiple query types in a single request with multi-query search. One endpoint handles all of it.

Multi-modal in one call

Search with text, an image, a color, or a reference to an existing resource. Combine multiple query types in a single request with multi-query search. One endpoint handles all of it.

Customizable precision

The precision parameter (0 to 1) controls the tradeoff between recall and relevance. Set it low for broad research. Set it high for precise answers. Default is 0.3.

Customizable precision

The precision parameter (0 to 1) controls the tradeoff between recall and relevance. Set it low for broad research. Set it high for precise answers. Default is 0.3.

Production-tested at scale

Deep Search runs on the same infrastructure as Fabric, where hundreds of thousands of users store and search their files, notes, and links every day. It's not a research prototype.

Production-tested at scale

Deep Search runs on the same infrastructure as Fabric, where hundreds of thousands of users store and search their files, notes, and links every day. It's not a research prototype.

Part of the full platform

Search works across everything in your Base. Resources you uploaded, content you extracted, bookmarks Workers saved. Memory, Extract, and Deep Search all share the same storage layer. No sync, no glue code, no separate search service to manage.

Part of the full platform

Part of a full platform

Memory

Self-managing advanced memory system.

Bases

Isolated per-tenant instances with version rollback.

Resources

Files, notes, links. Your portable context server.

Deep Search

Sub-document multi-modal hybrid search.

Extract

Structured data from PDFs, websites, images, and more.

Workers

Autonomous agents and self-enriching knowledge bases.

Memory

Self-managing advanced memory system.

Bases

Isolated per-tenant instances with version rollback.

Resources

Files, notes, links. Your portable context server.

Deep Search

Sub-document multi-modal hybrid search.

Extract

Structured data from PDFs, websites, images, and more.

Workers

Autonomous agents and self-enriching knowledge bases.

Works with everything

Model-agnostic. Framework-agnostic.

SDK

TypeScript SDK. A convenient toolkit to move faster.

API

Simple, clean API. Up and running in under a minute.

Works with everything

Model-agnostic. Framework-agnostic.

SDK

TypeScript SDK. A convenient toolkit to move faster.

API

Simple, clean API. Up and running in under a minute.

Works with everything

Model-agnostic. Framework-agnostic.

SDK

TypeScript SDK. A convenient toolkit to move faster.

API

Simple, clean API. Up and running in under a minute.

FAQs

What is Exabase?

Exabase is infrastructure for AI agents. It gives your agents memory, versioned file storage, AI deep search, and context automation through a set of APIs. Store what your agent learns, search inside any content type, and keep knowledge bases current automatically. Built for production use.

Who uses Exabase?

Developers and teams building AI agents, copilots, and RAG applications. If your agent needs to remember things between sessions, store and retrieve files, search across documents and media, or stay up to date without manual maintenance, Exabase handles that infrastructure so you can focus on your product.

What search modes are supported?

Text (hybrid semantic + keyword), image (visual similarity), color (dominant color or palette), resource similarity (find content similar to something you already have), and multi-query (combine several of these in one request).

Is it semantic search or keyword search?

Both. Text search is hybrid by default, combining typo-tolerant keyword matching with semantic search. This means it works for exact phrases and natural language questions. Wrap a phrase in double quotes to force exact match.

What file types can I search across?

Anything stored in your Base. PDFs, images, audio, video, notes, bookmarks, documents. Content is indexed automatically when you store it.

Does it search inside documents or just titles?

Inside. Deep Search finds specific paragraphs in PDFs, timestamps in audio and video, objects and colors in images. Results include the exact chunk that matched, with page numbers, timestamps, or image regions.

How do I control result precision?

Pass the precision parameter (0 to 1). Higher values return fewer, more relevant results. Lower values return more results that may be less precise. Default is 0.3.

Can I filter results by folder, tag, date, or file type?

Yes. The filters object supports kinds (file type), parentIds (folder), ancestorIds (nested folders), tagIds, userIds, createdAfter, and createdBefore.

Can I search with an image?

Yes. Pass a base64-encoded image in the image field. The API finds visually similar images across your resources. You can combine image search with color filtering.

Can I search using an existing resource as the query?

Yes. Pass a resourceId and the API finds content similar to that resource. Works across all resource types.

What do results look like?

Each hit includes the resource metadata, a relevance score, and matched chunks. Chunks contain the matched text, page number (for documents), or timestamp range (for audio/video).

Does it support pagination?

Yes. Pass pagination.page and pagination.pageSize. The response includes total and hasMore.

Can I sort results?

Yes. By relevance (default) or by createdAt in ascending or descending order.

Do I need to build an embedding pipeline?

No. Content is indexed automatically when you store it through the Resources API. No external parsers, embedding models, or chunking logic. Exabase handles the entire path from raw file to searchable index.

Can I search across multiple Bases?

Search runs against one Base at a time. Pass the X-Exabase-Base-Id header to target a specific Base. If omitted, the parent workspace is used.

Is search history stored?

By default, yes. Pass incognito: true to prevent the query from being saved to search history.

What is semantic search?

Semantic search vs keyword search for AI agents

Semantic collapse: Why vector search breaks at scale

Deciding?

Ask your favourite AI about Exabase:

Ship your first app in minutes.

Start for free

Read the docs

Ship your first app in minutes.

Start for free

Read the docs

Deep Search

Deep Search

Search inside your content, not just about it. Find the exact paragraph, the exact moment, the exact frame.

Frontier search, that works out of the box.

AI-native file storage for agents and apps. Files, notes, bookmarks, documents.

Full CRUD, semantic search, folders and tags.

What is Deep Search?

Months building search infrastructure.

Frontier deep search, ready to go.

Search modes

Precision

Results that deliver

How it works

API at a glance

Use cases

Ready for scale

Ready for scale

Ready for scale

Why Exabase

Part of a full platform

Works with everything

Works with everything

Works with everything

FAQs

Related posts:

Ship your first app in minutes.

Ship your first app in minutes.

Part of the family:

Part of the family:

Deep Search

Deep Search

Search inside your content, not just about it. Find the exact paragraph, the exact moment, the exact frame.Frontier search, that works out of the box.

AI-native file storage for agents and apps. Files, notes, bookmarks, documents. Full CRUD, semantic search, folders and tags.

What is Deep Search?

Months building search infrastructure.

Frontier deep search, ready to go.

Search modes

Precision

Results that deliver

How it works

API at a glance

Use cases

Ready for scale

Ready for scale

Ready for scale

Why Exabase

Part of a full platform

Works with everything

Works with everything

Works with everything

FAQs

Related posts:

Ship your first app in minutes.

Ship your first app in minutes.

Search inside your content, not just about it. Find the exact paragraph, the exact moment, the exact frame.

Frontier search, that works out of the box.

AI-native file storage for agents and apps. Files, notes, bookmarks, documents.

Full CRUD, semantic search, folders and tags.