Introduction

Vectorless is a reasoning-based document-retrieval engine. Instead of splitting a document into fixed-size chunks and searching them by vector similarity, Vectorless parses the document into a hierarchical tree and lets an LLM agent navigate it — returning answers with precise, path-correct citations.

The one-line version

Vectorless parses a document into a tree, an agent walks it with the treewalk strategy, and you get an answer plus the exact nodes it came from.

Why no chunking, embeddings, or vector DB?

Classic RAG turns retrieval into a nearest-neighbor problem: chunk the text, embed every chunk, and hope the question's embedding lands near the right one. That throws away the document's structure and makes citations approximate.

Vectorless takes the opposite bet:

Structure is signal. A document already has sections, sub-sections, tables, and ordering. Vectorless preserves that as a tree.
Navigation beats similarity. An LLM agent reasons about where an answer lives and walks the tree to it — the treewalk strategy.
Citations are structural. Because the agent ends on real nodes, every answer maps back to exact locations in the source.

Why no chunking, embeddings, or vector DB?

What's in these docs

Getting Started

Core Concepts

API Reference

SDKs

Self-hosting

On this page