Document RAG Guide
Query documents using vector embeddings and semantic similarity search using the Workbench
Beginner
20 min
- TrustGraph deployed (Quick Start)
- Understanding of Core Concepts
Load documents into TrustGraph, create Document RAG flows, and query using vector similarity search while understanding its limitations.
Query documents using vector embeddings and semantic search
Document RAG (also called “basic RAG”, “naive RAG”, or simply “RAG”) is the original retrieval-augmented generation approach that uses vector embeddings to find relevant document chunks and provides them as context to an LLM for generating responses.

Despite being introduced in 2020, many practitioners in 2023 treated basic RAG as if it were a revolutionary breakthrough, seemingly unaware of its well-documented limitations. The approach is straightforward: chunk your documents, embed them, perform similarity search, and hope the LLM can make sense of whatever fragments get retrieved.
While document RAG can work for simple use cases, it struggles with multi-hop reasoning, fails to capture document structure, and has no mechanism for handling contradictory information across chunks. The “naive” in “naive RAG” is there for a reason—yet it remains surprisingly popular among those who stopped reading the literature after the first paper.
Document RAG is the most basic information retrieval flow. It can prove useful for some limited cases, but you should consider GraphRAG or Ontology RAG for real-world information retrieval use-cases.
What is Document RAG?
The essential Document RAG ingest flow consists of:
- Chunking documents into smaller pieces
- Embedding each chunk as a vector
- Storing vectors in a vector database along with the chunks
- Retrieving similar chunks based on query embedding
- Generating responses using retrieved context and an LLM
The pros and cons of this approach:
- ✅ Pro: Quicker ingest time compared to knowledge extraction flows
- ✅ Pro: No token consumption on document ingest
- ⚠️ Con: Sentence embeddings are imprecise and limited for retrieval where documents contain diverse concepts
- ⚠️ Con: Ineffective for resolving complex questions
When to Use Document RAG
✅ Use Document RAG when:
- You need semantic search over documents
- Questions can be answered from isolated passages
- You want simple, fast implementation
- Document context is self-contained in paragraphs or chunks
⚠️ Consider alternatives when:
- You need to understand relationships between entities → Use Graph RAG
- You need structured schema-based extraction → Use Ontology RAG
- Answers require connecting information across documents → Use Graph RAG
Prerequisites
Before starting:
- ✅ TrustGraph deployed (Quick Start)
- ✅ Understanding of Core Concepts
Step-by-Step Guide
Step 1: Load Your Document
TrustGraph supports multiple document formats:
- PDF files (
.pdf) - Text files (
.txt) - Markdown (
.md) - HTML (
.html)
We’re going to start by using a fictional maritime tracking report which you can download at this URL:
- Download the document
- Go the ‘Library’ page
- Click ‘Upload documents’
- Set the title: PHANTOM CARGO
- Set the Comments to: Intelligence report: Operation PHANTOM CARGO
- Set keywords: maritime, intelligence, cargo, grey arms
- Select ‘Text’ for the upload operation
- Click ‘Select text files’
- Add the document you just downloaded
- Click Submit

Step 2: Create a Collection
A collection is used to organise a set of related documents or data sources into a single unit. Retrieval operations operate across a single collection.
We’ll create an ‘intelligence’ collection:
- Go to the ‘Library’ page
- Select the ‘Collections’ tab
- Click ‘Create Collection’
- Set the ID: intelligence
- Set the name: Intelligence
- Set the description to: Intelligence analysis
- Click ‘Submit’
Step 3: Create the Flow
A flow describes the collection of processing operations. We’re going to create a single flow for Document RAG processing.
We’ll create a ‘doc-rag’ flow:
- Go to the ‘Flows’ page
- Click ‘Create’
- Select the flow blueprint ‘Document RAG’
- Set the ID: doc-rag
- Set the description: Document RAG
- Click ‘Create’
Step 4: Submit the Document for Processing
This pushes the document into the flow input.
There is a selection widget top right of the screen with an database icon top left.

Click that to open the collection/flow selector, and select the Intelligence collection, and Document RAG, both of which you created earlier.

You are ready to submit the document:
- Go to the ‘Library’ page
- Select the PHANTOM CARGO document so that the tick box is selected
- Click ‘Submit’ at the bottom of the page
- Change the Processing flow to Document RAG
- Click Submit
Step 5: Monitoring
If you want to see the document loading, you can go to Grafana at http://localhost:3000. The default login user is admin, password admin. Grafana is configured with a single dashboard, which has a ‘pub/sub backlog’ graph.

The document we loaded is small, and will process very quickly, so you should only see a ‘blip’ on the backlog showing that chunks were loaded and cleared quickly.
Step 6: Retrieval
Presently, there is no Document RAG query interface in the Workbench.
For querying Document RAG using the command line, see the Document RAG CLI Guide.
The CLI guide also includes examples demonstrating Document RAG’s limitations with complex queries and multiple documents.
Document RAG vs. Other Approaches
| Aspect | Document RAG | Graph RAG | Ontology RAG |
|---|---|---|---|
| Retrieval | Vector similarity | Graph relationships | Schema-based |
| Context | Isolated chunks | Connected entities | Connected objects, properties and types |
| Best for | Semantic search | Complex relationships | Complex relationships + precise types |
| Setup | Simple | Simple | Complex |
| Speed | Fast | Medium | Medium |
Use multiple approaches: The processing flow defines the extraction and retrieval mechanisms, so you can use multiple approaches on the same data.
Next Steps
Using the CLI
For command-line workflows, see the Document RAG CLI Guide.
Explore Other RAG Types
- Graph RAG - Leverage knowledge graph relationships
- Ontology RAG - Use structured schemas for extraction
Related Resources
- Core Concepts - Understanding embeddings and chunks
- Vector Search - How semantic search works