Vercel Knowledge Agents: Building AI That Understands Your Data Without Embeddings

Frequently Asked Questions

What is the Vercel Knowledge Agent Template?+

It is an open-source template from Vercel Labs for building AI knowledge agents that search your data using grep, find, and cat commands instead of embeddings or vector databases. It runs in isolated Vercel Sandboxes and can be deployed as web chat, GitHub, Discord, or Slack bots.

How does it work without embeddings or a vector database?+

Sources are synced to a filesystem snapshot. When a query arrives, the agent runs bash commands like grep -r, find, and cat to search through files, read content, and synthesize answers. Modern LLMs are already trained on code and terminal interactions, so they use these tools effectively.

What cost savings does this approach offer?+

Vercel reports that one deployment reduced sales call agent costs from 1.00 dollar to 0.25 dollars per call, a 75 percent reduction, while also improving answer quality. Savings come from eliminating vector database infrastructure and using a complexity router to match query difficulty to model cost.

When should I still use RAG with embeddings?+

RAG remains better for very large corpora of hundreds of millions of documents, queries requiring fuzzy semantic matching without exact keywords, and use cases needing nuanced understanding of relationships between concepts not co-located in source documents.

How do I fix wrong answers from the knowledge agent?+

Debugging is transparent. You can see exactly which grep commands the agent ran and which files it read. Fix wrong answers by editing source files or improving content structure, rather than retuning embeddings or adjusting chunking strategies.

What platforms can I deploy the knowledge agent to?+

The template supports deployment as a web chat interface, GitHub bot, Discord bot, or Slack integration through Vercel's Chat SDK adapters. It is built as a Nuxt application using the AI SDK.

Is the template free to use?+

The template itself is open source. Costs come from the underlying Vercel Sandbox and AI SDK usage, which are usage-based. There is no separate pricing for the template.

How does the complexity router work?+

The complexity router classifies incoming queries by difficulty and routes them to appropriate AI models via Vercel's AI Gateway. Simple factual lookups use cheaper, faster models while complex analytical questions get routed to more capable models, optimizing cost without sacrificing quality.

Vercel Knowledge Agents: Building AI That Understands Your Data Without Embeddings

Soizic

The Case Against Embeddings

How It Works: Files and Bash Commands

The Architecture in Detail

Source Management

Sandboxed Execution

Complexity Router

Admin and Observability

Multi-Platform Deployment

Why This Approach Beats RAG for Many Use Cases

Debugging is Transparent

No Silent Failures

Simpler Maintenance

Cost Efficiency

When RAG Still Makes Sense

Building Your Own Knowledge Agent

Step 1: Define Your Sources

Step 2: Configure the Complexity Router

Step 3: Deploy and Test

Step 4: Iterate on Sources, Not Embeddings

Practical Applications

Internal Knowledge Management

Developer Documentation Bots

Customer Support

Email and Communication Intelligence

The Bigger Trend: Simplicity Over Complexity

Frequently Asked Questions

Ready to get started?