Overview
KnowledgeBase enables you to build Retrieval-Augmented Generation (RAG) systems by automatically processing documents, creating embeddings, and storing them in vector databases. It integrates seamlessly with Agent and Task to provide relevant context for AI-powered queries.Key Features
- Automatic Processing: Loads documents, chunks text, creates embeddings, and stores in vector databases
- Multiple Formats: Supports PDFs, Markdown, DOCX, CSV, JSON, HTML, and more
- Intelligent Chunking: Auto-detects optimal text splitting strategies
- Flexible Storage: Works with Chroma, Milvus, Qdrant, Pinecone, Weaviate, FAISS, and PGVector
- Hybrid Search: Combines dense vector search with full-text search for better results
- Tool Integration: Can be used as a tool, allowing agents to actively search and retrieve information
Installation
To use KnowledgeBase, you’ll need to install the required dependencies for your chosen vector database, document loaders, and embedding providers.Example: Setting up KnowledgeBase with ChromaFor a complete RAG setup using Chroma as the vector database, PDF loader, and OpenAI embeddings:What each optional group provides:
[chroma]- ChromaDB vector database client[pdf-loader]- PDF document loader (PyPDF)[embeddings]- Embedding providers (OpenAI, Anthropic, etc.)
chroma with qdrant, milvus, weaviate, pinecone, faiss, or pgvector. For other loaders, see the Loaders documentation.Example
Create a KnowledgeBase from documents and use it with an Agent:Navigation
- Attributes - Configuration options for KnowledgeBase
- Putting Files - How to add documents to your knowledge base
- Using as Tool - Use KnowledgeBase as a tool in Agent or Task
- Storage Providers - Vector database providers
- Embedding Providers - Embedding model providers
- Splitters - Text chunking strategies
- Loaders - Document loading strategies

