RAG is still in BETA, if you encounter issues let us know. Its currently free for users to try
How to use RAG?
1
Head over to the RAG tab from the left menu
This will open up the 2 Step RAG UI, ready to be used
2
Select the Files
You can either click to select or even drag & drop files in the window to get started
File types: .pdf, .txt, .doc, .docx, .md, .png, .jpg, .jpeg, .mp3, .wav are supported & Total size of all files uploaded can be upto 16 MB
3
Click the Upload button
This will open up the 2 Step RAG UI, ready to be used
4
Head over to the Chat UI
Now you can ask whatever questions you might have, the answers will be from the reference documents uploaded only
If you want to delete the files, simply press the ❌ button beside the documents. To clear the chat & start a new conversation you can use the
Clear Chat
buttonThe answers provided by the RAG Agent will also have citations & will mention the PDF source for you to crosscheck if needed
Why Use Multimodal RAG?
- Accuracy Across Formats: Reduces hallucinations by grounding models in trusted enterprise data, not just text.
- Custom Context: Aligns responses with your organization’s knowledge base, images, videos, or audio transcripts.
- Freshness: Works with frequently updated multimodal datasets (e.g., meeting recordings, surveillance images, reports).
- Multimodal Use Cases: Go beyond text-only AI; integrate visual, audio, and structured data reasoning.
Supported Modalities
- Text: Documents, FAQs, manuals, knowledge bases.
- Images: Diagrams, charts, product images.
- PDFs: Reports, invoices, academic papers.
- Audio: Meeting recordings, call center conversations.
- Video: Training videos, tutorials, surveillance streams.
Coming Soon
- Structured Data: CSVs, relational tables, analytics exports.
Coming Soon
Example Scenarios
- Customer Support: Retrieve text FAQs + product images to answer queries.
- Compliance & Legal: Retrieve legal PDFs + annotated charts for audits.
- Healthcare: Ground AI in radiology images + doctor’s notes.
- Research: Combine academic papers (PDFs) with charts/images from experiments.
- Enterprise Training: Use video transcripts + slides to answer employee questions.