Sunbelt Computer Software

🧠 Ragify

Build, configure, and chat with your own Retrieval-Augmented Generation bots.

Features · Quick Start · Architecture · Tech Stack · Security · Roadmap

✨ Features

🔑 Bring-Your-Own-Key — Use your own API keys for OpenAI, Anthropic, Google Gemini, or Mistral. Keys are encrypted at rest with AES-256-GCM. Zero LLM cost for the platform.
💬 Real-Time Streaming Chat — Live token-by-token responses powered by the Vercel AI SDK, with markdown rendering and source citations.
📂 Document Upload & Retrieval — Upload documents (TXT, MD, PDF) during RAG creation or directly mid-chat via the 📎 button. Keyword retrieval surfaces relevant context to the LLM.
🎛️ In-Chat Model Switcher — Switch between 20+ models from all providers on the fly from a dropdown — no need to leave the conversation.
⚙️ Live Bot Tuning — Adjust temperature, max tokens, top-p, and system prompt from a slide-out panel without leaving the chat.
🛡️ Granular Error Handling — Distinct, actionable error banners for invalid keys, exhausted credits, rate limits, model-not-found, and context overflow.
🏗️ Multi-Step Creation Wizard — A guided 6-step wizard: name → model → retrieval → safety → upload → review.
🔒 Platform API Keys — Generate rag_-prefixed keys for programmatic access, hashed with bcrypt. Revealed once, never stored in plaintext.
🌙 Dark/Light Themes — Full theme support via CSS custom properties. No Tailwind — pure vanilla CSS Modules.

🚀 Quick Start

🐳 Running with Docker (Recommended)

The easiest way to run Ragify without worrying about Node.js versions, native build tools, or installing Ollama is using Docker. This will spin up both the Ragify application and a local Ollama instance automatically.

Clone the repository:

git clone https://github.com/bhoomik-codes/ragify.git
cd ragify

Configure Environment: Create your .env file. You will need to generate secure keys for AUTH_SECRET and ENCRYPTION_KEY (see the manual setup step 2 below for instructions on generating these keys).
```
cp .env.example .env
```
Start with Docker Compose:
```
docker compose up --build -d
```
Open http://localhost:3000 when the build finishes.

💻 Manual Local Setup

Prerequisites

Requirement	Version
Node.js	`≥ 20.x`
npm	`≥ 10.x` (ships with Node 20)

Step 1 — Clone & Install

git clone https://github.com/bhoomik-codes/ragify.git
cd ragify
npm install

Step 2 — Automatic Setup & Initialization

Run the included setup script. This will automatically generate your secure .env cryptographic keys, initialize the SQLite database, generate the Prisma client, and verify your local Ollama connection.

npm run setup

Step 3 — Start the Dev Server

npm run dev

Open http://localhost:3000, register an account, and create your first RAG.

🧪 Running Without API Keys (Mock Mode)

Don't have LLM API keys yet? No problem — set MOCK_MODE="true" in .env to run with simulated responses. This lets you explore the full UI, upload documents, and test the creation wizard without spending a cent.

🔑 Adding Your LLM Keys

Once running, navigate to Settings → Provider Keys in the app. Add your API key for any provider (OpenAI, Anthropic, Google, Mistral). Keys are encrypted with AES-256-GCM before touching the database — the raw key is never stored.

🏛️ Architecture

Retrieval flow (high level)

Vector search first: if an embedding model is available, Ragify embeds the user query and retrieves the top-K most similar chunks.
Keyword fallback: if vector retrieval is unavailable or returns no results, Ragify falls back to SQLite FTS5 keyword search.
No-context response: if both return no matches, Ragify responds neutrally instead of injecting arbitrary chunks into the context window.

ragify/
├── app/
│   ├── (auth)/               # Login, Signup, Forgot/Reset Password
│   ├── (app)/                # Protected routes (requires session)
│   │   ├── dashboard/        # RAG cards grid
│   │   │   └── new/          # 6-step creation wizard
│   │   ├── rags/[ragId]/
│   │   │   └── chat/         # Chat UI (model switcher, params panel, upload)
│   │   └── settings/         # BYOK & Platform key management
│   ├── (marketing)/          # Public landing page
│   └── api/
│       ├── auth/             # NextAuth handlers + credential flows
│       ├── rags/             # CRUD, streaming chat, document upload
│       │   └── [id]/
│       │       ├── chat/     # POST — streaming chat endpoint
│       │       └── documents/ # POST — in-chat file upload
│       └── users/me/
│           ├── provider-keys/ # BYOK key CRUD
│           └── platform-keys/ # Platform API key CRUD
│
├── components/
│   ├── layout/               # AppShell, Sidebar, TopBar, ThemeToggle
│   ├── settings/             # ProviderKeyManager, PlatformKeyManager
│   ├── shared/               # ConfirmDialog, EmptyState, OnboardingTour
│   └── ui/                   # Button, Card, Input, Modal, Badge, Spinner
│
├── lib/
│   ├── auth.ts               # NextAuth v5 config (credentials provider)
│   ├── crypto.ts             # AES-256-GCM encrypt/decrypt + bcrypt
│   ├── llm.ts                # Provider-agnostic streaming + error classification
│   ├── pipeline.ts           # Document parse → chunk → embed pipeline
│   ├── vector.ts             # Cosine similarity, serialize, searchChunks
│   ├── validators.ts         # Zod schemas for all API payloads
│   ├── types.ts              # SSoT for enums, DTOs, interfaces
│   ├── mappers.ts            # Prisma row → safe DTO mapping
│   ├── db.ts                 # Prisma client singleton
│   └── mail.ts               # Email transport (password reset)
│
├── prisma/
│   └── schema.prisma         # Database schema
│
├── middleware.ts              # Auth route protection
└── .env.example               # Template for environment variables

🛠 Tech Stack

Layer	Technology	Badge
Framework	Next.js 14 (App Router)
Language	TypeScript 5.9 (strict)
Runtime	Node.js 22
Auth	NextAuth v5 (Auth.js)
ORM	Prisma 7
Database	SQLite (dev) → PostgreSQL (prod)
AI / LLM	Vercel AI SDK v3.4.x
Validation	Zod
Styling	Vanilla CSS Modules
Encryption	AES-256-GCM + bcrypt

Supported LLM Providers

Provider	Models
	GPT-4o, GPT-4o mini, o4-mini, o3-mini, o1, GPT-4 Turbo, GPT-3.5 Turbo
	Claude Opus 4.5, Sonnet 4.5, 3.7 Sonnet, 3.5 Sonnet, 3.5 Haiku, 3 Opus
	Gemini 2.5 Flash, 2.5 Pro, 2.0 Flash, 1.5 Pro, 1.5 Flash
	Mistral Large, Medium, Small, Codestral, Mixtral 8x22B

🔒 Security

Concern	Implementation
Provider API keys	AES-256-GCM with unique IV per key
Platform API keys	bcrypt hashed; raw key shown exactly once
Route authorization	IDOR check (`userId` match) on every API route
Input validation	Zod schemas on all API payloads
DTO mapping	Raw Prisma objects never returned to clients
Error classification	LLM errors mapped to specific codes (no stack leaks)
Secrets	`.env` excluded from git; `ENCRYPTION_KEY` required

📋 Environment Variables

Variable	Required	Default	Description
`DATABASE_URL`	✅	`file:./dev.db`	Database connection string
`AUTH_SECRET`	✅	—	NextAuth session signing key
`AUTH_URL`	✅	`http://localhost:3000`	App base URL for NextAuth callbacks
`ENCRYPTION_KEY`	✅	—	64-char hex string for AES-256-GCM
`MOCK_MODE`	❌	`"false"`	Set `"true"` to bypass real LLM calls
`MOCK_PIPELINE_DELAY_MS`	❌	`500`	Simulated pipeline delay (ms)
`OPENAI_API_KEY`	❌	—	Platform-level OpenAI fallback key
`ANTHROPIC_API_KEY`	❌	—	Platform-level Anthropic fallback key
`GOOGLE_API_KEY`	❌	—	Platform-level Google fallback key
`MISTRAL_API_KEY`	❌	—	Platform-level Mistral fallback key

📜 Available Scripts

🗺️ Roadmap

📝 Changelog

v0.2.2 - Stability & UX Hardening

Memory Stabilization: Implemented batch-processing in the ingestion pipeline to prevent Heap Out of Memory (OOM) errors during large document processing.
Improved UX: Added a "Retry" button and detailed error messaging (e.g., "Rate limited") for document uploads in the creation wizard.
Rate Limit Optimization: Increased document upload rate limits from 10 to 50 per minute to better support large batch uploads.
Mermaid Diagram Reliability:
- Updated system prompts with strict syntax rules for modern Mermaid (flowchart TD, quoted labels).
- Enhanced the Mermaid component to capture and display actual parser errors instead of hanging on failures.
Infrastructure & Testing:
- Expanded test suite with comprehensive integration tests for API routes and pipeline logic.

v0.2.1 - Recent Updates

Ingestion reliability: Upload ingestion no longer detaches a background promise that can be killed in serverless runtimes; temp files are always cleaned up.
No-context handling: Removed the “stuff the first 3 chunks” fallback; unrelated questions now return a neutral “no relevant information found” response.
Upload security:
- Filename sanitization + upload-dir containment to prevent path traversal
- Strict allowlist validation (415) for: .txt / .md / .pdf / .docx / .csv
- 10MB max upload size (413) + basic per-user upload rate limit (429)
Performance:
- Vector retrieval yields to the event loop during similarity scoring and caps embeddings per query (with pgvector migration guidance in FUTURE_PLAN.md)
- Keyword fallback upgraded to SQLite FTS5 (with raw SQL migration + safe fallback if not applied)
Pipeline quality & resilience:
- Semantic chunking (paragraph → line → sentence → word) with overlap preservation
- Extraction failures mark the document as FAILED with a human-readable errorMessage
Tests & maintenance:
- Added unit tests for vector utilities + chunking
- Standardized module imports and added a minimal Vitest runner (npm test)

v0.2.0 - List of Updates

Enhanced Chat Experience: Introduced a collapsible sidebar for managing conversation history, allowing users to seamlessly resume previous chats or start new ones.
Bot Lifecycle Management: Added Edit and Delete functionality directly from the dashboard bot cards.
Improved RAG Creation Wizard: Integrated dropdowns for model selection based on providers, and added an interactive emoji picker for bot avatars.
Local Model Support: Added support for local Ollama models (including Qwen3 and Deepseek) alongside cloud providers.
Secure File Uploads: Users can now upload various documents (.txt, .md, .csv, .pdf, .docx, .pptx) directly into an active chat context.

🐛 Troubleshooting

Prisma: "Cannot find module '@prisma/client'"

Run npx prisma generate to regenerate the Prisma client after any schema change.

Hydration mismatch errors

The Modal component uses a mounted state pattern to avoid server/client mismatch. If you see hydration errors after adding a new modal, ensure it returns null on the first render pass.

"Invalid payload" when saving provider keys

The validator trims whitespace automatically. If the error persists, check that the key format matches the provider's expected pattern (e.g., sk-... for OpenAI).

ENCRYPTION_KEY errors on startup

The key must be exactly 64 hex characters. Generate one with:

node -e "console.log(require('crypto').randomBytes(32).toString('hex'))"

📄 License

This project is licensed under the MIT License.

_{Built with ❤️ by bhoomik-codes}

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
__tests__		__tests__
app		app
components		components
lib		lib
prisma		prisma
public		public
scripts		scripts
types		types
.dockerignore		.dockerignore
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
middleware.ts		middleware.ts
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
prisma.config.ts		prisma.config.ts
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Command	Description
`npm run dev`	Start Next.js development server on port 3000
`npm run build`	Create an optimized production build
`npm start`	Run the production build
`npm run lint`	Run ESLint checks
`npm test`	Run unit tests (Vitest)
`npx prisma studio`	Open Prisma's visual database browser
`npx prisma db push`	Push schema changes to the database
`npx tsc --noEmit`	Type-check the project without emitting files

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Ragify

✨ Features

🚀 Quick Start

🐳 Running with Docker (Recommended)

💻 Manual Local Setup

Prerequisites

Step 1 — Clone & Install

Step 2 — Automatic Setup & Initialization

Step 3 — Start the Dev Server

🧪 Running Without API Keys (Mock Mode)

🔑 Adding Your LLM Keys

🏛️ Architecture

Retrieval flow (high level)

🛠 Tech Stack

Supported LLM Providers

🔒 Security

📋 Environment Variables

📜 Available Scripts

🗺️ Roadmap

📝 Changelog

v0.2.2 - Stability & UX Hardening

v0.2.1 - Recent Updates

v0.2.0 - List of Updates

🐛 Troubleshooting

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Folders and files

Latest commit

History

Repository files navigation

🧠 Ragify

✨ Features

🚀 Quick Start

🐳 Running with Docker (Recommended)

💻 Manual Local Setup

Prerequisites

Step 1 — Clone & Install

Step 2 — Automatic Setup & Initialization

Step 3 — Start the Dev Server

🧪 Running Without API Keys (Mock Mode)

🔑 Adding Your LLM Keys

🏛️ Architecture

Retrieval flow (high level)

🛠 Tech Stack

Supported LLM Providers

🔒 Security

📋 Environment Variables

📜 Available Scripts

🗺️ Roadmap

📝 Changelog

v0.2.2 - Stability & UX Hardening

v0.2.1 - Recent Updates

v0.2.0 - List of Updates

🐛 Troubleshooting

📄 License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages