AI Study Tutor - Technical Documentation

AI-Powered Study Tutor Application with Retrieval-Augmented Generation (RAG) capabilities for document analysis and conversational learning.

🚀 Quick Setup

Prerequisites

Node.js 18+ - JavaScript runtime
Python 3.10+ - For FastAPI backend
MongoDB - Database (local or Atlas)
LLM API Key - Gemini, OpenAI, or DeepInfra

Installation Steps

# 1. Clone the repository
git clone <repo-url>
cd AI-Study-Tutor

# 2. Frontend (cognify-frontend)
cd cognify-frontend && npm install && npm run dev
# Runs on http://localhost:5173

# 3. Node.js Backend (node-backend)
cd ../node-backend && npm install
cp .env.example .env  # Configure your .env
npm run dev
# Runs on http://localhost:4000

# 4. FastAPI Backend (ai-engine)
cd ../ai-engine

# Create virtual environment
python -m venv venv
source venv/bin/activate  # macOS/Linux
# Windows: venv\Scripts\activate

# Install and run
pip install -r requirements.txt
cp .env.example .env  # Configure your .env
uvicorn app.main:app --reload
# Runs on http://localhost:8000

MongoDB Atlas Setup (Recommended)

Why MongoDB Atlas?

Free tier with 512MB storage - enough for development
No installation - fully managed cloud database
Global availability - deploy close to your users
Built-in backups - automatic data protection

Setup Steps:

Go to mongodb.com/atlas and sign up
Create a FREE cluster (M0 Sandbox)
Create database user with password
Whitelist IP: "Allow Access from Anywhere"
Get connection string from "Connect" → "Connect your application"

# Example connection string:
mongodb+srv://username:password@cluster0.xxxxx.mongodb.net/rag_app

# Add to node-backend/.env:
MONGO_URI=mongodb+srv://username:password@cluster0.xxxxx.mongodb.net/rag_app

Required .env Files

node-backend/.env

MONGO_URI=mongodb+srv://...
JWT_SECRET=your-secret
GOOGLE_CLIENT_ID=xxx
EMAIL_USER=email@zoho.in
EMAIL_PASSWORD=xxx

ai-engine/.env

LLM_PROVIDER=gemini
GOOGLE_API_KEY=your-key
JWT_SECRET=your-secret

See Section 5 for complete environment variable reference.

1. Technology Stack

1.1 Frontend (`cognify-frontend`)

Technology	Purpose
React 19	Modern UI framework with latest features
TypeScript	Type-safe JavaScript for scalable development
Vite	Lightning-fast build tool and dev server
TailwindCSS 4	Utility-first CSS framework
Redux Toolkit	State management with redux-persist
React Router v7	Client-side routing
Framer Motion	Smooth animations and transitions
Mermaid	Diagram rendering for concept maps
react-markdown	Markdown rendering with remark-gfm
Ant Design	UI component library

Why these choices?

React 19: Latest React with improved performance and concurrent features
Vite: 10x faster than Webpack for development builds
TailwindCSS 4: Rapid UI development with atomic CSS classes
Redux Toolkit: Simplified Redux with built-in best practices and persistence for offline-first architecture

1.2 Node.js Backend (`node-backend`)

Technology	Purpose
Express.js	Fast, minimalist web framework
TypeScript	Type safety for backend code
MongoDB + Mongoose	NoSQL database for flexible schemas
JWT (jsonwebtoken)	Stateless authentication
AWS S3 SDK	Cloud file storage integration
Multer	File upload handling
bcryptjs	Password hashing
Nodemailer	Email services
Google Auth Library	OAuth2 social login

Why these choices?

Express.js: Lightweight, battle-tested, with massive ecosystem
MongoDB: Flexible document storage for varying workspace/document structures
JWT: Stateless auth eliminates session storage, scales horizontally
Multer: Industry standard for multipart/form-data handling

1.3 FastAPI Backend (`ai-engine`)

Technology	Purpose
FastAPI	High-performance async Python framework
Uvicorn	ASGI server for async processing
ChromaDB	Vector database for embeddings
Sentence Transformers	Text embedding generation
PyMuPDF (fitz)	PDF text extraction
OpenAI/DeepInfra/Gemini	LLM provider integrations
BeautifulSoup4	Web scraping for URLs
boto3	AWS S3 integration
Pydantic Settings	Configuration management

Why these choices?

FastAPI: Native async/await, automatic OpenAPI docs, type validation
ChromaDB: Lightweight, embedded vector DB - no external service needed
Sentence Transformers: High-quality embeddings with all-MiniLM-L6-v2 model
Multi-LLM Factory: Flexibility to switch between providers (cost/performance optimization)

2. Storage Configuration

The application supports dual storage providers with a unified abstraction layer:

2.1 Storage Provider Architecture

┌─────────────────────────────────────────────────────────────┐ │ Storage Abstraction │ ├─────────────────────────────────────────────────────────────┤ │ IStorageProvider Interface │ │ ├── uploadFile(key, content, contentType) │ │ ├── uploadPDF(workspaceId, fileName, buffer) │ │ ├── uploadText(workspaceId, title, content) │ │ ├── getSignedReadUrl(key, expiresIn) │ │ ├── deleteFile(key) │ │ ├── getFileContent(key) │ │ └── fileExists(key) │ └─────────────────────────────────────────────────────────────┘ │ │ ▼ ▼ ┌──────────────────┐ ┌──────────────────┐ │ LocalStorageProvider │ │ S3StorageProvider │ │ │ │ │ │ • Filesystem-based │ │ • AWS S3 bucket │ │ • HTTP static serve │ │ • Pre-signed URLs │ │ • Development/Self- │ │ • Scalable cloud │ │ hosted production │ │ • CDN-ready │ └──────────────────┘ └──────────────────┘

2.2 Local Storage Provider

Configuration:

STORAGE_PROVIDER=local
LOCAL_STORAGE_PATH=../application-data

How it works:

Files stored in application-data/ folder at project root
Node.js serves as static file server via Express
URLs return as {BACKEND_URL}/storage/{key}
Path traversal protection with key sanitization

File organization:

application-data/
└── {userEmail}/
    └── workspaces/
        └── {workspaceId}/
            ├── pdfs/
            │   └── {timestamp}-{filename}.pdf
            └── texts/
                └── {timestamp}-{title}.txt

2.3 AWS S3 Storage Provider

Configuration:

STORAGE_PROVIDER=s3
AWS_S3_BUCKET=your-bucket-name
AWS_REGION=us-east-1
AWS_ACCESS_KEY_ID=your-access-key
AWS_SECRET_ACCESS_KEY=your-secret-key

How it works:

Files uploaded to S3 bucket with structured keys
Pre-signed URLs generated for secure, time-limited access
Supports CloudFront CDN for global distribution
Same file organization pattern as local storage

S3 URL format:

https://{bucket}.s3.{region}.amazonaws.com/{key}

2.4 Switching Between Providers

Simply change the STORAGE_PROVIDER environment variable:

Provider	Value	Use Case
Local	`local`	Development, self-hosted deployments
AWS S3	`s3`	Production, scalable deployments

Both backends (Node.js and FastAPI) use the same abstraction, ensuring consistent file access across the stack.

3. Technical Flow & Architecture

3.1 High-Level Architecture

┌─────────────────────────────────────────────────────────────┐ │ Frontend (React + Vite) │ │ UI → Redux Store → API Client │ └─────────────────────┬───────────────────────────────────────┘ │ ┌─────────────┴─────────────┐ ▼ ▼ ┌─────────────────────┐ ┌─────────────────────┐ │ Node.js (Express) │ │ FastAPI (Python) │ │ • Auth Service │ │ • RAG Agent │ │ • Workspace Service │ │ • Document Processor│ │ • Storage Service │ │ • Content Generator │ │ • MongoDB │ │ • ChromaDB │ └─────────────────────┘ └─────────────────────┘ │ │ ▼ ▼ AWS S3 / Local LLM Providers File Storage (Gemini/OpenAI/DeepInfra)

3.2 Complete Workflow

Step 1: User Authentication (Node.js)

User → POST /api/auth/login ↓ AuthService validates credentials (bcrypt) ↓ JWT token generated with user ID ↓ Token stored in client (localStorage) ↓ All subsequent requests include: Authorization: Bearer {token}

Step 2: Document Upload Flow

1. User uploads PDF/Text in frontend │ ▼ 2. Node.js receives file via Multer POST /api/workspaces/{id}/sources │ ▼ 3. StorageService determines provider ├── Local: Write to filesystem └── S3: Upload to bucket │ ▼ 4. Source metadata saved to MongoDB { sourceId, workspaceId, fileName, storageKey, storageUrl, type } │ ▼ 5. Node.js calls FastAPI for processing POST /api/process Body: { storageKey, sourceId, type } │ ▼ 6. FastAPI DocumentProcessor ├── Fetches file from storage ├── Extracts text (PyMuPDF for PDFs) ├── Semantic chunking at paragraph/sentence boundaries └── Generates embeddings (Sentence Transformers) │ ▼ 7. ChromaDB stores embeddings { doc_id, chunks: [...], embeddings: [...] } │ ▼ 8. Response returned to frontend { success: true, chunks: 42, docId: "xxx" }

Step 3: RAG Chat Flow

1. User asks: "What is the main topic?" │ ▼ 2. Frontend sends via WebSocket WS: /ws/rag_chat Message: { query, document_ids: [...], chat_history: [...] } │ ▼ 3. RAGAgent.retrieve_context() ├── Query embedding generated ├── ChromaDB similarity search (TOP_K = 8) └── Returns relevant text chunks │ ▼ 4. RAGAgent.build_prompt() ├── System prompt (AI tutor personality) ├── Conversation memory (last 6 messages) ├── Retrieved context with [Source N] labels └── User question │ ▼ 5. LLM Provider generates response ├── Factory selects provider (Gemini/OpenAI/DeepInfra) ├── Streaming tokens via WebSocket └── Citations linked to sources │ ▼ 6. Frontend displays with Markdown rendering

Step 4: Content Generation Flow

1. User clicks "Generate Study Materials" │ ▼ 2. FastAPI receives with progress streaming (SSE) │ ▼ 3. DeepContentGenerator Pipeline: Stage 1 (5%): Content Analysis ├── LLM extracts: main_theme, key_concepts, entities └── Complexity estimation │ ▼ Stage 2 (10%): Outline Generation └── Hierarchical structure with titles/subtitles │ ▼ Stage 3 (10-50%): Section Deep Dives ├── Detailed explanations per section └── Examples and applications │ ▼ Stage 4 (50-70%): Visual Generation ├── Mermaid concept map └── Process flowcharts │ ▼ Stage 5 (70-85%): Flashcards └── 12+ cards with term/definition pairs │ ▼ Stage 6 (85-95%): Quizzes ├── 5+ MCQ questions └── Distractor explanations │ ▼ Stage 7 (95-100%): Final Assembly

3.3 LLM Provider (Simple .env Switch)

Switching LLM providers requires only updating 2 environment variables in ai-engine/.env:

# Just change these two lines to switch providers:
LLM_PROVIDER=gemini          # Options: gemini, openai, deepinfra, ollama, bedrock
GOOGLE_API_KEY=your-api-key  # Add the corresponding API key

Available Providers:

Provider	LLM_PROVIDER Value	API Key Variable	Example Model
Google Gemini	`gemini`	GOOGLE_API_KEY	gemini-2.0-flash
OpenAI	`openai`	OPENAI_API_KEY	gpt-4-turbo
DeepInfra	`deepinfra`	DEEPINFRA_API_KEY	Various OSS models
Ollama (Local)	`ollama`	None (uses OLLAMA_HOST)	llama2, mistral
AWS Bedrock	`bedrock`	AWS credentials	Claude, Titan

Example Configurations:

# Use Google Gemini (default)
LLM_PROVIDER=gemini
GOOGLE_API_KEY=AIzaSy...

# Use OpenAI
LLM_PROVIDER=openai
OPENAI_API_KEY=sk-...

# Use local Ollama (no API key needed)
LLM_PROVIDER=ollama
OLLAMA_HOST=http://localhost:11434

Note: No code changes required. Just update .env and restart the backend.

3.4 Vector Store (ChromaDB)

Configuration:

CHROMA_PERSIST_DIR=./chroma_db
EMBEDDING_MODEL=all-MiniLM-L6-v2
CHUNK_SIZE=500
CHUNK_OVERLAP=100

Key Operations:

Operation	Description
`add_documents`	Store document chunks with embeddings
`query`	Semantic similarity search with doc_id filtering
`delete_document`	Remove all chunks for a document
`embed_text`	Generate embedding for single text

Semantic Chunking Strategy:

Split on paragraph boundaries (\n\n)
Further split large paragraphs at sentence boundaries
Merge tiny chunks (< 100 chars) with neighbors
Include parent window indices for context expansion

4. API Endpoints Summary

Node.js Backend (`localhost:4000`)

Endpoint	Method	Purpose
`/api/auth/register`	POST	User registration
`/api/auth/login`	POST	JWT authentication
`/api/auth/google`	POST	Google OAuth login
`/api/workspaces`	CRUD	Workspace management
`/api/workspaces/:id/sources`	CRUD	Source (document) management
`/api/chat/sessions`	CRUD	Chat session storage
`/health`	GET	Health check

FastAPI Backend (`localhost:8000`)

Endpoint	Method	Purpose
`/api/process`	POST	Document processing + embedding
`/api/generate`	POST	Deep content generation (SSE)
`/api/chat`	POST	RAG query (non-streaming)
`/ws/rag_chat`	WebSocket	Real-time RAG chat
`/api/scrape`	POST	URL content scraping
`/docs`	GET	Swagger documentation
`/health`	GET	Health check

5. Environment Variables Reference

Node.js Backend (.env)

# Server
NODE_ENV=development
PORT=4000
CORS_ORIGINS=http://localhost:5173

# Database
MONGODB_URI=mongodb://localhost:27017/rag_app

# JWT (shared with FastAPI)
JWT_SECRET=your-super-secret-key
JWT_EXPIRES_IN=7d

# Storage
STORAGE_PROVIDER=local
LOCAL_STORAGE_PATH=../application-data
AWS_S3_BUCKET=your-bucket
AWS_REGION=us-east-1
AWS_ACCESS_KEY_ID=xxx
AWS_SECRET_ACCESS_KEY=xxx

# OAuth
GOOGLE_CLIENT_ID=xxx.apps.googleusercontent.com

FastAPI Backend (.env)

# LLM
LLM_PROVIDER=gemini
GOOGLE_API_KEY=your-api-key
DEEPINFRA_API_KEY=xxx
OPENAI_API_KEY=xxx

# Vector Store
CHROMA_PERSIST_DIR=./chroma_db
EMBEDDING_MODEL=all-MiniLM-L6-v2

# Storage (same as Node.js)
STORAGE_PROVIDER=local
LOCAL_STORAGE_PATH=../application-data

# JWT (shared)
JWT_SECRET=your-super-secret-key

6. Advanced Retrieval Engine

The RAG system uses a sophisticated multi-stage retrieval pipeline for significantly improved accuracy:

6.1 Retrieval Architecture

User Query: "What are the key benefits?" │ ▼ ┌───────────────────────────────┐ │ Step 1: HyDE Query Expansion │ │ "Hypothetical answer that │ │ would contain benefits..." │ └───────────────────────────────┘ │ ┌───────────┴───────────┐ ▼ ▼ ┌───────────────┐ ┌───────────────┐ │ Vector Search │ │ BM25 Keyword │ │ (Semantic) │ │ (Exact Match) │ │ Top 15 results│ │ Top 15 results│ └───────────────┘ └───────────────┘ │ │ └───────────┬───────────┘ ▼ ┌───────────────────────────────┐ │ Step 4: Reciprocal Rank │ │ Fusion (RRF) │ │ Combines both result sets │ └───────────────────────────────┘ │ ▼ ┌───────────────────────────────┐ │ Step 5: Cross-Encoder │ │ Re-Ranking │ │ ms-marco-MiniLM-L-6-v2 │ └───────────────────────────────┘ │ ▼ Top 5 Final Results

6.2 Retrieval Techniques

Technique	Description	Why It Helps
HyDE	LLM generates hypothetical answer, used for vector search	Better semantic alignment with document chunks
Hybrid Search	Combines vector similarity + BM25 keyword search	Catches both semantic and exact matches
RRF Fusion	Reciprocal Rank Fusion combines multiple result lists	Leverages strengths of different search methods
Cross-Encoder Re-Ranking	Neural model scores query-document pairs	More accurate relevance scoring than bi-encoder

6.3 Re-Ranking Model

The system uses cross-encoder/ms-marco-MiniLM-L-6-v2:

Trained on MS MARCO passage ranking dataset
Takes (query, document) pairs and outputs relevance score
Significantly improves top-k precision over raw vector search

7. Error Handling

7.1 FastAPI Error Handling

# Graceful LLM failure handling
try:
    response = await self.llm.generate(messages)
    return response.content
except Exception as e:
    print(f"[RAGAgent] Error: {e}")
    return "I encountered an error. Please try again."

7.2 JSON Repair for LLM Outputs

The DeepContentGenerator includes robust JSON parsing:

Extracts JSON from markdown code blocks
Repairs truncated strings and arrays
Handles unbalanced brackets
Fallback to default structures on failure

7.3 Storage Fallbacks

// Node.js: Graceful storage errors
async getFileBuffer(key: string): Promise<Buffer> {
  try {
    return await readFile(this.getFullPath(key));
  } catch (error) {
    logger.error(`Failed to read file: ${key}`, error);
    throw new NotFoundError(`File not found: ${key}`);
  }
}

8. Email Service (Nodemailer)

The application uses Nodemailer for sending OTP verification emails during user registration.

8.1 Environment Variables

Add these to your node-backend/.env file:

# Email Configuration (Zoho Mail - Default)
EMAIL_USER=your-email@zoho.in
EMAIL_PASSWORD=your-zoho-app-password

8.2 SMTP Configuration

Default Setup (Zoho Mail India):

// services/emailService.ts
const transporter = nodemailer.createTransport({
  host: "smtp.zoho.in",       // Zoho India SMTP server
  port: 465,                   // SSL port
  secure: true,                // Use TLS encryption
  auth: {
    user: config.emailUser,    // EMAIL_USER from .env
    pass: config.emailPassword // EMAIL_PASSWORD from .env
  },
});

Setting	Value
Host	`smtp.zoho.in`
Port	`465` (SSL)
Secure	`true`
Authentication	Email + App Password

8.3 Alternative Email Providers

To switch providers, modify emailService.ts:

Gmail:

const transporter = nodemailer.createTransport({
  host: "smtp.gmail.com",
  port: 587,
  secure: false,
  auth: {
    user: config.emailUser,
    pass: config.emailPassword, // Use App Password (not regular password)
  },
});

SendGrid (Recommended for Production):

const transporter = nodemailer.createTransport({
  host: "smtp.sendgrid.net",
  port: 587,
  auth: {
    user: "apikey",
    pass: process.env.SENDGRID_API_KEY,
  },
});

8.4 Email Features

Feature	Implementation
OTP Generation	6-digit random code via `Math.random()`
HTML Templates	Professional styled emails with gradient design
Plain Text Fallback	For email clients without HTML support
Custom Headers	`X-Priority` and `X-Mailer` for deliverability
Branding	"Cognify" branded email templates

8.5 Getting Zoho Credentials

Create a Zoho Mail account at mail.zoho.in
Go to Settings → Security → App Passwords
Click Generate New Password
Select "Other Apps" as the application
Copy the generated 16-character password
Use your Zoho email as EMAIL_USER
Use the App Password as EMAIL_PASSWORD

8.6 Email Template Preview

The OTP email includes:

Dark gradient theme matching the app design
Prominent OTP display with letter spacing
10-minute expiry notice
Brand footer with copyright

8.7 Troubleshooting

Issue	Solution
Authentication failed	Use App Password, not regular password
Connection timeout	Check firewall allows port 465/587
Email not received	Check spam folder, verify sender domain
Rate limiting	Use production provider (SendGrid/SES)

🚀 Quick Setup

Prerequisites

Installation Steps

MongoDB Atlas Setup (Recommended)

Required .env Files

1. Technology Stack

1.1 Frontend (cognify-frontend)

1.2 Node.js Backend (node-backend)

1.3 FastAPI Backend (ai-engine)

2. Storage Configuration

2.1 Storage Provider Architecture

2.2 Local Storage Provider

2.3 AWS S3 Storage Provider

2.4 Switching Between Providers

3. Technical Flow & Architecture

3.1 High-Level Architecture

3.2 Complete Workflow

Step 1: User Authentication (Node.js)

Step 2: Document Upload Flow

Step 3: RAG Chat Flow

Step 4: Content Generation Flow

3.3 LLM Provider (Simple .env Switch)

3.4 Vector Store (ChromaDB)

4. API Endpoints Summary

Node.js Backend (localhost:4000)

FastAPI Backend (localhost:8000)

5. Environment Variables Reference

Node.js Backend (.env)

FastAPI Backend (.env)

6. Advanced Retrieval Engine

6.1 Retrieval Architecture

6.2 Retrieval Techniques

6.3 Re-Ranking Model

7. Error Handling

7.1 FastAPI Error Handling

7.2 JSON Repair for LLM Outputs

7.3 Storage Fallbacks

8. Email Service (Nodemailer)

8.1 Environment Variables

8.2 SMTP Configuration

8.3 Alternative Email Providers

8.4 Email Features

8.5 Getting Zoho Credentials

8.6 Email Template Preview

8.7 Troubleshooting

1.1 Frontend (`cognify-frontend`)

1.2 Node.js Backend (`node-backend`)

1.3 FastAPI Backend (`ai-engine`)

Node.js Backend (`localhost:4000`)

FastAPI Backend (`localhost:8000`)