Architecture

This page describes the system topology for the Oversight beta release, including all major components, data flows, and external dependencies.

Architecture Diagram

System Topology

System Overview

Oversight is a full-stack Next.js application deployed on Vercel. The application uses the App Router for both the frontend pages and API routes, communicates with external LLM providers for analysis, and stores data in a PostgreSQL database hosted on Neon.

Core Components

Component	Technology	Purpose
Frontend	Next.js 16 App Router, React 19, Tailwind CSS 4	All user-facing pages (upload, dashboard, chat, monitor, trends, settings)
API Layer	Next.js API Routes (serverless)	18 API endpoints handling auth, uploads, chat, analysis, monitoring, settings
Core Logic	TypeScript modules in `lib/`	Analysis pipeline, live monitoring, chat reply generation, email alerts, rate limiting
Database	PostgreSQL via Neon	All persistent data (users, uploads, analyses, chat sessions, settings)
ORM	Prisma 7 with Neon adapter	Database access with connection pooling
Authentication	NextAuth.js v5 (beta)	Email/password auth, JWT sessions, bcrypt password hashing
File Storage	Vercel Blob	Uploaded JSON conversation files
Email	Resend	Analyst alert notifications

External LLM Providers

Provider	Models	Used For
Google Gemini	`gemini-2.5-flash`, `gemini-3.1-flash-lite` (fallback)	Hallucination, bias, and toxicity analysis in Gemini and Both modes
Groq	`openai/gpt-oss-120b`, Kimi K2 variants (fallback)	Analysis in Groq and Both modes, live monitoring, chatbot reply generation

Data Flow

File Upload Flow: User → Upload Page → POST /api/upload → Vercel Blob (file storage) → lib/run-analysis.ts → Gemini/Groq API → Analysis results → PostgreSQL
Chat Flow: Customer → Chat Page → POST /api/chat → Groq (bot reply) → lib/live-monitor.ts → Groq (monitoring) → PostgreSQL (message + monitoring data)
Alert Flow: Chat session completes → lib/run-analysis.ts (full analysis) → lib/send-alert-email.ts → Resend API → Analyst inbox

Database Schema

The database consists of 9 models:

Model	Purpose	Key Relations
User	Analyst accounts (email, password hash, name)	Has many: Uploads, Feedbacks, GroundTruths; Has one: UserPreferences
UserPreferences	Per-user settings (default mode, alert email, bias threshold, T&C acceptance)	Belongs to: User
Upload	A conversation file upload or chat-sourced analysis	Belongs to: User (optional), GroundTruth (optional); Has many: Analyses; Has one: ChatSession
Analysis	A single analysis result (e.g., hallucination-gemini)	Belongs to: Upload
ChatSession	A live chatbot conversation session	Has many: ChatMessages; Has one: Upload
ChatMessage	A single message in a chat session (with optional monitoring data)	Belongs to: ChatSession
GroundTruth	A reference document for factual verification	Belongs to: User (optional); Has many: Uploads
Feedback	In-app user feedback (bug/feature/general)	Belongs to: User
RateLimit	Per-user/IP rate limiting counters	Unique on: (identifier, type)

Architecture Rationale

The system topology is unchanged since the alpha release. The same components — Next.js, Neon PostgreSQL, Vercel Blob, Google Gemini, Groq Llama, Resend, and NextAuth — remain in the same architectural positions. No components were added, removed, or re-positioned, and no new external services were introduced.

The alpha release served as a validation milestone for the full architecture. Each component kept its position in beta because alpha testing produced concrete evidence that the current choice is correct — not merely adequate — and that the load, latency, and correctness characteristics of the system hold inside acceptable bounds. Specifically:

The dual-LLM analysis pipeline (Gemini + Groq with cross-checking) produced agreement between providers on the vast majority of conversations analyzed during alpha. The cases where providers disagreed were the cases most worth surfacing to an analyst, which validated the design intent of running both in "Both" mode rather than collapsing to a single provider for cost savings.
Vercel serverless functions completed every alpha analysis — including "Both" mode on the largest permitted input (5 MB JSON) running Gemini and Groq sequentially across all three categories — inside the 120-second function timeout with headroom. This ruled out the need to migrate analysis to a long-running worker or queue architecture for beta.
Neon connection pooling via the Prisma adapter absorbed concurrent uploads from multiple analysts during alpha testing without connection exhaustion or cold-start pool starvation, validating the pooled-adapter approach over a self-managed pool.
Live chat monitoring kept per-message Groq round-trip latency low enough that the customer-facing chatbot remained responsive with monitoring on the request path. This is important: if latency had been unacceptable, monitoring would have had to move off the request path into a background job, which would have delayed violation detection and broken the automatic-escalation flow. Alpha showed we can keep monitoring synchronous.
Resend + NextAuth + Vercel Blob each carried their expected traffic during alpha without operational issues or per-service limits being approached, confirming no need to self-host or replace any of the hosted dependencies for beta.

Alpha testing also helped us identify and document known issues. A formal bug severity audit classified all discovered bugs by severity (Critical, High, Medium, Low). The key finding was that zero Critical or High severity bugs remained open — all were either resolved or classified as Medium/Low with documented mitigations:

Bug ID	Severity	Description	Status
BUG-001	Medium	Simulated progress bar does not reflect actual backend progress	Documented; deferred to GA
BUG-002	High	No rate limiting on public chat API	Resolved — enforces 5/min, 40/day per IP
BUG-003	Medium	Session completion endpoint lacks ownership check	Mitigated by CUID non-enumerability
BUG-004	Low	Chat-originated uploads have null userId	Accepted trade-off for internal analyst scope
BUG-005	Low	Vercel Blob URLs are publicly accessible if known	Accepted; non-enumerable filenames

Taken together, the alpha evidence — latency inside the serverless budget, pool saturation not reached, synchronous monitoring viable, dual-provider disagreement concentrated on the cases worth surfacing, and zero Critical/High bugs remaining open — is what gave us confidence to ship the same topology into beta rather than re-architect. The next architectural decision points (queue-based analysis, provider fallback routing, self-hosted email) are deferred to GA, where the traffic profile will determine whether they are actually needed.

Architecture Diagram​

System Overview​

Core Components​

External LLM Providers​

Data Flow​

Database Schema​

Architecture Rationale​