PersonalCompleted

RAG API Support AI Chatbot

A conversational AI system that helps developers and support teams answer questions about APIs by retrieving relevant documentation and generating contextual, grounded responses with source citations. Users can ask natural language questions like "How do I authenticate?" or "Why am I getting a 401 error?" and receive accurate answers pulled directly from ingested API docs. Built with Django REST Framework, PostgreSQL with pgvector for vector similarity search, and Ollama for local LLM inference — no cloud API keys required.

Problem Statement

Developers frequently struggle navigating complex API documentation, jumping between docs, examples, and support tickets to find answers. There was a need for a conversational interface that centralizes API knowledge and provides grounded, cited responses.

My Approach

Built a RAG pipeline where API documentation is ingested, chunked semantically, and converted to embedding vectors stored in PostgreSQL with pgvector. User queries are embedded and matched via vector similarity search to retrieve the most relevant doc chunks, which are then compiled into a prompt for the LLM (Ollama) to generate grounded answers with source citations. The entire stack is containerized with Docker for easy deployment.

Key Outcomes

  • RAG-based question-answering over custom API documentation with source citations
  • Vector-based semantic search across document chunks using pgvector
  • Conversation tracking with message history for multi-turn interactions
  • Fully local LLM inference via Ollama — no cloud API dependency
  • Dockerized stack for simplified one-command deployment

Tech Stack

Python 3.11Django 5Django REST FrameworkPostgreSQL 16pgvectorOllamaDockerdocker-compose

Tags

PythonDjangoPostgreSQLpgvectorRAGOllamaDockerREST APIVector SearchLLM

Project Info

Status
Completed
Category
Personal
Created
1 year ago
Ended
Aug 2024

Links

Related Projects

Sales Data Agent - Enterprise AI with Snowflake Intelligence
In Development
Sony
S

Sales Data Agent - Enterprise AI with Snowflake Intelligence

Currently leading development of an enterprise conversational AI agent that enables business users to query Snowflake sales data in plain English — no SQL required. Built on Snowflake Cortex Intelligence with RAG, Text-to-SQL, and AI Observability.

SnowflakeCortex AICortex AnalystCortex Search+14
Back to All Projects