// AIINFRA 301 · Semester 3
Production RAG & LLMOps — Observability and Evaluation
Building, Evaluating, and Operating Retrieval-Augmented LLM Systems in Production
This course prepares learners to design, evaluate, and operate retrieval-augmented generation systems using modern vector databases, hybrid search, and reranking techniques. Students build RAG evaluation pipelines with RAGAS and DeepEval as automated quality gates, then instrument production systems with tracing, prompt versioning, and drift monitoring using industry-standard LLMOps tooling. Emphasis is placed on hands-on labs that mirror real workplace tasks, from indexing strategy selection through cost-aware, observable deployment. Learners exit ready to support or own the retrieval and evaluation layer of an enterprise AI application.
Outcomes
Course objectives
- Select and configure a vector database (Qdrant, Milvus, Weaviate, or pgvector) based on scale, filtering, and indexing tradeoffs
- Design chunking and hybrid retrieval pipelines combining dense embeddings, BM25, and cross-encoder reranking
- Build automated RAG evaluation suites with RAGAS and DeepEval to gate retrieval and generation quality in CI/CD
- Instrument LLM applications with tracing and observability platforms to diagnose latency, cost, and quality regressions
- Implement production monitoring, prompt versioning, and feedback loops to detect and respond to model and data drift
Student learning outcomes
- Select and configure a vector database (Qdrant, Milvus, Weaviate, or pgvector) based on scale, filtering, and indexing tradeoffs.
- Design chunking and hybrid retrieval pipelines combining dense embeddings, BM25, and cross-encoder reranking.
- Build automated RAG evaluation suites with RAGAS and DeepEval to gate retrieval and generation quality in CI/CD.
- Instrument LLM applications with tracing and observability platforms to diagnose latency, cost, and quality regressions.
- Implement production monitoring, prompt versioning, and feedback loops to detect and respond to model and data drift.