All coupons / IT & Software

Advanced RAG: Build & Deploy Production GenAI Apps

Course Description

Retrieval-Augmented Generation (RAG) is at the core of every serious AI application today. But basic RAG pipelines quickly hit their limits when documents are large, queries are complex, or your application needs to run reliably in production. In this course, you will build RAGWire — a production-grade RAG toolkit built on LangChain, Qdrant, and LangGraph — from the ground up. You will start with a simple hybrid search pipeline and progressively add advanced retrieval, metadata filtering, agentic RAG, multi-agent frameworks, a full chat UI, and multi-cloud deployment. By the end of this course you will know how to: Build a hybrid RAG pipeline with BM25 sparse + dense retrieval and Reciprocal Rank Fusion (RRF) Configure RAGWire with OpenAI GPT, Groq, Google Gemini, Ollama, and HuggingFace embeddings Implement LLM-driven auto metadata filtering over complex, nested document structures Build agentic RAG pipelines with LangChain agent tools, memory, and reasoning Build a self-correcting RAG agent that grades its own retrieval and rewrites queries when quality is low Build supervisor multi-agent systems that route queries to specialist agents using LangGraph Build multi-agent document analysts with CrewAI, Microsoft AutoGen, and Microsoft Agent Framework Build a production Chainlit chat UI with authentication, chat history, and document upload Build a FastAPI backend with OpenAI-compatible /v1/chat/completions endpoints and SSE streaming Deploy RAG agents to Render, Railway, AWS ECS Fargate, GCP Cloud Run, and Azure Secure production APIs with API keys and protect credentials with Docker .dockerignore This is a hands-on, code-first course. Every section produces working, runnable code that you can adapt to your own documents and use cases.