Open-source RAG interview guide covers designs and common failures

An open-source GitHub repository collects RAG interview questions and answers based on production experience. It covers more than 200 questions across 12 RAG designs, including naive RAG, agentic RAG, graph RAG, self-checking RAG, speculative RAG, and multimodal RAG. The questions are grouped by difficulty: 13 basic, 58 intermediate, and 129 advanced.

It also explains 6 common production failure modes, including wrong answers even with context, failed retrieval, embedding mismatch, stale indexes, context window overflow, and rerankers pushing the right answer down. The repository includes concept notes on chunking, embeddings, vector databases, reranking, evaluation metrics, and prompt injection. It also has a one-table cheatsheet comparing all 12 RAG types, plus study paths for one-week preparation, phone screens, and system design interviews.

Planned additions include labs, an interview simulator, and evaluation tools.

Key points

  • The repository includes more than 200 RAG questions and answers.
  • It compares 12 RAG designs in one cheatsheet.
  • It separates questions into basic, intermediate, and advanced levels.
  • It covers production failures such as stale indexes and context window overflow.
  • Labs, an interview simulator, and evaluation tools are planned.

Quick term guide

open-source
Software whose code is shared publicly so others can inspect, use, or change it.
GitHub repository
An online folder where software code and related files are stored.
GitHub repo
A project folder on GitHub where code and related files are stored.
context window
The amount of text an AI tool can remember and use in one chat.
vector databases
Databases that help find related text by meaning, not just by exact words.
vector database
A special type of storage that saves text as numbers so similar meanings can be found quickly, commonly used for AI memory
prompt injection
A trick where hidden instructions in text make an AI do something the user did not ask for.
system design
The plan for how different parts of a website or app work together to handle users and data.
Read original