Play 01
Enterprise RAG Q&A
Medium✅ Ready
Production RAG with hybrid search, semantic reranking, and pre-tuned guardrails.
Build a production-grade Retrieval-Augmented Generation system. AI Search indexes your documents, GPT-4o generates grounded answers with citations, and Container Apps hosts the API. Pre-tuned with temp=0.1, hybrid 60/40 search, top-k=5, and semantic reranker for optimal quality.
Architecture Pattern
RAG: hybrid search, chunking, semantic reranking
Azure Services
AI SearchAzure OpenAI (gpt-4o)Container AppsBlob Storage
DevKit (.github Agentic OS)
- agent.md — citation rules, abstention, few-shot examples
- instructions.md — system prompt + guardrails
- copilot-instructions.md — project context
- mcp/index.js — search, indexer, evaluator, config loader plugins
- .vscode/mcp.json — agent tooling
TuneKit (AI Config)
- config/openai.json — temp=0.1, seed=42, JSON schema
- config/search.json — hybrid 60/40, top-k=5, threshold=0.78
- config/chunking.json — 512 tokens, semantic, 10% overlap
- config/guardrails.json — safety, PII redaction
- evaluation/eval.py — Faithfulness >0.90, Relevance >0.85, Groundedness >0.95
Tuning Parameters
temperature (0.1→0.3)top-k (5→10)chunk_size_tokens (512→256)severity_threshold (2→1)
Estimated Cost
Dev/Test
$150–300/mo
Production
$2K–8K/mo