Play 01

Enterprise RAG Q&A

Medium✅ Ready

Production RAG with hybrid search, semantic reranking, and pre-tuned guardrails.

Build a production-grade Retrieval-Augmented Generation system. AI Search indexes your documents, GPT-4o generates grounded answers with citations, and Container Apps hosts the API. Pre-tuned with temp=0.1, hybrid 60/40 search, top-k=5, and semantic reranker for optimal quality.

Architecture Pattern

RAG: hybrid search, chunking, semantic reranking

Azure Services

AI SearchAzure OpenAI (gpt-4o)Container AppsBlob Storage

DevKit (.github Agentic OS)

agent.md — root orchestrator with builder→reviewer→tuner handoffs
3 agents — RAG Builder (gpt-4o), Reviewer (gpt-4o-mini), Tuner (gpt-4o-mini)
3 skills — deploy (106 lines), evaluate (153 lines), tune (167 lines)
4 prompts — /deploy, /test, /review, /evaluate with agent routing
.vscode/mcp.json — FrootAI MCP with Azure key inputs + envFile

TuneKit (AI Config)

config/openai.json — temp=0.1, seed=42, JSON schema
config/search.json — hybrid 60/40, top-k=5, threshold=0.78
config/chunking.json — 512 tokens, semantic, 10% overlap
config/guardrails.json — safety, PII redaction
evaluation/eval.py — Faithfulness >0.90, Relevance >0.85, Groundedness >0.95

Tuning Parameters

temperature (0.1→0.3)top-k (5→10)chunk_size_tokens (512→256)severity_threshold (2→1)

Estimated Cost

Dev/Test

$150–300/mo

Production

$2K–8K/mo

User Guide Open in VS Code View on GitHub Setup Guide Configurator Ask Agent FAI Back to FrootAI