Learn how to adapt models to domain data and ground outputs with retrieval for factual, high-precision generation.
# Pseudo setup for PEFT config lora_r = 16 lora_alpha = 32 dropout = 0.05 batch_size = 4 grad_accum = 8
| Component | Decision | Impact |
|---|---|---|
| Chunk Size | 500-900 tokens | Precision versus context coverage |
| Retriever Top-K | 4-8 docs | Cost and hallucination balance |
| Reranker | Cross-encoder | Higher relevance, extra latency |
Build a Knowledge Copilot for one domain (for example: HR policy, product docs, SOP manuals, or legal templates). The assistant must answer with citations and decline when confidence is low.
| Category | Expectation | Weight |
|---|---|---|
| Groundedness | Answers map to source content without made-up facts | 35% |
| Style consistency | Response format stays stable across prompts | 20% |
| Evaluation rigor | You track metrics and failure patterns clearly | 25% |
| Product readiness | Usable UI/API with basic error handling | 20% |