genai-llms
/
S3 Agentic RAG
Home
S0/S1
S2
S3
Metrics
S3: Agentic RAG
Claude Sonnet 4.6 with autonomous tool use, self-reflection, and critic scoring via OpenRouter.
1
Retrieve
→
2
Reason
→
3
Translate
→
4
Reflect
→
5
Critic
BLEU
68.75
ChrF++
61.1
Coverage
0.83
Source — English
Please update your password in the account settings page and enable two-factor authentication for better security.
Translate
Output — Japanese
Listen
Coverage
critic score
Tool Calls
glossary + kb
Latency
end to end
Model
Sonnet
claude 4.6
▶
Agent Trace