S3: Agentic RAG

Claude Sonnet 4.6 with autonomous tool use, self-reflection, and critic scoring via OpenRouter.

1Retrieve 2Reason 3Translate 4Reflect 5Critic
BLEU 68.75
ChrF++ 61.1
Coverage 0.83
Source — English
Output — Japanese
Coverage
critic score
Tool Calls
glossary + kb
Latency
end to end
Model
Sonnet
claude 4.6