ENJA

Terminology-aware English-to-Japanese translation. Comparing prompt-only, fine-tuned, RAG, and agentic approaches.

S0
Baseline
Prompt-only Qwen 2.5-0.5B. No adaptation.
S1
Fine-Tuned
QLoRA adapters on Qwen. Task-specific training.
S2
RAG + Reranker
Multi-strata retrieval with cross-encoder reranking.
S3
Agentic RAG
Claude Sonnet 4.6 with ReAct, tool use, and self-audit.
Tech Stack
Base Model
Qwen 2.5-0.5B
Agentic Model
Claude Sonnet 4.6
Vector Store
AWS S3 Vectors
Embeddings
multilingual-e5-small