Publications

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness. arXiv Preprint, 2026.
AIRoA MoMa Dataset: A Large-Scale Hierarchical Dataset for Mobile Manipulation. arXiv Preprint, 2025.
A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP. IJCNLP-AACL 2025, 2025.