Weihao XUAN
Tokyo · 東京 35.71°N / 139.76°E
Ph.D. Candidate · NLP · Vision · AI for Science

Weihao Xuan宣 偉豪

The University of Tokyo RIKEN AIP

I'm a Ph.D. candidate at Machine Learning and Statistical Data Analysis Lab (杉山・横矢・石田研究室), The University of Tokyo (東京大学), where I'm very fortunate to be advised by Prof. Naoto Yokoya. I'm also under the Junior Research Associate (JRA) program at RIKEN Center for Advanced Intelligence Project.

My research builds reliable multimodal AI systems: models that know what they know, and that generalize from the lab to the world. On the foundations side, I study the reliability of large language models, vision-language models, and increasingly, agentic systems: their calibration, hallucinations, reasoning under uncertainty, and multilingual behavior. On the applications side, I use these models to understand the physical world at scale. Most of my fieldwork is in earth observation, with active collaborations in AI for Science across medicine, biomedicine, and biology.

If you are interested in working with me or joining our lab, feel free to reach out via email.

01News

02Publications

* co-first author · † corresponding author
Filter
No publications match this filter.

Preprints

  1. Say Something Else: Rethinking Contextual Privacy as Information Sufficiency Yunze Xiao*, Wenkai Li*, Xiaoyuan Wu, Ningshan Ma, Yueqi Song, & Weihao Xuan† arXiv preprint arXiv:2604.06409.
  2. OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation Sijie Zhao, Feng Liu, Xueliang Zhang, Hao Chen, Xinyu Gu, Zhe Jiang, Fenghua Ling, Ben Fei, Wenlong Zhang, Junjue Wang, Weihao Xuan, Pengfeng Xiao, Naoto Yokoya, & Lei Bai arXiv preprint arXiv:2603.22148.
  3. Experience-Driven Multi-Agent Systems Are Training-free Context-aware Earth Observers Pengyu Dai, Weihao Xuan, Junjue Wang, Hongruixuan Chen, Jian Song, Yafei Ou, & Naoto Yokoya arXiv preprint arXiv:2602.02559.
  4. Towards Valid Student Simulation with Large Language Models Zhihao Yuan*, Yunze Xiao*, Ming Li*, Weihao Xuan, Richard Jiarui Tong, Mona Diab, & Tom Mitchell† arXiv preprint arXiv:2601.05473.
  5. Toward Global Large Language Models in Medicine Rui Yang, Huitao Li, Weihao Xuan†, Heli Qi, Xin Li, Kunyu Yu, Yingjian Chen, Rongrong Wang, Jacques Behmoaras, Tianxi Cai, Bibhas Chakraborty, Qingyu Chen, Lionel Tim-Ee Cheng, Marie-Louise Damwanza, Chido Dzinotyiwei, Aosong Feng, Chuan Hong, Yusuke Iwasawa, Yuhe Ke, Linah Kitala, Taehoon Ko, Jisan Lee, Irene Li, Jonathan Chong Kai Liew, Hongfang Liu, Lian Leng Low, Edison Marrese-Taylor, Yutaka Matsuo, Isheanesu Misi, Yilin Ning, Jasmine Chiat Ling Ong, Marcus Eng Hock Ong, Enrico Petretto, Hossein Rouhizadeh, Abiram Sandralegar, Oren Schreier, Iain Bee Huat Tan, Patrick Tan, Daniel Shu Wei Ting, Junjue Wang, Chunhua Weng, Matthew Yu Heng Wong, Fang Wu, Yunze Xiao, Xuhai Xu, Qingcheng Zeng, Zhuo Zheng, Yifan Peng†, Douglas Teodoro†, & Nan Liu† arXiv preprint arXiv:2601.02186. (under review)
  6. TeamPath: Building MultiModal Pathology Experts with Reasoning AI Copilots Tianyu Liu*, Weihao Xuan*, Hao Wu, Peter Humphrey, Marcello DiStasio, Heli Qi, Rui Yang, Simeng Han, Tinglin Huang, Fang Wu, Nan Liu, Irene Li, Hua Xu, & Hongyu Zhao† arXiv preprint arXiv:2511.17652. (under review)
  7. Retrieval-Augmented Generation in Medicine: A Scoping Review of Technical Implementations, Clinical Applications, and Ethical Considerations Rui Yang*, Matthew Yu Heng Wong*, Huitao Li*, Xin Li, Wentao Zhu, Jingchi Liao, Kunyu Yu, Jonathan Chong Kai Liew, Weihao Xuan, Yingjian Chen, Yuhe Ke, Jasmine Chiat Ling Ong, Douglas Teodoro, Chuan Hong, Daniel Shu Wei Ting, & Nan Liu† arXiv preprint arXiv:2511.05901. (under review)
  8. HF Daily Top 3 · 2025.07.22 The Invisible Leash: Why RLVR May Not Escape Its Origin Fang Wu*, Weihao Xuan*, Ximing Lu, Zaid Harchaoui, & Yejin Choi† arXiv preprint arXiv:2507.14843 (also in ICML 2025 AI4MATH Workshop).
  9. HF Daily Top 2 · 2025.08.07 VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking Shunyu Liu, Minghao Liu, Huichi Zhou, Zhenyu Cui, Yang Zhou, Yuhao Zhou, Jialiang Gao, Heng Zhou, Yunhao Yang, Wendong Fan, Puzhen Zhang, Ge Zhang, Jiajun Shi, Weihao Xuan, Jiaxing Huang, Shuang Luo, Fang Wu, Heli Qi, Qingcheng Zeng, Junjie Wang, Aosong Feng, Jindi Lv, Sicong Jiang, Ziqi Ren, Wangchunshu Zhou, Zhenfei Yin, Wenlong Zhang, Guohao Li, Wenhao Yu, Lei Ma, Lei Bai, Qunshu Lin, Mingli Song, Dacheng Tao arXiv preprint arXiv:2508.04026.
  10. PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents Junjie Wang†, Yuxiang Zhang, Minghao Liu, Yin Zhang, Yatai Ji, Weihao Xuan, Nie Lin, Kang Zhu, Zhiqiang Lin, Yiming Ren, Chunyang Jiang, Yiyao Yu, Zekun Wang, Tiezhen Wang, Wenhao Huang, Jie Fu, Qunshu Liu, Yujiu Yang, Ge Zhang, Ruibin Yuan†, Bei Chen†, & Wenhu Chen† arXiv preprint arXiv:2406.13923.
  11. HF Daily Top 3 · 2024.08.20 Segment Anything With Multiple Modalities Aoran Xiao*, Weihao Xuan*, Heli Qi, Yun Xing, Naoto Yokoya†, & Shijian Lu† arXiv preprint arXiv:2408.09085.

Conference Papers

  1. The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents Weihao Xuan*, Qingcheng Zeng*, Heli Qi, Yunze Xiao, Junjue Wang, & Naoto Yokoya† In The 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026 Main Conference).
  2. The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards Fang Wu*, Aaron Tu*, Weihao Xuan*, Heli Qi*, Xu Huang, Qingcheng Zeng, Shayan Talaei, Yijia Xiao, Peng Xia, Xiangru Tang, Yuchen Zhuang, Bing Hu, Hanqun Cao, Wenqi Shi, Rui Yang, Nan Liu, Huaxiu Yao, Ge Liu, Li Erran Li, Amin Saberi, Naoto Yokoya, Jure Leskovec, Yejin Choi† In The 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026 Main Conference).
  3. How to Improve LLMs' Performance on Specific Languages: A Perspective on LLM-Derived Language Similarity Xinhe Shi, Qingcheng Zeng, Weihao Xuan, & Linchao Zhu† In The 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026 Main Conference).
  4. Code-Switching Information Retrieval: Benchmarks, Analysis, and the Limits of Current Retrievers Qingcheng Zeng*, Yuheng Lu*, Zeqi Zhou, Heli Qi, Puxuan Yu, Fuheng Zhao, Hitomi Yanaka, Weihao Xuan†, Naoto Yokoya In The 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026 Findings).
  5. Sentipolis: Emotion-Aware Agents for Social Simulations Chiyuan Fu*, Lyuhao Chen*, Yunze Xiao*, Weihao Xuan, Carlos Busso, & Mona T. Diab† In The 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026 Findings).
  6. Direction-aware 3D Large Multimodal Models Quan Liu, Weihao Xuan, Junjue Wang, Naoto Yokoya, Ling Shao, & Shijian Lu† In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2026).
  7. HF Daily Top 1 · 2025.10.02 DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Fang Wu*, Weihao Xuan*, Heli Qi*, Ximing Lu, Aaron Tu, Li Erran Li, Yejin Choi† In The Fourteenth International Conference on Learning Representations (ICLR 2026).
  8. Is Pre-Training Applicable to the Decoder for Dense Prediction Chao Ning, Wanshui Gan, Weihao Xuan, & Naoto Yokoya† In 2026 IEEE International Conference on Robotics & Automation (ICRA 2026).
  9. Taming Object Hallucinations with Verified Atomic Confidence Estimation Jiarui Liu, Weihao Xuan, Zhijing Jin, Mona Diab† In The 19th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026).
  10. The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models Kefan Yu*, Qingcheng Zeng*, Weihao Xuan, Wanxin Li, Jingyi Wu, & Rob Voigt† In The 19th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026) (also in COLM 2025 PragLM Workshop).
  11. LandCraft: Designing the Structured 3D Landscapes via Text Guidance Zhihao Liu*, Fang Liu*, Weihao Xuan, & Naoto Yokoya† In The 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026).
  12. DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding Weihao Xuan*, Junjue Wang*, Heli Qi, Zihang Chen, Zhuo Zheng, Yanfei Zhong, Junshi Xia, & Naoto Yokoya† In The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025).
  13. DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response Junjue Wang*, Weihao Xuan*, Heli Qi, Zhihao Liu, Kunyi Liu, Yuhan Wu, Hongruixuan Chen, Jian Song, Junshi Xia, Zhuo Zheng, & Naoto Yokoya† In The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025).
  14. MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation Weihao Xuan†, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao, Aosong Feng, Dairui Liu, Yun Xing, Junjue Wang, Fan Gao, Jinghui Lu, Yuang Jiang, Huitao Li, Xin Li, Kunyu Yu, Ruihai Dong, Shangding Gu, Yuekang Li, Xiaofei Xie, Felix Juefei-Xu, Foutse Khomh, Osamu Yoshie, Qingyu Chen, Douglas Teodoro, Nan Liu, Randy Goebel, Lei Ma, Edison Marrese-Taylor, Shijian Lu, Yusuke Iwasawa, Yutaka Matsuo, & Irene Li† In The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025 Main Conference).
  15. Thinking Out Loud: Do Reasoning Models Know When They're Right? Qingcheng Zeng*, Weihao Xuan*, Leyang Cui, & Rob Voigt† In The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025 Main Conference).
  16. Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery Mai Tsujimoto, Junjue Wang, Weihao Xuan, & Naoto Yokoya† In The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2026).
  17. LR2Depth: Large-Region Aggregation at Low Resolution for Efficient Monocular Depth Estimation Chao Ning, Weihao Xuan, Wanshui Gan, & Naoto Yokoya† In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2025)
  18. 3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds Aoran Xiao, Jiaxing Huang, Weihao Xuan, Ruijie Ren, Kangcheng Liu, Dayan Guan, Abdulmotaleb El Saddik, Shijian Lu†, & Eric Xing In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023) (pp. 9382–9392).
  19. On a Network SIS Model With Opinion Dynamics Weihao Xuan, Ruijie Ren, Philip E. Paré, Mengbin Ye, Sebastian Ruf, & Ji Liu† IFAC-PapersOnLine, 53(2), 2582–2587.

Journal Papers

  1. AI for Earthquake Response: Outcomes & Insights from a Global Spaceborne Rapid Mapping Challenge Patrick Ebel, Mounia El Baz, Junjue Wang, Weihao Xuan, Heli Qi, Zhuo Zheng, Naoto Yokoya, Junghwan Park, Jaewan Park, Arthur Elskens, Eléonore Charles, Zachary Foltz, Iacopo Modica, Philippe Bally, Christian Bossung, Marco Chini, Nicolas Longépé, & Gabriele Meoni IEEE Geoscience and Remote Sensing Magazine.
  2. CityVLM: Towards Sustainable Urban Development via Multi-View Coordinated Vision–Language Model Junjue Wang*, Weihao Xuan*, Heli Qi, Zihang Chen, Hongruixuan Chen, Zhuo Zheng, Junshi Xia, Yanfei Zhong, & Naoto Yokoya† ISPRS Journal of Photogrammetry and Remote Sensing.
  3. BRIGHT: A Globally Distributed Multimodal Building Damage Assessment Dataset With Very-High-Resolution for All-Weather Disaster Response Hongruixuan Chen, Jian Song, Olivier Dietrich, Clifford Broni-Bediako, Weihao Xuan, Junjue Wang, Xinlei Shao, Yimin Wei, Junshi Xia, Cuiling Lan, Konrad Schindler, & Naoto Yokoya† Earth System Science Data (ESSD) (also in ICCV 2025 SEA Workshop, IGARSS 2025).
  4. Foundation Models for Remote Sensing and Earth Observation: A Survey Aoran Xiao, Weihao Xuan, Junjue Wang, Jiaxing Huang, Dacheng Tao, Shijian Lu†, & Naoto Yokoya† IEEE Geoscience and Remote Sensing Magazine.
  5. TSG-Seg: Temporal-selective guidance for semi-supervised semantic segmentation of 3D LiDAR point clouds Weihao Xuan, Heli Qi, & Aoran Xiao ISPRS Journal of Photogrammetry and Remote Sensing.

03Education

The University of Tokyo (東京大学) Ph.D. Candidate in Complexity Science and Engineering
Junior Research Associate (JRA) at RIKEN Center for Advanced Intelligence Project
Waseda University (早稲田大学) M.Eng. in Information
Okuma Memorial Scholarship · Top Student
University of Leeds B.Eng. in Mechanical Engineering
First-Class Honours

04Professional Activities

Reviewer

Conference
NeurIPSICLRICMLCVPRICCVECCVAAAIACL Rolling ReviewCOLMICCVWACMMMBMVCICRAIROSICDLSIICPHS
Journal
ISPRS Journal of Photogrammetry and Remote SensingPattern RecognitionIEEE Robotics and Automation Letters (RA-L)

Editorial Board

Guest Editor
Special Issue: Advancement of Multi-Source Remote Sensing Data Fusion in Environmental Monitoring, Remote Sensing (JCR Q1)

Organization

Organizer
BRIGHT Challenge, CVPR 2026 MONTI Workshop
Session Co-Chair
ER3: System Integration, SII 2022

05Funding & Awards

  • RIKEN BAIHO Award (RIKEN Excellent Achievement Award) Feb 2026
  • NVIDIA Academic Grant Dec 2024
  • RIKEN Junior Research Associate Dec 2023
  • Okuma Memorial Scholarship (Top Student) Dec 2022
  • Monbukagakusho Honors Scholarship, JASSO Apr 2022