Weihao XUAN

Weihao XUAN (宣 偉豪)

I'm a Ph.D. candidate at Machine Learning and Statistical Data Analysis Lab (杉山・横矢・石田研究室), The University of Tokyo (東京大学), where I'm very fortunate to be advised by Prof. Naoto Yokoya. I'm also under the Junior Research Associate (JRA) program at RIKEN Center for Advanced Intelligence Project.

My main research focuses on natural language understanding, particularly in post-training for LLMs and VLMs. I'm also actively engaged in AI for Social Good & AI for Science, applying NLP techniques and foundation models to Earth Observation and Medical domains through collaborative research. I collaborate very closely with my friends Heli Qi and Junjue Wang, as well as brilliant researchers in LLM from the United States, Singapore, and Japan.

News

[08/2025] Three papers were accepted by EMNLP 2025 Main Conference, one with a Full Meta Score (10/10).

[06/2025] One paper was accepted by IROS 2025.

[09/2024] One paper was accepted by NeurIPS 2024 and selected as Spotlight Paper.

Publications

Preprints

  1. The Invisible Leash: Why RLVR May Not Escape Its Origin Fang Wu*, Weihao Xuan*, Ximing Lu, Zaid Harchaoui, & Yejin Choi† (* indicates co-first authors) arXiv preprint arXiv:2507.14843 (also in ICML 2025 AI4MATH Workshop).
  2. DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding Weihao Xuan*, Junjue Wang*, Heli Qi, Zihang Chen, Zhuo Zheng, Yanfei Zhong, Junshi Xia, & Naoto Yokoya† (* indicates co-first authors) arXiv preprint arXiv:2505.21076.
  3. DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response Junjue Wang*, Weihao Xuan*, Heli Qi, Zhihao Liu, Kunyi Liu, Yuhan Wu, Hongruixuan Chen, Jian Song, Junshi Xia, Zhuo Zheng, & Naoto Yokoya† (* indicates co-first authors) arXiv preprint arXiv:2505.21089.
  4. The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models Kefan Yu, Qingcheng Zeng, Weihao Xuan, Wanxin Li, Jingyi Wu, & Rob Voigt† arXiv preprint arXiv:2505.18497 (also in COLM 2025 PragLM Workshop).
  5. VeriGUI: Verifiable Long-Chain GUI Dataset Shunyu Liu, Minghao Liu, Huichi Zhou, Zhenyu Cui, Yang Zhou, Yuhao Zhou, Wendong Fan, Ge Zhang, Jiajun Shi, Weihao Xuan, Jiaxing Huang, Shuang Luo, Fang Wu, Heli Qi, Qingcheng Zeng, Ziqi Ren, Jialiang Gao, Jindi Lv, Junjie Wang, Aosong Feng, Heng Zhou, Wangchunshu Zhou, Zhenfei Yin, Wenlong Zhang, Guohao Li, Wenhao Yu, Irene Li, Lei Ma, Lei Bai, Qunshu Lin, Mingli Song†, & Dacheng Tao† arXiv preprint arXiv:2508.04026.
  6. BRIGHT: A Globally Distributed Multimodal Building Damage Assessment Dataset With Very-High-Resolution for All-Weather Disaster Response Hongruixuan Chen, Jian Song, Olivier Dietrich, Clifford Broni-Bediako, Weihao Xuan, Junjue Wang, Xinlei Shao, Yimin Wei, Junshi Xia, Cuiling Lan, Konrad Schindler, & Naoto Yokoya† arXiv preprint arXiv:2501.06019.
  7. Is Pre-Training Applicable to the Decoder for Dense Prediction Chao Ning, Wanshui Gan, Weihao Xuan, & Naoto Yokoya† arXiv preprint arXiv:2503.07637.
  8. Segment Anything With Multiple Modalities Aoran Xiao*, Weihao Xuan*, Heli Qi, Yun Xing, Naoto Yokoya†, & Shijian Lu† (* indicates co-first authors) arXiv preprint arXiv:2408.09085.

Conference Papers

  1. Seeing Is Believing, But How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models Weihao Xuan*, Qingcheng Zeng*, Heli Qi, Junjue Wang, & Naoto Yokoya† (* indicates co-first authors) The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025 Main Conference). Full Meta Score (10/10) [Top 1%]
  2. MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation Weihao Xuan†, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao, Aosong Feng, Dairui Liu, Yun Xing, Junjue Wang, Fan Gao, Jinghui Lu, Yuang Jiang, Huitao Li, Xin Li, Kunyu Yu, Ruihai Dong, Shangding Gu, Yuekang Li, Xiaofei Xie, Felix Juefei-Xu, Foutse Khomh, Osamu Yoshie, Qingyu Chen, Douglas Teodoro, Nan Liu, Randy Goebel, Lei Ma, Edison Marrese-Taylor, Shijian Lu, Yusuke Iwasawa, Yutaka Matsuo, & Irene Li† The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025 Main Conference).
  3. Thinking Out Loud: Do Reasoning Models Know When They're Right? Qingcheng Zeng*, Weihao Xuan*, Leyang Cui, & Rob Voigt† (* indicates co-first authors) The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025 Main Conference).
  4. LR2Depth: Large-Region Aggregation at Low Resolution for Efficient Monocular Depth Estimation Chao Ning, Weihao Xuan, Wanshui Gan, & Naoto Yokoya† In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2025)
  5. SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding From Monocular Remote Sensing Imagery Jian Song, Hongruixuan Chen, Weihao Xuan, Junshi Xia, & Naoto Yokoya† In The Thirty-eight Conference on Neural Information Processing Systems (NeurIPS 2024). Spotlight Paper [Top 3.1%]
  6. Cat-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model Aoran Xiao*, Weihao Xuan*, Heli Qi, Yun Xing, Ruijie Ren, Xiaoqin Zhang, Ling Shao & Shijian Lu† (* indicates co-first authors) In European Conference on Computer Vision (ECCV 2024) (pp. 189-206). Oral Paper [Top 2.3%, 200/8585]
  7. 3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds Aoran Xiao, Jiaxing Huang, Weihao Xuan, Ruijie Ren, Kangcheng Liu, Dayan Guan, Abdulmotaleb El Saddik, Shijian Lu†, & Eric Xing In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023) (pp. 9382-9392).
  8. MaskVO: Self-Supervised Visual Odometry With a Learnable Dynamic Mask Weihao Xuan, Ruijie Ren, Siyuan Wu, & Changhao Chen In 2022 IEEE/SICE International Symposium on System Integration (SII) (pp. 225-231). IEEE.
  9. On a Discrete-Time Network SIS Model With Opinion Dynamics Yixuan Lin, Weihao Xuan, Ruijie Ren, & Ji Liu† In 2021 60th IEEE Conference on Decision and Control (CDC) (pp. 2098-2103). IEEE.
  10. On a Network SIS Model With Opinion Dynamics Weihao Xuan, Ruijie Ren, Philip E. Paré, Mengbin Ye, Sebastian Ruf, & Ji Liu† IFAC-PapersOnLine, 53(2), 2582-2587.

Journal Papers

  1. Foundation Models for Remote Sensing and Earth Observation: A Survey Aoran Xiao, Weihao Xuan, Junjue Wang, Jiaxing Huang, Dacheng Tao, Shijian Lu†, & Naoto Yokoya† IEEE Geoscience and Remote Sensing Magazine. Accepted.

Education

The University of Tokyo (東京大学)
Ph.D. in Complexity Science and Engineering
Machine Learning and Statistical Data Analysis Lab (杉山・横矢・石田研究室), Advisor: Prof. Naoto Yokoya
Junior Research Associate (JRA) program at RIKEN Center for Advanced Intelligence Project

Waseda University (早稲田大学)
M.Eng. in Computer Science
Okuma Memorial Scholarship (Top Student)

University of Leeds
B.Eng. in Mechanical Engineering
First-Class Honours

Professional Activities

Reviewer

Conference: NeurIPS, CVPR, ICCV, AAAI, ICCVW, ACMMM, IROS, ICDL, SII, CPHS
Journal: Pattern Recognition, ISPRS Journal of Photogrammetry and Remote Sensing, IEEE Transactions on Geoscience and Remote Sensing

Organization

Session Co-Chair: ER3: System Integration, IEEE/SICE International Symposium on System Integration (SII 2022)

Editorial Board

Guest Editor Assistant: Special Issue: Advancement of Multi-Source Remote Sensing Data Fusion in Environmental Monitoring, Remote Sensing

Funding and Awards

NVIDIA Academic GrantDec 2024

RIKEN Junior Research AssociateDec 2023

Okuma Memorial Scholarship (Top Student)Dec 2022

Monbukagakusho Honors Scholarship, JASSOApr 2022