Selected Highlights

MedTutor-R1: Advancing Scalable and Robust One-to-Many Alignment in Clinical Socratic Education
ICML'26 (spotlight). [arXiv][code]

Webwatcher: Breaking new frontiers of vision-language deep research agent
ICLR'26. [arXiv] [code]

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting
TMLR. [arXiv] [code]

XSkill: Continual Learning from Experience and Skills in Multimodal Agents
🔥ICML🔥. [arXiv] [code]

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers
🔥Preprint🔥. [arXiv] [code]

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents
🚀Preprint🚀. [arXiv] [code]

Other Publications

R-Tuning: Instructing Large Language Models to Say ‘I Don’t Know’
NAACL 2024 (Outstanding Paper). [arXiv]

Word Embeddings Are Steers for Language Models
ACL 2024 (Outstanding Paper). [arXiv]

COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation
NAACL 2021 (Best Demo). [arXiv]