Publications
Please check out here for the full publication list.
-
MedTutor-R1: Advancing Scalable and Robust One-to-Many Alignment in Clinical Socratic Education
ICML'26 (spotlight). [arXiv][code]
-
Webwatcher: Breaking new frontiers of vision-language deep research agent
ICLR'26. [arXiv] [code]
-
AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting
TMLR. [arXiv] [code]
-
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
🔥ICML🔥. [arXiv] [code]
-
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers
🔥Preprint🔥. [arXiv] [code]
Other Publications
-
Scaling Laws of Synthetic Data for Language Models
COLM, 2025 [arXiv]
-
R-Tuning: Instructing Large Language Models to Say ‘I Don’t Know’
NAACL 2024 (Outstanding Paper). [arXiv]
-
Word Embeddings Are Steers for Language Models
ACL 2024 (Outstanding Paper). [arXiv]
-
Tool Learning with Foundation Models
ACM Computing Survey. [arXiv]
-
COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation
NAACL 2021 (Best Demo). [arXiv]