Publications

Selected Highlights

Please check out here for the full publication list.
  1. MedTutor-R1: Advancing Scalable and Robust One-to-Many Alignment in Clinical Socratic Education
    ICML'26 (spotlight). [arXiv][code]

  2. Webwatcher: Breaking new frontiers of vision-language deep research agent
    ICLR'26. [arXiv] [code]

  3. AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting
    TMLR. [arXiv] [code]

  4. XSkill: Continual Learning from Experience and Skills in Multimodal Agents
    🔥ICML🔥. [arXiv] [code]

  5. Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers
    🔥Preprint🔥. [arXiv] [code]
Other Publications
  • Scaling Laws of Synthetic Data for Language Models
    COLM, 2025 [arXiv]

  • R-Tuning: Instructing Large Language Models to Say ‘I Don’t Know’
    NAACL 2024 (Outstanding Paper). [arXiv]

  • Word Embeddings Are Steers for Language Models
    ACL 2024 (Outstanding Paper). [arXiv]

  • Tool Learning with Foundation Models
    ACM Computing Survey. [arXiv]

  • COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation
    NAACL 2021 (Best Demo). [arXiv]