Junli Wang | 汪隽立

I'm a senior undergraduate student at Department of Computer Science and Technology, Tsinghua University, affiliated with Xinya College. Currently, I am a research intern at Alibaba Qwen Team, advised by Binyuan Hui.

I was very fortunate to be advised by Prof. Tao Yu during my internship at XLang Lab, working on multimodal computer use agents.

I am looking for a Ph.D. position starting in 2025 Fall. Feel free to drop me an email if you think I will be a good fit.

Email  /  CV  /  Scholar  /  X  /  Github

profile photo

Research

My research interests lie in (multimodal) large language models and their applications in digital/embodied agents. I hope to scale up the performance of LLMs solving complex tasks through enhancing their reasoning ability and interaction skills.

I am also dedicated to enhancing my skills in Machine Learning Systems.

Publication

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Yiheng Xu*, Zekun Wang*, Junli Wang*, Dunjie Lu, Tianbao Xie, Amrita Saha, Doyen Sahoo, Tao Yu, Caiming Xiong
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
Yiheng Xu*, Dunjie Lu*, Zhennan Shen*, Junli Wang, Zekun Wang, Yuchen Mao, Caiming Xiong, Tao Yu,
ICLR 2025

Research Experience

  • 2024.04 - 2024.12: Research Intern at XLANG Lab, advised by Prof. Tao Yu
  • Internships

  • 2024.11 - now: Qwen Team, Alibaba Group.
  • Misc

    During my time at XLang Lab, I was very fortunate to work with Yiheng Xu and Tianbao Xie. I am very grateful for their guidance and support. We have a lot of fun working together.



    The source code is stolen from Jon Barron. Thanks for his sharing! 🙏🏻