Agentic Intelligence Lab

Research

For the most up-to-date list of publications, please visit our Google Scholar profile. Below, we highlight our representative works in each active research direction. Following that, we provide a categorized list of our publications based on their type.

A word cloud of publication titles

All

T Theory RL Reinforcement Learning LLM Large Language Model RO Robotics FU Fusion VLA Vision-Language-Action Models CL Continual Learning HU Humanoid WM World Model

Preprints and In-Submission Works:

  1. Yang Fu, Haomin Bao, Rohit Sonker, Xiaoyan Hu, Aravind Venugopal, Jeff Schneider, and Jiayu Chen, “Offline Reinforcement Learning for Plasma Control in Nuclear Fusion: Codebase and Benchmark”, submitted to Conference on Neural Information Processing Systems (NeurIPS) Evaluations and Datasets Track, 2026. FU RL

  2. Rohit Sonker, Xiaoyan Hu, Hiro Josep Farre Kaga, Andrew Rothstein, Jiayu Chen, Egemen Kolemen, and Jeff Schneider, “Dynamics-Informed Adaptive Offline RL for Real-Time Tokamak Plasma Control”, submitted to Conference on Neural Information Processing Systems (NeurIPS), 2026. FU RL

  3. Ziyi Ding, Xiao-Ping Zhang, and Jiayu Chen, “Quantifying the Generalization Advantage of Causal World Models”, submitted to Conference on Neural Information Processing Systems (NeurIPS), 2026. WM

  4. Guoqing Ma, Mingqi Yuan, Chen Gao, Jiayu Chen, and Shan Yu, “HiWE: Hierarchical World Knowledge Model with Visual Keypoint Enhancement for Zero-Shot 3D Path Planning”, submitted to Conference on Neural Information Processing Systems (NeurIPS), 2026. VLA RO

  5. Haixiang Sun, Yang Xu, Jiefu Zhang, Xudong Wu, Zihan Zhou, Jun He, and Jiayu Chen, “Core-Halo Decomposition: Decentralizing Large-Scale Fixed-Point Problem”, submitted to Conference on Neural Information Processing Systems (NeurIPS), 2026. T

  6. Xudong Wu, Jian Qian, Pangpang Liu, Vaneet Aggarwal, and Jiayu Chen, “Distributionally Robust Listwise Preference Optimization”,submitted to Conference on Neural Information Processing Systems (NeurIPS), 2026. LLM T

  7. Xianxin Lai, Ziyi Ding, Weiyu Chen, and Jiayu Chen, “When Expert Rollouts Mislead: Planner-Selected Reliability in JEPA Latent World Models”,submitted to Conference on Neural Information Processing Systems (NeurIPS), 2026. WM

  8. Yiheng Zhang, Yiming Wang, Kaiyan Zhao, Zhenglin Wan, Jiayu Chen, and Leong Hou U, “ANO: A Unified Framework for Robust Policy Optimization”, submitted to Conference on Neural Information Processing Systems (NeurIPS), 2026. RL LLM RO

  9. Yuxuan Li, Qijun He, Mingqi Yuan, Xiaoquan Sun, Wentse Chen, Jeff Schneider, and Jiayu Chen, “Continual Policy Distillation From Distributed Reinforcement Learning Teachers”, submitted to Conference on Neural Information Processing Systems (NeurIPS), 2026. RO CL RL

  10. Wenqi Ge, Junde Guo, Zhen Fu, Shunpeng Yang, Jiayu Chen, and Hua Chen, “CWI: Composite Humanoid Whole-Body Imitation System for Loco-manipulation”, submitted to IEEE Robotics and Automation Letters, 2026. RO HU RL

  11. Xiaoquan Sun, Zetian Xu, Chen Cao, Zonghe Liu, Yihan Sun, Jingrui Pang, Ruijian Zhang, Zhen Yang, Kang Pang, Dingxin He, Mingqi Yuan, and Jiayu Chen, “AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World Models”, submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2026. RO VLA RL

  12. Ziyi Ding, Xianxin Lai, Weiyu Chen, Xiao-Ping Zhang, and Jiayu Chen, “CausalVAE as a Plug-in for World Models: Towards Reliable Counterfactual Dynamics”, submitted to The European Conference on Computer Vision (ECCV), 2026. WM

  13. Caihao Sun, Mingqi Yuan, Shiyuan Wang, and Jiayu Chen, “Vision Transformers that Never Stop Learning”, submitted to The European Conference on Computer Vision (ECCV), 2026. CL

  14. Xiaoquan sun, Ruijian Zhang, Kang Pang, Bingchen Miao, Yuxiang Tan, Zhen Yang, Ming Li, and Jiayu Chen, “RoboTidy : A 3D Gaussian Splatting Household Tidying Benchmark for Embodied Navigation and Action”, submitted to The European Conference on Computer Vision (ECCV), 2026. RO VLA

  15. Xudong Wu, Pangpang Liu, Vaneet Aggarwal, and Jiayu Chen, “On the Convergence of Self-Improving Online LLM Alignment”, submitted to Uncertainty in Artificial Intelligence (UAI), 2026. T LLM

  16. Mingqi Yuan, Qi Wang, Guozheng Ma, Caihao Sun, Bo Li, Xin Jin, Yunbo Wang, Xiaokang Yang, Wenjun Zeng, Dacheng Tao, and Jiayu Chen, “Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning”, submitted to JMLR Machine Learning Open Source Software. CL RL

  17. Jiayu Chen, Zhekai Wang, Xudong Wu, and Vaneet Aggarwal, “Hierarchical Deep Counterfactual Regret Minimization”, in progress. T RL

  18. Xudong Wu, Pangpang Liu, Vaneet Aggarwal, and Jiayu Chen, “SAIL-TRPO: A Trust-Region Online RLHF Algorithm with Guaranteed Fast Convergence”, in progress. T LLM

  19. Wentse Chen, Jiayu Chen, Hao Zhu, Fahim Tajwar, Ruslan Salakhutdinov, and Jeff Schneider, “Verlog: An Efficient Synchronized Multi-turn RL Framework for LLM Agents”, in progress. LLM RA

Workshop Papers (without proceedings):

  1. Ricardo Shousha, Hiro Farre, Peter Steiner, Jaemin Seo, Rohit Sonker, Jiayu Chen, Keith Erickson, Andy Rothstein, SangKyeun Kim, Z.A. Xing, Suk-Ho Hong, Yilun Zhu, and Egemen Kolemen, “Toward ITER-Relevant Robust Plasma Profile Control Using AI-Based RTCAKENN in Diagnostically Challenged Environments”, Annual Meeting of the APS Division of Plasma Physics (APS-DPP), 2025. FU

  2. Rohit Sonker, Hiro Farre, Jiayu Chen, Andy Rothstein, Aravind Venugopal, Namrata Deka, Ian Char, Ricardo Shousha, Egemen Kolemen, and Jeff Schneider, “Offline Reinforcement Learning for Rotation and Density Profile Control in Tokamaks”, Annual Meeting of the APS Division of Plasma Physics (APS-DPP), 2025. FU RL

  3. Wentse Chen, Jiayu Chen, Hao Zhu, and Jeff Schneider, “Context-lite Multi-turn Reinforcement Learning for LLM Agents”, accepted in ICML Workshop on Efficient Systems for Foundation Models, 2025. LLM RL

  4. Wenyan Xu, Jiayu Chen, Chen Li, Yonghong Hu, and Zhonghua Lu, “Mining Intraday Risk Factor Collections via Hierarchical Reinforcement Learning based on Transferred Options”, accepted in AAAI 2025 Workshop on AI for Social Impact (Oral presentation). RL

  5. Chang-Lin Chen, Jiayu Chen, Tian Lan, Elaine Zhao, Hongbo Dong, and Vaneet Aggarwal, “Rack Position Optimization in Large-Scale Heterogeneous Data Centers”, accepted in AAAI/INFORMS Bridge: AI+ORMS, Dec 2024 (Full presentation). RL

  6. Wentse Chen, Jiayu Chen, Fahim Tajwar, Hao Zhu, Xintong Duan, Russ Salakhutdinov, and Jeff Schneider, “Fine-tuning LLM Agents with Retrospective In-Context Online Learning”, accepted in NeurIPS Adaptive Foundation Models Workshop, Oct 2024 (Oral presentation). LLM RL

  7. Jiayu Chen, Marina Wagdy Wadea Haliem, Tian Lan, and Vaneet Aggarwal, “Multi-agent Deep Covering Option Discovery”, accepted in ICML Reinforcement Learning for Real Life Workshop, Jul 2021. RL

Conference Papers (with proceedings):

  1. Jiayu Chen, Le Xu, Aravind Venugopal, and Jeff Schneider, “Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning”, accepted in International Conference on Machine Learning (ICML), 2026. RL FU WM

  2. Jiayu Chen, Le Xu, Wentse Chen, and Jeff Schneider, “Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning”, accepted in International Conference on Learning Representations (ICLR), 2026. RL FU WM

  3. Aravind Venugopal, Jiayu Chen, Xudong Wu, Chongyi Zheng, Benjamin Eysenbach, and Jeff Schneider, “Occupancy Reward Shaping: Improving Credit Assignment for Offline Goal-Conditioned Reinforcement Learning”, accepted in International Conference on Learning Representations (ICLR), 2026. RL FU

  4. Rohit Sonker, Hiro Josep Farre Kaga, Jiayu Chen, Andrew Rothstein, Ian Char, Ricardo Shousha, Egemen Kolemen, and Jeff Schneider, “Offline Reinforcement Learning for Rotation Profile Control in Tokamaks”, accepted in Annual Learning for Dynamics and Control Conference (L4DC), 2026. RL FU

  5. Wentse Chen, Yuxuan Li, Shiyu Huang, Jiayu Chen, and Jeff Schneider, “ME-IGM: Individual-Global-Max in Maximum Entropy Multi-Agent Reinforcement Learning”, accepted in International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2026 (Oral Presentation). RL

  6. Yiming Wang, Kaiyan Zhao, Ming Yang, Yan Li, Furui Liu, Jiayu Chen, and Leong Hou U, “DSAP: Enhancing Generalization in Goal-Conditioned Reinforcement Learning”, accepted in Annual AAAI Conference on Artificial Intelligence (AAAI), 2026. RL T

  7. Yiming Wang, Kaiyan Zhao, Xu Li, Yan Li, Jiayu Chen, Steven Morad, and Leong Hou U, “Explore to Learn: Latent Exploration through Disentangled Synergy Patterns for Reinforcement Learning in Overactuated Control”, accepted in Annual AAAI Conference on Artificial Intelligence (AAAI), 2026 (Oral Presentation). RL RO

  8. Le Xu and Jiayu Chen, “Enhancing Robustness of Offline Reinforcement Learning Under Data Corruption via Sharpness-Aware Minimization”, accepted in AAAI Student Abstract and Poster Program, 2026 (Oral Presentation). RL

  9. Wentse Chen, Jiayu Chen, Fahim Tajwar, Hao Zhu, Xintong Duan, Russ Salakhutdinov, and Jeff Schneider, “Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models”, accepted in Conference on Neural Information Processing Systems (NeurIPS), 2025. LLM RL

  10. Sizhe Tang, Jiayu Chen, and Tian Lan, “MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning”, accepted in Conference on Neural Information Processing Systems (NeurIPS), 2025. RL

  11. Swetha Ganesh, Jiayu Chen, Washim Uddin Mondal, and Vaneet Aggarwal, “Order-Optimal Global Convergence for Average Reward Actor-Critic with General Policy and Neural Critic Parametrization”, accepted in Uncertainty in Artificial Intelligence (UAI), 2025. T

  12. Jiayu Chen, Tian Lan, and Vaneet Aggarwal, “Variational Offline Multi-agent Skill Discovery”, accepted in International Joint Conferences on Artificial Intelligence (IJCAI), 2025 (Oral Presentation). RL

  13. Chang-Lin Chen, Jiayu Chen, Tian Lan, Elaine Zhao, Hongbo Dong, and Vaneet Aggarwal, “Rack Position Optimization in Large-Scale Heterogeneous Data Centers”, accepted in International Conference on Automated Planning and Scheduling (ICAPS), 2025 (Archival and Oral Presentation). RL

  14. Jiayu Chen, Vaneet Aggarwal, and Tian Lan, “A Unified Algorithm Framework for Unsupervised Discovery of Skills based on Determinantal Point Process”, accepted in Conference on Neural Information Processing Systems (NeurIPS), Dec 2023. RL

  15. Jiayu Chen, Dipesh Tamboli, Tian Lan, and Vaneet Aggarwal, “Multi-task Hierarchical Adversarial Inverse Reinforcement Learning”, accepted in International Conference on Machine Learning (ICML), Jul 2023. RL

  16. Jiayu Chen, Tian Lan, and Vaneet Aggarwal, “Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control”, accepted in IEEE International Conference on Robotics and Automation (ICRA), Jun 2023. RO RL

  17. Jiayu Chen, Jingdi Chen, Tian Lan, and Vaneet Aggarwal, “Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs”, accepted in Conference on Neural Information Processing Systems (NeurIPS), Dec 2022. RL

  18. Jiayu Chen, Abhishek K. Umrawal, Tian Lan, and Vaneet Aggarwal, “DeepFreight: A Model-free Deep-reinforcement-learning-based Algorithm for Multi-transfer Freight Delivery”, accepted in International Conference on Automated Planning and Scheduling (ICAPS), Aug 2021. RL

  19. Pin Wang, Dapeng Liu, Jiayu Chen, Hanhan Li, and Ching-Yao Chan, “Decision Making for Autonomous Driving via Augmented Adversarial Inverse Reinforcement Learning”, accepted in IEEE International Conference on Robotics and Automation (ICRA), Jun 2021. RO RL

  20. Jilin Mei, Jiayu Chen, Wen Yao, Xijun Zhao, and Huijing Zhao, “Supervised Learning for Semantic Segmentation of 3D LiDAR Data”, accepted in IEEE Intelligent Vehicles Symposium (IV), Jun 2019. RO

Journal Papers:

  1. Mingqi Yuan, Tao Yu, Wenqi Ge, Xiuyong Yao, Dapeng Li, Huijiang Wang, Jiayu Chen, Xin Jin, Bo Li, Hua Chen, Wei Zhang, and Wenjun Zeng, “Behavior Foundation Model: Towards Next-Generation Whole-Body Control System of Humanoid Robots”, accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2026. RO HU

  2. Hyunsoo Choi, Jiayu Chen, Vaneet Aggarwal, and Zubin Jacob, “TURBO-RL: Turbulence Mitigation Using Reinforcement Learning for Severe Optical Aberrations”, accepted in Optical Society of America A, 2025. RL

  3. Swetha Ganesh, Jiayu Chen, Gugan Thoppe, and Vaneet Aggarwal, “Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries”, accepted in Transactions on Machine Learning Research (TMLR), Oct 2024. T

  4. Chang-Lin Chen, Hanhan Zhou, Jiayu Chen, Mohammad Pedramfar, Vaneet Aggarwal, Tian Lan, Zheqing Zhu, Chi Zhou, Pol Mauri Ruiz, Neeraj Kumar, and Hongbo Dong, “Learning-based Two-tiered Online Optimization of Region-wide Datacenter Resource Allocation”, accepted in IEEE Transactions on Network and Service Management (TNSM), Oct 2024. RL

  5. Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, and Vaneet Aggarwal, “Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions”, accepted in Transactions on Machine Learning Research (TMLR), Aug 2024 (Survey Certification). RL

  6. Dipesh Tamboli, Jiayu Chen, Kiran Pranesh Jotheeswaran, Denny Yu, and Vaneet Aggarwal, “Reinforced Sequential Decision-Making for Sepsis Treatment: The PosNegDM Framework with Mortality Classifier and Transformer”, accepted in IEEE Journal of Biomedical and Health Informatics (JBHI), Mar 2024. RL

  7. Jiayu Chen, Vaneet Aggarwal, and Tian Lan, “Hierarchical Adversarial Inverse Reinforcement Learning”, accepted in IEEE Transactions on Neural Networks and Learning Systems (TNNLS), Aug 2023. RL

  8. Jiayu Chen, Jingdi Chen, Tian Lan, and Vaneet Aggarwal, “Learning Multi-agent Options for Tabular Reinforcement Learning using Factor Graphs”, accepted to IEEE Transactions on Artificial Intelligence (TAI), Jul 2022. RL