I am currently at Alibaba Group, building a deterministic and scalable agentic AI ecosystem to maximize rewards in both business and social value.
Prior to this, I was the Chief Scientist at a generative AI research and product startup (raised $50M+), where I led the R&D of foundation models and launched AI-native products like Music Generator and Agentic Design Engine.
Previously, I was a Research Scientist and led the NLP research group [photo] at JD Explore Academy, where I was a member of the Doctoral Management Trainee (DMT) program (a top-tier talent program in JD.com, Inc.).
I received my Ph.D. from The University of Sydney, supervised by Prof. Dacheng Tao (IEEE/ACM Fellow).
I have published over 100 papers in top-tier AI/NLP venues (e.g., NeurIPS, ICLR, ICML, ACL, EMNLP, TPAMI), with a recent focus on large language models (training, alignment, evaluations, multilinguality, multimodality) and their agentic applications in the real world.
My work has been recognized with several honors, including the WAIC SAIL Award (highest honor at World AI Conference), the JD Technology Golden Award (highest tech award at JD.com Inc.), and an ACL Best Paper Nomination.
I also led the development of models that secured 1st place in world-renowned challenges, including SuperGLUE (surpassing human performance), GLUE, WMT (2019-2022), and IWSLT 2021.
I am an IEEE Senior Member (elevated in 2025). I actively serve the community as an Area Chair for major AI/NLP conferences, including NeurIPS, ACL, EMNLP, and NAACL.
I was also recognized as a Distinguished Reviewer for ACM TWEB and an Outstanding Reviewer for KDD 2025.
I served as a researcher at SIAS, Zhejiang University.
I am always open to collaborations!
📣 NEWS: I have several full-time and intern positions on pushing the boundaries of Agentic AI into the physical economy. Frontier topics include Agent-to-Agent (A2A) Post-training (GRPO/PPO), Industrial VLMs (Agent Perception), Knowledge Reasoning, and 2D/3D AIGC (Agent Action). Please reach out if you're interested in joining.
📣 I have several internship positions. Self-motivated students with experience in NLP and LLM are welcome.
News
May 2026: 🎉 Eight papers about {self-improvement, diffusion agent, agent eval., model editing, controllable gen., zeroth-order optimizer, multimodal evaluation} of foundation models are accepted by ICML 2026 and ACL 2026 (2 main, 3 findings, 1 workshop), respectively.
Mar. 2026: Invited to serve as the Area Chair for NeurIPS 2026.
Jan. 2026: Three papers about RLHF (adv. clipping), RAG (efficient reranking), and multimodal (ambiguity) of language models are accepted by AAAI 2026, WWW 2026, and CPAL 2026 respectively.
Sept. 2025: Three papers about {adversarial robustness, compression, self-evolution learning on forgetting} of language models are accepted by NeurIPS 2025, congrats to my students and coauthors.
Aug. 2025: 🎉 Six papers about {knowledge editing, dynamic KV caching, safety, agent-initialization, agent-early-exiting} of language models are accepted by EMNLP 2025, congrats to my interns and coauthors.
Aug. 2025: Invited to serve as the Senior PC (meta reviewer) for AAAI 2026.
Jul. 2025: I have been elevated to an IEEE Senior Member.
Jul. 2025: One paper about Healthcare Copilot is accepted by Nature Partner Journal npj Artificial Intelligence.
May 2025: 🎉 Six papers about {enhancing in-context learning, domain alignment, multilingual synchronization, multimodal reasoning, multi-agent, and eye-tracking-based intervention} of language models are accepted by ACL 2025, congrats to my interns and coauthors.
May 2025: Two papers about multimodality (retrieval-augmented perception) and RLHF (mitigating reward hacking) of language models are accepted by ICML 2025, with one oral, congrats to my interns.
Jan. 2025: A paper about improving the lexical choice of non-autoregressive translation is accepted by Computer Speech & Language.
Dec. 2024: Two papers about {multimodal benchmark on high-resolution images and complex reasoning} of language models are accepted by AAAI 2025, congrats to my interns.
Nov. 2024: Three papers about {distillation for translation, jailbreak defense, translation evaluation} of language models are accepted by COLING 2025, congrats to my interns.
Oct. 2024: Invited to serve as the Area Chair for NAACL 2025.
Sept. 2024: One Paper about mitigating reward hacking in RLHF is accepted by NeurIPS 2024, congrats to my intern Yuchun.
Sept. 2024: Four Papers about {catastrophic forgetting, distillation for CodeGen, speech modality expansion, and watermark} of language models are accepted by EMNLP 2024, congrats to my interns and coauthors.
Aug. 2024: One paper about understanding multimodal alignment for MLLM is accepted by ACM ToMM.
Jul. 2024: One paper about multimodal fusion for MLLM is accepted by ACM MM 2024.
Jul. 2024: One paper about an orthogonal optimizer for MoE is accepted by ECAI 2024.
Dec. 2023: Invited to serve as the Area Chair for EMNLP 2024.
May. 2024: 🎉 Ten papers about {alignment, in-context learning, compression, evaluation, safety, and downstream adaptations} of language models are accepted by ACL 2024.
Mar. 2024: One paper about sparse graph Transformer is accepted by Neural Networks.
Oct. 2023: One paper about training LM with adaptive sharpness-aware optimizer is accepted by Neural Networks.
Oct. 2023: Five Papers about {high (data & model) efficiency, cross-modal alignment in speech translation, LLM quantization, and ChatGPT for machine translation} are accepted by EMNLP 2023, congrats to my interns and coauthors.
Jul. 2023: Three papers about {cross-modal contrastive learning, knowledge alignment, and federated optimizer} of model training are accepted by ECAI 2023, IEEE TASLP, and TPAMI, respectively.
May. 2023: 🎉 Nine papers about {training, evaluation, robustness, and downstream adaptation} of the large model are accepted by ACL 2023, two oral papers and one best paper nomination, congrats to my interns and coauthors.
Mar. 2023: 🥂 I lead the R&D of the Vega series Large Language Models (织女系列自然语言大模型), which won the 2022 Technology Golden Award ("京东集团技术金项奖", the highest tech award at JD.com, Inc.), see internal media coverage.
Jan. 2023: Invited to serve as the Session Chair for AAAI 2023.
Jan. 2023: One paper about federated learning is accepted by ICLR 2023.
Jan. 2023: One paper about dynamic contrastive distillation is accepted by IEEE Transactions on Multimedia, congrats to my intern Jun.
Nov. 2022: One paper about memory-efficient pipeline parallelism of mixture-of-experts is accepted by IPDPS 2023, congrats to my intern Zheng.
Nov. 2022: Invited talk at China National Computer Congress 2022 (CNCC'22), check out the schedule.
Nov. 2022: One paper about simultaneous translation is accepted by AAAI 2023, congrats to my intern Hexuan.
Oct. 2022: 🏆 Our Vega v2 got 1st place on one of the most difficult general language understanding leaderboards - SuperGLUE! Check out the tech report.
Oct. 2022: Invited talk about Towards Efficient NLP Foundation Models -- Pretrain, Downstream Adaptation, and Beyond at Nankai Univ. and Univ. Chinese Academy of Sciences.
Oct. 2022: Two papers are accepted by EMNLP 2022, congrats to my interns Qihuang and Shwai.
Sep. 2022: 📖 Co-authored "White Paper on Artificial Intelligence Generated Content" is published, check out the [Chinese version]&[media coverage].
Aug. 2022: Two papers are accepted by COLING 2022, congrats to my interns Changtong and Bing.
Jan. 2022: 🏆 Our Vega v1 got 1st place on The General Language Understanding Evaluation (GLUE) benchmark! Check out the [tech report]&[media coverage].
Dec. 2021: Invited to serve as the Area Chair for ACL 2022.
Dec. 2021: Our Vega (织女) achieved the SOTA performance in two tasks @GLUE, surpassing human performance.
Aug. 2021: Two papers are accepted by EMNLP 2021 and its findings.
Aug. 2021: We organize a course "Advanced topics of AI" at the School of Gifted Young, USTC. I am the lecturer of NLP part.
Jul. 2021: 🏆 Ranked 1st in Swahili-English Speech Translation Task in IWSLT 2021.
Learning to Control Summaries with Score Ranking.
†Hongye Liu, Liang Ding, and Ricardo Henao. Findings of The Annual Meeting of the Association for Computational Linguistics, 2026 (ACL 2026). (CORE Rank A*)
VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search.
Yikun Wang, Siyin Wang, Qinyuan Cheng, Zhaoye Fei, Liang Ding, Qipeng Guo, Dacheng Tao, and Xipeng Qiu. The Annual Meeting of the Association for Computational Linguistics, 2025 (ACL 2025). (CORE Rank A*)
Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models.
†Qingyue Wang, Yanhe Fu, Yanan Cao, Shuai Wang, Zhiliang Tian, and Liang Ding arXiv preprint, 2023. & Neurocomputing, 2025. (CORE Rank B)
(We proposed this strategy in 2023, which has recently become very popular in agent building and attracted lots of attention, e.g., on Y Combinator's Hacker News.)
Uncertainty Aware Learning for Language Model Alignment.
†Yikun Wang, Rui Zheng, Liang Ding✉️, Qi Zhang, Dahua Lin, and Dacheng Tao. The Annual Meeting of the Association for Computational Linguistics, 2024 (ACL 2024). (CORE Rank A*)
DB-LLM: Accurate Dual-Binarization for Efficient LLMs.
†Hong Chen, Chengtao Lv, Liang Ding, Haotong Qin, Xiabin Zhou, Yifu Ding, Xuebo Liu, Min Zhang, Jinyang Guo, Xianglong Liu, and Dacheng Tao. Findings of The Annual Meeting of the Association for Computational Linguistics, 2024 (ACL 2024). (CORE Rank A*)
Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models.
†Qingyu Lu, †Baopu Qiu, Liang Ding, Kanjian Zhang, Tom Kocmi, and Dacheng Tao. Technical report & arXiv preprint, 2023. & Findings of The Annual Meeting of the Association for Computational Linguistics, 2024 (ACL 2024). (CORE Rank A*)
(🎁A present for the MT evaluation community to better understand and harness the powerful ChatGPT)
3AM: An Ambiguity-Aware Multimodal Machine Translation Dataset.
Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia S. Chao, Dacheng Tao, and Min Zhang. The International Conference on Computational Linguistics, 2024. (COLING 2024). (CORE Rank A)
Towards Making the Most of ChatGPT for Machine Translation.
†Keqin Peng, Liang Ding✉️, Qihuang Zhong, Li Shen, Xuebo Liu, Min Zhang, Yuanxin Ouyang, and Dacheng Tao. Technical report & arXiv preprint, 2023. & Findings of the Conference on Empirical Methods in Natural Language Processing, 2023 (EMNLP 2023). (CORE Rank A*)
(🎁A present for the MT community to better understand and harness the powerful ChatGPT)
Token-Level Self-Evolution Training for Sequence-to-Sequence Learning.
†Keqin Peng, Liang Ding(co-first author), Qihuang Zhong, Yuanxin Ouyang, Wenge Rong, Zhang Xiong, and Dacheng Tao. The Annual Meeting of the Association for Computational Linguistics, 2023 (ACL 2023). (CORE Rank A*)
(best paper nomination)
PAD-Net: An Efficient Framework for Dynamic Networks.
†Shwai He, Liang Ding✉️, Daize Dong, Boan Liu, Fuqiang Yu, and Dacheng Tao. arXiv preprint, 2022. & The Annual Meeting of the Association for Computational Linguistics, 2023 (ACL 2023). (CORE Rank A*)
TransGEC: Improving Grammatical Error Correction with Translationese.
Tao Fang, Xuebo Liu, Derek F. Wong, Runzhe Zhan, Liang Ding, Lidia S. Chao, Dacheng Tao, and Min Zhang. Findings of The Annual Meeting of the Association for Computational Linguistics, 2023 (ACL 2023). (CORE Rank A*)
Vega-MT: The JD Explore Academy Translation System for WMT22.
†Changtong Zan, †Keqin Peng, Liang Ding✉️(co-first author), Baopu Qiu, Boan Liu, Shwai He, Qingyu Lu, Zheng Zhang, Chuang Liu, Weifeng Liu, Yibing Zhan, and Dacheng Tao. The Conference on Machine Translation, 2022 (WMT 2022).
(Among all constrained high-resource tracks, Vega-MT won 7 champions, 2 runners-up, and 1 third place w.r.t BLEU, and won 8 champions and 2 runners-up w.r.t COMET.)
On the Complementarity between Pre-training and Back-Translation.
Xuebo Liu, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Shuming Shi, and Zhaopeng Tu. Findings of the Conference on Empirical Methods in Natural Language Processing, 2021 (EMNLP 2021). (CORE Rank A*)
The USYD-JD Speech Translation System for IWSLT2021. Liang Ding, Di Wu, and Dacheng Tao. The International Conference on Spoken Language Translation, 2021 (IWSLT 2021).
(Winning submission out of 42 teams to Sw-En speech translation, exceeding the 2nd place by more than 10 BLEU points)
EcomBench, ranked 1st with our shopping agent Alphashop with an average score of 69 (since Dec. 26 2025).
SuperGLUE Benchmark, ranked 1st with an average score of 91.3 (since Oct. 8 2022).
WMT 2022, ranked 1st on Chinese<=>English, German<=>English, Czech<=>English, and English=>Russian, 2nd on Russian=>English and Japanese=>English, and 3rd on English=>Japanese General Translation Tasks, respectively.
GLUE Benchmark, ranked 1st with an average score of 91.3 (since Jan. 1 2022).
IWSLT 2021, ranked 1st on Swahili-English speech translation task.
WMT 2020, ranked 2nd on German-to-English chat translation shared task.