Baosong Yang
I am currently a Senior Algorithm Expert in the Language Technology Lab at Alibaba Tongyi Lab. I received my Ph.D at NLP2CT Lab of University of Macau, advised by Prof. Derek F. Wong. I have studied and practiced in a broad field of natural language processing (NLP) such as Chinese word segmentation, named entity, semantic role labeling, question answering system, machine translation. My current research focuses on multilingualism of Large Language Model. I have published over 50 papers in leading NLP/AI journals and conferences such as ACL, EMNLP, AAAI, and NAACL.
We have some research intern positions available in Qwen Team. If you are interested in LLM, please feel free to contact us: nlp2ct.baosong[AT]gmail.com.
News
2024-06-07: We release Qwen2, which boasts comparable multilingual capabilities to GPT4-Turbo.
2024-05-16: Two papers are accepted by ACL-2024.
2024-02-20: One paper is accepted by COLING-2024.
2024-02-06: We release Qwen1.5, which boasts comparable multilingual capabilities to GPT3.5-Turbo.
2023-09-22: One paper is accepted by NeurlPS-2023.
2023-07-21: We release a 13B multilingual large language model which is trained on 600 billion tokens – PolyLM (Demo,Model,Paper,Code)
Interests
- Multilingualism
- Large Language Model
- Machine Translation
Experience
- Ph.D. 2019. University of Macau, Macau, China. Supervised by Prof. Derek F. Wong and Prof. Lidia S. Chao.
- Research Intern. 2018. Tencent AI Lab, Shenzhen, China. Supervised by Dr. Zhaopeng Tu
- M.S. 2015. Waseda University, Fukuka, Japan. Supervised by Prof. Yves Lepage
- B.S. 2014. Sichuan University, Sichuan, China.
Selected Publications
- MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation
Jianhui Pang, Baosong Yang*, Derek F Wong, Dayiheng Liu, Xiangpeng Wei, Jun Xie, Lidia S Chao
COLING 2024 - Rethinking the Exploitation of Monolingual Data for Low-Resource Neural Machine Translation
Jianhui Pang, Baosong Yang*, Derek Fai Wong*, Yu Wan, Dayiheng Liu, Lidia Sam Chao, Jun Xie
Computational Linguisitcs - MMNMT: Modularizing Multilingual Neural Machine Translation with Flexibly Assembled MoE and Dense Blocks
Shangjie Li, Xiangpeng Wei, Shaolin Zhu, Jun Xie, Baosong Yang*, Deyi Xiong
EMNLP 2023 - EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning
Ping Guo, Xiangpeng Wei, Yue Hu, Baosong Yang, Dayiheng Liu, Fei Huang, Jun Xie
NeurIPS 2023 - Polylm: An open source polyglot large language model
Xiangpeng Wei, Haoran Wei, Huan Lin, Tianhao Li, Pei Zhang, Xingzhang Ren, Mei Li, Yu Wan, Zhiwei Cao, Binbin Xie, Tianxiang Hu, Shangjie Li, Binyuan Hui, Bowen Yu, Dayiheng Liu, Baosong Yang*, Fei Huang, Jun Xie
Arxiv Preprint
Professional Services
- Area Chair: EMNLP, AACL; PC Member/Reviewer: ACL, AAAI, NAACL, NLPCC, CCL, etc.