Baosong Yang
I am currently an Algorithm Expert in the Language Technology Lab at Alibaba DAMO Academy. I received my Ph.D at NLP2CT Lab of University of Macau, advised by Prof. Derek F. Wong. I have studied and practiced in a broad field of natural language processing (NLP) such as Chinese word segmentation, named entity, semantic role labeling, question answering system, machine translation. My current research focuses on deep learning for NLP, including neural machine translation (NMT), cross-lingual language model (XLM), cross-lingual information retrieval (CLIR). I have published over 50 papers in leading NLP/AI journals and conferences such as ACL, EMNLP, AAAI, and NAACL.
We have some research intern positions available in Alibaba DAMO Academy. If you are interested in NLP and ML, please feel free to contact us: nlp2ct.baosong[AT]gmail.com.
News
2023-09-22: One paper is accepted by NeurlPS-2023.
2023-07-21: We release a 13B multilingual large language model which is trained on 600 billion tokens – PolyLM (Demo,Model,Paper,Code)
2023-05-03: Four papers are accepted by ACL-2023.
2022-10-07: Two papers are accepted by EMNLP-2022.
Interests
- Machine Learning
- Natural Language Processing
- Machine Translation
Experience
- Ph.D. 2019. University of Macau, Macau, China. Supervised by Prof. Derek F. Wong and Prof. Lidia S. Chao.
- Research Intern. 2018. Tencent AI Lab, Shenzhen, China. Supervised by Dr. Zhaopeng Tu
- M.S. 2015. Waseda University, Fukuka, Japan. Supervised by Prof. Yves Lepage
- B.S. 2014. Sichuan University, Sichuan, China.
Selected Publications
- Polylm: An open source polyglot large language model
Xiangpeng Wei, Haoran Wei, Huan Lin, Tianhao Li, Pei Zhang, Xingzhang Ren, Mei Li, Yu Wan, Zhiwei Cao, Binbin Xie, Tianxiang Hu, Shangjie Li, Binyuan Hui, Bowen Yu, Dayiheng Liu, Baosong Yang*, Fei Huang, Jun Xie
Arxiv Preprint - Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation
Zhiwei Cao, Baosong Yang, Huan Lin, Suhang Wu, Xiangpeng Wei, Dayiheng Liu, Jun Xie, Min Zhang, Jinsong Su
ACL 2023 - Towards energy-preserving natural language understanding with spiking neural networks
Rong Xiao, Yu Wan, Baosong Yang*, Haibo Zhang, Huajin Tang, Derek F. Wong, Boxing Chen
IEEE/ACM Transactions on Audio, Speech, and Language Processing - Competency-Aware Neural Machine Translation: Can Machine Translation Know its Own Translation Quality?
Pei Zhang, Baosong Yang, Hao-Ran Wei, Dayiheng Liu, Kai Fan, Luo Si and Jun Xie
EMNLP 2022 - Effective Approaches to Neural Query Language Identification
Xingzhang Ren, Baosong Yang*, Dayiheng Liu, Haibo Zhang, Xiaoyu Lv, Liang Yao, Jun Xie
Computational Linguisitcs - Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation
Huan Lin, Baosong Yang, Liang Yao, Dayiheng Liu, Haibo Zhang, jun xie, Min Zhang, Jinsong Su
NAACL 2022 (Findings) - UniTE: Unified Translation Evaluation
Yu Wan, Dayiheng Liu, Baosong Yang, Haibo Zhang, Boxing Chen, Derek F. Wong, Lidia S. Chao
ACL 2022 - Unsupervised Preference-Aware Language Identification
Xingzhang Ren, Baosong Yang, Dayiheng Liu, Haibo Zhang, Xiaoyu Lv, Liang Yao, Jun Xie
ACL 2022 (Findings) - Attention Mechanism with Energy-Friendly Operations
Yu Wan, Baosong Yang, Dayiheng Liu, Rong Xiao, Derek F. Wong, Haibo Zhang, Boxing Chen, Lidia S. Chao
ACL 2022 (Findings) - Challenges of Neural Machine Translation for Short Texts
Yu Wan, Baosong Yang*, Derek Wong*, Lidia Chao, Liang Yao, Haibo Zhang and Boxing Chen
Computational Linguisitcs - Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation
Xin Liu, Baosong Yang, Dayiheng Liu, Haibo Zhang, Weihua Luo, Min Zhang, Haiying Zhang and Jinsong Su
ACL 2021 - Towards User-Driven Neural Machine Translation
Huan Lin, Liang Yao, Baosong Yang, Dayiheng Liu, Haibo Zhang, Weihua Luo, Degen Huang and Jinsong Su
ACL 2021 - Domain Transfer based Data Augmentation for Neural Query Translation
Liang Yao, Baosong Yang, Haibo Zhang, Boxing Chen and Weihua Luo
COLING 2020 - Self-Paced Learning for Neural Machine Translation
Yu Wan, Baosong Yang*, Derek Wong*, Yikai Zhou, Lidia Chao, Haibo Zhang and Boxing Chen
EMNLP 2020 - Uncertainty-Aware Curriculum Learning for Neural Machine Translation
Yikai Zhou, Baosong Yang, Derek Wong, Yu Wan and Lidia Chao
ACL 2020 - Unsupervised Neural Dialect Translation with Commonality and Diversity Modeling
Yu Wan*, Baosong Yang*, Derek Wong, Lidia Chao, Haihua Du and Ben Ao
AAAI 2020 - Assessing the Ability of Self-Attention Networks to Learn Word Order
Baosong Yang, Longyue Wang, Derek Wong, Lidia Chao and Zhaopeng Tu
ACL 2019 - Convolutional Self-Attention Networks
Baosong Yang, Longyue Wang, Derek Wong, Lidia Chao and Zhaopeng Tu
NAACL 2019 - Information Aggregation for Multi-Head Attention with Routing-by-Agreement.
Jian Li, Baosong Yang, Zi-Yi Dou, Xing Wang, Michael R. Lyu and Zhaopeng Tu
NAACL 2019 - Context-Aware Self-Attention Networks.
Baosong Yang, Jian Li, Derek Wong, Lidia Chao, Xing Wang and Zhaopeng Tu
AAAI 2019 - Modeling Localness for Self-Attention Networks.
Baosong Yang, Zhaopeng Tu, Derek Wong, Fandong Meng, Lidia Chao and Tong Zhang
EMNLP 2018 - Towards Bidirectional Hierarchical Representations for Attention-Based Neural Machine Translation.
Baosong Yang, Derek Wong, Tong Xiao, Lidia Chao and Jingbo Zhu
EMNLP 2017
Professional Services
- Area Chair: EMNLP, AACL; PC Member/Reviewer: ACL, AAAI, NAACL, NLPCC, CCL, etc.