I am a 1st-year PhD student in the Department of Computer Science and Technology at Tsinghua University. I am supervised by Prof. Juanzi Li. I also visited Mila and did research with Prof. Jian Tang at the summer of 2019. You can find my CV here.
My research interests lie in deep learning methods on Natural Language Processing and Knowledge Graph. My research goal is to bridge machine learning models and symbolic human knowledge.
- [Aug. 2020] Got two papers accepted at EMNLP 2020. See you online:)
- [Sep. 2019] Released a pre-trained language model reading list with Zhengyan Zhang.
- [Aug. 2019] Got one paper accepted at EMNLP 2019. See you at Hong Kong.
- Program Committee Member: AAAI/IJCAI/COLING 2020, AAAI 2021
- Review Assistant: COLING/EMNLP 2018, IJCAI/SIGIR/ACL 2019
- Zhengyan Zhang, Xu Han, Hao Zhou, Pei Ke, Yuxian Gu, Deming Ye, Yujia Qin, Yusheng Su, Haozhe Ji, Jian Guan, Fanchao Qi, Xiaozhi Wang, Yanan Zheng, Guoyang Zeng, Huanqi Cao, Shengqi Chen, Daixuan Li, Zhenbo Sun, Zhiyuan Liu, Minlie Huang, Wentao Han, Jie Tang, Juanzi Li, Xiaoyan Zhu, Maosong Sun. CPM: A Large-scale Generative Chinese Pre-trained Language Model. [arxiv] [code] [homepage]
* indicates equal contribution, and see here for details.
- Yuan Yao, Haoxi Zhong, Zhengyan Zhang, Xu Han, Xiaozhi Wang, Chaojun Xiao, Guoyang Zeng, Zhiyuan Liu, Maosong Sun. Adversarial Language Games for Advanced Natural Language Intelligence. To appear at AAAI Conference on Artifical Intelligence (AAAI 2021). [arxiv]
- Xiaozhi Wang, Tianyu Gao, Zhaocheng Zhu, Zhengyan Zhang, Zhiyuan Liu, Juanzi Li, Jian Tang. KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation. To appear at Transactions of the Association for Computational Linguistics. [pdf] [dataset]
- Xiaozhi Wang*, Shengyu Jia*, Xu Han, Zhiyuan Liu, Juanzi Li, Peng Li, Jie Zhou. Neural Gibbs Sampling for Joint Event Argument Extraction. The 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2020). [pdf] [code]
- Xiaozhi Wang, Ziqi Wang, Xu Han, Wangyi Jiang, Rong Han, Zhiyuan Liu, Juanzi Li, Peng Li, Yankai Lin, Jie Zhou. MAVEN: A Massive General Domain Event Detection Dataset. The Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [pdf] [code] [CodaLab] [leaderboard]
- Yuxian Gu, Zhengyan Zhang, Xiaozhi Wang, Zhiyuan Liu, Maosong Sun. Train No Evil: Selective Masking for Task-guided Pre-training. [pdf] [code]
Xiaozhi Wang*, Ziqi Wang*, Xu Han, Zhiyuan Liu, Juanzi Li, Peng Li, Maosong Sun, Jie Zhou, Xiang Ren. HMEAE: Hierarchical Modular Event Argument Extraction. The Conference on Empirical Methods in Natural Language Processing (EMNLP 2019). [pdf] [code] (oral) (short)
Xiaozhi Wang*, Xu Han*, Zhiyuan Liu, Maosong Sun, Peng Li. Adversarial Training for Weakly Supervised Event Detection. The 2019 Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT 2019). [pdf] [code] (oral)