About me
I am a fourth-year PhD student at Tsinghua University. I am fortunately advised by Prof. Juanzi Li and also work closely with Prof. Zhiyuan Liu. Currently, I am visiting BLENDER Lab@UIUC under the supervison of Prof. Heng Ji. Previously, I received my B.E. in Computer Science and Technology from Tsinghua University in 2020. In 2019, I visited Mila and worked with Prof. Jian Tang. You can find my CV here.
My research interest lies in natural language processing and knowledge engineering. The research directions I am fascinated in and working on are:
- Understanding Lanaguge Models (Mechanistic Interpretability, Probing, etc.)
- How to understand the working mechanisms of language models and how can the findings help us improve and steer language models.
- Projects: Skill Neuron, Intrinsic Task Subspace, Conceptual Knowledge Probing
- Event Understanding (Event Extraction, Event Relation Extraction, etc.)
- How to enable models understand complicated events and their interrelations like causalities.
- Datasets: MAVEN, MAVEN-ERE, MAVEN-Arg
- Toolkit: OmniEvent, Evaluation Pitfalls
News
- [May 2024] Got two papers accepted at ACL2024. See you in Bangkok!
- [Mar. 2024] Start visiting BLENDER Lab@UIUC.
- [Feb. 2024] Check out our new preprints on MAVEN-Arg Event Argument Dataset and Event-level Knowledge Editing.
- [Jan. 2024] The LLM world knowledge benchmark KoLA got accepted at ICLR. See you in Vienna!
- [Dec. 2023] The Robust Evaluation for Open IE paper was selected as outstanding paper of EMNLP!
- [Oct. 2022] Release a nice event extraction toolkit OmniEvent. Welcome to try it!
Highlighted Publications
Please refer to publications or my Google Scholar profile for the full list.
- Xiaozhi Wang, Hao Peng, Yong Guan, Kaisheng Zeng, Jianhui Chen, Lei Hou, Xu Han, Yankai Lin, Zhiyuan Liu, Ruobing Xie, Jie Zhou, Juanzi Li. MAVEN-Arg: Completing the puzzle of all-in-one event understanding dataset with event argument annotation. ACL 2024 [pdf] [code & data]
- Xiaozhi Wang*, Kaiyue Wen*, Zhengyan Zhang, Lei Hou, Zhiyuan Liu, Juanzi Li. Finding Skill Neurons in Pre-trained Transformer-based Language Models. EMNLP 2022 [pdf] [code]
- Xiaozhi Wang*, Yulin Chen*, Ning Ding, Hao Peng, Zimu Wang, Yankai Lin, Xu Han, Lei Hou, Juanzi Li, Zhiyuan Liu, Peng Li, Jie Zhou. MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subevent Relation Extraction. EMNLP 2022 [pdf] [code] [CodaLab]
- Hao Peng*, Xiaozhi Wang*, Shengding Hu, Hailong Jin, Lei Hou, Juanzi Li, Zhiyuan Liu, Qun Liu. COPEN: Probing Conceptual Knowledge in Pre-trained Language Models. EMNLP 2022 [pdf] [code] [CodaLab]
- Xiaozhi Wang, Tianyu Gao, Zhaocheng Zhu, Zhengyan Zhang, Zhiyuan Liu, Juanzi Li, Jian Tang. KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation. Transactions of the Association for Computational Linguistics (TACL), 2021. [pdf] [code] [dataset]
- Xiaozhi Wang, Ziqi Wang, Xu Han, Wangyi Jiang, Rong Han, Zhiyuan Liu, Juanzi Li, Peng Li, Yankai Lin, Jie Zhou. MAVEN: A Massive General Domain Event Detection Dataset. EMNLP 2020 [pdf] [code] [CodaLab] [leaderboard]
Professional Services
- Area Chair: ACL Rolling Review since Feb. 2024
- Program Committee Member/Reviewer (Conference): AAAI/IJCAI/COLING 2020, AAAI/ACL/EMNLP 2021, AAAI/COLING/SIGIR/CCKS/EMNLP 2022, AAAI/ACL/EMNLP/NeurIPS 2023, NeurIPS 2024, ACL Rolling Review.
- Reviewer (Journal): Neurocomputing, Complex & Intelligent Systems, AI Open, IEEE TASLP, Frontiers of Computer Science