I am a PhD student in Computational Linguistics at Stony Brook University since Fall 2022. My research interests are in Natural Language Processing, Computational Linguistics, Formal Language Theory, Machine Learning, and Computational Social Science.

I work with linguistic corpora, both real and synthetic. When I work on real linguistic data, my motivation is to understand the pragamtic meanings of actual language use in both spoken and written linguistic data and how to leverage state-of-the-art methods, such as Large Language Models (LLMs), to model the usage. When I work on synthetic linguistic data, my focus is on probing how and how well neural networks learn and generalize in light of formal language theory and computational learning theory.

I am looking for a ML/NLP internship (research or software engineering) for 2024 summer.


I was born and raised in Fuqing, a small southeastern town of China. Prior to coming to Stony Brook, I completed a bachelor's degree in Chinese Language and Literature from Hunan University, and a master's degree in Applied linguistics from University of Saskatchewan.

I am a proud self-taught and self-motivated programmer. I started learning programming since 2020, and have managed to make programming relevant to and then part of my daily life. Looking back, I am glad to find my experiences with NLP align well with the three major phases of the field featured as: rule-based (symbolic) methods, statistical machine learning, and deep learning. If you take a look at my papers since my master's thesis, such a trajectory should be sensible.


Here is my Curriculum Vitae.



Deep Learning

Text Processing