Research Interests
Mechanistic Interpretability, Machine Learning Theory, and Selective Inference.
Publications
α-β order denotes alphabetical authorship ordering, and * denotes equal contribution.
- In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention.
Jianliang He, Xintian Pan, Siyu Chen, and Zhuoran Yang.
International Conference on Machine Learning (ICML), 2025. [arXiv][pdf] [code] [blog] - From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems.
Jianliang He*, Siyu Chen*, Fengzhuo Zhang, and Zhuoran Yang.
International Conference on Machine Learning (ICML), 2024. [arXiv][pdf] - Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation.
Jianliang He, Han Zhong, and Zhuoran Yang.
International Conference on Learning Representations (ICLR), 2024. [arXiv][pdf] - Harnessing the Collective Wisdom: Fusion Learning using Decision Sequences from Diverse Sources.
(α-β order) Trambak Banerjeea, Bowen Gang, and Jianliang He.
Biometrika, 2025+. [arXiv][pdf]
Preprints
- On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking.
Jianliang He, Leda Wang, Siyu Chen, and Zhuoran Yang. - Data-light Uncertainty Set Merging with Admissibility: Synthetics, Aggregation, and Test Inversion.
Shenghao Qin*, Jianliang He*, Qi Kuang*, Bowen Gang, and Yin Xia.
Submitted, arXiv:2410.12201, 2024. [arXiv][pdf]
Miscellaneous
- False Discovery Control in Multiple Testing: A Selective Overview of Theories and Methodologies.
[pdf]
Short Version as a Book Chapter in ICSA Book Series in Statistics: Big Data Analytics in Biostatistics and Bioinformatics.