I am an assistant professor in the Department of Computer Science at The University of Hong Kong and The HKU Musketeers Foundation Institute of Data Science. I am generally interested in machine learning, stochastic optimization, and graph learning, with a special focus on the theoretical/empirical understanding (or Physics) of deep learning (especially foundation models). I am also particularlly interested in devloping the AI/ML methods for practical problems in other area, such as signal processing, intelligent transportation, and math problems.

Previously, I obtained my Ph.D. in the Computer Science department at the University of California, Los Angeles (UCLA), supervised by Prof. Quanquan Gu. I obtained my master degree in electrical engineering and bachelor degree in applied physics, both in Unversity of Science and Technology of China (USTC).

News

  • Multiple openings for PhD, Postdoc, and RA. Please drop me an email with your CV and transcript (optional) if you are interesed in joining my research group. For interested PhD candidates in 2025, please submit your application in https://i.cs.hku.hk/~gradappl/index.html and inform me via email.
  • [2023-05] Our paper on the implicit bias of batch normalization is accepted to COLT 2023.

  • [2023-03] Two manuscripts on explaning the advantages of Mixup and Gradient Regularization in training neural networks are online.

  • [2023-03] I will serve as the Area Chair in NeurIPS 2023.

  • [2023-01] Our paper on the generalization separation between Adam and GD has been accepted by ICLR 2023.

  • [2022-09] Two papers accepted by NeurIPS 2022. The first paper studies the generalization of multi-pass SGD for over-parameterized least squres; the second paper demonstrates the power and limitation of pretraining-finetunning for linear regression with distribution shift.

  • [2022-08] Dr Difan Zou just joined HKU CS as an assistant professor.