Computational Inference: Computational methods for statistics and probabilistic machine learning.

Spring 2026. UT Austin, Department of Statistics and Data Sciences. TThu 9–10:30 am at GDC.

Syllabus.

Course information

This course is a Ph.D. level core class on statistical computing methods. Computation plays a central role in modern statistics and machine learning. The goal of this class is to provide (1) knowledge of practical computational inference techniques, such that students who are interested in applied statistics can use these tools to efficiently fit realistic models, (2) exposure to the frontier of modern statistical computing, such that students who are interested in method or theory development can understand how and why existing methods work, and likely enter the research in this area.

This course will cover essential topics to develop a broad working knowledge of modern computational statistics. The selection of topics is based on our view of what is central to this evolving discipline and what will be both intellectually stimulating and practically relevant.

  1. Basic optimization (Newton-Raphson, quasi-Newton, EM method, stochastic gradient descent, Bayesian optimization).
  2. Monte Carlo methods (rejection sampling, importance sampling, quasi-Monte Carlo).
  3. MCMC methods (Gibbs sampling, Metropolis-Hastings, Sequential Monte Carlo, Hamiltonian Monte Carlo, MALA, NUTS).
  4. Approximate inference (Laplace, variational inference, expectation propagation).
  5. Evaluation and diagnostics of computing (convergence tests, validation, simulation-based check, post-processing).
  6. Miscellaneous tricks (reparametrization, tempering, variance reduction in optimization and sampling, data-augmentation, control variates)
  7. Score-based methods (Stein divergence, path sampling, score matching).
  8. Likelihood-free computation and related topics (ABC, neural posterior estimation, normalizing flows, diffusion models).

Prerequisites

If you are a student outside of the Statistics Ph.D. program, instructor permission is required to take this class. This course is designed to be the advanced course for first-year statistics Ph.D. students. Many students will already have (a) hands-on experience in applied modeling, (b) graduate-level knowledge of mathematical statistics and probability, (c) basic coding skills (R and/or Python), and (d) working knowledge of Bayesian inference.

I will not teach much programming, but will focus on overarching ideas and techniques. For the course project, if you want practical sampling, I recommend using a high-level probabilistic programming language such as Stan, Jax, PyMC, Turing, or Pangolin; if you want to implement a new algorithm, you may consider Jax/Pytorch, or R + BridgeStan if a you are an R user.

Slides

Schedule

We hold two classes per week, labeled sequentially as 1a, 1b, 2a, and so on. I strongly encourage you to read the recommended papers listed prior to each class to maximize your understanding and engagement. Papers marked with ▹ cover advanced topics and are optional for reading.

Homeworks

What’s next

Despite the topics covered in this course, several areas are not included: (a) advanced optimization methods, (b) numerical techniques, (c) the mathematical theory of MCMC, (d) discrete-space sampling (e.g., ising models and spin glasses), (e) transdimensional sampling (e.g., reversible jump, pseudo-marginal methods), and (f) an in-depth exploration of modern generative modeling. Many of these topics are likely to be valuable if you are interested in pursuing research in this field.

We also do not have time to cover most language-specific and model-specific considerations. These nuances typically arise in practice when working on applied modeling projects.