Statistical analysis of neural data (GR8201)

Fall 2023

This is a Ph.D.-level topics course in statistical analysis of neural data. Students from statistics, neuroscience, and engineering are all welcome to attend. A link to the previous iteration of this course is here.

Time: F 1-2:30
Place: JLG L3-079
Professor: Liam Paninski; Office: Zoom. Email: liam at stat dot columbia dot edu. Hours by appointment.

Prerequisite: A good working knowledge of basic statistical concepts (likelihood, Bayes' rule, Poisson processes, Markov chains, Gaussian random vectors), including especially linear-algebraic concepts related to regression and principal components analysis, is necessary. No previous experience with neural data is required.
Evaluation: Final grades will be based on class participation and a student project. Additional informal exercises will be suggested, but not required. The project can involve either the implementation and justification of a novel analysis technique, or a standard analysis applied to a novel data set. Students can work in pairs or alone (if you work in pairs, of course, the project has to be twice as impressive). See this page for some links to available datasets; or talk to other students in the class, many of whom have collected their own datasets.
Course goals: We will introduce a number of advanced statistical techniques relevant in neuroscience. Each technique will be illustrated via application to problems in neuroscience. The focus will be on the analysis of single and multiple spike train and calcium imaging data, with a few applications to analyzing intracellular voltage and dendritic imaging data. Note that this class will not focus on MRI or EEG data. A brief list of statistical concepts and corresponding neuroscience applications is below.

Statistical concept / technique	Neuroscience application
Point processes; conditional intensity functions	Neural spike trains; photon-limited image data
Time-rescaling theorem for point processes	Fast simulation of network models; goodness-of-fit tests for spiking models
Bias, consistency, principal components	Spike-triggered averaging; spike-triggered covariance
Generalized linear models	Neural encoding models including spike-history effects; inferring network connectivity
Regularization; shrinkage estimation	Maximum a posteriori estimation of high-dimensional neural encoding models
Laplace approximation; Fisher information	Model-based decoding and information estimation; adaptive design of optimal stimuli
Mixture models; EM algorithm; Dirichlet processes	Spike-sorting / clustering
Optimization and convexity techniques	Spike-train decoding; ML estimation of encoding models
Markov chain Monte Carlo: Metropolis-Hastings and hit-and-run algorithms	Firing rate estimation and spike-train decoding
State-space models; sequential Monte Carlo / particle filtering	Decoding spike trains; optimal voltage smoothing
Fast high-dimensional Kalman filtering	Optimal smoothing of voltage and calcium signals on large dendritic trees
Markov processes; first-passage times; Fokker-Planck equation	Integrate-and-fire-based neural models
Hierarchical Bayesian models	Estimating multiple neural encoding models
Amortized inference	Spike sorting; stimulus decoding

For those new to neuroscience: While we will cover all the necessary background as we go, for those who want to explore the material in greater depth, there are a bunch of good computational neuroscience resources. The recent Neuromatch Academy is a good place to start. A very non-exhaustive list of useful books (each of which emphasize different topics, albeit with some overlap): Theoretical Neuroscience, by Dayan and Abbott; Spiking Neuron Models, by Gerstner et al; and Spikes: exploring the neural code, by Rieke et al. The first chapter of the Spikes book has been kindly made available online - this makes a nice overview of some of the questions we will address in this course. The full text of the Gerstner et al book is online. Another good online tutorial is available here.

A couple good older online courses in computational neuroscience: one directed by Raj Rao and Adrienne Fairhall, and another by Wulfram Gerstner.

For those new to statistics: The book by Kass et al is an excellent introduction to statistics, illustrated with a number of neural examples; Columbia e-link here. Also, here is an excellent online book on convex optimization. Finally, Cox and Gabbiani have written a nice Matlab-based book on Mathematics for Neuroscientists, available online here if your library has access. A lot of very useful background material, along with some more advanced ideas.

Schedule

Date	Topic	Reading	Notes
Sept 8, 15	Intro and overview	Paninski and Cunningham, `18; International Brain Lab, '17, '22, '23a, '23b	Slides here.
Sept 22	Optogenetic circuit mapping	Hu et al '09, Shababo et al '13, Hage et al '19, Triplett et al '22, Antin et al '23, Triplett et al '23	Guest lecture by Marcus Triplett. Slides here. Background slides on Gaussian processes by J. Cunningham here.
Sept 29, Oct 6	Behavioral video analysis	DeepLabCut, DeepGraphPose, MoSeq, PS-VAE, SLEAP, Lightning-Pose, DAART	Guest lecture by Matt Whiteway and Dan Biderman. Slides here.
Oct 13-27	Signal acquisition: spike sorting	Lewicki '98; Pachitariu et al '16; Lee et al '20; Steinmetz et al '21; Calabrese and Paninski '11, Boussard et al '21, Varol et al '21, Wang et al '19, Zanos et al '11	EM notes; Blei et al review on variational inference. Guest lecture by Julien Boussard and Charlie Windolf. Slides here.
Oct 27, Nov 3, 10	Signal acquisition: single-cell-resolution functional imaging	Overview: Pnevmatikakis and Paninski '18 Compression and denoising: Buchanan et al '18, Sun et al '19 Demixing: Pnevmatikakis et al '16; Zhou et al '18; Friedrich et al '17b; Lu et al '17; Giovanucci et al '17; Charles et al '19, Pasarkar et al '23; Saxena et al '20 Deconvolution: Deneux et al '16; Picardo et al '16; Friedrich et al '17a; Berens et al '18, Rupprecht et al '21 Wei and Zhou et al '19	HMM tutorial by Rabiner; HMM notes. Guest lecture by Amol Pasarkar. Slides here.
Nov 10	Presentations of project ideas	Just two minutes each
Nov 17, Dec 8	Poisson regression models; hierarchical models for sharing information across cells; expected log-likelihood	Kass et al '03, Paninski et al '07, Wallstrom et al '08, Batty et al '17, Cadena et al '17, Ramirez and Paninski, '14, Mena and Paninski '14, Soudry et al '15	Generalized linear model notes
Nov 24	No class (University holiday)		Happy thanksgiving!
Dec 1	Decoding methods	Gallego et al '20, Zhang et al '23, Azabou et al '23	Guest lecture by Yizi Zhang. Slides here.
Dec 15	Project presentations		E-mail me your report as a pdf by Dec 18.

Thanks to the NSF for support.