Ph.D. Candidate, Electrical and Computer Engineering
University of California, Santa Cruz
Advisor: Jason K. Eshraghian
I am on the 2026 job market and expect to graduate this year. My CV is here.
Previously, I worked on spiking neural networks, contributing to
snnTorch,
SpikingJelly, and building
SpikeGPT.
My research has since shifted to scalable and efficient sequence modeling architectures,
and how to scale them.
Find me on GitHub, Google Scholar, X (Twitter), and CV.
Email: ridger@ucsc.edu
I am interested in building scalable and efficient sequence modeling architectures as an alternative to standard Transformers. On the architecture side, I have joined the development of linear attention and recurrent models that achieve Transformer-level quality at a fraction of the cost:
What I care about most is touching scaling with my own hands. My personal scaling trajectory covers three orders of magnitude in compute:
For each of these runs, I watched every checkpoint from the very first to the last, witnessing a model go from random to intelligence. That is what I am really enjoying. The journey is the reward.
I was also fortunate to spend a year interning at Seed, where I was deeply involved in LLM pre-training research. Across looped language models, concept models, and data synthesis (MGA), I developed a hands-on understanding of how architectures, data, and training dynamics interact at scale.
Please refer to publications for the full list.
This website is adapted from Tianyu Gao's design, which is in turn adapted from Gregory Gunderson.