Research

Ask anyone (including an LLM) how a frontier AI model works, how they arrive at their predictions, and you'll be met with one of two answers: "I don't know" or "it's a statistical algorithm reproducing what it saw in its training data." If you're like me, neither of those answers actually says why the models work, just that they do. What's actually going on inside these models? If they're reproducing patterns in the data they saw during training, which ones? And just exactly how?

I generally want to understand how neural networks learn. I believe this understanding comes from descriptions of simple models (which I will explain in a future blogpost), more specifically on their optimization dynamics and how they create features.

As an early-stage PhD student, a lot of my work is still unpublished, I will try to update this page whenever I have a new preprint.

Kernel Regression 1

Rundown of "Predicting Kernel Regression Learning Curves from only Raw Data Statistics" - the HEA Dec 23, 2025

On real data, we know how kernel regression performs.

Open → arXiv ↗ GitHub ↗

Physics 1

Direct measure of DNA bending by quantum magnetic imaging of a nano-mechanical torque-balance Feb 02, 2026

Quantum mechanics can be used to study microscopic forces!

Open → arXiv ↗