Research

[March 2025]
Scalable Structured Metacognition with Test-Time RL Tree Policies
RL tree search over decoding policies for discrete diffusion models. done as a test-time scaling axis for gemini diffusion. [paper] | [code] † *
[January 2025]
Scaling Inference for Robotics Diffusion Models
Variance-aware inference scaling methods for robotics diffusion models. † *
[October 2024]
Trajectory-Search Diffusion Inference Scaling
Theoretical framework and proof-of-concept for trajectory-search inference scaling of stochastic diffusion models, expanding on deepmind's work for deterministic models. [paper] | [code] *
[January 2018]
Computer Vision for Brain Tumor Diagnosis
Diagnostics for brain tumors using medical images and computer vision. Generative models for dataset balancing. I learned a lot about building infra for running lots of experiments in parallel. †
† = sole contributor.
* = compute sponsored by senior DeepMind researchers (affiliations under NDA).