Curserve: A Small Research and Hackathon Project November 9, 2025Research, ProgrammingFor Cal Hacks 2025, a few friends and I built Curserve, a fast and scalable server-side engine for agentic coding, which ended …
BERT is just a Single Text Diffusion Step October 20, 2025Machine LearningThis article appeared on Hacker News. Link to the discussion here. A while back, Google DeepMind unveiled Gemini Diffusion, an …
Local SGD and DiLoCo Research Musings October 14, 2025Research, Machine LearningOriginally, I was doing my master’s thesis on this topic. I eventually changed my focus, but these are some of the notes I …
Running GPT-2 in WebGL with Classic GPGPU Programming May 24, 2025Machine Learning, ProgrammingThis article appeared on Hacker News. Link to the discussion here. A few weeks back, I implemented GPT-2 using WebGL and …
Mathematical Statistics February 21, 2024Class NotesMy notes over Mark Maxwell’s course, Introduction to Mathematical Statistics, and his textbook, Probability & Statistics with …
Common Probability Distributions February 8, 2024Class NotesAn overview of common discrete and continuous distributions found in probability and statistics, from Mark Maxwell’s textbook, …
How to Fix Hugo's iOS Code-Block Text-Size Rendering Issue February 4, 2024ProgrammingLately, I’ve been coming across many blogs that have weird font-size rendering issues for code blocks on iOS. Basically, in a …
Intro to Autograd Engines: Karpathy's Micrograd in Go November 11, 2023Machine Learning, ProgrammingFor a while, I wanted to build a complete autograd engine. What is an autograd engine, you might ask? To find the answer, we …
Where Rust Shines: Algebraic Types and Match Statements November 11, 2023ProgrammingRecently I was going through Thorsten Ball’s “Writing An Interpreter in Go”. In this book, you create a basic interpreted …
Favorite Books October 14, 2023MiscellaneousBelow are all the books I’ve read since middle school, roughly in order. Those highlighted in blue were those I particularly …