Published onMarch 28, 2025|Views: 121|24 min readApple Silicon Metal vs NVIDIA CUDAcudametalgpuparallel-programmingNotes on the Apple Silicon GPUs: Architecture, Memory Hierarchy, and the Metal Programming framework, and how it compares to NVIDIA CUDA.
Published onMarch 12, 2025|Views: 83|13 min read(Mis)adventures in running CUDA on Google Colab Free TiercudanvcccolabA recap of my day debugging issues with nvcc and nvcc4jupyter on Google Colab's free T4 GPUs, with brief notes on CUDA backward compatibility and compute capability
Published onFebruary 14, 2025|Views: 39|17 min readTensor Puzzles Walkthrough: Optimizations, Comparing Solutionstensorspytorchlinear-algebramachine-learningPart 2 of my Tensor Puzzles Walkthrough series: optimizing solutions to fit the puzzle constraints, and comparing notes with the author.
Published onFebruary 12, 2025|Views: 38|17 min readTensor Puzzles Walkthroughtensorspytorchlinear-algebramachine-learningMy solutions and notes to the tensor broadcasting puzzles created by Sasha Rush.
Published onJune 20, 2024|Views: 10|2 min readTo blog frequently is to blog brieflywritingBlogging frequently, in-depth, and on topics of interest. Choose two.