← portfolio · barisgunaydin.com

webgpu-fusion-max

Pushing fused WebGPU transformer kernels to max model size — int4, tiled FFN, Phi-3-mini 3.6B in Chrome

github →

HTML · 3 commits synced · last 2026-05-04