UncGPT-69 hybrid 🦉

47M-param Jamba-style hybrid (10 Mamba-2 + 2 MQA) + MoE (1+6 experts top-2). Trained ~64 min on 4×L40 from scratch, lm_loss 8.79→1.88. Running in your browser via ONNX Runtime Web + WebGPU.

⏳ initializing onnxruntime-web…
(model output will stream here)
0 tokens tok/s