05/03/2026
"The Universal Runtime Vision: Why We're Not Targeting Mobile (And What We're Building Instead)"
NeuroBrix has a long-term goal: become the standard runtime for neural network inference. Any model. Any GPU architecture. One engine.
Here's our Phase 3 vision for 2027 โ and one deliberately honest decision.
๐+ ๐๐๐ ๐๐ซ๐๐ก๐ข๐ญ๐๐๐ญ๐ฎ๐ซ๐๐ฌ. NVIDIA, AMD, Intel, Apple Silicon, ARM (Jetson, Snapdragon), and RISC-V. The Prism solver already abstracts hardware into YAML profiles โ adding a new GPU family means writing a hardware profile and validating dtype support. The ex*****on engine stays the same.
๐๐% ๐๐ซ๐ข๐ญ๐จ๐ง ๐ค๐๐ซ๐ง๐๐ฅ ๐๐จ๐ฏ๐๐ซ๐๐ ๐. Today, NeuroBrix uses PyTorch ATen as the default dispatch layer, with optional Triton kernels. By 2027, the vast majority of operations will have custom Triton implementations โ reducing PyTorch to a weight-loading utility, not a runtime dependency.
๐๐ซ๐๐ฉ๐ก ๐๐๐๐ฎ๐ ๐ ๐๐ซ. Set breakpoints inside the computation graph. Inspect intermediate tensors at any point. Step through ex*****on op-by-op. Today, we have NBX_TRACE_ZEROS, NBX_TRACE_NAN, and NBX_NAN_GUARD as environment variables for debugging. The graph debugger turns this into a proper interactive tool.
๐๐๐ ๐๐จ๐ซ ๐ข๐ง๐ญ๐๐ ๐ซ๐๐ญ๐ข๐จ๐ง๐ฌ. A stable Python API for embedding NeuroBrix in other applications: web services, batch pipelines, and orchestration platforms. Import, load, execute โ three calls.
๐๐๐+ ๐ฆ๐จ๐๐๐ฅ๐ฌ. Comprehensive coverage across diffusion, LLM, multimodal, audio, and video. Every model uses the same .nbx format, runtime, and CLI.
Now โ the honest part.
๐๐ ๐๐ซ๐ ๐ง๐จ๐ญ ๐ญ๐๐ซ๐ ๐๐ญ๐ข๐ง๐ ๐ฆ๐จ๐๐ข๐ฅ๐.
NeuroBrix is built on Python and Triton. These don't run on phones. We will not compile to WASM, ship an iOS framework, or pretend that mobile inference is around the corner for us.
Server-side and edge GPUs (Jetson, ARM servers) are real targets. Apple Silicon Macs are a real target. Phones and browsers are not.
If you need on-device mobile inference, Core ML, TensorFlow Lite, and ONNX Runtime Mobile are better tools. We'd rather point you to the right solution than ship a bad experience.
This is a deliberate technical decision. We believe doing fewer things exceptionally well is more valuable than doing everything poorly.
If Triton gains mobile support or WebGPU matures for real inference โ we'll revisit. But we don't chase hype.
Follow: github.com/NeuroBrix/neurobrix
Open source. Apache 2.0. pip install neurobrix
Universal AI Runtime โ Execute any model on any hardware - NeuroBrix/neurobrix