Software-Delivered AI
Software-Delivered AI
From research to code, accelerate open-source LLMs and bring operational simplicity to GenAI deployments.
Accelerated Inference With Sparsity
>99% accuracy of FP32 MPT model on GSM dataset
State-of-the-Art Model Optimization Research
In collaboration with the Institute of Science and Technology Austria, Neural Magic develops innovative LLM compression research and shares impactful findings with the open source community, including the state-of-the-art Sparse Fine-Tuning technique.