Pruna helps developers shrink and speed up AI models for better performance. It offers tools like caching and quantization to make your models more efficient, with a focus on low overhead.
Accelerate and optimize your AI models with this Python framework.
Machine learning engineers and developers looking to improve AI model performance.
Not enough data yet. Star history will appear after a few days of tracking.