huggingface/optimum

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

/ 100

Verified

Supports hardware-specific backends including ONNX Runtime, OpenVINO, TensorRT-LLM, AWS Neuron, and Intel Gaudi through modular installations, enabling optimized inference across diverse accelerators. Provides unified APIs for model export, quantization, and graph optimization while maintaining compatibility with PyTorch, enabling deployment from research to production without refactoring model code.

3,325 stars and 1,613,657 monthly downloads. Used by 29 other packages. Actively maintained with 4 commits in the last 30 days. Available on PyPI.

Maintenance 16 / 25

Adoption 25 / 25

Maturity 25 / 25

Community 24 / 25

How are scores calculated?

Stars

3,325

Forks

624

Language

Python

License

Apache-2.0

Compare

optimum and optimum-intel optimum and optimum-rbln optimum and optimum-habana optimum and optimum-transformers optimum and optimum-graphcore

Related models

openvinotoolkit/nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

huggingface/optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

RBLN-SW/optimum-rbln

⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN SDK for efficient...

eole-nlp/eole

Open language modeling toolkit based on PyTorch

Explore Transformer Models

All categories Trending Transformer directory Insights