elvinagam/benchmarking_gpu_inference

Scripts from Neural network inference on Pytorch with tools like ONNX, TensorRT, nvFuser, TorchDynamo, Triton

/ 100

Experimental

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 1 / 25

Maturity 9 / 25

Community 0 / 25

Stars

Forks

—

Language

Jupyter Notebook

License

Apache-2.0

Category

Last pushed

Nov 15, 2022

Commits (30d)

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/elvinagam/benchmarking_gpu_inference"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

huggingface/optimum

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers...

openvinotoolkit/nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

huggingface/optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

RBLN-SW/optimum-rbln

⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN SDK for efficient...