Tensor Computation Libraries ML Frameworks
C++ and multi-language libraries providing core tensor operations, linear algebra, and multidimensional array manipulation for machine learning. Does NOT include end-to-end deep learning frameworks, neural network layers, or domain-specific applications—focus is on low-level tensor primitives and algebraic optimizations.
There are 115 tensor computation libraries frameworks tracked. 5 score above 70 (verified tier). The highest-rated is apache/tvm at 76/100 with 13,183 stars. 3 of the top 10 are actively maintained.
Get all 115 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=tensor-computation-libraries&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
apache/tvm
Open Machine Learning Compiler Framework |
|
Verified |
| 2 |
uxlfoundation/oneDNN
oneAPI Deep Neural Network Library (oneDNN) |
|
Verified |
| 3 |
Tencent/ncnn
ncnn is a high-performance neural network inference framework optimized for... |
|
Verified |
| 4 |
OpenMined/TenSEAL
A library for doing homomorphic encryption operations on tensors |
|
Verified |
| 5 |
iree-org/iree-turbine
IREE's PyTorch Frontend, based on Torch Dynamo. |
|
Verified |
| 6 |
VeriSilicon/TIM-VX
VeriSilicon Tensor Interface Module |
|
Established |
| 7 |
apache/tvm-ffi
Open ABI and FFI for Machine Learning Systems |
|
Established |
| 8 |
lisitsyn/tapkee
A flexible and efficient С++ template library for dimension reduction |
|
Established |
| 9 |
ARM-software/armnn
Arm NN ML Software. |
|
Established |
| 10 |
libxsmm/tpp-mlir
TPP experimentation on MLIR for linear algebra |
|
Established |
| 11 |
koba-jon/pytorch_cpp
Deep Learning sample programs using PyTorch in C++ |
|
Established |
| 12 |
NervanaSystems/neon
Intel® Nervana™ reference deep learning framework committed to best... |
|
Established |
| 13 |
dmlc/dlpack
common in-memory tensor structure |
|
Established |
| 14 |
XUANTIE-RV/csi-nn2
An optimized neural network operator library for chips base on Xuantie CPU. |
|
Emerging |
| 15 |
MegEngine/MegEngine
MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架 |
|
Emerging |
| 16 |
keith2018/TinyTorch
A tiny deep learning training framework implemented from scratch in C++ that... |
|
Emerging |
| 17 |
axonn-ai/axonn
Parallel framework for training and fine-tuning deep neural networks |
|
Emerging |
| 18 |
mrphys/tensorflow-nufft
Fast, Native Non-Uniform Fast Fourier Transform for TensorFlow |
|
Emerging |
| 19 |
apple/turicreate
Turi Create simplifies the development of custom machine learning models. |
|
Emerging |
| 20 |
zkmkarlsruhe/ofxTensorFlow2
TensorFlow 2 AI/ML library wrapper for openFrameworks |
|
Emerging |
| 21 |
OpenMachine-ai/tinyfive
TinyFive is a lightweight RISC-V emulator and assembler written in Python... |
|
Emerging |
| 22 |
Neargye/hello_tf_c_api
Neural Network TensorFlow C API |
|
Emerging |
| 23 |
NiuTrans/NiuTensor
NiuTensor is an open-source toolkit developed by a joint team from NLP Lab.... |
|
Emerging |
| 24 |
alibaba/BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine... |
|
Emerging |
| 25 |
neo-ai/neo-ai-dlr
Neo-AI-DLR is a common runtime for machine learning models compiled by AWS... |
|
Emerging |
| 26 |
tqchen/tinyflow
Tutorial code on how to build your own Deep Learning System in 2k Lines |
|
Emerging |
| 27 |
wy1iu/LargeMargin_Softmax_Loss
Implementation for |
|
Emerging |
| 28 |
hpi-xnor/BMXNet
(New version is out: https://github.com/hpi-xnor/BMXNet-v2) BMXNet: An... |
|
Emerging |
| 29 |
PatWie/tensorflow-cmake
TensorFlow examples in C, C++, Go and Python without bazel but with cmake... |
|
Emerging |
| 30 |
prabhuomkar/pytorch-cpp
C++ Implementation of PyTorch Tutorials for Everyone |
|
Emerging |
| 31 |
apache/tvm-vta
Open, Modular, Deep Learning Accelerator |
|
Emerging |
| 32 |
LISTENAI/thinker
a lightweight deep learning framework for CSK60XX serial products |
|
Emerging |
| 33 |
TensoriumCore/Tensorium_lib
An early-stage HPC tensor and grid-kernel library for numerical relativity,... |
|
Emerging |
| 34 |
Tiramisu-Compiler/tiramisu
A polyhedral compiler for expressing fast and portable data parallel algorithms |
|
Emerging |
| 35 |
chochain/tensorForth
Forth does tensors, in CUDA. |
|
Emerging |
| 36 |
apache/tvm-rfcs
A home for the final text of all TVM RFCs. |
|
Emerging |
| 37 |
memoakten/ofxMSATensorFlow
C++ openframeworks addon and examples integrating Google's TensorFlow... |
|
Emerging |
| 38 |
CEA-LIST/N2D2
N2D2 is an open source CAD framework for Deep Neural Network simulation and... |
|
Emerging |
| 39 |
nguyenphuminh/catniff
Torch-like deep learning framework for Javascript |
|
Emerging |
| 40 |
Infatoshi/batmobile
High-performance CUDA kernels for equivariant graph neural networks (MACE,... |
|
Emerging |
| 41 |
AngusG/tensorflow-xnor-bnn
BinaryNets in TensorFlow with XNOR GEMM op |
|
Emerging |
| 42 |
facebookresearch/TensorComprehensions
A domain specific language to express machine learning workloads. |
|
Emerging |
| 43 |
vlang/vtl
The V Tensor Library |
|
Emerging |
| 44 |
merrymercy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and... |
|
Emerging |
| 45 |
jiazhihao/TASO
The Tensor Algebra SuperOptimizer for Deep Learning |
|
Emerging |
| 46 |
ducha-aiki/LSUVinit
Reference caffe implementation of LSUV initialization |
|
Emerging |
| 47 |
neilcz/libGDL
一个移动端跨平台的gpu+cpu并行计算的cnn框架(A mobile-side cross-platform gpu+cpu parallel... |
|
Emerging |
| 48 |
RubixML/Tensor
A library and extension that provides objects for scientific computing in PHP. |
|
Emerging |
| 49 |
hpi-xnor/BMXNet-v2
BMXNet 2: An Open-Source Binary Neural Network Implementation Based on MXNet |
|
Emerging |
| 50 |
leggedrobotics/tensorflow-cpp
Pre-built TensorFlow for C/C++ and CMake. |
|
Emerging |
| 51 |
mgorshkov/np
⚡ NumPy-style arrays in C++ | CUDA GPU + AVX512 CPU | Tikhonov Regularized... |
|
Emerging |
| 52 |
Pigrecos/TensorFlow.Delphi
TensorFlow.Delphi (TF.Delphi) provides a Delphi(Pascal) Standard binding for... |
|
Emerging |
| 53 |
tensil-ai/tensil
Open source machine learning accelerators |
|
Emerging |
| 54 |
iree-org/iree-torch
Torch Frontend for IREE |
|
Emerging |
| 55 |
google/minimalloc
A lightweight memory allocator for hardware-accelerated machine learning |
|
Emerging |
| 56 |
skylook/tensorflow_cpp
C and C++ examples for tensorflow |
|
Emerging |
| 57 |
nict-wisdom/rannc
RaNNC is an automatic parallelization middleware used to train very... |
|
Emerging |
| 58 |
joennlae/halutmatmul
Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera... |
|
Emerging |
| 59 |
solomontesema/nn-weight-extractor
Neural network weight extractor with batch normalization folding for... |
|
Experimental |
| 60 |
marty1885/tiny-htm
A tiny yet flexable framework for (HTM) Hierarchical Temporal Memory |
|
Experimental |
| 61 |
akielaries/openGPMP
Hardware Accelerated General Purpose Mathematics Package (very stale) |
|
Experimental |
| 62 |
entropy-flux/Tannic
A C++ Tensor Library |
|
Experimental |
| 63 |
lucasdelimanogueira/PyNorch
Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with... |
|
Experimental |
| 64 |
explosion/thinc_gpu_ops
🔮 GPU kernels for Thinc |
|
Experimental |
| 65 |
jasmcaus/hazel
A Tensor Library written in C++. |
|
Experimental |
| 66 |
Savernish/forgeNN
forgeNN is a in-development purpose‑built neural network framework combining... |
|
Experimental |
| 67 |
EECS-583-Group-24/ML-LOOP
Using Machine Learning to Predict the Sequences of Optimization Passes in LLVM |
|
Experimental |
| 68 |
edelsys/colibri
Communication and scheduling framework for MAVlink-based companion systems [... |
|
Experimental |
| 69 |
vishnupsatish/microtensor
Deep learning library (training & inference) and GPT-2 in C++ from scratch. |
|
Experimental |
| 70 |
karimknaebel/itar
tar file index for constant-time member access |
|
Experimental |
| 71 |
Tamerkobba/Parallel-CNN
This project explores the parallelization of Convolutional Neural Networks... |
|
Experimental |
| 72 |
Tensor-Array/Tensor-Array
A C++ machine learning framework/library. |
|
Experimental |
| 73 |
marcosalvalaggio/kiwigrad
Mini deep learning framework written in Python C-API for Speed |
|
Experimental |
| 74 |
SanftMonster/megengine_cpp_training_example
Examples for using MegEngine cpp API for model training, which support... |
|
Experimental |
| 75 |
Gilberto-Galan/Mini-Framework-Deep-Learning-T-Tensor
🚀 Mini-Framework de Deep Learning desarrollado en C++/CUDA con bindings de... |
|
Experimental |
| 76 |
Amanda-Barbara/nvcaffe
study nvcaffe code |
|
Experimental |
| 77 |
tensortask/ttp
Tensor Transport Protocol (TTP) |
|
Experimental |
| 78 |
YconquestY/Needle
Imperative deep learning framework with customized GPU and CPU backend |
|
Experimental |
| 79 |
Accumulated/Accelerating-CNN-on-GPU-using-CUDA-C
This repository is for implementing and accelerating CNN on GPU using NVIDIA... |
|
Experimental |
| 80 |
EzraWolf/TinyMOA
RISC-V CPU with an integrated SRAM-based compute-in-memory (CIM) accelerator... |
|
Experimental |
| 81 |
PedroHenriqueBatistaSilva/AdderNet
Neural networks with zero multiplications at inference. AdderNet + HDC for... |
|
Experimental |
| 82 |
trevorpogue/algebraic-nnhw
Algebraic enhancements for GEMM & AI accelerators |
|
Experimental |
| 83 |
Xeanos7913/Vensor
A simple C++ machine learning library, using Vulkan for GPU acceleration. |
|
Experimental |
| 84 |
alarxx/Tensor-library
Analogue to Math and ML libraries like numpy or pytorch |
|
Experimental |
| 85 |
Li-TianCheng/TinyDL
基于Eigen运算库的深度学习框架(支持CUDA加速) |
|
Experimental |
| 86 |
moritztng/cuda-ml
Machine Learning Framework with Automatic Differentiation and Cuda... |
|
Experimental |
| 87 |
Sharveswar007/SSBLAST
First open-source FP8 linear solver for consumer NVIDIA GPUs — 2-3x faster... |
|
Experimental |
| 88 |
Nathanael007/8bit-cpu
🖥️ Design and simulate a custom 8-bit CPU with a 16-instruction ISA, 5-stage... |
|
Experimental |
| 89 |
dbjwhs/inference-systems-lab
C++17 inference platform: belief propagation, Mamba SSM, MoE, neuro-symbolic... |
|
Experimental |
| 90 |
rikhil-amonkar/pytorch-cpp-layernorm-accelerator
Implemented a custom LayerNorm forward and backward pass extension in C++... |
|
Experimental |
| 91 |
ianmkim/snnpp
Spiking Neural Network implementation in pure C++ with minimal dependencies |
|
Experimental |
| 92 |
Frobeniusnorm/Flint
Tensor math framework for machine learning |
|
Experimental |
| 93 |
albertopirillo/NITRO-D
Novel framework for deep integer-only CNNs operating entirely in the... |
|
Experimental |
| 94 |
l1nkr/DL-Compiler-Navigation
Machine Learning Compiler Road Map |
|
Experimental |
| 95 |
rizkysaputradev/Syntralinepp-Compiler
SyntraLine++ is a compiler-based domain-specific language (DSL) for... |
|
Experimental |
| 96 |
ColeAndrae/tensorgrad
A high-performance deep learning framework written in C++, engineered from... |
|
Experimental |
| 97 |
muditbhargava66/llvm-dl-optimizer
LLVM-based Compiler Optimization for Deep Learning Kernels |
|
Experimental |
| 98 |
Shengxiang-Lin/ParticleFlow-Lite
This is an official implementation of "Lightweight Deep Learning Framework... |
|
Experimental |
| 99 |
xusworld/tars
Tars is a cool deep learning framework. |
|
Experimental |
| 100 |
Naveen-Baburaj/Porting-PyTorch-to-LibTorch-Iris-Neural-Network-Implementation
Porting a PyTorch neural network for Iris classification from Python to... |
|
Experimental |
| 101 |
Tanniefooted733/qemu-cpu-guide
🔍 Explore QEMU CPU models for KVM guests to enhance performance in... |
|
Experimental |
| 102 |
dhrvrc/Sparsity
Native Similarity Searching for Dense & Sparse Vectors. Final Year Project.... |
|
Experimental |
| 103 |
tmm-dl/tmm
A hardware-agnostic, plugin-extensible CLI for training deep learning models... |
|
Experimental |
| 104 |
hysmio/tensors-cpp
Just a small repo intended for me to recreate a minimally working example of... |
|
Experimental |
| 105 |
StxGuy/TenaciousTensors
Fortran/C++ library for multidimensional algebra. |
|
Experimental |
| 106 |
rizukirr/ctorch
Re-implement Pytorch in C |
|
Experimental |
| 107 |
BlaiseMuhirwa/gladius
An easy-to-use dynamic computation graph library for running e2e ML training. |
|
Experimental |
| 108 |
laluxx/tensors
single header machine learning library |
|
Experimental |
| 109 |
saivishwak/DeepLearning_Compilers_Research
DeepLearning Compilers Research |
|
Experimental |
| 110 |
DhruvSrikanth/CUDANN
A distributed implementation of a deep learning framework in CUDA. |
|
Experimental |
| 111 |
pgmesa/synapx
A deep learning library powered by a C++ autograd engine, featuring a... |
|
Experimental |
| 112 |
manili/ctorch
CTorch is a super lightweight C implementation of PyTorch, built for... |
|
Experimental |
| 113 |
rexionmars/ctorch
A next library stb-style header-only library for Neural Networks |
|
Experimental |
| 114 |
Rishav-hub/GNetTrainer-package
GNetTrainer is a Deep Learning web application for training and predicting... |
|
Experimental |
| 115 |
Mobink980/Network-on-Chip
As we transition into the era characterized by many-core architectures and... |
|
Experimental |