Tensor Computation Libraries ML Frameworks

C++ and multi-language libraries providing core tensor operations, linear algebra, and multidimensional array manipulation for machine learning. Does NOT include end-to-end deep learning frameworks, neural network layers, or domain-specific applications—focus is on low-level tensor primitives and algebraic optimizations.

There are 115 tensor computation libraries frameworks tracked. 5 score above 70 (verified tier). The highest-rated is apache/tvm at 76/100 with 13,183 stars. 3 of the top 10 are actively maintained.

Get all 115 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=tensor-computation-libraries&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 apache/tvm

Open Machine Learning Compiler Framework

76
Verified
2 uxlfoundation/oneDNN

oneAPI Deep Neural Network Library (oneDNN)

76
Verified
3 Tencent/ncnn

ncnn is a high-performance neural network inference framework optimized for...

74
Verified
4 OpenMined/TenSEAL

A library for doing homomorphic encryption operations on tensors

73
Verified
5 iree-org/iree-turbine

IREE's PyTorch Frontend, based on Torch Dynamo.

71
Verified
6 VeriSilicon/TIM-VX

VeriSilicon Tensor Interface Module

55
Established
7 apache/tvm-ffi

Open ABI and FFI for Machine Learning Systems

54
Established
8 lisitsyn/tapkee

A flexible and efficient С++ template library for dimension reduction

54
Established
9 ARM-software/armnn

Arm NN ML Software.

54
Established
10 libxsmm/tpp-mlir

TPP experimentation on MLIR for linear algebra

53
Established
11 koba-jon/pytorch_cpp

Deep Learning sample programs using PyTorch in C++

51
Established
12 NervanaSystems/neon

Intel® Nervana™ reference deep learning framework committed to best...

51
Established
13 dmlc/dlpack

common in-memory tensor structure

51
Established
14 XUANTIE-RV/csi-nn2

An optimized neural network operator library for chips base on Xuantie CPU.

49
Emerging
15 MegEngine/MegEngine

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

47
Emerging
16 keith2018/TinyTorch

A tiny deep learning training framework implemented from scratch in C++ that...

47
Emerging
17 axonn-ai/axonn

Parallel framework for training and fine-tuning deep neural networks

46
Emerging
18 mrphys/tensorflow-nufft

Fast, Native Non-Uniform Fast Fourier Transform for TensorFlow

46
Emerging
19 apple/turicreate

Turi Create simplifies the development of custom machine learning models.

46
Emerging
20 zkmkarlsruhe/ofxTensorFlow2

TensorFlow 2 AI/ML library wrapper for openFrameworks

46
Emerging
21 OpenMachine-ai/tinyfive

TinyFive is a lightweight RISC-V emulator and assembler written in Python...

44
Emerging
22 Neargye/hello_tf_c_api

Neural Network TensorFlow C API

44
Emerging
23 NiuTrans/NiuTensor

NiuTensor is an open-source toolkit developed by a joint team from NLP Lab....

44
Emerging
24 alibaba/BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine...

43
Emerging
25 neo-ai/neo-ai-dlr

Neo-AI-DLR is a common runtime for machine learning models compiled by AWS...

43
Emerging
26 tqchen/tinyflow

Tutorial code on how to build your own Deep Learning System in 2k Lines

43
Emerging
27 wy1iu/LargeMargin_Softmax_Loss

Implementation for

43
Emerging
28 hpi-xnor/BMXNet

(New version is out: https://github.com/hpi-xnor/BMXNet-v2) BMXNet: An...

43
Emerging
29 PatWie/tensorflow-cmake

TensorFlow examples in C, C++, Go and Python without bazel but with cmake...

43
Emerging
30 prabhuomkar/pytorch-cpp

C++ Implementation of PyTorch Tutorials for Everyone

42
Emerging
31 apache/tvm-vta

Open, Modular, Deep Learning Accelerator

42
Emerging
32 LISTENAI/thinker

a lightweight deep learning framework for CSK60XX serial products

42
Emerging
33 TensoriumCore/Tensorium_lib

An early-stage HPC tensor and grid-kernel library for numerical relativity,...

41
Emerging
34 Tiramisu-Compiler/tiramisu

A polyhedral compiler for expressing fast and portable data parallel algorithms

41
Emerging
35 chochain/tensorForth

Forth does tensors, in CUDA.

41
Emerging
36 apache/tvm-rfcs

A home for the final text of all TVM RFCs.

41
Emerging
37 memoakten/ofxMSATensorFlow

C++ openframeworks addon and examples integrating Google's TensorFlow...

41
Emerging
38 CEA-LIST/N2D2

N2D2 is an open source CAD framework for Deep Neural Network simulation and...

40
Emerging
39 nguyenphuminh/catniff

Torch-like deep learning framework for Javascript

40
Emerging
40 Infatoshi/batmobile

High-performance CUDA kernels for equivariant graph neural networks (MACE,...

40
Emerging
41 AngusG/tensorflow-xnor-bnn

BinaryNets in TensorFlow with XNOR GEMM op

40
Emerging
42 facebookresearch/TensorComprehensions

A domain specific language to express machine learning workloads.

40
Emerging
43 vlang/vtl

The V Tensor Library

40
Emerging
44 merrymercy/awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and...

39
Emerging
45 jiazhihao/TASO

The Tensor Algebra SuperOptimizer for Deep Learning

39
Emerging
46 ducha-aiki/LSUVinit

Reference caffe implementation of LSUV initialization

39
Emerging
47 neilcz/libGDL

一个移动端跨平台的gpu+cpu并行计算的cnn框架(A mobile-side cross-platform gpu+cpu parallel...

39
Emerging
48 RubixML/Tensor

A library and extension that provides objects for scientific computing in PHP.

36
Emerging
49 hpi-xnor/BMXNet-v2

BMXNet 2: An Open-Source Binary Neural Network Implementation Based on MXNet

36
Emerging
50 leggedrobotics/tensorflow-cpp

Pre-built TensorFlow for C/C++ and CMake.

35
Emerging
51 mgorshkov/np

⚡ NumPy-style arrays in C++ | CUDA GPU + AVX512 CPU | Tikhonov Regularized...

34
Emerging
52 Pigrecos/TensorFlow.Delphi

TensorFlow.Delphi (TF.Delphi) provides a Delphi(Pascal) Standard binding for...

34
Emerging
53 tensil-ai/tensil

Open source machine learning accelerators

34
Emerging
54 iree-org/iree-torch

Torch Frontend for IREE

33
Emerging
55 google/minimalloc

A lightweight memory allocator for hardware-accelerated machine learning

33
Emerging
56 skylook/tensorflow_cpp

C and C++ examples for tensorflow

32
Emerging
57 nict-wisdom/rannc

RaNNC is an automatic parallelization middleware used to train very...

32
Emerging
58 joennlae/halutmatmul

Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera...

30
Emerging
59 solomontesema/nn-weight-extractor

Neural network weight extractor with batch normalization folding for...

29
Experimental
60 marty1885/tiny-htm

A tiny yet flexable framework for (HTM) Hierarchical Temporal Memory

28
Experimental
61 akielaries/openGPMP

Hardware Accelerated General Purpose Mathematics Package (very stale)

28
Experimental
62 entropy-flux/Tannic

A C++ Tensor Library

27
Experimental
63 lucasdelimanogueira/PyNorch

Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with...

27
Experimental
64 explosion/thinc_gpu_ops

🔮 GPU kernels for Thinc

27
Experimental
65 jasmcaus/hazel

A Tensor Library written in C++.

27
Experimental
66 Savernish/forgeNN

forgeNN is a in-development purpose‑built neural network framework combining...

27
Experimental
67 EECS-583-Group-24/ML-LOOP

Using Machine Learning to Predict the Sequences of Optimization Passes in LLVM

26
Experimental
68 edelsys/colibri

Communication and scheduling framework for MAVlink-based companion systems [...

26
Experimental
69 vishnupsatish/microtensor

Deep learning library (training & inference) and GPT-2 in C++ from scratch.

26
Experimental
70 karimknaebel/itar

tar file index for constant-time member access

26
Experimental
71 Tamerkobba/Parallel-CNN

This project explores the parallelization of Convolutional Neural Networks...

25
Experimental
72 Tensor-Array/Tensor-Array

A C++ machine learning framework/library.

25
Experimental
73 marcosalvalaggio/kiwigrad

Mini deep learning framework written in Python C-API for Speed

25
Experimental
74 SanftMonster/megengine_cpp_training_example

Examples for using MegEngine cpp API for model training, which support...

24
Experimental
75 Gilberto-Galan/Mini-Framework-Deep-Learning-T-Tensor

🚀 Mini-Framework de Deep Learning desarrollado en C++/CUDA con bindings de...

24
Experimental
76 Amanda-Barbara/nvcaffe

study nvcaffe code

24
Experimental
77 tensortask/ttp

Tensor Transport Protocol (TTP)

24
Experimental
78 YconquestY/Needle

Imperative deep learning framework with customized GPU and CPU backend

24
Experimental
79 Accumulated/Accelerating-CNN-on-GPU-using-CUDA-C

This repository is for implementing and accelerating CNN on GPU using NVIDIA...

24
Experimental
80 EzraWolf/TinyMOA

RISC-V CPU with an integrated SRAM-based compute-in-memory (CIM) accelerator...

23
Experimental
81 PedroHenriqueBatistaSilva/AdderNet

Neural networks with zero multiplications at inference. AdderNet + HDC for...

23
Experimental
82 trevorpogue/algebraic-nnhw

Algebraic enhancements for GEMM & AI accelerators

23
Experimental
83 Xeanos7913/Vensor

A simple C++ machine learning library, using Vulkan for GPU acceleration.

23
Experimental
84 alarxx/Tensor-library

Analogue to Math and ML libraries like numpy or pytorch

23
Experimental
85 Li-TianCheng/TinyDL

基于Eigen运算库的深度学习框架(支持CUDA加速)

22
Experimental
86 moritztng/cuda-ml

Machine Learning Framework with Automatic Differentiation and Cuda...

22
Experimental
87 Sharveswar007/SSBLAST

First open-source FP8 linear solver for consumer NVIDIA GPUs — 2-3x faster...

22
Experimental
88 Nathanael007/8bit-cpu

🖥️ Design and simulate a custom 8-bit CPU with a 16-instruction ISA, 5-stage...

22
Experimental
89 dbjwhs/inference-systems-lab

C++17 inference platform: belief propagation, Mamba SSM, MoE, neuro-symbolic...

22
Experimental
90 rikhil-amonkar/pytorch-cpp-layernorm-accelerator

Implemented a custom LayerNorm forward and backward pass extension in C++...

20
Experimental
91 ianmkim/snnpp

Spiking Neural Network implementation in pure C++ with minimal dependencies

20
Experimental
92 Frobeniusnorm/Flint

Tensor math framework for machine learning

19
Experimental
93 albertopirillo/NITRO-D

Novel framework for deep integer-only CNNs operating entirely in the...

18
Experimental
94 l1nkr/DL-Compiler-Navigation

Machine Learning Compiler Road Map

18
Experimental
95 rizkysaputradev/Syntralinepp-Compiler

SyntraLine++ is a compiler-based domain-specific language (DSL) for...

16
Experimental
96 ColeAndrae/tensorgrad

A high-performance deep learning framework written in C++, engineered from...

15
Experimental
97 muditbhargava66/llvm-dl-optimizer

LLVM-based Compiler Optimization for Deep Learning Kernels

15
Experimental
98 Shengxiang-Lin/ParticleFlow-Lite

This is an official implementation of "Lightweight Deep Learning Framework...

15
Experimental
99 xusworld/tars

Tars is a cool deep learning framework.

14
Experimental
100 Naveen-Baburaj/Porting-PyTorch-to-LibTorch-Iris-Neural-Network-Implementation

Porting a PyTorch neural network for Iris classification from Python to...

14
Experimental
101 Tanniefooted733/qemu-cpu-guide

🔍 Explore QEMU CPU models for KVM guests to enhance performance in...

14
Experimental
102 dhrvrc/Sparsity

Native Similarity Searching for Dense & Sparse Vectors. Final Year Project....

14
Experimental
103 tmm-dl/tmm

A hardware-agnostic, plugin-extensible CLI for training deep learning models...

14
Experimental
104 hysmio/tensors-cpp

Just a small repo intended for me to recreate a minimally working example of...

14
Experimental
105 StxGuy/TenaciousTensors

Fortran/C++ library for multidimensional algebra.

12
Experimental
106 rizukirr/ctorch

Re-implement Pytorch in C

12
Experimental
107 BlaiseMuhirwa/gladius

An easy-to-use dynamic computation graph library for running e2e ML training.

11
Experimental
108 laluxx/tensors

single header machine learning library

11
Experimental
109 saivishwak/DeepLearning_Compilers_Research

DeepLearning Compilers Research

11
Experimental
110 DhruvSrikanth/CUDANN

A distributed implementation of a deep learning framework in CUDA.

11
Experimental
111 pgmesa/synapx

A deep learning library powered by a C++ autograd engine, featuring a...

11
Experimental
112 manili/ctorch

CTorch is a super lightweight C implementation of PyTorch, built for...

11
Experimental
113 rexionmars/ctorch

A next library stb-style header-only library for Neural Networks

10
Experimental
114 Rishav-hub/GNetTrainer-package

GNetTrainer is a Deep Learning web application for training and predicting...

10
Experimental
115 Mobink980/Network-on-Chip

As we transition into the era characterized by many-core architectures and...

10
Experimental