kaushikcfd/feinsum
Database approach for generating near-roofline Einstein Summation kernels.
Manages code-transformation databases for batched Einstein summations, optimizing memory efficiency by avoiding DRAM flushes of operands. Uses schedule-based transformations recorded from trivial baselines rather than shipping pre-compiled kernels. Integrates with PyTorch and NumPy ecosystems for tensor operations while targeting high-performance compute backends.
Available on PyPI.
Stars
3
Forks
3
Language
Python
License
MIT
Category
Last pushed
Mar 27, 2026
Monthly downloads
184
Commits (30d)
0
Dependencies
13
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ai-coding/kaushikcfd/feinsum"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
symforce-org/symforce
Fast symbolic computation, code generation, and nonlinear optimization for robotics
inducer/loopy
A code generator for array-based code on CPUs and GPUs
alphaville/optimization-engine
Nonconvex embedded optimization: code generation for fast real-time optimization + ROS support
inducer/islpy
Python wrapper for isl, an integer set library
loco-3d/crocoddyl
Crocoddyl is an optimal control library for robot control under contact sequence. Its solver is...