ML Experiment Tracking ML Frameworks

Tools for versioning, tracking, and managing machine learning experiments, including data versioning, model checkpoints, metrics logging, and experiment comparison. Does NOT include model serving, deployment infrastructure, hyperparameter optimization frameworks, or general MLOps pipeline orchestration.

There are 102 ml experiment tracking frameworks tracked. 2 score above 70 (verified tier). The highest-rated is treeverse/dvc at 85/100 with 15,443 stars and 2,111,672 monthly downloads. 3 of the top 10 are actively maintained.

Get all 102 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=ml-experiment-tracking&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 treeverse/dvc

πŸ¦‰ Data Versioning and ML Experiments

85
Verified
2 runpod/runpod-python

🐍 | Python library for RunPod API and serverless worker SDK.

79
Verified
3 uber/petastorm

Petastorm library enables single machine or distributed training and...

67
Established
4 carsdotcom/skelebot

Machine Learning Project Development Tool

65
Established
5 microsoft/vscode-jupyter

VS Code Jupyter extension

64
Established
6 operatorai/modelstore

🏬 modelstore is a Python library that allows you to version, export, and...

61
Established
7 deepchecks/deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks...

61
Established
8 J535D165/recordlinkage

A powerful and modular toolkit for record linkage and duplicate detection in Python

60
Established
9 4paradigm/OpenMLDB

OpenMLDB is an open-source machine learning database that provides a feature...

60
Established
10 GokuMohandas/Made-With-ML

Learn how to develop, deploy and iterate on production-grade ML applications.

59
Established
11 mad-lab-fau/tpcp

Pipeline and Dataset helpers for complex algorithm evaluation.

58
Established
12 ml6team/fondant

Production-ready data processing made easy and shareable

57
Established
13 lazyscribe/lazyscribe

Lightweight, lazy model experiment logging

55
Established
14 floydhub/floyd-cli

Command line tool for FloydHub - the fastest way to build, train, and deploy...

55
Established
15 basetenlabs/truss-examples

Examples of models deployable with Truss

54
Established
16 nuhame/mlpug

MLPug is a library for training and evaluating Machine Learning (ML) models,...

54
Established
17 openml/OpenML

Open Machine Learning

53
Established
18 treeverse/dvclive

πŸ“ˆ Log and track ML metrics, parameters, models with Git and/or DVC

53
Established
19 deepnote/jupyterlab-deepnote

A .deepnote file viewer extension for JupyterLab

53
Established
20 firefly-cpp/succulent

A lightweight framework for collecting and processing data from HTTP POST requests

53
Established
21 IBM/sail

Library for streaming data and incremental learning algorithms.

50
Established
22 nidhaloff/igel

a delightful machine learning tool that allows you to train, test, and use...

50
Established
23 zincware/ZnTrack

Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.

49
Emerging
24 ipums/hlink

Hierarchical record linkage at scale

49
Emerging
25 treeverse/vscode-dvc

Machine learning experiment tracking and data versioning with DVC extension...

48
Emerging
26 deepnote/vscode-deepnote

Deepnote extension for VSCode, Cursor and Windsurf

46
Emerging
27 eto-ai/rikai

Parquet-based ML data format optimized for working with unstructured data

45
Emerging
28 CubicZebra/informatics

Framework of fast implementation data processing and operating pipelines

45
Emerging
29 HoloClean/holoclean

A Machine Learning System for Data Enrichment.

44
Emerging
30 TangleML/tangle

Tangle is a web app that allows the users to build and run Machine Learning...

43
Emerging
31 regel/loudml

Loud ML is the first open-source AI solution for ICT and IoT automation

43
Emerging
32 aporia-ai/mlnotify

πŸ”” No need to keep checking your training - just one import line and you'll...

43
Emerging
33 ahkarami/Deep-Learning-in-Production

In this repository, I will share some useful notes and references about...

41
Emerging
34 amikos-tech/chromadb-data-pipes

ChromaDB Data Pipes πŸ–‡οΈ - The easiest way to get data into and out of ChromaDB

41
Emerging
35 yottalabsai/YottaML

Python SDK and CLI for the YottaML cloud GPU platform. Manage pods,...

40
Emerging
36 iterative/features

A collection of development container 'features' for machine learning and...

39
Emerging
37 AbdoulayeSeydi/mlbuild

MLBuild enforces inference performance SLAs in CI, automatically blocking...

38
Emerging
38 pierpierpy/LightML

Experiment tracking that stays out of your way. One pip install, one .db...

37
Emerging
39 VashuTheGreat/ML-Learner

Full stack machine learning learning repository covering model training,...

37
Emerging
40 lightforever/mlcomp

Distributed DAG (Directed acyclic graph) framework for machine learning with UI

36
Emerging
41 TangleML/website

Tangle is a web app that allows the users to build and run Machine Learning...

35
Emerging
42 pailabteam/pailab

a package for versioning, automatization and analysis of machine learning development

35
Emerging
43 DigitalKin-ai/kin-kernel

This package is designed to enable developers to create Cells

35
Emerging
44 IBM/mlapp

MLApp is a Python library for building scalable data science solutions that...

34
Emerging
45 replicate/keepsake

Version control for machine learning

34
Emerging
46 Bread-Technologies/Bread-Dataset-Viewer

VS Code extension to easily view and handle large datasets. Look at...

34
Emerging
47 aidd-msca/registry-factory

An abstract implementation of the registry design pattern proposed in...

34
Emerging
48 paletteml/mlsync

Sync your ML data with your favorite productivity tools!

33
Emerging
49 treeverse/example-get-started-experiments

Get started DVC project

32
Emerging
50 modelhub-ai/modelhub

A collection of deep learning models with a unified API.

31
Emerging
51 oap-project/cloudtik

Cloud Scale Platform for Distributed Analytics and AI

30
Emerging
52 ModelChimp/modelchimp

Experiment tracking for machine and deep learning projects

30
Emerging
53 iterative/VSCode-DVC-Workshop

Workshop about DVC VSCode Extension

29
Experimental
54 modelhub-ai/modelhub-engine

Backend library, framework, and API for models in modelhub

29
Experimental
55 UETAILab/uetai

Custom ML tracking experiment and debugging tools.

29
Experimental
56 treeverse/chocolatey-dvc

Chocolatey package for dvc

28
Experimental
57 AICoE/experiment-tracking

Experiment Tracking for Machine Learning Jobs

28
Experimental
58 FanaticPythoner/AutoAi

AI automation library that allows automatic training for a large amount of...

28
Experimental
59 amakelov/mandala

A simple & elegant experiment tracking framework that integrates persistence...

28
Experimental
60 guildai/guildai-r

Track machine learning experiments

27
Experimental
61 radiantone/entangle

A lightweight (serverless) native python parallel processing framework based...

27
Experimental
62 finegrain-ai/trackio-tool

A tool to work with Trackio data files.

26
Experimental
63 npatta01/pytorch-serving-workshop

Slides and notebook for the workshop on serving bert models in production

26
Experimental
64 kubegems/modelx

A simple, High-Performance, Scalable ML/DL Models Repository based on OCI Artifacts

25
Experimental
65 JuliaAI/DearDiary.jl

A lightweight but powerful machine learning experiment tracking tool for Julia

25
Experimental
66 shcheklein/dvc-docker-example

An example of DVC pipeline with a Docker-wrapped command

23
Experimental
67 galafis/feature-store-architecture

Feature Store para ML: armazenamento online (Redis) e offline (Parquet), API...

23
Experimental
68 CompBio-Lab/MESSI-pipeline

MESSI - Multimodal Experiments with SyStematic Interrogation in nextflow

23
Experimental
69 liblaf/cherries

πŸ’ Sweet experiment tracking with Comet, DVC, and Git integration.

23
Experimental
70 MukundaKatta/StreamPipe

Composable async streaming pipeline framework for LLM responses and data processing

22
Experimental
71 anekanews777/tinytracker

πŸ”¬ Track your ML experiments effortlessly with TinyTrackerβ€”local,...

22
Experimental
72 homeofe/BMAS

BMAS: Research project for AI-assisted metric evaluation and experiment tracking

22
Experimental
73 durandtibo/minrecord

Minimalist library to record values in a ML workflow

22
Experimental
74 Sidgithub18/mlbuild

Enforce ML model performance in CI/CD by benchmarking inference, validating...

22
Experimental
75 srinagadurga3455/mlforge

End-to-end ML pipeline library β€” from data loading to production monitoring

22
Experimental
76 MukundaKatta/PipelineAI

CI/CD pipeline generator β€” auto-detect projects and generate GitHub Actions,...

22
Experimental
77 rogue-agent1/spinner

spinner - Terminal spinner/progress wrapper for long-running commands

22
Experimental
78 rhussain21/edge-ai-pipeline

Building a structured corpus of industrial automation knowledge for AI systems

22
Experimental
79 MukundaKatta/ConfigAI

Smart config file generator β€” auto-detect projects, generate...

22
Experimental
80 guilycst/lazy-dvc

A serverless-style LFS alternative that uses GitHub Org membership as...

22
Experimental
81 okenakt/protium

Interactive Python environment for VS Code with flexible execution in your workflow

21
Experimental
82 SymbioticLab/ModelKeeper

A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup

20
Experimental
83 Minoro/pgpyml

An in-database machine learning solution to run python models in Postgres

20
Experimental
84 knap-ai/knapsack

Fast, private data connectors for AI βš‘οΈπŸ€–

20
Experimental
85 NoOPeEKS/DataNvim

A fully-featured batteries-included Neovim distribution for the world of...

20
Experimental
86 raymon-ai/raymon

The official http://raymon.ai data profiling and logging library.

20
Experimental
87 getindata/quickstart-ml-blueprints

Data science project development best practices and state of the art...

19
Experimental
88 getindata/quickstart-ml-starter

Kedro starterts to quickly set up new projects according to QuickStart ML...

19
Experimental
89 mosh3eb/TrainKeeper

TrainKeeper is a minimal-decision, high-signal toolkit for building...

19
Experimental
90 MPX0222/BLS-APIs

A Modified Toolbox for Broad Learning System, with sklearn liked APIs and...

19
Experimental
91 rack2cloud/ai-cluster-failure-modes

Diagnostic frameworks for high-performance compute architectures, checkpoint...

19
Experimental
92 the-ai-merge/production-hub

Hands-on hub to learn techniques to optimize and serve AI models to...

18
Experimental
93 felixmccuaig/flowbase

A declarative ML platform for tabular data that eliminates infrastructure...

15
Experimental
94 Scontel/distributed-ml-ops

Scalable MLOps framework for distributed training and model deployment with...

14
Experimental
95 rogue-agent1/middleware-chain-py

Middleware pipeline pattern

14
Experimental
96 Kind-italianwoodbine415/warm-start

Speed up code sessions by automatically loading your project's git state,...

14
Experimental
97 Darekmi9/feature-store-v1

πŸš€ Build a local-first, cloud-extensible Feature Store to simplify MLOps,...

14
Experimental
98 KUNAL7231/ml-itg

πŸ€– Streamline machine learning workflows with ml-itg, a toolkit designed for...

14
Experimental
99 iterative/studio-support

❓ DVC Studio Issues, Question, and Discussions

12
Experimental
100 josephmachado/data-pipeline-generator

Data pipeline code generator for portfolio projects

11
Experimental
101 seunboy1/DVC_GITHUB_CODE

This contains the dvc files created from data versioning.

10
Experimental
102 nidhinradh/quickstartml

Quickstart ML is a tool to generate boilerplate code for your machine...

10
Experimental