Prompt Experimentation Platforms Prompt Engineering Tools
Tools for systematic A/B testing, comparison, and evaluation of LLM prompts across multiple models and variants. Includes statistical analysis, cost/performance measurement, and playground environments for prompt optimization. Does NOT include prompt templates, prompt collections, general LLM evaluation frameworks, or prompt management without experimentation features.
There are 29 prompt experimentation platforms tools tracked. The highest-rated is Supervertaler/Supervertaler-Workbench at 45/100 with 26 stars.
Get all 29 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=prompt-engineering&subcategory=prompt-experimentation-platforms&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
Supervertaler/Supervertaler-Workbench
Open-source, AI-enhanced CAT tool with multi-LLM support, translation... |
|
Emerging |
| 2 |
Mirascope/lilypad
Open-source versioning, tracing, and annotation tooling. |
|
Emerging |
| 3 |
crjaensch/PromptoLab
A multi-platform app to serve as a prompts catalog, a LLM playground for... |
|
Emerging |
| 4 |
parea-ai/parea-sdk-py
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered... |
|
Emerging |
| 5 |
dbhavery/promptlab
Prompt testing framework — pytest for LLM prompts. Define prompts as YAML,... |
|
Experimental |
| 6 |
MukundaKatta/PromptLab
Prompt experimentation workspace — A/B testing prompt variants with... |
|
Experimental |
| 7 |
jeong-se-hun/autotune-skill
Eval-first tuning skill for prompts, docs, skills, and code with guards,... |
|
Experimental |
| 8 |
geeknees/sentinel_rb
SentinelRb is an LLM-driven prompt inspector designed to automatically... |
|
Experimental |
| 9 |
tmam-dev/tmam-python-sdk
An open-source LLM engineering platform featuring observability, metrics,... |
|
Experimental |
| 10 |
magifd2/log_analyzer
A Python-based CLI tool for analyzing large log files (JSONL) with Large... |
|
Experimental |
| 11 |
Personaz1/prompt-qa-lab
Regression and evaluation toolkit for prompt and agent output quality |
|
Experimental |
| 12 |
NeuroTinkerLab/synt-e-project
A Python tool to translate natural language requests into efficient,... |
|
Experimental |
| 13 |
vesper-astrena/promptlab
Test and compare LLM prompts. Measure response time, tokens, and cost.... |
|
Experimental |
| 14 |
fernandoxx73/department-of-truth
An experimental Python interface testing LLM constraint enforcement. It... |
|
Experimental |
| 15 |
dakshjain-1616/promptfight
Minimal prompt A/B testing: run two prompts 30 times, get winner + p-value +... |
|
Experimental |
| 16 |
albipuliga/PromptLab
Mange, test, and compare you prompts with different models. |
|
Experimental |
| 17 |
akashjindal423/Promptlab
The open-source prompt engineering workbench. Analyse your LLM prompts... |
|
Experimental |
| 18 |
rldyourmnd/local-llm-prompt-optimizer
Offline prompt A/B testing, scoring & auto-tuning for local LLMs |
|
Experimental |
| 19 |
prompt-foundry/python-sdk
The prompt engineering, prompt management, and prompt evaluation tool for Python |
|
Experimental |
| 20 |
artefactop/promptdev
A prompt evaluation framework that provides comprehensive testing for AI... |
|
Experimental |
| 21 |
martinklepsch/llm-web-ui
A web UI for the `llm` command line tool |
|
Experimental |
| 22 |
mangobanaani/semantic-ui
Minimal web interface for Large Language Models using Semantic Kernel |
|
Experimental |
| 23 |
oruizramos/Blender-structured-knowledge-FAQ-retrieval
PromptLab is a Python experimental framework for systematic prompt... |
|
Experimental |
| 24 |
theishanpathak/prompt-tester
Precision API analytics engine developed in Java 17 to track LLM usage... |
|
Experimental |
| 25 |
EltonCN/toolpy
Python module made to facilitate the creation of tools using LLMs. |
|
Experimental |
| 26 |
joncoded/keywords
keying in those words to understand them better (Next.js + Llama LLM + decap CMS) |
|
Experimental |
| 27 |
prompt-foundry/java-sdk
The prompt engineering, prompt management, and prompt evaluation tool for Java. |
|
Experimental |
| 28 |
Shawn91/promtrix
An intuitive GUI for evaluating and optimizing prompts and LLMs |
|
Experimental |
| 29 |
prompt-foundry/ruby-sdk
The prompt engineering, prompt management, and prompt evaluation tool for Ruby. |
|
Experimental |