sharathStack/LLM-Prompt-Engineering-Evaluation-Toolkit

Benchmarks 5 prompt strategies (zero-shot, CoT, few-shot, role-based, structured output) against a weighted rubric. Produces JSONL annotations for LLM training. Python · NLP

/ 100

Experimental

No License No Package No Dependents

Maintenance 13 / 25

Adoption 0 / 25

Maturity 1 / 25

Community 0 / 25

How are scores calculated?

Stars

—

Forks

—

Language

Python

License

—

Category

output-parsing

Last pushed

Apr 03, 2026

Commits (30d)

GitHub

Output Parsing · 49 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/prompt-engineering/sharathStack/LLM-Prompt-Engineering-Evaluation-Toolkit"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

Leonxlnx/agentic-ai-prompt-research

Research into how agentic AI coding assistants work — reconstructed prompt patterns, agent...

antonio0720/writing-intelligence

75 files. 76,327 words. The most advanced writing compiler ever open-sourced — now with a...

jefftriplett/files-to-claude-xml

Use XML tags for long context prompting using Claude's multi-document structure.

m727ichael/context-engineering

Information architecture for AI reasoning. PromptOS + HITL Context Engine. Copy, paste, use

madara88645/Compiler

A tool that compiles messy natural language prompts into a structured intermediate...

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights