Instruction Tuning Datasets LLM Tools
Datasets, papers, and resources specifically for instruction tuning and instruction-following in LLMs. Does NOT include general fine-tuning methods, evaluation benchmarks, or model inference tools.
There are 28 instruction tuning datasets tools tracked. 1 score above 50 (established tier). The highest-rated is MantisAI/sieves at 54/100 with 125 stars and 605 monthly downloads.
Get all 28 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=instruction-tuning-datasets&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
MantisAI/sieves
Plug-and-play document AI with zero-shot models. |
|
Established |
| 2 |
xiaoya-li/Instruction-Tuning-Survey
Project for the paper entitled `Instruction Tuning for Large Language... |
|
Emerging |
| 3 |
princeton-pli/STAT
Skill-Targeted Adaptive Training |
|
Experimental |
| 4 |
TencentARC-QQ/TagGPT
TagGPT: Large Language Models are Zero-shot Multimodal Taggers |
|
Experimental |
| 5 |
rafaelpierre/bullet
bullet: A Zero-Shot / Few-Shot Learning, LLM Based, text classification framework |
|
Experimental |
| 6 |
amazon-science/adaptive-in-context-learning
AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective... |
|
Experimental |
| 7 |
18907305772/Explore-Instruct
EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage... |
|
Experimental |
| 8 |
andrewzamai/SLIMER_IT
An Instruction-tuned LLM for zero-shot NER on Italian |
|
Experimental |
| 9 |
Shivanshu-Gupta/in-context-learning
Easy in-context learning experiemnts with variety of datasets, LLMs, and... |
|
Experimental |
| 10 |
LIN-SHANG/InstructERC
The offical realization of InstructERC |
|
Experimental |
| 11 |
Lichang-Chen/InstructZero
Official Implementation of InstructZero; the first framework to optimize bad... |
|
Experimental |
| 12 |
OpenGVLab/Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with... |
|
Experimental |
| 13 |
LGDiMaggio/few-shot-fault-diagnosis-multimodal-LLM
Few-shot bearing fault diagnosis using multimodal LLMs and prototypical networks |
|
Experimental |
| 14 |
HamedBabaei/author-profiling-pan2023
Symbol Team model for PAN@AP 2023 shared task on Profiling Cryptocurrency... |
|
Experimental |
| 15 |
raunak-agarwal/instruction-datasets
Datasets for Instruction Tuning of Large Language Models |
|
Experimental |
| 16 |
basicv8vc/chinese-instruction-datasets-for-llms
用于微调LLM的中文指令数据集 |
|
Experimental |
| 17 |
MK2112/conflicting-few-shots
experiments on how conflicting few-shot examples affect emotion... |
|
Experimental |
| 18 |
OpenDFM/HeadsUp
[ICML 2025] Codes for the paper "Heads up! Large Language Models Can Perform... |
|
Experimental |
| 19 |
snowood1/Zero-Shot-PLOVER
Leveraging Codebook Knowledge with NLI and ChatGPT for Zero-Shot Political... |
|
Experimental |
| 20 |
MiuLab/InstUPR
Source code of our paper "InstUPR: Instruction-based Unsupervised Passage... |
|
Experimental |
| 21 |
A-baoYang/instruction-finetune-datasets
Collect and maintain high quality instruction finetune datasets in different... |
|
Experimental |
| 22 |
andrewzamai/SLIMER
Show Less, Instruct More: Enriching Prompts with Definitions and Guidelines... |
|
Experimental |
| 23 |
Reason-Wang/notable-instruction-llm
The repo collects model and data projects for instruction following large... |
|
Experimental |
| 24 |
Showndarya/Few-Shot-ChatGPT
Zero-Shot and Few-shot learning method using ChatGPT on problem sets |
|
Experimental |
| 25 |
mukhal/icl-ensembling
[Me-FoMo ICLR 2023 - Oral] Exploring Demonstration Ensembling for In-context Learning |
|
Experimental |
| 26 |
Ghost---Shadow/InSQuaD
InSQuaD is a research framework for efficient in-context learning that... |
|
Experimental |
| 27 |
DeperiasKerre/qpInstruct
Instruction Dataset for QCL properties Extraction from Text |
|
Experimental |
| 28 |
davidandym/Multitask-Transfer-Instruction-Tuning
This is the official code repository for the ACL Findings Paper "Multi-Task... |
|
Experimental |