LLM Fine-Tuning Generative AI Tools

Projects for adapting and training language models on custom datasets using techniques like LoRA, QLoRA, and instruction tuning. Includes domain-specific fine-tuning, multimodal model adaptation, and continual learning. Does NOT include pre-built model APIs, inference frameworks, RAG systems, or prompt engineering tools.

There are 36 llm fine-tuning tools tracked. The highest-rated is daekeun-ml/genai-ko-LLM at 33/100 with 26 stars.

Get all 36 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=generative-ai&subcategory=llm-fine-tuning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 daekeun-ml/genai-ko-LLM

This hands-on lab walks you through a step-by-step approach to efficiently...

33
Emerging
2 GURPREETKAURJETHRA/Llama-3-ORPO-Fine-Tuning

Llama 3 ORPO Fine Tuning on A100 in Colab Pro.

27
Experimental
3 ramalamadingdong/onnx-rubikpi

ONNX LLM runtime on RUBIK-Pi with Gemma 1B and Llama 3.2 1B

26
Experimental
4 keanteng/sesame-csm-elise

Fine-Tuning Sesame CSM Wth Elise. Enjoy the voice ( ̄︶ ̄)↗ 

25
Experimental
5 sukanyabag/Finetuning-Qwen2-7B-VQA-on-Radiology-Scans

This repository is doing the finetuning of the Qwen2 7B VLM for performing...

23
Experimental
6 DianaDorobantu/legal-llm

Develop a Romanian legal domain Large Language Model (LLM) using pre-trained...

23
Experimental
7 PriyaDas258/llm-biomedical-finetuning-lab

Fine-tune TinyLlama, Phi-2, and Mistral on PubMedQA using LoRA/QLoRA —...

22
Experimental
8 ksm26/Quantization-Fundamentals-with-Hugging-Face

Learn linear quantization techniques using the Quanto library and...

22
Experimental
9 jwest33/lora_craft

An open-source web application for fine-tuning large language models using...

19
Experimental
10 Abu-Sameer-66/ChemLLM-Tox-OLMo

Fine-tuning OLMo-7B with QLoRA & DeepChem for Molecular Toxicity Prediction...

19
Experimental
11 just4give/llm-sagemaker-fargate-api

This repository contains two major projects that work together to deploy and...

17
Experimental
12 zufeshan12/fine-tuning-and-reinforcement-learning-on-llms

supervised fine tuning and RLAIF on DeepSeek-math-7b-base using LoRA...

16
Experimental
13 hari9618/Model-Fine-Tnuning_-Hugging-Fcace-

End-to-end Generative AI chatbot built with Hugging Face Transformers,...

15
Experimental
14 sjsayedkader/FineTuning-paris2024-olympics

End-to-end LLM fine-tuning: Paris 2024 Olympics Q&A using Databricks, AWS...

15
Experimental
15 simran-padam/FineTuningLlama

FineTuning Llama to create a versatile chatbot

15
Experimental
16 gwr3n/rhetor

Rethor is an Integrated Modelling Environment that leverages pyopl, a Python...

14
Experimental
17 zengatso/orpo

🚀 Optimize preferences effectively with ORPO, a framework for monolithic...

14
Experimental
18 RenaudGaudron/llm-quantisation-performance-study

Code and data accompanying the article "The impact of quantising a small...

13
Experimental
19 krishnaura45/LMBattle

⚔️Battle between Chatbots 🛠️Finetuning LLMs

13
Experimental
20 bmaxdk/lightweight-fine-tuning-customer-support

PEFT Customer Support Chatbot

12
Experimental
21 monadicarts/mistral-7b-trainer

Mistral 7b v0.3 LLM Model Trainer

12
Experimental
22 Ajairajv/Detoxified-Summaries-with-FLAN-T5-PPO

Fine-tunes FLAN-T5 using Reinforcement Learning (PPO) and PEFT to generate...

11
Experimental
23 Akarsh1/Exploring-Unsloth-Library-for-Fine-Tuning

This is a sample notebook that can be used for exploring the fine-tuning of...

11
Experimental
24 Ajairajv/Fine-Tuning-a-Generative-AI-Model

Fine-tunes FLAN-T5 for dialogue summarization using full fine-tuning and...

11
Experimental
25 shreyas27092004/Generative-AI-Model-Fine-Tuning-Hugging-Face-Transformers

Fine-tuning a Generative AI model using Hugging Face Transformers. Includes...

11
Experimental
26 shreyas27092004/Generative-AI-Summarization-with-FLAN-T5

The primary focus of this lab is to explore dialogue summarization using the...

11
Experimental
27 ayushtiwari134/llm_fine_tuning

This model is fine-tuned to respond like Michael Gary Scott, Regional...

11
Experimental
28 MSWagner/qwen-lora-grpo-letter-counting

Fine-tuning Qwen2.5-3B-Instruct model with LoRa (Low-Rank Adaptation) and...

11
Experimental
29 prakhar175/fine-tuning-mistral-7b

Fine tuned a mistral-7b-inst quantized model using QLoRA for Youtube comment

11
Experimental
30 KishanVadhiya/Tokenizer

A simple tokenizer visualizer for processing text data into tokens to...

11
Experimental
31 Xebec19/custom-tokeniser

custom tokeniser to replicate how llms tokenize input and output

11
Experimental
32 ictorv/LangToken

Built Library to Encode each word and Decode it for Language tasks.

10
Experimental
33 mehrdadalmasi2020/microsoft_MiniLM_L12_H384_uncased

A library that leverages the pre-trained microsoft_MiniLM-L12-H384-uncased...

10
Experimental
34 rvats20/LLM-Classification-Finetuning

Welcome to the LLM Classification Finetuning repository! This project...

10
Experimental
35 s-araromi/My-SageMaker-IT-Domain-Expert-Project

This project demonstrates how to fine-tune a large language model (LLM) for...

10
Experimental
36 mehrdadalmasi2020/bert-fine-tuning-text-classifier-lux

A library that leverages pre-trained BERT models for multilingual text...

10
Experimental