LLM Fine-Tuning LLM Tools
Tools, frameworks, and techniques for fine-tuning Large Language Models using methods like LoRA, QLoRA, and instruction tuning on custom datasets. Does NOT include base model training, inference serving, or general LLM applications.
There are 169 llm fine-tuning tools tracked. 1 score above 70 (verified tier). The highest-rated is axolotl-ai-cloud/axolotl at 81/100 with 11,429 stars. 2 of the top 10 are actively maintained.
Get all 169 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-fine-tuning&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions |
|
Verified |
| 2 |
google/paxml
Pax is a Jax-based machine learning framework for training large scale... |
|
Established |
| 3 |
JosefAlbers/PVM
Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon |
|
Emerging |
| 4 |
metriccoders/one-line-llm-tuner
This repository is the source code for fine tuning any LLM in just one line 🔥 |
|
Emerging |
| 5 |
Nano-Collective/nanotune
A simple, interactive CLI for fine-tuning small language models on Apple... |
|
Emerging |
| 6 |
MoHussein197/dgx-spark-finetune-llm
🔧 Fine-tune large language models efficiently on NVIDIA DGX Spark with LoRA... |
|
Emerging |
| 7 |
iamarunbrahma/finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset |
|
Emerging |
| 8 |
h2oai/h2o-wizardlm
Open-Source Implementation of WizardLM to turn documents into Q:A pairs for... |
|
Emerging |
| 9 |
SculptAI/GIMKit
Guided Infilling Modeling Toolkit |
|
Emerging |
| 10 |
WangRongsheng/Aurora
The official codes for "Aurora: Activating chinese chat capability for... |
|
Emerging |
| 11 |
unit-mesh/unit-minions
《AI 研发提效:自己动手训练 LoRA》,包含 Llama (Alpaca LoRA)模型、ChatGLM (ChatGLM Tuning)相关... |
|
Emerging |
| 12 |
readytensor/rt-llm-eng-cert-week3
Week 3 of LLM Engineering Certification: Learn to fine-tune large language... |
|
Emerging |
| 13 |
anakin87/qwen-scheduler-grpo
Train a Language Model with GRPO to create a schedule from a list of events... |
|
Emerging |
| 14 |
CrazyBoyM/phi3-Chinese
Phi3 中文后训练模型仓库 |
|
Emerging |
| 15 |
ThomasRochefortB/bettercallbloom
Let's finetune BLOOM-3B on Pile of Law - r/legal_advice |
|
Emerging |
| 16 |
ambideXtrous9/GRPO-and-SFT-Finetune-Qwen3-using-Unsloth-Reasoning-and-Non-Reasoning-Dataset
GRPO and SFT Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset |
|
Emerging |
| 17 |
WangRongsheng/MedQA-ChatGLM
🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答 |
|
Emerging |
| 18 |
NgJaBach/dark-kit
Collect and share guidance + code snippets for running LM-related tasks. |
|
Experimental |
| 19 |
Breeze648/MedCoT-7B
本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth... |
|
Experimental |
| 20 |
prakash-aryan/qwen-arabic-project
This project fine-tunes the Qwen2-1.5B model for Arabic language tasks using... |
|
Experimental |
| 21 |
InternLM/Agent-FLAN
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent... |
|
Experimental |
| 22 |
aws-samples/lambda-gen-ai-endpoint-blog
This repository guides you through the process of using transfer learning to... |
|
Experimental |
| 23 |
GURPREETKAURJETHRA/Phi-3-LLM-by-Microsoft
Phi-3 LLM by Microsoft |
|
Experimental |
| 24 |
HomoScriptor-Project/HomoScriptor
Fuel innovation and advance language models with HomoScriptor: A vibrant,... |
|
Experimental |
| 25 |
huawei-csl/AC-LoRA
Welcome to the official repository of AC-LORA: (Almost) Training-Free Access... |
|
Experimental |
| 26 |
DoubleVII/lithft
Pretrain, finetune any LLMs from huggingface on your own data. |
|
Experimental |
| 27 |
alaradirik/finetune-phi-2
Fine tune Phi 2 for persona grounded chat |
|
Experimental |
| 28 |
jianzhnie/LLMToolkit
LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large... |
|
Experimental |
| 29 |
XavierSpycy/hands-on-lora
Explore practical fine-tuning of LLMs with Hands-on Lora. Dive into examples... |
|
Experimental |
| 30 |
hyintell/BLOOM-fine-tuning
Finetune BLOOM |
|
Experimental |
| 31 |
rabiaedayilmaz/speech2text-pipelines
Speech to text pipelines using both APIs and finetuned models on custom and... |
|
Experimental |
| 32 |
carbonz0/alpaca-chinese-dataset
alpaca中文指令微调数据集 |
|
Experimental |
| 33 |
Emart29/phi4-finance-finetuning
Fine-tuning Microsoft Phi-4 Mini 3.8B on SEC 10-K financial Q&A using QLoRA... |
|
Experimental |
| 34 |
graphcore/flan-t5
Notebook for Flan-T5 – an alternative to large language models like GPT-3 &... |
|
Experimental |
| 35 |
Victorletzelter/LoRA-MCL
Multiple Choice Learning of Low Rank Adapters for Language Modeling |
|
Experimental |
| 36 |
bupticybee/FastLoRAChat
Instruct-tune LLaMA on consumer hardware with shareGPT data |
|
Experimental |
| 37 |
kevintsai/Finetuning-Large-Language-Models
Jupyter notebooks for course Finetuning Large Language Models, taught by... |
|
Experimental |
| 38 |
niuwz/Mini-Chinese-Phi3
基于Phi3模型结构,使用常见的中文预料从零训练的小参数量LLM。包括了tokenizer训练、模型预训练、指令微调和直接偏好优化等流程。 |
|
Experimental |
| 39 |
l11x0m7/LMPresent
Including pre-trained language models for fine-tuning on other NLP tasks |
|
Experimental |
| 40 |
PardhuSreeRushiVarma20060119/OpenLoRA
"OpenLoRa" is designed to streamline and elevate the fine-tuning of large... |
|
Experimental |
| 41 |
Followb1ind1y/Medical-LLM-Fine-tuning
Fine-tunes LLaMA-3-8B on PubMedQA with QLoRA, optimized via DeepSpeed and... |
|
Experimental |
| 42 |
ambideXtrous9/Finetune-Qwen3-using-Unsloth
Finetune Qwen3 using Unsloth : Reasoning and Non-Reasoning Dataset |
|
Experimental |
| 43 |
daniau23/LoRAfrica_CPU
Deploying LoRAfrica on consumer CPU devices |
|
Experimental |
| 44 |
uncase-ai/UNCASE
Open-source framework for turning expert knowledge into PII-free synthetic... |
|
Experimental |
| 45 |
daniau23/LoRAfrica
LoRAfrica: Scaling LLM Fine Tuning for African History |
|
Experimental |
| 46 |
Atomic-man007/falcon-7b-lora-fine-tuning
falcon-7b-lora-fine-tuning |
|
Experimental |
| 47 |
zamfir70/transxlab
Training architect CLI — validate and design LLM fine-tuning runs before you... |
|
Experimental |
| 48 |
anmolg1997/Domain-Adaptive-LLM
Domain-specialized LLM fine-tuning — medical, legal, finance, code domains... |
|
Experimental |
| 49 |
evanatyourservice/llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers. |
|
Experimental |
| 50 |
TLILIFIRAS/Efficient-Fine-Tuning-of-Vision-Language-Models-with-LoRA-Quantization
This project demonstrates parameter-efficient fine-tuning of large... |
|
Experimental |
| 51 |
mags0ft/simple-sft
Build functionally complete, extremely high-quality SFT datasets for... |
|
Experimental |
| 52 |
SergiuDeveloper/yoro-finetuning
YORO (You-Only-Reason-Once) - a novel LLM architecture that runs the main... |
|
Experimental |
| 53 |
fkuhne/doctune
A fine-tuning pipeline for SLMs |
|
Experimental |
| 54 |
Shreyash-Gaur/Nyaya-LLM
An ablation study adapting 4B-parameter LLMs (Qwen-2.5, Gemma-3, Phi-4) to... |
|
Experimental |
| 55 |
arifme071/llm-finetuning-engineering-domain
Fine-tuned BERT (94.2% accuracy) + LoRA Mistral-7B on railroad AI domain... |
|
Experimental |
| 56 |
dvianna/LegalQA-bloomz-560m
Finetuning a small BLOOMZ model (bloomz-560m) on a small dataset and with... |
|
Experimental |
| 57 |
Abdur-azure/xlmtec
xlmtec is a powerful, modular, and interactive command-line tool for... |
|
Experimental |
| 58 |
AnnaValentinaHirsch/Web3CodeLLM
Finetuning Starcoder2 to assist the development of decentralised NEAR dApps |
|
Experimental |
| 59 |
heyisula/infosage-13b
LLM pretraining pipeline using the FineWeb-Edu Dataset |
|
Experimental |
| 60 |
krishnaplwl/Homework_Solver_LLM
A fine-tuned LLM to solve homework questions ranging from maths to science... |
|
Experimental |
| 61 |
gallen881/Physics_Master
Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer... |
|
Experimental |
| 62 |
Arlchoose-code/Indonesian-LLM-Finetune
Fine-tune your Indonesian LLM with LoRA — instruction tuning kit designed to... |
|
Experimental |
| 63 |
GURPREETKAURJETHRA/LLMs-Inference-and-Fine-Tuning
Estimate Memory Consumption of LLMs Inference and Fine Tuning |
|
Experimental |
| 64 |
tonyreina/trl
Transformer Reinforcement Learning for Health Generative AI |
|
Experimental |
| 65 |
xingmingxu/LiteSight
Efficient Chart Summarization with LoRA |
|
Experimental |
| 66 |
khadimhussain0/kllm
Fine-tune state-of-the-art LLMs with LoRA/QLoRA on consumer hardware. |
|
Experimental |
| 67 |
tomoeOOseven/gptoss120b-qlora-mathreasoning
KrackHack 3.0 submission — Domain: Gen AI | PS: Open Innovation — ... |
|
Experimental |
| 68 |
LittleLittleCloud/Torchsharp-phi
Torchsharp port of phi-series model |
|
Experimental |
| 69 |
sachink1729/Finetuning-Mistral-7B-Chat-Doctor-Huggingface-LoRA-PEFT
Finetuning Mistral-7B into a Medical Chat Doctor using Huggingface 🤗+ QLoRA + PEFT. |
|
Experimental |
| 70 |
Abeshith/FineTuning_LanguageModels
🎯 Fine-tune large language models and use them for text-related tasks. This... |
|
Experimental |
| 71 |
Eric-he-cn/Qwen3-QLoRA-News
This project enables the model to directly generate structured summaries... |
|
Experimental |
| 72 |
amjadmajid/llm_toaster
LLM Toaster enables you to train and fine-tune mini-GPTs. |
|
Experimental |
| 73 |
AkhileshMalthi/selftune
A self-service platform that enables users to fine-tune Large Language... |
|
Experimental |
| 74 |
NgJaBach/Language-Models-Utilities
Collect and share guidance + code snippets for running LM-related tasks. |
|
Experimental |
| 75 |
strickvl/isafpr_finetune
Finetuning an LLM for structured data extraction from press releases |
|
Experimental |
| 76 |
Lichang-Chen/AlpaGasus
A better Alpaca Model Trained with Less Data (only 9k instructions of the... |
|
Experimental |
| 77 |
inuwamobarak/Meta-Llama-3-8B
Experiments with the Meta-Llama-3-8B |
|
Experimental |
| 78 |
mattialoszach/LoRA-Agentic-Output-Format
Fine-tuning LLMs for structured agent-style outputs (e.g. JSON), built for... |
|
Experimental |
| 79 |
sovit-123/lm_sft
Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised... |
|
Experimental |
| 80 |
garystafford/duke-fine-tuning-llama
DUKE (Document Understanding and Knowledge Extraction) along with... |
|
Experimental |
| 81 |
nikisetti01/MTL-LORA-for-PubMedQA-and-Riddle
🚀 Fine-tuning LLaMA 1B for a medical chatbot using LoRA and a custom... |
|
Experimental |
| 82 |
mbeps/llama3.1_fine-tuning_mult-it
Fine-tuning various Llama 3.1 family of models on the Mult-It dataset |
|
Experimental |
| 83 |
mbeps/qwen3_fine-tune_mult-it
Parameter Efficient Fine-Tuning of various Qwen3 family of models on the... |
|
Experimental |
| 84 |
Siesher/Qwen3_LoRA_pet
🐉 Fine-tuning Qwen3 with LoRA for custom tasks |
|
Experimental |
| 85 |
mbeps/magistral_mult-it_fine-tuning
Parameter Efficient Fine-Tuning of Magistral Small model on the Mult-It... |
|
Experimental |
| 86 |
EN10/BabyLlama
Train and run a small Llama 2 model from scratch on the TinyStories dataset. |
|
Experimental |
| 87 |
Nihal108-bi/Emotion-Aware-Conversational-AI-QLoRA-Fine-Tuned-7B-LLM-
Fine-tuned 7B LLM for empathetic emotional-support dialogue using QLoRA.... |
|
Experimental |
| 88 |
nv-legate/multimesh-jax
PjRt plugin and Python APIs for MPMD workflows in Jax |
|
Experimental |
| 89 |
royxlead/autollmforge-python
Fine-tune any large language model with intelligent QLoRA optimization |
|
Experimental |
| 90 |
renaldiangsar/Medical-LLM-Fine-Tuning
Fine-tuning Large Language Models (LLMs) for medical reasoning to enhances... |
|
Experimental |
| 91 |
robuno/Title-Generator-with-LLM-QLoRa
Fine-tuning LLMs with LoRA to generate titles from the given abstract,... |
|
Experimental |
| 92 |
apudasm10/region-aware-vlm-finetune
Pipeline for finetuning VLMs with region-aware inputs. Trains on custom... |
|
Experimental |
| 93 |
stperrakis/ULM-fit
This repository contains an implementation of the ULMfit (Universal Language... |
|
Experimental |
| 94 |
Yousefbadr0/GPT-Neo_Medical_Fine-Tuning_using_LoRA
Fine-tuning GPT-Neo-125M using LoRA on a medical QA dataset, achieving... |
|
Experimental |
| 95 |
NamrataThakur/Fine-tuning-LLMs-Strategies
Different Strategies to Fine-Tune a Large Language Model. We cover 4... |
|
Experimental |
| 96 |
faezeh-gholamrezaie/Fine-Tuning-Large-Language-Models-for-Sleep-Stage-Classification
Fine-tuning Large Language Models (LLMs) using QLoRA on EEG data for... |
|
Experimental |
| 97 |
r-kovalch/omnigec-models
Reproducible QLoRA recipes and configs that fine‑tune Aya‑Expanse‑8B and... |
|
Experimental |
| 98 |
Tommaso-Sgroi/VojoLe-LM
DL24-25 project. The goal is Fine-Tuning a LLM on Italian Dialect. |
|
Experimental |
| 99 |
SauravMaheshkar/nanollm
JAX LLM playground |
|
Experimental |
| 100 |
jmaczan/c-137
🦙 Llama 2 7B fine-tuned to revive Rick |
|
Experimental |
| 101 |
enoreese/mechanic-gpt
A fine-tuned LLM great at answering questions about car repairs and maintenance. |
|
Experimental |
| 102 |
Dhwani-Chande/Natural-Language-to-Bash-Translation-using-LLMs
Fine-tuned Llama-3.2-1B & Qwen2.5-Coder on 40K NL→Bash pairs. Includes... |
|
Experimental |
| 103 |
christinajoslin/faq-generation
CLiFF (Clustering & Language model integration for FAQ Formation) |
|
Experimental |
| 104 |
ShubhammS18/finetune-json-extractor
Fine-tuned Qwen2.5-7B on Fireworks AI for structured JSON extraction from... |
|
Experimental |
| 105 |
Witurpred64/LLM-FineTuning-Toolkit
A comprehensive toolkit for fine-tuning Large Language Models (LLMs) with... |
|
Experimental |
| 106 |
Isha1600/LLM-Finetuning
Fine-tuning Large Language Models (LLMs) using custom datasets for improved... |
|
Experimental |
| 107 |
Gyldenn/storywriter
Fine-tuning Mistral 7B with LoRA (QLoRA 4-bit) to generate Shakespearean... |
|
Experimental |
| 108 |
SaniyaBekova/kazakh-llm-finetuning
LLM fine-tuning for Kazakh fairy tale generation using QLoRA, SFT, DPO |
|
Experimental |
| 109 |
cre8vdj/cre8v-ai-finetune
Fine-tune Llama 2 / Mistral with LoRA & QLoRA using PEFT. Runs on free Colab... |
|
Experimental |
| 110 |
DNLab2024/BGP_LLaMA
BGP-LLaMA: Fine-tuning Open-Source LLM on BGP Routing Knowledge and Analysis |
|
Experimental |
| 111 |
AIdventures/flora
Fine-tuning LLMs with LoRA |
|
Experimental |
| 112 |
YanSte/NLP-LLM-Fine-tuning-DeepSpeed
Natural Language Processing (NLP) and Large Language Models (LLM) with... |
|
Experimental |
| 113 |
chaithanyasai18/LLMs-finetuning
This repository consists of python scripts for LLM finetuning (SFT, LoRA,... |
|
Experimental |
| 114 |
codershiyar/llama-google-colab-tutorial
Step-by-step tutorial on loading and using Llama 3.1 8B Instruct in Google... |
|
Experimental |
| 115 |
aakarsh31/qlora-llm-finetuning
QLoRA fine-tuning of Llama 3.2 3B on MedQA with full LoRA rank ablation... |
|
Experimental |
| 116 |
spatialft/spatialft.github.io
LoRA fine-tuning of LFM2.5-1.2B to improve spatial reasoning on StepGame —... |
|
Experimental |
| 117 |
Pects1949/LLM-Fine-tuning-Toolkit
A comprehensive toolkit for fine-tuning and deploying Large Language Models... |
|
Experimental |
| 118 |
ahmad-albasha/Frankenstein-LLM-Model-fine-tuning-code
Fine-tuning Mistral-7B-v0.1 on Mary Shelley's Frankenstein using LoRA/QLoRA... |
|
Experimental |
| 119 |
flaviengeoffray/loRa-reimplem
A practical reimplementation of the Low-Rank Adaptation (LoRA) paper for... |
|
Experimental |
| 120 |
igna-s/QLoRA-Experiments
A collection of SFT and distillation pipelines to train specialized medical... |
|
Experimental |
| 121 |
neoheartbeats/neoheartbeats-kernel
An architecture for LLMs' continual-learning and long-term memories |
|
Experimental |
| 122 |
fb3rasp/finetune-ingest
Ability to finetune LLMs and generate training data using provided documents... |
|
Experimental |
| 123 |
manufactai/finetuning-cookbook
A collection of practical examples and tutorials for fine-tuning large... |
|
Experimental |
| 124 |
sparkup/medical-llm-finetuning-alignment
Medical LLM fine-tuning and preference alignment using SFT and DPO, with... |
|
Experimental |
| 125 |
dineshsoudagar/llm-lab-from-scratch-to-fine-tuning
Comprehensive resources and scripts for training and fine-tuning Large... |
|
Experimental |
| 126 |
arunpshankar/VAI-FineTuning-LLMs
"Clean and comprehensive examples for fine-tuning LLMs supported by Vertex... |
|
Experimental |
| 127 |
HEMANGANI/Fine-Tuning-LLM-for-QA
Fine-Tuning Large Language Models for Question Answering |
|
Experimental |
| 128 |
Pavansomisetty21/A-Fine-Tuned-Model-for-Medical-Named-Entity-Recognition-using-Gemini-LLM
In this we finetuned the Gemini model with our own medical NER dataset and... |
|
Experimental |
| 129 |
Pavansomisetty21/Visual-Question-Answering-Pixtral_Vision_Finetuning_Unsloth
In this we finetune Pixtral-12B-2409 model using unsloth for visual Question... |
|
Experimental |
| 130 |
OutllierRejects/Intellihack_OutlierRejects_Task3
LLM Fine-tuning Challenge Enhancing Qwen 2.5 3B for AI Research QA |
|
Experimental |
| 131 |
sanskaryo/LLM-Finetuning-Projects
This repository contains various projects focused on fine-tuning Large... |
|
Experimental |
| 132 |
bshtmichielsen/expert_chat
Using a LoRA to make a LLM talk about a subject I like. |
|
Experimental |
| 133 |
erraji-jo/LLM-Finutune-based-on-customData
The project aims to showcase the process of fine-tuning LLMs on... |
|
Experimental |
| 134 |
Rishabh9559/medical-llama-3.2-3B-model
This is all about fine-tuning the Llama3.2-3B model on your medical textbook. |
|
Experimental |
| 135 |
giankev/Ancient-to-Modern-Italian-Automatic-Translation
Finetuning and evaluating LLMs on Ancient-to-Modern Italian translation task. |
|
Experimental |
| 136 |
clement-cvll/AIMO-Math-Finetuning
Fine tuning of a model for AIMO 2 math competition on Kaggle |
|
Experimental |
| 137 |
priyam-hub/LLM-Fine-Tuning-Pipeline
A comprehensive pipeline for Different Fine-Tuning Methods for Large... |
|
Experimental |
| 138 |
Gholamrezadar/finetuning_llm_on_letter_counting
Fine-tuning Gemma-3 4B on the letter-counting dataset |
|
Experimental |
| 139 |
atasoglu/turkish-llava-notebooks
A useful collection of notebooks for quantization, fine-tuning, and... |
|
Experimental |
| 140 |
jwliao1209/Taiwan-LLaMa-Instruction-Tuning
2023 NTU CSIE ADL Homework 3 |
|
Experimental |
| 141 |
c4dt/pitfalls_in_fine_tuning_llms
Jupyter notebooks for the LLM fine-tuning pitfalls hands-on workshop |
|
Experimental |
| 142 |
alvi75/MultiTask-QLoRA-NFAnalysis
Official implementation of "Parameter-Efficient Multi-Task Fine-Tuning in... |
|
Experimental |
| 143 |
BetikuOluwatobi/clinical-instruct-api
Fine-tuned GPT-2 (355M) language model for clinical reasoning tasks. |
|
Experimental |
| 144 |
1nilx2/Deep-Learning
LLM, VLLM Models |
|
Experimental |
| 145 |
nglguarino/code-completion
Fine-tuned 3 LLMs (Phi-2, Gemma, Llama2) on 100K+ instruction CodeInstruct... |
|
Experimental |
| 146 |
gamithasam/notion-qwen2.5-1.5B
Fine-tuning notebook for creating a Notion template generator using... |
|
Experimental |
| 147 |
AparnaRoy76/LLM-finetuning
A comprehensive toolkit for fine-tuning Large Language Models (LLMs) using... |
|
Experimental |
| 148 |
Rohanjain2312/medical-llm-finetuning
End-to-end LLM fine-tuning pipeline: Fine-tuned Mistral 7B on medical... |
|
Experimental |
| 149 |
Sahar-Sheikhi/CRM-Data-Automation-Llama-3.2-Finetuned-
A memory-efficient fine-tuning pipeline using Llama-3.2-3B and QLoRA to... |
|
Experimental |
| 150 |
EkBass/fin-eng-translations-set
Massive translation set between Finnish and English languages. |
|
Experimental |
| 151 |
jo-valer/machine-translation-ladin-fascian
Repository of our paper Nesciun Lengaz Lascià Endò: Machine Translation for... |
|
Experimental |
| 152 |
Pavansomisetty21/Vision_Finetuning_Unsloth_Radiography-Image-Captioning
In this we fine tune Llama-3.2-11B-Vision-Instruct model on... |
|
Experimental |
| 153 |
anujsahani01/PyLoomer
Python Code Completion bot |
|
Experimental |
| 154 |
zekaouinoureddine/BioMed-LLaMa-3
BioMed-LLaMa-3: Instruction-Efficient Fine-Tuning of Large Language Models... |
|
Experimental |
| 155 |
btboilerplate/Llama-2
Fine-tunes LLaMA-2 using QLoRA for instruction-style text generation,... |
|
Experimental |
| 156 |
avishek04/MedLam
A Medical Assistant based on Llama 3.1 |
|
Experimental |
| 157 |
Anonymous-user-00/FLoRIST
Official implementation of FLoRIST: efficient and accurate federated... |
|
Experimental |
| 158 |
myatthukyaw/ft-llm
Finetuning LLMs using Hugging Face |
|
Experimental |
| 159 |
fabiantoh98/finetune-llm
Fine-tuning LLMs with QLoRA on consumer GPUs — includes training,... |
|
Experimental |
| 160 |
123RohitVarshit/FINETUNED_DEEPSEEK-R1
Fine-tuning the DeepSeek-LLM to create a medical expert for advanced... |
|
Experimental |
| 161 |
jasonjiang8866/peft-fine-tuning-recipes-classification
A working recipes for sequential classification finetuning using peft |
|
Experimental |
| 162 |
YounesBensafia/Algeria-2-0-FineTuning-workshop
This repository contains resources and examples used in my workshop for... |
|
Experimental |
| 163 |
shadynasrat/RDMM
RDMM:Fine-Tuned LLM Models for On-Device Robotic Decision Making with... |
|
Experimental |
| 164 |
sahilfaizal01/Kaggle-Contest---Fine-tuning-Llama-3.1-LLM-
We used the Llama-3.1 8B (LLM) model to verify math problem solutions via... |
|
Experimental |
| 165 |
PrathamLearnsToCode/Fine-tuning-FLAN-T5-with-LoRA-WandB
Fine tune an LLM for summarization task using Low rank adaptation |
|
Experimental |
| 166 |
mfaizan-ai/NewsQA
News QA generation and fine tuning an LLM for QA generation (under development) |
|
Experimental |
| 167 |
slv-ai/Fine-Tune-LLMs-with-DPO
Fine-tuning Microsoft’s Phi-2 Machine Learning Model with DPO |
|
Experimental |
| 168 |
shizheng-rlfresh/llm-opt
Fine-tuning LLMs with LoRA and Hessian-free optimizers |
|
Experimental |
| 169 |
KayvanShah1/UniFAQ
Fine-Tuned LLM-Based FAQ Generation for University Admissions: A project... |
|
Experimental |