Thariya13/medical-llm-lora

🧠 Fine-tuning a medical reasoning LLM with LoRA 🚀 — Step-by-step project to train a compact LLM (DeepSeek-R1-Distill-Qwen-1.5B) on medical reasoning data using LoRA adapters. Includes dataset preprocessing, chat formatting, model training with TRL’s SFTTrainer, and inference for reasoning-rich answers 🩺💡.

/ 100

Experimental

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 1 / 25

Maturity 1 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Category

lora-qlora-fine-tuning

Last pushed

Sep 30, 2025

Commits (30d)

GitHub

LoRA QLoRA Fine-tuning · 206 models

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Thariya13/medical-llm-lora"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama,...

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5,...

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Explore Transformer Models

All categories Trending Transformer directory Insights