Thariya13/medical-llm-lora
π§ Fine-tuning a medical reasoning LLM with LoRA π β Step-by-step project to train a compact LLM (DeepSeek-R1-Distill-Qwen-1.5B) on medical reasoning data using LoRA adapters. Includes dataset preprocessing, chat formatting, model training with TRLβs SFTTrainer, and inference for reasoning-rich answers π©Ίπ‘.
No commits in the last 6 months.
Stars
1
Forks
1
Language
Jupyter Notebook
License
—
Category
Last pushed
Sep 30, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Thariya13/medical-llm-lora"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama,...
huggingface/peft
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5,...
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
oumi-ai/oumi
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!