Thariya13/medical-llm-lora

🧠 Fine-tuning a medical reasoning LLM with LoRA πŸš€ β€” Step-by-step project to train a compact LLM (DeepSeek-R1-Distill-Qwen-1.5B) on medical reasoning data using LoRA adapters. Includes dataset preprocessing, chat formatting, model training with TRL’s SFTTrainer, and inference for reasoning-rich answers πŸ©ΊπŸ’‘.

16
/ 100
Experimental

No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 1 / 25
Maturity 1 / 25
Community 12 / 25

How are scores calculated?

Stars

1

Forks

1

Language

Jupyter Notebook

License

Last pushed

Sep 30, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Thariya13/medical-llm-lora"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.