dakshjain-1616/gemma-3-12b-medical-sft
Fine-tunes google/gemma-3-12b-it with Unsloth SFT and LoRA (r=32, alpha=64) on 1000 synthetic medical reasoning samples — symptom triage, differential diagnosis, drug interaction checks. Response-Only Training with Gemma-3 chat format and
Stars
—
Forks
—
Language
Python
License
—
Category
Last pushed
Mar 28, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/dakshjain-1616/gemma-3-12b-medical-sft"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
limix-ldm-ai/LimiX
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence...
XXO47OXX/layer-scan
Automated LLM layer duplication config scanner — find the optimal (i,j) for any model + task
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
google-research/plur
PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets...
thuml/LogME
Code release for "LogME: Practical Assessment of Pre-trained Models for Transfer Learning" (ICML...