MasterAI-EAM/Darwin

An open-source project dedicated to build foundational large language model for natural science, mainly in physics, chemistry and material science.

34
/ 100
Emerging

Pretrains and fine-tunes LLaMA on 100,000+ instruction-following examples synthesized from 6M scientific papers and 16 FAIR datasets via a custom Scientific Instruction Generator (SIG). Employs QA + multi-task fine-tuning strategies that outperform specialized ML methods and GPT-3.5 in chemistry tasks, with demonstrated state-of-the-art results on MatBench bandgap and metallic classification benchmarks. Integrates with LangChain for complex scientific workflows and supports inference on 10GB+ GPU memory while enabling full fine-tuning via distributed training.

247 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 9 / 25
Community 15 / 25

How are scores calculated?

Stars

247

Forks

27

Language

Jupyter Notebook

License

Last pushed

Feb 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/MasterAI-EAM/Darwin"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.