MasterAI-EAM/Darwin
An open-source project dedicated to build foundational large language model for natural science, mainly in physics, chemistry and material science.
Pretrains and fine-tunes LLaMA on 100,000+ instruction-following examples synthesized from 6M scientific papers and 16 FAIR datasets via a custom Scientific Instruction Generator (SIG). Employs QA + multi-task fine-tuning strategies that outperform specialized ML methods and GPT-3.5 in chemistry tasks, with demonstrated state-of-the-art results on MatBench bandgap and metallic classification benchmarks. Integrates with LangChain for complex scientific workflows and supports inference on 10GB+ GPU memory while enabling full fine-tuning via distributed training.
247 stars. No commits in the last 6 months.
Stars
247
Forks
27
Language
Jupyter Notebook
License
—
Category
Last pushed
Feb 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/MasterAI-EAM/Darwin"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
maxischuh/TwinBooster
Package for TwinBooster. Enables fast and powerful zero-shot molecular property prediction.
theochem/ModelHamiltonian
Generate 1- and 2-electron integrals so that molecular quantum chemistry software can be used...
lamalab-org/chembench
How good are LLMs at chemistry?
pnnl/cactus
LLM Agent that leverages cheminformatics tools to provide informed responses.
jan-janssen/LangSim
Application of Large Language Models (LLM) for computational materials science - visit...