ise-uiuc/magicoder
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct
OSS-Instruct generates low-bias instruction data by grounding LLM synthesis in open-source code references, addressing inherent dataset biases in LLM-generated training data. The approach creates diverse, realistic instructions with explicit code snippets rather than purely synthetic examples. Models are available across multiple architectures (Llama2, DeepSeek) and fine-tuning strategies, with Magicoder-S-DS-6.7B achieving 76.8% on HumanEval, outperforming GPT-3.5-turbo and Gemini Ultra.
2,086 stars. No commits in the last 6 months.
Stars
2,086
Forks
173
Language
Python
License
MIT
Category
Last pushed
Nov 01, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/ise-uiuc/magicoder"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MMMU-Benchmark/MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal...
pat-jj/DeepRetrieval
[COLM’25] DeepRetrieval — 🔥 Training Search Agent by RLVR with Retrieval Outcome
lupantech/MathVista
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
x66ccff/liveideabench
[𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea...
IAAR-Shanghai/xVerify
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations