ise-uiuc/magicoder

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

/ 100

Emerging

OSS-Instruct generates low-bias instruction data by grounding LLM synthesis in open-source code references, addressing inherent dataset biases in LLM-generated training data. The approach creates diverse, realistic instructions with explicit code snippets rather than purely synthetic examples. Models are available across multiple architectures (Llama2, DeepSeek) and fine-tuning strategies, with Magicoder-S-DS-6.7B achieving 76.8% on HumanEval, outperforming GPT-3.5-turbo and Gemini Ultra.

2,086 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

2,086

Forks

173

Language

Python

License

MIT

Higher-rated alternatives

MMMU-Benchmark/MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal...

pat-jj/DeepRetrieval

[COLM’25] DeepRetrieval — 🔥 Training Search Agent by RLVR with Retrieval Outcome

lupantech/MathVista

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

x66ccff/liveideabench

[𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea...

IAAR-Shanghai/xVerify

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Explore LLM Tools

All categories Trending LLM Tool directory Insights