VHellendoorn/Code-LMs

Guide to using pre-trained large language models of source code

/ 100

Emerging

Provides pre-trained PolyCoder models (160M–2.7B parameters) trained on 12-language code corpora using GPT-NeoX, available via Hugging Face transformers or custom checkpoints. Supports code generation with configurable temperature and beam search, includes evaluation harnesses for perplexity and HumanEval benchmarks, and offers Docker containerization with GPU support for inference and fine-tuning workflows.

1,842 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

1,842

Forks

265

Language

Python

License

MIT

Higher-rated alternatives

Goekdeniz-Guelmez/mlx-lm-lora

Train Large Language Models on MLX.

uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

jarobyte91/pytorch_beam_search

A lightweight implementation of Beam Search for sequence models in PyTorch.

SmallDoges/small-doge

Doge Family of Small Language Models

ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

Explore Transformer Models

All categories Trending Transformer directory Insights