VinAIResearch/PhoGPT

PhoGPT: Generative Pre-training for Vietnamese (2023)

/ 100

Emerging

Built on a 3.7B-parameter decoder architecture trained on 102B Vietnamese tokens with 8K context length, PhoGPT supports inference across vLLM, Text Generation Inference, llama.cpp, and standard Transformers pipelines, with quantization options via bitsandbytes and GGUF formats. The chat variant incorporates instruction-following and conversation tuning on 70K prompt-response pairs plus 290K multi-turn dialogues. Full fine-tuning is facilitated through llm-foundry or alternative frameworks like LLaMA-Factory and lit-gpt.

798 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

798

Forks

Language

Python

License

BSD-3-Clause

Higher-rated alternatives

Nixtla/nixtla

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and...

akanyaani/gpt-2-tensorflow2.0

OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

andrewdalpino/NoPE-GPT

A GPT-style small language model (SLM) with no positional embeddings (NoPE).

teddykoker/image-gpt

PyTorch Implementation of OpenAI's Image GPT

LIYUESEN/druggpt

DrugGPT: A GPT-based Strategy for Designing Potential Ligands Targeting Specific Proteins

Explore LLM Tools

All categories Trending LLM Tool directory Insights