All Transformer Models

6,429 models ranked by quality score · Page 6 of 65

Showing 501–600 of 6,429
# Model Score Tier
501 inclusionAI/asystem-awex

A high-performance RL training-inference weight synchronization framework,...

48
Emerging
502 olivkoch/nano-trm

An implementation of Tiny Recursive Models (TRM)

48
Emerging
503 NVIDIA-AI-IOT/nanoowl

A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.

48
Emerging
504 jianghoucheng/AlphaEdit

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models,...

48
Emerging
505 AlekseyKorshuk/optimum-transformers

Accelerated NLP pipelines for fast inference on CPU and GPU. Built with...

48
Emerging
506 ALucek/ppt2desc

Convert PowerPoint files into semantically rich text using vision language models

48
Emerging
507 SakanaAI/doc-to-lora

Hypernetworks that update LLMs to remember factual information

48
Emerging
508 rojagtap/transformer-abstractive-summarization

Abstractive Text Summarization using Transformer

48
Emerging
509 X-D-Lab/LangChain-ChatGLM-Webui

基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答

48
Emerging
510 VHellendoorn/Code-LMs

Guide to using pre-trained large language models of source code

48
Emerging
511 Beomi/KoAlpaca

KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model...

48
Emerging
512 zyushun/Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More...

48
Emerging
513 socialfoundations/folktexts

Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on...

48
Emerging
514 worldbank/REaLTabFormer

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer...

48
Emerging
515 jmont-dev/ollama-hpp

Modern, Header-only C++ bindings for the Ollama API.

48
Emerging
516 livingbio/fuzzy-json

Fuzzy-JSON is a compact Python package with no dependencies, designed to...

48
Emerging
517 ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

48
Emerging
518 FoundationVision/Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for...

48
Emerging
519 Thinklab-SJTU/Crossformer

Official implementation of our ICLR 2023 paper "Crossformer: Transformer...

48
Emerging
520 datawhalechina/llms-from-scratch-cn

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

48
Emerging
521 malteos/llm-datasets

A collection of datasets for language model pretraining including scripts...

48
Emerging
522 kyegomez/attn_res

A clean, single-file PyTorch implementation of Attention Residuals (Kimi...

48
Emerging
523 fboulnois/llama-cpp-docker

Run llama.cpp in a GPU accelerated Docker container

48
Emerging
524 graphdeeplearning/graphtransformer

Graph Transformer Architecture. Source code for "A Generalization of...

48
Emerging
525 kyegomez/HLT

Implementation of the transformer from the paper: "Real-World Humanoid...

48
Emerging
526 AndrewZhe/lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

48
Emerging
527 slwang-ustc/nano-vllm-v1

Nano vLLM with vLLM v1's request scheduling strategy and chunked prefill

48
Emerging
528 curiousily/Deploy-BERT-for-Sentiment-Analysis-with-FastAPI

Deploy BERT for Sentiment Analysis as REST API using FastAPI, Transformers...

48
Emerging
529 locuslab/wanda

A simple and effective LLM pruning approach.

47
Emerging
530 cztomsik/ava

All-in-one desktop app for running LLMs locally.

47
Emerging
531 DaoD/INTERS

This is the repository for our paper "INTERS: Unlocking the Power of Large...

47
Emerging
532 lorenzorovida/FHE-BERT-Tiny

Source code for the paper "Transformer-based Language Models and Homomorphic...

47
Emerging
533 ictnlp/LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction...

47
Emerging
534 geeks-of-data/knowledge-gpt

Extract knowledge from all information sources using gpt and other language...

47
Emerging
535 x-tabdeveloping/turftopic

Robust and fast topic models with sentence-transformers.

47
Emerging
536 xusenlinzy/api-for-open-llm

Openai style api for open large language models, using LLMs just as chatgpt!...

47
Emerging
537 back2matching/turboquant

First open-source TurboQuant KV cache compression for LLM inference. Drop-in...

47
Emerging
538 soulteary/docker-llama2-chat

Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! (...

47
Emerging
539 dali92002/DocEnTR

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

47
Emerging
540 deepseek-ai/Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

47
Emerging
541 HHousen/TransformerSum

Models to perform neural summarization (extractive and abstractive) using...

47
Emerging
542 NVlabs/Eagle

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

47
Emerging
543 haoliuhl/ringattention

Large Context Attention

47
Emerging
544 hiyouga/ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

47
Emerging
545 The-FinAI/PIXIU

This repository introduces PIXIU, an open-source resource featuring the...

47
Emerging
546 mim-solutions/bert_for_longer_texts

BERT classification model for processing texts longer than 512 tokens. Text...

47
Emerging
547 Cardinal-Operations/ORLM

ORLM: Training Large Language Models for Optimization Modeling

47
Emerging
548 dusty-nv/NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for...

47
Emerging
549 The-Swarm-Corporation/MedGuard

MedGuard is a robust, production-grade Python library that ensures HIPAA...

47
Emerging
550 cedrickchee/awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, attention...

47
Emerging
551 kayoyin/transformer-slt

Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)

47
Emerging
552 sagorbrur/bangla-bert

Bangla-Bert is a pretrained bert model for Bengali language

47
Emerging
553 kyegomez/Lets-Verify-Step-by-Step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

47
Emerging
554 ycq091044/BIOT

BIOT - A framework for pretraining biosignals at scale. Large EEG pre-trained models.

47
Emerging
555 ymcui/Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs...

47
Emerging
556 xianglin226/Benchmarking-Single-Cell-Perturbation

Single-Cell (Perturbation) Model Library

47
Emerging
557 prrao87/tweet-stance-prediction

Applying NLP transfer learning techniques to predict Tweet stance toward a topic

47
Emerging
558 awslabs/mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

47
Emerging
559 Uminosachi/open-llm-webui

This repository contains a web application designed to execute relatively...

47
Emerging
560 conceptofmind/LaMDA-rlhf-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding...

47
Emerging
561 Event-AHU/Medical_Image_Analysis

Foundation models based medical image analysis

47
Emerging
562 chuanyangjin/MMToM-QA

[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind...

47
Emerging
563 obss/trapper

State-of-the-art NLP through transformer models in a modular design and...

47
Emerging
564 leaderj1001/BottleneckTransformers

Bottleneck Transformers for Visual Recognition

47
Emerging
565 Zefan-Cai/KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

47
Emerging
566 noahho/CAAFE

Semi-automatic feature engineering process using Language Models and your...

47
Emerging
567 dorarad/gansformer

Generative Adversarial Transformers

47
Emerging
568 raymin0223/mixture_of_recursions

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive...

47
Emerging
569 alpa-projects/alpa

Training and serving large-scale neural networks with auto parallelization.

47
Emerging
570 jackaduma/Recurrent-LLM

The open-source LLM implementation of paper: RecurrentGPT: Interactive...

47
Emerging
571 predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

47
Emerging
572 haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V...

47
Emerging
573 horseee/LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language...

47
Emerging
574 jla524/fromthetensor

From the Tensor to Stable Diffusion, a rough outline for a 10 week course.

47
Emerging
575 jeya-maria-jose/TransWeather

Pytorch Code for the paper TransWeather - CVPR 2022

47
Emerging
576 jobergum/browser-ml-inference

Edge Inference in Browser with Transformer NLP model

47
Emerging
577 vectorch-ai/ScaleLLM

A high-performance inference system for large language models, designed for...

47
Emerging
578 hybridgroup/yzma

Go with your own intelligence - Go applications that directly integrate...

47
Emerging
579 ARM-software/keyword-transformer

Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769

47
Emerging
580 ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

47
Emerging
581 jiwidi/Behavior-Sequence-Transformer-Pytorch

This is a pytorch implementation for the BST model from Alibaba...

47
Emerging
582 EleutherAI/gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the...

47
Emerging
583 VinAIResearch/PhoBERT

PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)

47
Emerging
584 The-AI-Summer/self-attention-cv

Implementation of various self-attention mechanisms focused on computer...

47
Emerging
585 vinjn/llm-metahuman

An open solution for AI-powered photorealistic digital humans.

47
Emerging
586 monologg/KoBERT-Transformers

KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed)

47
Emerging
587 codewithdark-git/Building-LLMs-from-scratch

This repository guides you through the process of building a GPT-style Large...

47
Emerging
588 MiniMax-AI/MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention...

46
Emerging
589 NVlabs/RLP

[ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a...

46
Emerging
590 ariannamethod/molequla

molequla.ai. live ecology of GPT organisms

46
Emerging
591 DUTIR-BioNLP/Taiyi-LLM

Taiyi 2, Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large...

46
Emerging
592 livepeer/ai-runner

Inference runtime for running different batch and real-time AI pipelines.

46
Emerging
593 MegEngine/InferLLM

a lightweight LLM model inference framework

46
Emerging
594 vfeofanov/mantis

Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time...

46
Emerging
595 hila-chefer/Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic...

46
Emerging
596 IDEA-CCNL/Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

46
Emerging
597 zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

46
Emerging
598 georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

46
Emerging
599 microsoft/GODEL

Large-scale pretrained models for goal-directed dialog

46
Emerging
600 datawhalechina/base-llm

从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/

46
Emerging
« Prev 1 2 3 4 5 6 7 8 63 64 65 Next »