All Transformer Models

6,429 models ranked by quality score · Page 16 of 65

Showing 1501–1600 of 6,429
# Model Score Tier
1501 AlexanderVNikitin/kernel-language-entropy

Code for Fine-grained Uncertainty Quantification for LLMs from Semantic...

37
Emerging
1502 rbitr/llm.f90

LLM inference in Fortran

37
Emerging
1503 zjohn77/lightning-mlflow-hf

Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow

37
Emerging
1504 xiangking/prompt_uie_torch

基于PaddleNLP开源的抽取式UIE进行医学命名实体识别(torch实现)

37
Emerging
1505 ksm26/Finetuning-Large-Language-Models

Unlock the potential of finetuning Large Language Models (LLMs). Learn from...

37
Emerging
1506 HomebrewML/HomebrewNLP-torch

A case study of efficient training of large language models using commodity hardware.

37
Emerging
1507 litus-ai/classy

classy is a simple-to-use library for building high-performance Machine...

37
Emerging
1508 mantasu/cs224n

Solutions for CS224n (2022)

37
Emerging
1509 lliai/D2MoE

D^2-MoE: Delta Decompression for MoE-based LLMs Compression

37
Emerging
1510 alexeykarnachev/full_stack_transformer

Pytorch library for end-to-end transformer models training, inference and serving

37
Emerging
1511 openshieldai/openshield

OpenShield is a new generation security layer for AI models

37
Emerging
1512 mohyunho/NAS_transformer

Evolutionary Neural Architecture Search on Transformers for RUL Prediction

37
Emerging
1513 GithubX-F/DynaMO-RL

Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization...

37
Emerging
1514 GT-RIPL/robo-vln

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics...

37
Emerging
1515 taishi-i/nagisa_bert

A BERT model for nagisa

37
Emerging
1516 poteminr/instruct-ner

Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models...

37
Emerging
1517 canjiali/PARADE

code and data to faciliate BERT/ELECTRA for document ranking. Details refer...

37
Emerging
1518 user1342/Tomato

LLM steganography with minimum-entropy coupling - Hiding encrypted messages...

37
Emerging
1519 all-things-vits/code-samples

Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and...

37
Emerging
1520 RLado/STB-VMM

STB-VMM: Swin Transformer Based Video Motion Magnification (official repository)

37
Emerging
1521 jhcho99/CoFormer

[CVPR'22] Official PyTorch Implementation of "Collaborative Transformers for...

37
Emerging
1522 mu-cai/matryoshka-mm

Matryoshka Multimodal Models

37
Emerging
1523 rkansal47/MPGAN

The message passing GAN https://arxiv.org/abs/2106.11535 and generative...

37
Emerging
1524 Shanghai-Digital-Brain-Laboratory/BDM-DB1

A large-scale multi-modal pre-trained model

37
Emerging
1525 princeton-nlp/LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

37
Emerging
1526 zd11024/NaviLLM

[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for...

37
Emerging
1527 microsoft/AdaMix

This is the implementation of the paper AdaMix: Mixture-of-Adaptations for...

37
Emerging
1528 eqimp/hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM...

37
Emerging
1529 zjunlp/Mol-Instructions

[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset...

37
Emerging
1530 CTCycle/ADSMOD-Adsorption-Modeling

Streamline adsorption modeling by automatically fitting theoretical...

37
Emerging
1531 bodeby/torchstack

🫧 probability-level model ensembling for transformers

37
Emerging
1532 DebarshiChanda/Amazon-ML-Challenge2021

Scripts and Approach for Amazon ML Challenge

37
Emerging
1533 desaixie/zeroverse

Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction...

37
Emerging
1534 HKUNLP/icl-ceil

[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.

37
Emerging
1535 K-H-Ismail/torchortho

[ICLR 2026] Polynomial, trigonometric, and tropical activations

37
Emerging
1536 joslefaure/HERMES

[ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes...

37
Emerging
1537 ImKeTT/AdaVAE

[Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling...

37
Emerging
1538 babycommando/machinascript-for-robots

Build LLM-powered robots in your garage with MachinaScript For Robots!

37
Emerging
1539 locuslab/massive-activations

Code accompanying the paper "Massive Activations in Large Language Models"

37
Emerging
1540 eduard23144/locoformer

🤖 Explore LocoFormer, a Transformer-XL model that enhances robot locomotion...

37
Emerging
1541 ariya/gamal

Research tool leveraging LLM for answers

37
Emerging
1542 lukechilds/humanscript

A truly natural scripting language

37
Emerging
1543 SALT-NLP/LLaVAR

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for...

37
Emerging
1544 promptslab/LLMtuner

FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)

37
Emerging
1545 horus-ai-labs/DistillFlow

Library for model distillation

37
Emerging
1546 juyongjiang/CodeUp

CodeUp: A Multilingual Code Generation Llama-X Model with...

37
Emerging
1547 extreme-bert/extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customized...

37
Emerging
1548 sandesha21/Stock-Market-News-Sentiment-Analysis-and-Summarization

NLP pipeline for classifying sentiment in financial news and generating...

37
Emerging
1549 OSU-NLP-Group/AmpleGCG

AmpleGCG: Learning a Universal and Transferable Generator of Adversarial...

37
Emerging
1550 yangjianxin1/Firefly

Firefly:...

37
Emerging
1551 viddexa/moderators

One package to moderate them all

37
Emerging
1552 FuxiaoLiu/LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust...

37
Emerging
1553 volverjs/ai

Hugging Face Transformers.js wrapper for on-device AI with web-workers

37
Emerging
1554 iiis-ai/cumulative-reasoning

[TMLR] Cumulative Reasoning With Large Language Models...

37
Emerging
1555 CVI-SZU/Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

37
Emerging
1556 iMoonLab/LLM4Hypergraph

The source code of ICLR 2025 "Beyond Graphs: Can Large Language Models...

37
Emerging
1557 tommyip/mamba2-minimal

Minimal Mamba-2 implementation in PyTorch

37
Emerging
1558 ziegler-ingo/cleavage_benchmark

[BIBM 2025] Code and resources for the paper "Enhancing Multi-Epitope...

37
Emerging
1559 hyintell/awesome-refreshing-llms

EMNLP'23 survey: a curation of awesome papers and resources on refreshing...

37
Emerging
1560 SimeonHristov99/DL_25-26

Practice sessions for the course "Introduction to deep learning" in the...

37
Emerging
1561 huggingface/large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for training...

37
Emerging
1562 GyanPrakashkushwaha/DataScience

EVERYTHING YOU NEED FOR DATA SCIENCE.

37
Emerging
1563 softengg-manoj/dreamer4

🌟 Implement Dreamer 4 for training agents within scalable world models,...

37
Emerging
1564 NVlabs/RocketKV

[ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage...

36
Emerging
1565 ziplab/HVT

[ICCV 2021] Official implementation of "Scalable Vision Transformers with...

36
Emerging
1566 oValach/RailSafeNet

Repository of the paper: RailSafeNet: Visual Scene Understanding for Tram Safety

36
Emerging
1567 FSoft-AI4Code/CodeCapybara

Open-source Self-Instruction Tuning Code LLM

36
Emerging
1568 jaketae/alibi

PyTorch implementation of Train Short, Test Long: Attention with Linear...

36
Emerging
1569 AaronFeng753/Ollama-Model-Dumper

Export and Backup Ollama models into GGUF and ModelFile

36
Emerging
1570 asigalov61/Perceiver-Music-Transformer

SOTA Google's Perceiver-AR Music Transformer Implementation and Model

36
Emerging
1571 Alsace08/Chain-of-Embedding

[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding...

36
Emerging
1572 kaistAI/Janus

[NeurIPS 2024] Train LLMs with diverse system messages reflecting...

36
Emerging
1573 hao-ai-lab/DistCA

Efficient Long-context Language Model Training by Core Attention Disaggregation

36
Emerging
1574 kyegomez/DifferentialTransformer

An open source community implementation of the model from "DIFFERENTIAL...

36
Emerging
1575 DFKI-NLP/thermostat

Collection of NLP model explanations and accompanying analysis tools

36
Emerging
1576 pleisto/yuren-baichuan-7b

基于baichuan-7b的开源多模态大语言模型

36
Emerging
1577 warner-benjamin/commented-transformers

Highly commented implementations of Transformers in PyTorch

36
Emerging
1578 sinanuozdemir/oreilly-bert-nlp

This repository contains code for the O'Reilly Live Online Training for BERT

36
Emerging
1579 lifeadventurer/sentify

Leveraging Sentiment Analysis on News for Stock Market Insights

36
Emerging
1580 AIFrameResearch/SPO

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL...

36
Emerging
1581 general-preference/general-preference-model

[ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for...

36
Emerging
1582 harryjdavies/HeartGPT

Interpretable Pre-Trained Transformers for Heart Time-Series Data

36
Emerging
1583 qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM

A professional list on Large (Language) Models and Foundation Models (LLM,...

36
Emerging
1584 weiserlab/TinyLLM

Bringing Language Models to the Most Resource Constrained Devices

36
Emerging
1585 styfeng/DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

36
Emerging
1586 zhilizju/Awesome-instruction-tuning

A curated list of awesome instruction tuning datasets, models, papers and...

36
Emerging
1587 DAMO-NLP-SG/LLM-Zoo

LLM Zoo collects information of various open- and close-sourced LLMs

36
Emerging
1588 aryan-jadon/Regression-Loss-Functions-in-Time-Series-Forecasting-Tensorflow

This repository contains the implementation of paper Temporal Fusion...

36
Emerging
1589 dravenk/ollama-zig

Ollama Zig library

36
Emerging
1590 epfl-dlab/llm-latent-language

Repo accompanying our paper "Do Llamas Work in English? On the Latent...

36
Emerging
1591 mrdbourke/mac-ml-speed-test

A few quick scripts focused on testing TensorFlow/PyTorch/Llama 2 on macOS.

36
Emerging
1592 chanind/linear-relational

Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs)...

36
Emerging
1593 csiro-robotics/HOTFormerLoc

[IEEE/CVF CVPR 2025] Hierarchical Octree Transformer for Versatile Lidar...

36
Emerging
1594 mala-lab/SEMPO

[NeurIPS 2025] Official implementation of "SEMPO: Lightweight Foundation...

36
Emerging
1595 ahans30/goldfish-loss

[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs

36
Emerging
1596 mytechnotalent/RE-GPT

Inspired by Andrej Karpathy’s "Let’s Build GPT", this project guides you...

36
Emerging
1597 tlc4418/llm_optimization

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.

36
Emerging
1598 lechmazur/writing

This benchmark tests how well LLMs incorporate a set of 10 mandatory story...

36
Emerging
1599 yinboc/trans-inr

Transformers as Meta-Learners for Implicit Neural Representations, in ECCV 2022

36
Emerging
1600 chaitjo/gated-graph-transformers

Transformers are Graph Neural Networks!

36
Emerging
« Prev 1 2 3 14 15 16 17 18 63 64 65 Next »