All Transformer Models

6,429 models ranked by quality score · Page 12 of 65

Showing 1101–1200 of 6,429
# Model Score Tier
1101 Esmail-ibraheem/Axon

AI research lab🔬: implementations of AI papers and theoretical research:...

41
Emerging
1102 ariannamethod/arianna.c

Arianna is a Digital Persona. Embodied cognition as is.

41
Emerging
1103 Multi-Agent-LLMs/mallm

Framework: Multi-Agent LLMs For Conversational Task-Solving (MALLM)

41
Emerging
1104 declare-lab/instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned...

41
Emerging
1105 willxxy/ECG-Bench

A Unified Framework for Benchmarking Generative Electrocardiogram-Language...

41
Emerging
1106 bigscience-workshop/xmtf

Crosslingual Generalization through Multitask Finetuning

41
Emerging
1107 cloudguruab/modsysML

Human reinforcement learning (RLHF) framework for AI models. Evaluate and...

41
Emerging
1108 yang-ai-lab/SleepLM

SleepLM: Natural-Language Intelligence for Human Sleep

41
Emerging
1109 ariannamethod/ariannamethod.ai

Arianna Method Programming Language

41
Emerging
1110 mlvlab/Flipped-VQA

Large Language Models are Temporal and Causal Reasoners for Video Question...

41
Emerging
1111 XunhaoLai/native-sparse-attention-triton

Efficient triton implementation of Native Sparse Attention.

41
Emerging
1112 HHousen/DocSum

A tool to automatically summarize documents abstractively using the BART or...

41
Emerging
1113 ictnlp/LLaVA-Mini

LLaVA-Mini is a unified large multimodal model (LMM) that can support the...

41
Emerging
1114 pjlab-sys4nlp/llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual...

41
Emerging
1115 zai-org/GLM-Edge

GLM Series Edge Models

41
Emerging
1116 liuqidong07/MOELoRA-peft

[SIGIR'24] The official implementation code of MOELoRA.

41
Emerging
1117 Beomi/InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No...

41
Emerging
1118 punica-ai/punica

Serving multiple LoRA finetuned LLM as one

41
Emerging
1119 SqueezeAILab/SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

41
Emerging
1120 VITA-MLLM/Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with...

41
Emerging
1121 AdrianBZG/llama-multimodal-vqa

Multimodal Instruction Tuning for Llama 3

41
Emerging
1122 fahadshamshad/awesome-transformers-in-medical-imaging

A collection of resources on applications of Transformers in Medical Imaging.

41
Emerging
1123 Breeze648/Transformer-from-Scratch

本仓库定位为 AI论文复现 / 从零实现 Transformer。 ...

41
Emerging
1124 HarderThenHarder/transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,...

41
Emerging
1125 monologg/KoCharELECTRA

Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)

40
Emerging
1126 kyegomez/SimplifiedTransformers

SimplifiedTransformer simplifies transformer block without affecting...

40
Emerging
1127 lin-tan/clm

For our ICSE23 paper "Impact of Code Language Models on Automated Program...

40
Emerging
1128 NVIDIA/Star-Attention

Efficient LLM Inference over Long Sequences

40
Emerging
1129 thevasudevgupta/bigbird

Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers

40
Emerging
1130 zyds/transformers-code

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

40
Emerging
1131 ziplab/LIT

[AAAI 2022] This is the official PyTorch implementation of "Less is More:...

40
Emerging
1132 luuyin/OWL

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity...

40
Emerging
1133 TrustedLLM/LLMDet

LLMDet is a text detection tool that can identify which generated sources...

40
Emerging
1134 JulesBelveze/bert-squeeze

🛠️ Tools for Transformers compression using PyTorch Lightning ⚡

40
Emerging
1135 git-cloner/llama-lora-fine-tuning

llama fine-tuning with lora

40
Emerging
1136 Hsu1023/DuQuant

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation...

40
Emerging
1137 amitkedia007/Financial-Fraud-Detection-Using-LLMs

The aim of this dissertation is to assess the effectiveness of LLMs such as ...

40
Emerging
1138 ECNU-ICALK/EduChat

An open-source educational chat model from ICALK, East China Normal...

40
Emerging
1139 HHousen/speaker-change-detection

Speaker change detection using SincNet and an LSTM/Transformer

40
Emerging
1140 boheumd/MA-LMM

(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term...

40
Emerging
1141 molbal/llm-text-completion-finetune

Guide on text completion large language model fine-tuning, including example...

40
Emerging
1142 PediaMedAI/AggPose

[IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation...

40
Emerging
1143 RedHatResearch/conext24-NetConfEval

Benchmark for evaluating LLMs in network configuration problems.

40
Emerging
1144 ChristophReich1996/Swin-Transformer-V2

PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up...

40
Emerging
1145 architkaila/Fine-Tuning-LLMs-for-Medical-Entity-Extraction

Exploring the potential of fine-tuning Large Language Models (LLMs) like...

40
Emerging
1146 rishikksh20/CrossViT-pytorch

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer...

40
Emerging
1147 ukairia777/pytorch-nlp-tutorial

pytorch를 사용하여 텍스트 전처리부터 RAG, 에이전트, LLM 파인튜닝을 정리한 Deep Learning NLP 저장소입니다.

40
Emerging
1148 jha-lab/acceltran

[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers

40
Emerging
1149 rasbt/blog-finetuning-llama-adapters

Supplementary material for "Understanding Parameter-Efficient Finetuning of...

40
Emerging
1150 johnmai-dev/NotebookMLX

📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)

40
Emerging
1151 xNul/code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local...

40
Emerging
1152 OpenSparseLLMs/LLaMA-MoE-v2

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of...

40
Emerging
1153 0x7o/RETRO-transformer

Easy-to-use Retrieval-Enhanced Transformer implementation

40
Emerging
1154 cdli-gh/Semi-Supervised-NMT-for-Sumerian-English

Exploring the Limits of Low-Resource Neural Machine Translation

40
Emerging
1155 kingabzpro/using-llama3-locally

Running llama3 using Ollama-Python, Curl, LangChain, Chroma, and User interface.

40
Emerging
1156 infocusp/llm_seminar_series

Material for the series of seminars on Large Language Models

40
Emerging
1157 zackshen/gguf

a GGUF file parser

40
Emerging
1158 cooelf/AwesomeMRC

IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading...

40
Emerging
1159 rednote-hilab/dots.llm1

The official repository of the dots.llm1 base and instruct models proposed...

40
Emerging
1160 google-deepmind/gemma_penzai

A JAX Research Toolkit for Visualizing, Manipulating, and Understanding...

40
Emerging
1161 saqib1707/gpt2-from-scratch

PyTorch Implementation of GPT-2

40
Emerging
1162 huggingface/datablations

Scaling Data-Constrained Language Models

40
Emerging
1163 VITA-Group/Q-GaLore

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank...

40
Emerging
1164 vicgalle/zero-shot-reward-models

ZYN: Zero-Shot Reward Models with Yes-No Questions

40
Emerging
1165 trrahul/llama2.cs

Inference Llama 2 in one file of pure C#

40
Emerging
1166 souzatharsis/tamingLLMs

Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software

40
Emerging
1167 hkust-nlp/deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

40
Emerging
1168 aniketmaurya/llm-inference

Large Language Model (LLM) Inference API and Chatbot

40
Emerging
1169 ai4co/parco

[NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization

40
Emerging
1170 Traffic-Alpha/iLLM-TSC

This repository contains the code for the paper“iLLM-TSC: Integration...

40
Emerging
1171 HqWu-HITCS/Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

40
Emerging
1172 teticio/llama-squad

Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a...

40
Emerging
1173 Fsoft-AIC/Grasp-Anything

Dataset and Code for ICRA 2024 paper "Grasp-Anything: Large-scale Grasp...

40
Emerging
1174 Omid-Nejati/BEFUnet

A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation

40
Emerging
1175 SamsungSAILMontreal/ghn3

Code for "Can We Scale Transformers to Predict Parameters of Diverse...

40
Emerging
1176 Bindwell/PLAPT

Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding...

40
Emerging
1177 OpenBMB/InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K...

40
Emerging
1178 aJupyter/ThinkLLM

ThinkLLM:🚀 轻量、高效的大语言模型算法实现

40
Emerging
1179 l294265421/alpaca-rlhf

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback)...

40
Emerging
1180 IAAR-Shanghai/Grimoire

Grimoire is All You Need for Enhancing Large Language Models

40
Emerging
1181 prismformore/Multi-Task-Transformer

Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting...

40
Emerging
1182 bigcode-project/selfcodealign

[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation

40
Emerging
1183 vmicheli/delta-iris

Efficient World Models with Context-Aware Tokenization. ICML 2024

40
Emerging
1184 harishdeivanayagam/rowfill

Open-source spreadsheets platform for deep research and document processing

40
Emerging
1185 takashiishida/paper2slides

Transform any arXiv papers into slides using LLMs

40
Emerging
1186 hongyehu/Machine_Learning_Quantum_State_Tomography

An **unofficial** pytorch implementation of using generative models to do...

40
Emerging
1187 DmitryNekrasov/ai-code-completion-idea-plugin

Implementation of IntelliJ IDEA code completion plugin using a local LLM.

40
Emerging
1188 cgbur/llama2.zig

Inference Llama 2 in one file of pure Zig

40
Emerging
1189 asigalov61/Allegro-Music-Transformer

Full-attention multi-instrumental music transformer featuring asymmetrical...

40
Emerging
1190 alexrozanski/LlamaChat

Chat with your favourite LLaMA models in a native macOS app

40
Emerging
1191 yifanzhang-pro/HLA

Official Project Page for HLA: Higher-order Linear Attention...

40
Emerging
1192 samestrin/llm-pdf-ocr-api

A Python-based REST API for PDF OCR using AI models with PyTorch and...

40
Emerging
1193 hans00/react-native-transformers-example

Example of transformers.js on React Native

40
Emerging
1194 elapse-annals/laravel-plus

Based on Laravel transformation and expansion, more convenient for practical...

40
Emerging
1195 Sachithx/EntroPE

This includes the codebase for EntroPE (Entropy-Guided Dynamic Patch Encoder...

40
Emerging
1196 tlkh/t2t-tuner

Convenient Text-to-Text Training for Transformers

40
Emerging
1197 Traffic-Alpha/LLM-Assisted-Light

This repository contains the code for the paper "LLM-Assisted Light:...

40
Emerging
1198 clabrugere/scratch-llm

Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch,...

40
Emerging
1199 amithkoujalgi/ollama-pdf-bot

A bot that accepts PDF docs and lets you ask questions on it.

40
Emerging
1200 WENGSYX/LMTuner

LMTuner: Make the LLM Better for Everyone

40
Emerging
« Prev 1 2 3 10 11 12 13 14 63 64 65 Next »