QData/C-Tran
General Multi-label Image Classification with Transformers
Implements a transformer-based architecture with label-specific attention mechanisms and multi-head label attention (LMT) to model inter-label dependencies for multi-label classification tasks. Built on PyTorch with support for standard benchmarks (COCO, VOC2007), featuring configurable transformer layers and gradient accumulation for efficient training on large-scale datasets.
280 stars. No commits in the last 6 months.
Stars
280
Forks
44
Language
Python
License
MIT
Category
Last pushed
Nov 02, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/QData/C-Tran"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
jesus3476/Fire-Detection-Siglip2
Fire-Detection-Siglip2 is an image classification vision-language encoder model fine-tuned from...
pagraf/Seabed-Net
Quick start guide for Seabed-Net
moharamfatema/graduation-project
Video vision transformers for hierarchical anomaly detection in video scenes.
apollosoldier/Advanced-Classifier
The Advanced Classification Model is a deep learning-based approach for image classification...
PRITHIVSAKTHIUR/Fire-Detection-Siglip2
Fire-Detection-Siglip2 is an image classification vision-language encoder model fine-tuned from...