invictus717/MetaTransformer

Meta-Transformer for Unified Multimodal Learning

44
/ 100
Emerging

Implements a shared-encoder architecture with modality-agnostic "Data-to-Sequence" tokenization that unifies 12 diverse data types (text, images, point clouds, audio, video, medical/hyperspectral/infrared imagery, graphs, tabular, time-series, IMU) into a single transformer backbone. Supports unpaired multimodal training and downstream task-specific heads for classification, detection, and segmentation, with pretrained weights available on LAION-2B and compatible with Hugging Face and OpenXLab ecosystems.

1,654 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

1,654

Forks

117

Language

Python

License

Apache-2.0

Last pushed

Dec 05, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/invictus717/MetaTransformer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.