rese1f/aurora

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

36
/ 100
Emerging

Employs token merging to reduce visual token count before LLM processing, enabling efficient video captioning while maintaining performance across inference and training. Includes the Video Detailed Caption (VDC) benchmark and integrates with lmms-eval for standardized evaluation, with planned support for HuggingFace transformers and SGLang deployment. Provides both XTuner and HuggingFace format weights, supporting continued training and inference through multiple framework backends.

139 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 8 / 25

How are scores calculated?

Stars

139

Forks

6

Language

Python

License

Apache-2.0

Last pushed

Jun 04, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/rese1f/aurora"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.