SALMONN and video-SALMONN-2

These are ecosystem siblings where SALMONN is a foundational multi-modal LLM framework and video-SALMONN-2 is a specialized extension that applies the same architecture specifically to audio-visual video understanding tasks.

SALMONN
54
Established
video-SALMONN-2
50
Established
Maintenance 10/25
Adoption 10/25
Maturity 16/25
Community 18/25
Maintenance 10/25
Adoption 10/25
Maturity 15/25
Community 15/25
Stars: 1,392
Forks: 112
Downloads:
Commits (30d): 0
Language:
License: Apache-2.0
Stars: 167
Forks: 19
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
No Package No Dependents
No Package No Dependents

About SALMONN

bytedance/SALMONN

SALMONN family: A suite of advanced multi-modal LLMs

About video-SALMONN-2

bytedance/video-SALMONN-2

video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is developed by the Department of Electronic Engineering at Tsinghua University and ByteDance.

Scores updated daily from GitHub, PyPI, and npm data. How scores work