SALMONN and video-SALMONN-2
These are ecosystem siblings where SALMONN is a foundational multi-modal LLM framework and video-SALMONN-2 is a specialized extension that applies the same architecture specifically to audio-visual video understanding tasks.
Maintenance
10/25
Adoption
10/25
Maturity
16/25
Community
18/25
Maintenance
10/25
Adoption
10/25
Maturity
15/25
Community
15/25
Stars: 1,392
Forks: 112
Downloads: —
Commits (30d): 0
Language: —
License: Apache-2.0
Stars: 167
Forks: 19
Downloads: —
Commits (30d): 0
Language: Python
License: Apache-2.0
No Package
No Dependents
No Package
No Dependents
About SALMONN
bytedance/SALMONN
SALMONN family: A suite of advanced multi-modal LLMs
About video-SALMONN-2
bytedance/video-SALMONN-2
video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is developed by the Department of Electronic Engineering at Tsinghua University and ByteDance.
Scores updated daily from GitHub, PyPI, and npm data. How scores work