QwenLM/Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

/ 100

Emerging

Supports ultra-high resolution images up to millions of pixels with extreme aspect ratios, enabling detailed text recognition and document analysis. Built on a vision-language architecture optimized for reasoning tasks, the model family includes quantized Int4 variants and API-accessible Plus/Max versions. Integrates with HuggingFace, ModelScope, and Alibaba's DashScope API, with fine-tuning support via full-parameter, LoRA, and Q-LoRA methods.

6,569 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

6,569

Forks

491

Language

Python

License

—

Compare

Qwen-VL and Qwen

Higher-rated alternatives

QwenLM/Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

LLM-Red-Team/qwen-free-api

🚀...

willbnu/Qwen-3.5-16G-Vram-Local

Configs, launchers, benchmarks, and tooling for running Qwen3.5 GGUF models locally with...

yassa9/qwen600

Static suckless single batch CUDA-only qwen3-0.6B mini inference engine

QwenLM/qwen.cpp

C++ implementation of Qwen-LM

Explore LLM Tools

All categories Trending LLM Tool directory Insights