di37/multiclass-image-classification-using-multimodal-llms

A comprehensive comparison of multimodal models - llama3.2-vision, minicpm-v, llava-llama3, llava, llava13:b and closed source models for animal classification tasks. This project evaluates various models' performance in classifying 10 different animal species, ranging from common to rare animals.

/ 100

Experimental

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 1 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Category

llm-thesis-research

Last pushed

Dec 10, 2024

Commits (30d)

GitHub

LLM Thesis Research · 46 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/di37/multiclass-image-classification-using-multimodal-llms"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

muharamdani/notebooklm-categorizer

NotebookLM Project Categorizer

aimclub/Edulytica

The purpose of the study is to automate the analysis of scientific and educational documents in...

cssmagic/Awesome-AI

收集分享 AI 大型语言模型 (LLM)、AI 辅助编程、AI 绘画等领域的常用资料，探索生成式人工智能的应用与开发。

IndoNLP/indonlg

The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and...

BodduSriPavan-111/diemsim

A Python Library Implementing Dimension Insensitive Euclidean Metric (DIEM)

Explore LLM Tools

All categories Trending LLM Tool directory Insights