manunair1990/Phi3-Vision-huggingface
This repository contains Python code for performing vision tasks using the Microsoft Phi-3 Vision model and the Hugging Face library. It demonstrates generating textual responses based on image content, showcasing the integration of advanced vision-language models for tasks such as image analysis and descriptive text generation.
No commits in the last 6 months.
Stars
2
Forks
—
Language
Python
License
—
Category
Last pushed
Jun 06, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/manunair1990/Phi3-Vision-huggingface"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVIDIA-NeMo/NeMo
A scalable generative AI framework built for researchers and developers working on Large...
alexiglad/EBT
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
vlm-run/vlmrun-hub
A hub for various industry-specific schemas to be used with VLMs.
HyperGAI/HPT
HPT - Open Multimodal LLMs from HyperGAI
yash9439/Falcon-Local-AI-Model
Explore this GitHub repository housing 3 versions of Falcon code for text generation. Each...