krik8235/rag-agent-vision-model
An image-to-text agent using NLP and Llama 3.2 11B Vision Model. The agent will analyze the image file, extract keywords, group them semantically, and craft concise sentences demonstrating correct usage.
No commits in the last 6 months.
Stars
2
Forks
1
Language
Python
License
MIT
Category
Last pushed
Nov 30, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/krik8235/rag-agent-vision-model"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
LearningCircuit/local-deep-research
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports...
NVIDIA-AI-Blueprints/rag
This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented...
Denis2054/RAG-Driven-Generative-AI
This repository provides programs to build Retrieval Augmented Generation (RAG) code for...
0verL1nk/PaperSage
📚 AI-powered research reading workbench. Project-based paper Q&A with Hybrid RAG, multi-agent...
RapidFireAI/rapidfireai
RapidFire AI: Rapid AI Customization from RAG to Fine-Tuning