Linda5823/Magic-Point-to-Read-V3
🪄 Magic Point-to-Read: An interactive AI reading assistant using Google Gemini (Vision OCR & TTS) to turn any image into clickable, audible learning material. 一个利用 Gemini 实现的交互式点读笔,支持图片识别、翻译与语音朗读。
Stars
—
Forks
—
Language
TypeScript
License
MIT
Category
Last pushed
Feb 10, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/Linda5823/Magic-Point-to-Read-V3"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
GetStream/Vision-Agents
Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses...
video-db/videodb-capture-quickstart
Give your agents real time desktop perception. Stream screen, microphone, and system audio for...
TheSethRose/AI-File-Organizer-Agent
Uses an AI agent (powered by Google Gemini via the Agno framework) to intelligently propose and...
Karmacoke/chargen
AI-powered character generator built with React. Create detailed TRPG/Novel characters, NPC...
grctest/g3n-fastapi-webcam-docker
Utilizing multiple Gemma 3n agents to analyze webcam footage