workforyou786/Large-Language-Model-Research-Paper
Multimodal AI — systems that can understand and generate information across text, images, and sometimes audio/video. LLMs (Large Language Models), Computer Vision (CV), and Natural Language Processing (NLP) is through Multimodal AI
Stars
—
Forks
—
Language
—
License
MIT
Category
Last pushed
Feb 01, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/workforyou786/Large-Language-Model-Research-Paper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gabeur/mmt
Multi-Modal Transformer for Video Retrieval
JerryYLi/valhalla-nmt
Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"
MichiganNLP/Scalable-VLM-Probing
Probe Vision-Language Models
benywon/LALM
code and resource for ACL2021 paper 'Multi-Lingual Question Generation with Language Agnostic...
thunlp/cost-optimal-gqa
The code for the paper "Cost-Optimal Grouped-Query Attention for Long-Context Modeling"