jack-tol/usda-food-data-pipeline
Code for the USDA Branded Food Dataset pipeline and the USDA Food Assistant. This project consolidates USDA FoodData Central data into a structured dataset, along with an interactive tool that allows for conversational exploration of food items, nutrients, and ingredients.
The pipeline automates ingestion and transformation of 34 USDA FoodData Central CSV files into a normalized, ML-ready dataset. The Food Assistant uses semantic search via Pinecone vector indexing with multilingual-e5-large embeddings to enable conversational queries, combining retrieval with language generation to answer nutrition and ingredient questions. The cleaned dataset is published on HuggingFace Datasets with a live demo available on HuggingFace Spaces.
No commits in the last 6 months.
Stars
7
Forks
1
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Nov 07, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/jack-tol/usda-food-data-pipeline"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Azure-Samples/Cosmic-Food-RAG-app
A chat-based recommendation application that revolutionizes the culinary experience.
Decade-qiu/CookHero
CookHero是一个基于 LLM + RAG + Agent + 多模态的智能饮食与烹饪管理平台,支持智能菜谱查询、个性化饮食计划、AI 饮食记录、营养分析、Web 搜索增强,以及可扩展的...
FutureUnreal/What-to-eat-today
🍽️基于图RAG技术的AI美食推荐助手 - Datawhale all-in-rag教程实战案例,集成Neo4j图数据库、Milvus向量检索与智能对话系统
WalidAlsafadi/Recipa-RAG-Assistant
Recipa AI is a full-stack Retrieval-Augmented Generation project that turns a cookbook PDF into...
Fat1512/NutriTrack
NutriTrack là hệ thống hỗ trợ theo dõi dinh dưỡng và lối sống, giúp người dùng quản lý chế độ ăn...