samchengcs/IKEA-Dataset

A dataset for multimodal machine translation

21
/ 100
Experimental

This dataset helps e-commerce professionals, localization specialists, and product managers improve multilingual communication. It provides product descriptions in English-French and English-German pairs, alongside product images, sourced from IKEA and Under Armour. This allows users to train and evaluate systems that translate product information more accurately by understanding both text and visuals.

No commits in the last 6 months.

Use this if you need a specialized dataset to train or test machine translation systems that leverage both text and images for product descriptions.

Not ideal if you need general-purpose text translation, data for domains outside of e-commerce products, or require a very large dataset for a single language pair.

e-commerce product-localization multilingual-content catalog-management international-marketing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

13

Forks

Language

License

MIT

Last pushed

Dec 06, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/samchengcs/IKEA-Dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.