veezbo/akkadian_english_corpus

Cleaned Akkadian English Corpus for LLMs

20
/ 100
Experimental

This project provides a meticulously cleaned and pre-processed collection of Akkadian texts translated into English. It takes raw, expert-translated Akkadian-English materials, removes inconsistencies and irrelevant notes, and enriches them with clear translation details. The output is a highly usable dataset designed for researchers and computational linguists working with ancient languages.

No commits in the last 6 months.

Use this if you are a researcher or computational linguist needing a high-quality, pre-processed dataset of Akkadian-to-English translations for training or analysis.

Not ideal if you need a dataset for a different ancient language or if you require the original, uncleaned text without any modifications.

ancient-languages assyriology computational-linguistics historical-research text-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

8

Forks

Language

Jupyter Notebook

License

MIT

Last pushed

Oct 10, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/veezbo/akkadian_english_corpus"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.