arashdn/dtt
DTT: A Deep Learning Framework to Transform Tabular Data for Joinability by Leveraging Large Language Models
This project helps data professionals clean and standardize messy tabular data so it can be accurately combined, or 'joined,' with other datasets. It takes a source table and a target table, applies a deep learning model to transform the source data's format, and outputs a revised source table ready for joining. It's designed for data analysts, data scientists, or database administrators who deal with inconsistent data formats.
No commits in the last 6 months.
Use this if you frequently encounter situations where you need to join multiple tables but their text-based columns (like names, addresses, or product descriptions) don't match perfectly due to variations in spelling, formatting, or abbreviations.
Not ideal if your data cleaning needs are limited to numerical data transformations or simple string operations that can be handled with standard scripting or spreadsheet functions.
Stars
10
Forks
—
Language
Python
License
—
Category
Last pushed
Jun 13, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/arashdn/dtt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
wx-chevalier/NLP-Notes
人工智能与深度学习实战 - 自然语言处理篇
TingFree/NLPer-Arsenal
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
wx-chevalier/DeepLearning-Notes
人工智能与深度学习实战 - 深度学习篇
hscspring/All4NLP
All For NLP, especially Chinese.
km1994/NLP-Interview-Notes
该仓库主要记录 NLP 算法工程师相关的面试题