BothBosu/Synthetic-Data-for-Scam-Detection-Leveraging-LLMs-to-Train-Deep-Learning-Models
This repository contains the source code and synthetic datasets used in the research on scam detection using deep learning models trained on data generated by Large Language Models (LLMs).
No commits in the last 6 months.
Stars
6
Forks
1
Language
Jupyter Notebook
License
MIT
Category
Last pushed
May 20, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/BothBosu/Synthetic-Data-for-Scam-Detection-Leveraging-LLMs-to-Train-Deep-Learning-Models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
VikParuchuri/textbook_quality
Generate textbook-quality synthetic LLM pretraining data
dmanuel64/codablellm
A framework for creating and curating high-quality code datasets tailored for large language models
BhabhaAI/dataformer
Solving data for LLMs - Create quality synthetic datasets!
iiis-ai/TemplateMath
[ICLR 2025 DATA-FM] Training and Evaluating Language Models with Template-based Data Generation...
MichiganNLP/depression_synthetic_data
Can LMs generate useful synthetic data for the mental health domain?