BothBosu/Synthetic-Data-for-Scam-Detection-Leveraging-LLMs-to-Train-Deep-Learning-Models

This repository contains the source code and synthetic datasets used in the research on scam detection using deep learning models trained on data generated by Large Language Models (LLMs).

/ 100

Emerging

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Category

synthetic-data-generation

Last pushed

May 20, 2025

Commits (30d)

GitHub

Synthetic Data Generation · 13 models

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/BothBosu/Synthetic-Data-for-Scam-Detection-Leveraging-LLMs-to-Train-Deep-Learning-Models"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

VikParuchuri/textbook_quality

Generate textbook-quality synthetic LLM pretraining data

dmanuel64/codablellm

A framework for creating and curating high-quality code datasets tailored for large language models

BhabhaAI/dataformer

Solving data for LLMs - Create quality synthetic datasets!

iiis-ai/TemplateMath

[ICLR 2025 DATA-FM] Training and Evaluating Language Models with Template-based Data Generation...

MichiganNLP/depression_synthetic_data

Can LMs generate useful synthetic data for the mental health domain?

Explore Transformer Models

All categories Trending Transformer directory Insights