akanimax/T2F

T2F: text to face generation using Deep Learning

49
/ 100
Emerging

Combines LSTM text encoding with progressive GAN training to synthesize realistic facial images from natural language descriptions, using the Face2Text dataset of 400 images with captions. The architecture feeds LSTM-encoded text embeddings through a conditioning augmentation layer into ProGAN's generator, while employing layer-by-fade-in training for stable multi-resolution generation. Implemented in PyTorch with modular components for data processing, network definitions, and configurable hyperparameters for progressive depth expansion from 64×64 to higher resolutions.

547 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

547

Forks

95

Language

Python

License

MIT

Category

gan-based-t2i

Last pushed

May 14, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/akanimax/T2F"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.