akanimax/T2F
T2F: text to face generation using Deep Learning
Combines LSTM text encoding with progressive GAN training to synthesize realistic facial images from natural language descriptions, using the Face2Text dataset of 400 images with captions. The architecture feeds LSTM-encoded text embeddings through a conditioning augmentation layer into ProGAN's generator, while employing layer-by-fade-in training for stable multi-resolution generation. Implemented in PyTorch with modular components for data processing, network definitions, and configurable hyperparameters for progressive depth expansion from 64×64 to higher resolutions.
547 stars. No commits in the last 6 months.
Stars
547
Forks
95
Language
Python
License
MIT
Category
Last pushed
May 14, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/akanimax/T2F"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Yutong-Zhou-cv/Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
tobran/DF-GAN
[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis
aelnouby/Text-to-Image-Synthesis
Pytorch implementation of Generative Adversarial Text-to-Image Synthesis paper
Baiyuetribe/paper2gui
Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence...
Chen-Yang-Liu/Text2Earth
[IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a...