Rishikesh-Jadhav/Multi-Modal-Image-Generation-using-Grounding-DINO-SAM-and-Stable-Diffusionv2
A project to combine Grounding-DINO with Meta AI's Segment Anything Model (SAM) and Stable Diffusion for image manipulation using prompts. The plan is to integrate these techniques and deploy the model on Hugging Face with a Gradio interface for users to detect, segment regions and inpaint them in images.
No commits in the last 6 months.
Stars
5
Forks
1
Language
Jupyter Notebook
License
—
Category
Last pushed
Oct 31, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Rishikesh-Jadhav/Multi-Modal-Image-Generation-using-Grounding-DINO-SAM-and-Stable-Diffusionv2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
sp-nitech/diffsptk
A differentiable version of SPTK
trigeorgis/mdm
A TensorFlow implementation of the Mnemonic Descent Method.
clovaai/fewshot-font-generation
The unified repository for few-shot font generation methods. This repository includes FUNIT...
clovaai/mxfont
Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font...
Michedev/DDPMs-Pytorch
Implementation of various DDPM papers to understand how they work