Rishikesh-Jadhav/Multi-Modal-Image-Generation-using-Grounding-DINO-SAM-and-Stable-Diffusionv2

A project to combine Grounding-DINO with Meta AI's Segment Anything Model (SAM) and Stable Diffusion for image manipulation using prompts. The plan is to integrate these techniques and deploy the model on Hugging Face with a Gradio interface for users to detect, segment regions and inpaint them in images.

/ 100

Experimental

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 1 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Category

diffusion-model-frameworks

Last pushed

Oct 31, 2024

Commits (30d)

GitHub

Diffusion Model Frameworks · 78 frameworks

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Rishikesh-Jadhav/Multi-Modal-Image-Generation-using-Grounding-DINO-SAM-and-Stable-Diffusionv2"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

sp-nitech/diffsptk

A differentiable version of SPTK

trigeorgis/mdm

A TensorFlow implementation of the Mnemonic Descent Method.

clovaai/fewshot-font-generation

The unified repository for few-shot font generation methods. This repository includes FUNIT...

clovaai/mxfont

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font...

Michedev/DDPMs-Pytorch

Implementation of various DDPM papers to understand how they work

Explore ML Frameworks

All categories Trending ML Framework directory Insights