sigmoid.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A social space for people researching, working with, or just interested in AI!

Server stats:

723
active users

Fahim Farook

"Guiding Text-to-Image Diffusion Model Towards Grounded Generation. (arXiv:2301.05221v1 [cs.CV])" — Enhance a pre-trained text-to-image diffusion model to simultaneously generate images and segmentation masks for the corresponding visual entities described in the text prompt.

Paper: arxiv.org/abs/2301.05221
Code: No code in linked repo (yet)

<<Find this useful? Please boost so that others can benefit too 🙂>>