Pages that link to "File:A demonstration of the pretraining tasks, including visual grounding, grounded captioning, image-text matching, image captioning, VQA, object detection, image infilling as well as text infilling.png"
The following pages link to File:A demonstration of the pretraining tasks, including visual grounding, grounded captioning, image-text matching, image captioning, VQA, object detection, image infilling as well as text infilling.png:
Displayed 2 items.
- Stable Diffusion: Image to Prompts (file link) (← links)
- Course:CPSC522/Stable Diffusion: Image to Prompts (file link) (← links)