Yau-Meng Wong

CS @ Columbia University

CLIP2CLAP: Joint Embedding Space Alignment for Image to Audio Generation



Full Screen
Exit Full Screen

Tools
Translate to