Connecting Multi-modal Contrastive Representations

Zehan Wang1, Yang Zhao2, Xize Chen1, Haifeng Huang1, Jiageng Liu1, Li Tang1, Linjun Li1, Yongqi Wang1, Aoxiong Yin1, Ziang Zhang1, Zhou Zhao1,3,

1Zhejiang University 2ByteDance 3Shanghai AI Laboratory

[paper][github]
Select a Task
Select an Audio
(click audio)
Fireworks
Train
Female Speaker
Truck
Male Speaker
Cat
Football Game
Ducks
Recorder
See the Results