Connecting Multi-modal Contrastive Representations

Zehan Wang1, Yang Zhao2, Xize Chen1, Haifeng Huang1, Jiageng Liu1, Li Tang1, Linjun Li1, Yongqi Wang1, Aoxiong Yin1, Ziang Zhang1, Zhou Zhao1,3,

1Zhejiang University 2ByteDance 3Shanghai AI Laboratory

[paper][github]
Select a Task
Select an Audio
(click audio)
Bell
Sewing Machine
Racing Car
Cartoon Truck
Tractor
Air Blower
Cello
Dog
Scratch
Cat
Bird
Popcorn
Saxophone
Chorus
Explosion
Cello
Puppy
Bird
Cartoon Sheep
Bird
Electronic Organ
See the Results