WikiMuTe: A web-sourced dataset of semantic descriptions for music audio

Benno Weck, Holger Kirchhoff, Peter Grosche, Xavier Serra

PDTW150K: A Dataset for Patent Drawing Retrieval

Chan-Ming Hsu, Tse-Hung Lin, Yu-Hsien Chen, Chih-Yi Chiu

Interactive Question Answering for Multimodal Lifelog Retrieval

Ly-Duyen Tran, Liting Zhou, Binh Nguyen, Cathal Gurrin

Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformer

, Sahar Nasirihaghighi, Negin Ghamsarian, Heinrich Husslein, Klaus Schoeffmann

GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild

Ujjwal Sharma, Stevan Rudinac, Joris Demmers, Willemijn van Dolen, Marcel Worring