Correlation-based deep learning for multimedia semantic concept detection Conference

Ha, HY, Yang, Y, Pouyanfar, S et al. (2015). Correlation-based deep learning for multimedia semantic concept detection . EURO-PAR 2011 PARALLEL PROCESSING, PT 1, 9419 473-487. 10.1007/978-3-319-26187-4_43

cited authors

  • Ha, HY; Yang, Y; Pouyanfar, S; Tian, H; Chen, SC


  • Nowadays, concept detection from multimedia data is considered as an emerging topic due to its applicability to various applications in both academia and industry. However, there are some inevitable challenges including the high volume and variety of multimedia data as well as its skewed distribution. To cope with these challenges, in this paper, a novel framework is proposed to integrate two correlation-based methods, Feature-Correlation Maximum Spanning Tree (FC-MST) and Negative-based Sampling (NS), with a well-known deep learning algorithm called Convolutional Neural Network (CNN). First, FC-MST is introduced to select the most relevant low-level features, which are extracted from multiple modalities, and to decide the input layer dimension of the CNN. Second, NS is adopted to improve the batch sampling in the CNN. Using NUS-WIDE image data set as a web-based application, the experimental results demonstrate the effectiveness of the proposed framework for semantic concept detection, comparing to other well-known classifiers.

publication date

  • January 1, 2015

published in

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

start page

  • 473

end page

  • 487


  • 9419