Large Scale Cross-media Data Retrieval based on Hadoop

Authors

  • Wenchen Cheng Beijing University of Posts and Telecommunications image/svg+xml
  • Jiang Qian Beijing University of Posts and Telecommunications image/svg+xml
  • Zhicheng Zhao Beijing University of Posts and Telecommunications image/svg+xml
  • Fei Su Beijing University of Posts and Telecommunications image/svg+xml

DOI:

https://doi.org/10.4108/eai.19-8-2015.2260108

Keywords:

cross-media, image retrieval, hadoop, mapreudce

Abstract

With the rapid development of the Internet and speedy increase of the data size, there are more and more data intensive applications which often involve hundreds of megabytes of data. It is important and necessary to obtain the retrieval results from cross-media data quickly and accurately. Large scale cross-media data retrieval based on Hadoop is proposed to speed up the retrieval in this paper. We divide cross-media feature extraction and cross-media retrieval into paralleled pipeline, and implement with the combination of the HDFS, HBase and MapReduce framework. To verify the performance of the proposed method, comparisons with stand-alone mode on different sizes of the image dataset are conducted, and the experimental results demonstrate the good performances of proposed method, which sharply decreases time-consuming, and meanwhile keeps the same query precision.

Downloads

Published

08-09-2015

How to Cite

[1]
W. Cheng, J. Qian, Z. Zhao, and F. Su, “Large Scale Cross-media Data Retrieval based on Hadoop”, EAI Endorsed Trans Cloud Sys, vol. 1, no. 2, p. e5, Sep. 2015.

Funding data