• Keine Ergebnisse gefunden

Distance-based Multimedia Indexing

N/A
N/A
Protected

Academic year: 2022

Aktie "Distance-based Multimedia Indexing"

Copied!
4
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Distance-based Multimedia Indexing

Christian Beecks Merih Seran Uysal Thomas Seidl Data Management and Data Exploration Group

RWTH Aachen University Germany

{beecks, uysal, seidl}@cs.rwth-aachen.de

Abstract:This tutorial aims at providing a unified and comprehensive overview of the state-of-the-art approaches to distance-based multimedia indexing.

1 Introduction

Concomitant with the explosive growth of the digital universe [GCM+08], an immensely increasing amount of multimedia data is generated, processed, and finally stored in very large multimedia databases. The rapid expansion of the internet and the extensive spread of mobile devices allow users to generate and share multimedia data everywhere and at any time. As a result, multimedia databases tend to grow continuously without any restriction and are thus no longer manually manageable by humans. Automatic approaches that allow for effective and efficient information access to massive multimedia databases become immensely important.

Multimedia retrieval approaches[LSDJ06] are one class of information access approaches that allow to manage and access multimedia databases with respect to the users’ informa- tion needs. These approaches deal with the representation, storage, organization of, and access to information items [BYRN11]. In fact, they can be thought of approaches al- lowing users tosearch,browse,explore, andanalyzemultimedia databases by means of similarity relations among multimedia objects.

One promising and widespread approach to define similarity between multimedia objects consists in automatically extracting inherent properties of multimedia objects and com- paring them with each other. For this purpose, the content-based properties of multime- dia objects are modeled by feature representations which are comparable by means of distance-based similarity measures. This class of similarity measures follows a rigorous mathematical interpretation [She57] and allows domain experts and database experts to ad- dress the issues of effectiveness and efficiency simultaneously and independently. In fact, it has become mandatory for current distance-based similarity measures to be indexable in order to facilitate large-scale applicability.

265

(2)

2 Tutorial Outline

This tutorial aims at providing a unified and comprehensive overview of the state-of-the- art approaches to distance-based multimedia indexing. We intend to cover a broad target audience starting from beginners to experts in the domain of multimedia databases.

The tutorial is structured into four parts as shown below:

• Object Representation – Feature Extraction – Feature Representations – Algebraic Properties

– Clustering-based Computation

• Fundamental Similarity Models – Similarity Measures – Dissimilarity Measures

• Efficient Query Processing – Similarity Queries – Lower-Bounding

• Indexing

– Spatial Indexing

– High-dimensional Indexing – Metric and Ptolemaic Indexing

3 About The Presenters

Christian Beecksis a postdoctoral researcher in the data management and data explo- ration group at RWTH Aachen University, Germany. His research interests include effi- cient content-based multimedia retrieval and exploration, adaptive distance-based similar- ity measures such as the Earth Mover’s Distance [RTG00], Signature Quadratic Form Dis- tance [BUS10], and Signature Matching Distance [BKS13], as well as metric and Ptole- maic indexing.

Merih Seran Uysalis a researcher in the data management and data exploration group at RWTH Aachen University, Germany. Her research interests include similarity search in multimedia databases and efficient query processing based on adaptive distance-based similarity measures.

266

(3)

Thomas Seidlis a professor for computer science and head of the data management and data exploration group at RWTH Aachen University, Germany. His research interests in- clude data mining and database technology for multimedia and spatio-temporal databases in engineering, communication and life science applications. Prof. Seidl received his Diplom (MSc) in 1992 from TU Muenchen and his PhD (1997) and venia legendi (2001) from LMU Muenchen.

4 Acknowledgments

This work is funded by DFG grant SE 1039/7-1. It is partly based on the work of Beecks [Bee13].

References

[Bee13] Christian Beecks. Distance-based similarity models for content-based multimedia re- trieval. PhD thesis, RWTH Aachen University, 2013.

[BKS13] Christian Beecks, Steffen Kirchhoff, and Thomas Seidl. Signature Matching Distance for Content-based Image Retrieval. InProceedings of the ACM International Confer- ence on Multimedia Retrieval, pages 41–48, 2013.

[BUS10] Christian Beecks, Merih Seran Uysal, and Thomas Seidl. Signature Quadratic Form Distance. InProceedings of the ACM International Conference on Image and Video Retrieval, pages 438–445, 2010.

[BYRN11] R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison Wesley Professional, 2011.

[GCM+08] John F. Gantz, Christopher Chute, Alex Manfrediz, Stephen Minton, David Reinsel, Wolfgang Schlichting, and Anna Toncheva. The Diverse and Exploding Digital Uni- verse.IDC White Paper, 2, 2008.

[LSDJ06] Michael S. Lew, Nicu Sebe, Chabane Djeraba, and Ramesh Jain. Content-based mul- timedia information retrieval: State of the art and challenges. ACM Transactions on Multimedia Computing, Communications, and Applications, 2(1):1–19, 2006.

[RTG00] Yossi Rubner, Carlo Tomasi, and Leonidas J. Guibas. The Earth Mover’s Distance as a Metric for Image Retrieval.International Journal of Computer Vision, 40(2):99–121, 2000.

[She57] Roger N Shepard. Stimulus and response generalization: A stochastic model relating generalization to distance in psychological space.Psychometrika, 22:325–345, 1957.

267

(4)

Referenzen

ÄHNLICHE DOKUMENTE

Due to the fact that the project is accessing and processing distinct media in distinct languages, there is a need for a novel type of merging tool in order to combine the

That is exactly what this thesis pursued: By using together all the proposed techniques (e.g., using entropy impurity to select the best feature vectors to perform a simi- larity

( Große Anteile der im Netz verfügbaren Informationen sind kein Text und können nicht in einemA. Voll text index

recommendation of the RWTH Aachen CIO Advisory Board has decided to implement the following measures: establishing a holistic consultancy and support offer, a specification of

Starting from an original database of 60-dimensional feature vectors, we reduce them to different lower dimensional represen- tations using PCA (principle components analysis,

Based on the definition of Multimedia Object and Multimedia Data Unit (MDU), this paper focuses on the descriptive algorithm of generalization and aggregation, which are two kinds

Data management in Collaborative Research Centres (SFB) Service project INF – Information infrastructure.  Increased awareness for the responsible handling of research data in

• Weitere Gründe: „Empfehlung von Dritten“ (57 %), „Platzierung der RWTH Aachen in einem Hochschulranking“ (54 %), deutlich vor den Gründen „Nähe zum Heimatort“ (44 %)