• Keine Ergebnisse gefunden

Task 1: Searching with Lucene (practical)

N/A
N/A
Protected

Academic year: 2022

Aktie "Task 1: Searching with Lucene (practical)"

Copied!
2
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Multimedia Retrieval –HS 2019

Task 1: Searching with Lucene (practical)

In this exercise, we use Lucene and its fuzzy retrieval model to search for music files. The web site of the course contains a list of file names, but you can also use your own music library.

Exercise 3: Text and Web Retrieval Deadline: 5.11.2019

Exercise-3-1

▪ Download Lucene from Apache. Choose the programming language that fits you the best.

▪ Write a program to read the MP3 file names, create the index, and search for the titles that match your query. You can also use RAMDirectory for a fast implementation (but you need to build the index every time again)

▪ Extend the basic search with an implementation of the "Did you mean?"

function that Google provides. If the query contains spelling mistakes (or is seldom), automatically search with the closest matches of the terms used.

▪ Hint: Consider using the SpellChecker of Lucene

(2)

Multimedia Retrieval –HS 2019

Task 2: Hubs, Authorities, SALSA und PageRank (theoretical)

The following sub-graph of the Internet is given:

In this task, we order the nodes by their hub, authority, and PageRank values Exercise 3: Text and Web Retrieval Deadline: 5.11.2019

Exercise-3-2

a) We have defined matrices 𝐌 and 𝐀 for the iterations. In this sub task we use the original HITS algorithm:

Compute the matrices for the example graph.

b) Write a small program (e.g., with MATLAB, but also works with Excel) that evaluates the fix-point iteration to obtain all results.

c) For the example graph, determine the best hubs, authorities, and the documents with high PageRanks.

d) Apply the SALSA algorithm to the example graph. Does the order change compared to the original HITS algorithm?

2

5

6 10

7

4

11

8 3

1

9

12

𝒓(𝑡+1) =1 − 𝛼

𝑁 ∙ 𝟏 + 𝛼 ∙ 𝐌 ∙ 𝒓 𝑡(𝑡+1) = 𝐀 ∙ 𝑎 𝑡

𝑎 𝑡+1 = 𝐀∙ ℎ(𝑡)

Referenzen

ÄHNLICHE DOKUMENTE

Although the programming style required by object algebras (and retroactive interface implementations) is simi- lar to the use of object-oriented encodings of type classes, the

Although a mathematical insider of such methods would be able to produce lower bounds on the GHZ- type entanglement in some generic cases the problem for the practitioner has

Even if the political and military leadership in Pakistan is willing to give FATA and select settled districts of KP in a platter to the Taliban, Mullah Fazlullah is unlikely to

The selections were applied to events simulated by Monte Carlo for MSSM Higgs bosons of different masses and the efficiencies are observed to have a similar shape across categories

The selections were applied to events simulated by Monte Carlo for MSSM Higgs bosons of different masses and the efficiencies are observed to have a similar shape across categories

• At the same program point, typically different addresses are accessed ... • Storing at an unknown address destroys all information

devices or control units, be sure you have the records showing how everything should be reconnected. Follow the instructions in step 3 when disconnecting double baluns.

= Digit Symbol subtest of the German version of the Wechsler Adult Intelligence Scale; Digit Span = Digit Span subtest of the German version of the Wechsler Adult Intelligence