• Keine Ergebnisse gefunden

Algorithm engineering for big data

N/A
N/A
Protected

Academic year: 2022

Aktie "Algorithm engineering for big data"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Algorithm Engineering for Big Data

Peter Sanders sanders@kit.edu

Abstract:Perhaps the most fundamental challenge implied by advanced applications of big data sets is how to perform the vast amount of required computations sufficiently efficiently. Efficient algorithms are at the heart of this question. But how can we obtain innovative algorithmic solutions for demanding application problems with exploding input sizes using complex modern hardware and advanced algorithmic techniques?

This tutorial gives examples how the methodology of algorithm engineering can be applied here. Examples include sorting, main memory based data bases, communicati- on efficient algorithms, particle tracking at CERN LHC, 4D image processing, parallel graph algorithms, and full text indexing. Compared to a previous tutorial in Koblenz 2013 with the same title, this tutorial talks less about methodology and more about ac- tual algorithms and applications. For further reading refer to [San13] and, for selected individual results to [DS03, KS07, SSP07, MS08, San09, RSS10, SS12, DS13].

Literatur

[DS03] R. Dementiev und P. Sanders. Asynchronous Parallel Disk Sorting. In15th ACM Sympo- sium on Parallelism in Algorithms and Architectures, Seiten 138–148, San Diego, 2003.

[DS13] Jonathan Dees und Peter Sanders. Efficient Many-Core Query Execution in Main Memory Column-Stores. In29th IEEE Conference on Data Engineering, 2013.

[KS07] F. Kulla und P. Sanders. Scalable Parallel Suffix Array Construction.Parallel Computing, 33:605–612, 2007. Special issue on Euro PVM/MPI 2006, distinguished paper.

[MS08] K. Mehlhorn und P. Sanders. Algorithms and Data Structures — The Basic Toolbox.

Springer, 2008.

[RSS10] M. Rahn, P. Sanders und J. Singler. Scalable Distributed-Memory External Sorting. In 26th IEEE International Conference on Data Engineering, Seiten 685–688, 2010.

[San09] P. Sanders. Algorithm Engineering – An Attempt at a Definition. InEfficient Algorithms, Jgg. 5760 ofLNCS, Seiten 321–340. Springer, 2009.

[San13] Peter Sanders. Engineering Algorithms for Large Data Sets. In39th Conf. on Current Trends in Theory and Practice of Computer Science (SOFSEM), Jgg. 7741 ofLNCS, Seiten 29–32. Springer, 2013. invited talk.

[SS12] Peter Sanders und Christian Schulz. Distributed Evolutionary Graph Partitioning. In ALENEX 2012, Seiten 16–29. SIAM, 2012.

[SSP07] J. Singler, P. Sanders und F. Putze. MCSTL: The Multi-core Standard Template Library.

In13th Euro-Par, Jgg. 4641 ofLNCS, Seiten 682–694. Springer, 2007.

57

Referenzen

ÄHNLICHE DOKUMENTE

The hybrid linear solver PSPIKE is one instance of these new classes of solvers [162] which need solutions of efficient parallel graph algorithms for the bipartite graph matching,

Some of the important patterns suitable for execution on the Device are:.. • Map: The Map [MRR12] pattern can be seen as a fully unrolled loop, where each iteration of the loop

Comparing the performance of ICC (with α ∗ = 0.475) and GMC with respect to intra-cluster conductance suggests that ICC is much superior to GMC. Actually, the values obtained by ICC

The resulting representation(s) can be used to construct a canonical labeling for G as in the approach of Corollary 4. 2 , where it is also described how disconnected graphs can

Viele Probleme lassen sich elegant auf Graphen abbilden.. Materialwissenschaften,

A possible way to overcome this impediment is to design data reduction algorithms which only look at a small part of the data (which fits into the main memory) at any given moment

The  Vienna  University  of  Technology  is  among  the  most  successful  technical  universities  in  Europe,  and  is  Austria’s  largest 

In order to attain the desired error probability of 10 − 6 , we simply start by looking at what can be achieved via straight forward use of the following codes: repetition