• Keine Ergebnisse gefunden

Exploring and Visualizing the History of InfoVis

N/A
N/A
Protected

Academic year: 2022

Aktie "Exploring and Visualizing the History of InfoVis"

Copied!
2
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Exploring and Visualizing the History of InfoVis

Daniel A. Keim, Helmut Barro, Christian Panse, J¨orn Schneidewind, Mike Sips University of Konstanz, Germany

{keim,barro,panse,schneide,sips}@inf.uni-konstanz.de

1 Motivation

The exploration and visualization of large information spaces is a challenging task. The provided contest data set for example contains more than 1000 authors of about 600 papers. The basic idea for an effective data exploration is to include the human in the data exploration process and com- bine the flexibility, creativity, and general knowledge of the human with the enormous storage capacity and the compu- tational power of today’s computers. The key concept of our visualization approach is to visualize the whole dataset to provide a first overview and to provide rapid, incremental, and reversible analysis actions. Our techniques follow the well-known Information Seeking Mantra: overviews first, zoom and filter, and details on demand. All visualizations, actions, and details are tightly coupled using the well-known linking and brushing concepts.

2 Data Cleaning

When processing and visualizing large data sets, data clean- ing as part of data pre-processing is a very important step, since it directly influences the quality of the visualization.

Since there where some inconsistencies in the contest data set, like ambiguous authors or different formats and spellings for the conference names, some data cleaning was necessary.

Therefore we wrote some shell scripts, based on regular ex- pressions, to correct these inconsistencies. Additionally we corrected the spelling of some author names manually. An- other problem was, that for several attributes no values were recorded. An example are the keyword attributes, were for a lot of publications the keywords were missing. For these publications we extracted some keywords from their title.

For other missing attributes we set their value to not defined and handled it as special cases in the visualization step.

3 Overall Concept

To find solutions for the 3 contest tasks, we applied different visualization approaches. One technique we used, is adapted from a popular visualization tool for movie data sets, called

Acknowledgement: This work was partially funded by the Information Society Technologies programme of the European Commission, Future and Emerging Technologies under the IST-2001-33058 PANDA project (2001- 2004).

FilmFinder [Ahlberg and Shneiderman 1994]. We applied a similar technique to the contest database and called our technique PaperFinder. The basic idea is to use a two di- mensional layout, e.g. the euclidian plane and use the time attribute for the x-axis ordering and the dependent attribute for the y-axis ordering. Applied to the contest data set, the idea is to group the 600 papers and their authors together in a first overview. The papers are displayed in groups belong- ing to the publication year. Each paper is represented by a colored icon (rectangle). The color represents the thematic theme of each paper. Our results show that this technique is very useful to get insight in the development of research topics over time as well as over the number and research ar- eas of each authors publications. To show the connection and collaborations of authors we used a graph drawing ap- proach. If we assume, that the authors are the nodes of the graph and edges represent paper collaborations, the goal is to find strong connected components (SCC) in order to iden- tify groups of authors, which published most papers together.

Then we employed a spring embedder to find an appropriate graph layout for the computed SCC’s. To visualize all co- authors for a single author over time, we used a technique which is based on Interrings [Yang et al. 2002],

4 Overview of the 10 years of InfoVis

To show the overview over InfoVis we displayed the number of publications belonging to each conference over time, us- ing the PaperFinder. Additionally we visualized an ranking of authors, depending on their number of publications over the years. The colors represent the research topics, so that it is easy to see how the research areas developed. To show the publications for a single author and his co-authors, we ap- plied the Interring technique and the PaperFinder technique.

4.1 TASK 1: Static Overview of 10 years of In- fovis

To get an overview on the last 10 years of InfoVis, we used the PaperFinder. To see the development of topics over the years, we placed the time attribute on the x-axis and the number of co-authors on the y-axis. The color represents the paper categories as shown in the legend. We definied 5 categories to which we assigned all publications, depending on their keywords. The categories are Information Visual- ization, HCI, Data Analysis, Computer Graphics and Graph First publ. in: IEEE Symposium on Information Visualization 2004 (InfoVis04) Contest Poster (2nd place award), Austin, Texas, USA, May/jun, 2004

Konstanzer Online-Publikations-System (KOPS) URL: http://www.ub.uni-konstanz.de/kops/volltexte/2008/6957/

URN: http://nbn-resolving.de/urn:nbn:de:bsz:352-opus-69572

(2)

Drawing. Figure 2 shows the development of these cate- gories and the number of publications per conference over the last years.

Figure 1: Top 30 authors, based on number of publications

4.2 TASK 2: Characterize the research areas and their evolution

Figure 2 shows how the research areas developed over time.

For example, it turns out that the number of papers submitted under HCI topic decreased while the number of Visualiza- tion papers increased. It is easy to see, that there were more and more publications from other research topics than visu- alization, published at the InfoVis Conference (more yellow, green boxes). For last years InfoVis, most keywords were missing in the database (white rectangles).

Figure 2: Complete Overview of publications per confer- ence, Conferences are ranked by number of publications, color represents research topic

4.3 TASK 3: The people in InfoVis

To visualize the information about the people contained in the contest data set, we used the PaperFinder and the Inter- rings to visualize the 30 authors with most publications. As shown in Figure 1, B. Shneiderman has the highest num- ber of publications (more then 18), followed by S. K. Card (12), J. D. Mackinlay (10) and D. A. Keim (10). The col- ors show the categories to which the publications belong. To show the publications of a single author we used the interring technique. The examples in Figure 3 show the co-authors of Daniel A. Keim (left) and George Robertson (right). Each co-author is represented by a different color. As shown in the right figure, G. Robertson wrote most of his papers to- gether with S. K.Card and J. D. Mackinlay. In Figure 4 the co-authors of publications from G. Robertson and their re- search topics are visualized using PaperFinder.

Figure 3: Interring showing co-authors of D. A. Keim (left) and G. Robertson (right) over the years

Figure 4: Co-authors of G. Robertson and their publication topics over the years

References

AHLBERG, C.,ANDSHNEIDERMAN, B. 1994. Visual information seeking using the filmfinder. In ACM CHI’94 Conference Companion, 433–434.

YANG, J., WARD, O. M.,ANDRUNDENSTEINER, E. A. 2002. Interring:

An interactive tool for visually navigating and manipulating hierarchi- cal structures. In Proceedings of the IEEE Symposium on Information Visualization, 77.

Referenzen

ÄHNLICHE DOKUMENTE

The issue of whether a new entrant into mobile markets, such as Hutchinson 3G, is likely to enjoy significant market power in setting termination rates then reduces to assessing

University Press of Maryland. Huang Dongfu 黃冬富. "Cong Shengzhan kan guangfu yihou Taiwan jiaocaihua zhi fazhan 從 省展看光復以後台灣膠彩畫之發展 " [The

Show that, as a consequence of the boundary condition, the energies are quantized and give an expression for the

There exists an algorithm to solve Problem (ClassSet) for definite orders with factored discriminant over a fixed field F which runs in probabilistic polynomial time in the

For instance, it would be extremely interest- ing to observe, with a History of Ideas approach, why in the art and literature of the 15th Century, especially in Nordic countries,

In the Southern Ocean the target organism is krill (Euphausia superba), its fluctuations in biomass standing stocks in relation to ocean circulation and sea ice dynamics,

To study the sediment structures, a special survey system is installed on board of Polarstern, the sub-bottom profiler Parasound, which transmits special sonar signals to the

As for the conductivity sensor, the result of calibration shows that a set of coefficient for the conversion from the frequency to the conductivity decided at the time of the