Guiding the exploration of scatter plot data using motif-based interest measures

Lin Shao, Timo Schleicher, Michael Behrisch, Tobias Schreck, Ivan Sipiran, Daniel A. Keim

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

Finding interesting patterns in large scatter plot spaces is a challenging problem and becomes even more difficult with increasing number of dimensions. Previous approaches for exploring large scatter plot spaces like e.g., the well-known Scagnostics approach, mainly focus on ranking scatter plots based on their global properties. However, often local patterns contribute significantly to the interestingness of a scatter plot. We are proposing a novel approach for the automatic determination of interesting views in scatter plot spaces based on analysis of local scatter plot segments. Specifically, we automatically classify similar local scatter plot segments, which we call scatter plot motifs. Inspired by the well-known tf×idf-approach from information retrieval, we compute local and global quality measures based on frequency properties of the local motifs. We show how we can use these to filter, rank and compare scatter plots and their incorporated motifs. We demonstrate the usefulness of our approach with synthetic and real-world data sets and showcase our data exploration tools that visualize the distribution of local scatter plot motifs in relation to a large overall scatter plot space.
Original languageSpanish
Pages (from-to)1-12
Number of pages12
JournalJournal of Visual Languages and Computing
Volume36
StatePublished - 1 Oct 2016

Cite this