Visual analytics

Visual analytics is an outgrowth of the fields of information visualization and scientific visualization that focuses on analytical reasoning facilitated by interactive visual interfaces.

Overview
Visual analytics is "the science of analytical reasoning facilitated by interactive visual interfaces." It can attack certain problems whose size, complexity, and need for closely coupled human and machine analysis may make them otherwise intractable. Visual analytics advances science and technology developments in analytical reasoning, interaction, data transformations and representations for computation and visualization, analytic reporting, and technology transition. As a research agenda, visual analytics brings together several scientific and technical communities from computer science, information visualization, cognitive and perceptual sciences, interactive design, graphic design, and social sciences.

Visual analytics integrates new computational and theory-based tools with innovative interactive techniques and visual representations to enable human-information discourse. The design of the tools and techniques is based on cognitive, design, and perceptual principles. This science of analytical reasoning provides the reasoning framework upon which one can build both strategic and tactical visual analytics technologies for threat analysis, prevention, and response. Analytical reasoning is central to the analyst’s task of applying human judgments to reach conclusions from a combination of evidence and assumptions.

Visual analytics has some overlapping goals and techniques with information visualization and scientific visualization. There is currently no clear consensus on the boundaries between these fields, but broadly speaking the three areas can be distinguished as follows:


 * Scientific visualization deals with data that has a natural geometric structure (e.g., MRI data, wind flows).
 * Information visualization handles abstract data structures such as trees or graphs.
 * Visual analytics is especially concerned with coupling interactive visual representations with underlying analytical processes (e.g., statistical procedures, data mining techniques) such that high-level, complex activities can be effectively performed (e.g., sense making, reasoning, decision making).

Visual analytics seeks to marry techniques from information visualization with techniques from computational transformation and analysis of data. Information visualization forms part of the direct interface between user and machine, amplifying human cognitive capabilities in six basic ways:


 * 1) by increasing cognitive resources, such as by using a visual resource to expand human working memory,
 * 2) by reducing search, such as by representing a large amount of data in a small space,
 * 3) by enhancing the recognition of patterns, such as when information is organized in space by its time relationships,
 * 4) by supporting the easy perceptual inference of relationships that are otherwise more difficult to induce,
 * 5) by perceptual monitoring of a large number of potential events, and
 * 6) by providing a manipulable medium that, unlike static diagrams, enables the exploration of a space of parameter values

These capabilities of information visualization, combined with computational data analysis, can be applied to analytic reasoning to support the sense-making process.

Scope
Visual analytics is a multidisciplinary field that includes the following focus areas:


 * Analytical reasoning techniques that enable users to obtain deep insights that directly support assessment, planning, and decision making
 * Data representations and transformations that convert all types of conflicting and dynamic data in ways that support visualization and analysis
 * Techniques to support production, presentation, and dissemination of the results of an analysis to communicate information in the appropriate context to a variety of audiences.
 * Visual representations and interaction techniques that take advantage of the human eye’s broad bandwidth pathway into the mind to allow users to see, explore, and understand large amounts of information at once.

Analytical reasoning techniques
Analytical reasoning techniques are the method by which users obtain deep insights that directly support situation assessment, planning, and decision making. Visual analytics must facilitate high-quality human judgment with a limited investment of the analysts’ time. Visual analytics tools must enable diverse analytical tasks such as:


 * Understanding past and present situations quickly, as well as the trends and events that have produced current conditions
 * Identifying possible alternative futures and their warning signs
 * Monitoring current events for emergence of warning signs as well as unexpected events
 * Determining indicators of the intent of an action or an individual
 * Supporting the decision maker in times of crisis.

These tasks will be conducted through a combination of individual and collaborative analysis, often under extreme time pressure. Visual analytics must enable hypothesis-based and scenario-based analytical techniques, providing support for the analyst to reason based on the available evidence.

Data representations
Data representations are structured forms suitable for computer-based transformations. These structures must exist in the original data or be derivable from the data themselves. They must retain the information and knowledge content and the related context within the original data to the greatest degree possible. The structures of underlying data representations are generally neither accessible nor intuitive to the user of the visual analytics tool. They are frequently more complex in nature than the original data and are not necessarily smaller in size than the original data. The structures of the data representations may contain hundreds or thousands of dimensions and be unintelligible to a person, but they must be transformable into lower-dimensional representations for visualization and analysis.

Theories of visualization
Theories of visualization include:
 * Jacques Bertin's Semiology of Graphics (1967)
 * Nelson Goodman's Languages of Art (1977)
 * Jock D. Mackinlay's Automated design of optimal visualization (APT) (1986)
 * Leland Wilkinson's Grammar of Graphics (1998)

Visual representations
Visual representations translate data into a visible form that highlights important features, including commonalities and anomalies. These visual representations make it easy for users to perceive salient aspects of their data quickly. Augmenting the cognitive reasoning process with perceptual reasoning through visual representations permits the analytical reasoning process to become faster and more focused.

Process
The input for the data sets used in the visual analytics process are heterogeneous data sources (i.e., the internet, newspapers, books, scientific experiments, expert systems). From these rich sources, the data sets S = S1, ..., Sm are chosen, whereas each Si, i ∈ (1, ..., m) consists of attributes Ai1, ..., Aik. The goal or output of the process is insight I. Insight is either directly obtained from the set of created visualizations V or through confirmation of hypotheses H as the results of automated analysis methods. This formalization of the visual analytics process is illustrated in the following figure. Arrows represent the transitions from one set to another one.

More formally the visual analytics process is a transformation F: S → I, whereas F is a concatenation of functions f ∈ {DW, VX, HY, UZ} defined as follows:

DW describes the basic data pre-processing functionality with DW : S → S and W ∈ {T, C, SL, I} including data transformation functions DT, data cleaning functions DC, data selection functions DSL and data integration functions DI that are needed to make analysis functions applicable to the data set.

VW, W ∈ {S, H} symbolizes the visualization functions, which are either functions visualizing data VS : S → V or functions visualizing hypotheses VH : H → V.

HY, Y ∈ {S, V} represents the hypotheses generation process. We distinguish between functions that generate hypotheses from data HS : S → H and functions that generate hypotheses from visualizations HV : V → H.

Moreover, user interactions UZ, Z ∈ {V, H, CV, CH} are an integral part of the visual analytics process. User interactions can either effect only visualizations UV : V → V (i.e., selecting or zooming), or can effect only hypotheses UH : H → H by generating a new hypotheses from given ones. Furthermore, insight can be concluded from visualizations UCV : V → I or from hypotheses UCH : H → I.

The typical data pre-processing applying data cleaning, data integration and data transformation functions is defined as DP = DT(DI(DC(S1, ..., Sn))). After the pre-processing step either automated analysis methods HS = {fs1, ..., fsq} (i.e., statistics, data mining, etc.) or visualization methods VS : S → V, VS = {fv1, ..., fvs} are applied to the data, in order to reveal patterns as shown in the figure above.

In general the following paradigm is used to process the data:

Analyse First – Show the Important – Zoom, Filter and Analyse Further – Details on Demand

Related subjects

 * Cartography
 * Computational visualistics
 * Critical thinking
 * Decision-making
 * Google Analytics
 * Interaction design
 * Interactive visual analysis
 * Interactivity
 * Social network analysis software
 * Software visualization
 * Starlight Information Visualization System
 * Text analytics
 * Traffic analysis
 * Visual reasoning

Related scientists

 * Cecilia R. Aragon
 * Robert E. Horn
 * Daniel A. Keim
 * Theresa-Marie Rhyne
 * Lawrence J. Rosenblum
 * Ben Shneiderman
 * John Stasko
 * Jim Thomas

Related software

 * imc FAMOS (1987), graphical data analysis