Nothing — not the careful logic of mathematics, not statistical models and theories, not the awesome arithmetic power of modern computers — nothing can substitute here for the flexibility of the informed human mind... Accordingly, both [analysis] approaches and techniques need to be structured so as to facilitate human involvement and intervention. – John W. Tukey & Martin B. Wilk, Data Analysis & Statistics, 1966
The mission of the Interactive Data Lab is to enhance people's ability to understand and communicate data through the design of new interactive systems for data visualization and analysis. We study the perceptual, cognitive and social factors affecting data analysis in order to improve the efficiency and scale at which expert analysts work, and to lower barriers for non-experts.
Motivating questions include: How might we enable users to transform and integrate data with minimal programming? How can we support expressive and effective visualization designs? Can we build systems to query and visualize massive data sets at interactive rates? How might we enable domain experts to guide machine learning methods to produce better models?
Advances in computing and statistics provide new opportunities for data-driven discovery. However, breakthroughs in science and industry ultimately lie with the ability of empowered investigators to pursue questions, uncover domain-specific patterns, identify errors, and assess model outputs. Though voiced nearly 50 years ago, the sentiments of Tukey & Wilk ring true today: to facilitate effective human involvement at all stages of data analysis remains a grand challenge.
The UW Interactive Data Lab began its life as the Stanford Visualization Group, founded in the late 1990s by Prof. Pat Hanrahan. Early Stanford projects included the Polaris system, now commercialized as Tableau Software.
In 2009, Prof. Jeffrey Heer joined the Stanford faculty and assumed leadership of the group. Notable projects include popular visualization systems such as D3.js and Protovis (led by Ph.D. student Mike Bostock), new text visualization and language translation techniques (with the Stanford NLP Group), and interactive data transformation tools such as Data Wrangler (led by Ph.D. student Sean Kandel, with a cast of characters including Prof. Joe Hellerstein at UC Berkeley). Data Wrangler and related work are now being commercialized by our start-up company Trifacta.
In 2013, Prof. Heer and team moved to the University of Washington to become the UW Interactive Data Lab. The new name is meant to better reflect our focus on supporting the full life-cycle of data science work. In 2015, Prof. Jessica Hullman from the UW iSchool joined the group.
Our ongoing projects include new languages, theoretical models, exploratory analysis tools, and design tools for interactive visualization, perceptual experiments to assess how visualizations work, and visual analysis systems for domains ranging from large-scale text analysis to population genomics.