UW Interactive Data Lab
IDL logo

Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis

Yang Liu, Tim Althoff, Jeffrey Heer. ACM Human Factors in Computing Systems (CHI), 2020
Figure for Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis
Analytic Decision Graph representing a controlled experiment to investigate the impact of web design on reading performance. At several steps, the analyst revised her analytic decisions based on end results and reviewer feedback, for instance merging two levels of an IV because effect sizes were similar. While she examined model specification options thoroughly, she appeared to place less emphasis on inference decisions such as choosing which significance test to use.
Materials
Abstract
Drawing reliable inferences from data involves many, sometimes arbitrary, decisions across phases of data collection, wrangling, and modeling. As different choices can lead to diverging conclusions, understanding how researchers make analytic decisions is important for supporting robust and replicable analysis. In this study, we pore over nine published research studies and conduct semi-structured interviews with their authors. We observe that researchers often base their decisions on methodological or theoretical concerns, but subject to constraints arising from the data, expertise, or perceived interpretability. We confirm that researchers may experiment with choices in search of desirable results, but also identify other reasons why researchers explore alternatives yet omit findings. In concert with our interviews, we also contribute visualizations for communicating decision processes throughout an analysis. Based on our results, we identify design opportunities for strengthening end-to-end analysis, for instance via tracking and meta-analysis of multiple decision paths.
BibTeX
@inproceedings{2020-analysis-decision-points,
  title = {Paths Explored, Paths Omitted, Paths Obscured: Decision Points \& Selective Reporting in End-to-End Data Analysis},
  author = {Liu, Yang AND Althoff, Tim AND Heer, Jeffrey},
  booktitle = {ACM Human Factors in Computing Systems (CHI)},
  year = {2020},
  url = {https://uwdata.github.io/papers/analysis-decision-points},
  doi = {10.1145/3313831.3376533}
}