CausalPrism: A Visual Analytics Approach for Subgroup-based Causal Heterogeneity Exploration
Abstract
In causal inference, estimating Heterogeneous Treatment Effects (HTEs) from observational data is critical for understanding how different subgroups respond to treatments, with broad applications such as precision medicine and targeted advertising. However, existing work on HTE, subgroup discovery, and causal visualization is insufficient to address two challenges: first, the sheer number of potential subgroups and the necessity to balance multiple objectives (\eghigh effects and low variances) pose a considerable analytical challenge. Second, effective subgroup analysis has to follow the analysis goal specified by users and provide causal results with verification. To this end, we propose a visual analytics approach for subgroup-based causal heterogeneity exploration. Specifically, we first formulate causal subgroup discovery as a constrained multi-objective optimization problem and adopt a heuristic genetic algorithm to learn the Pareto front of optimal subgroups described by interpretable rules. Combining with this model, we develop a prototype system, \sysname, that incorporates tabular visualization, multi-attribute rankings, and uncertainty plots to support users in interactively exploring and sorting subgroups and explaining treatment effects. Quantitative experiments validate that the proposed model can efficiently mine causal subgroups that outperform state-of-the-art HTE and subgroup discovery methods, and case studies and expert interviews demonstrate the effectiveness and usability of the system. Code is available at OSF.
keywords:
Causal inference, data heterogeneity, subgroup discovery, optimization, interpretability, visual analytics1671
\vgtccategoryResearch
\vgtcpapertypeAnalytics & Decisions
\authorfooter
J. Zhou, W. Zhang, X. Liu, J. Zhang, M. Zhu and W. Chen are with the State Key Lab of CAD&CG, Zhejiang University.
E-mail: {zhoujiehui, 22151190, liu_xingyu, 3200105799, minfeng_zhu, chenvis}@zju.edu.cn.
X. Wang is with TMCC, CS, Nankai University.
E-mail: [email protected].
KK. Wong is with Hong Kong University of Science and Technology and Georgia Institute of Technology.
E-mail: [email protected].
\teaser
\sysnamehelps identify, explore, rank, and interprete causal subgroups in observational data. (A) The Causal Subgroup View includes a tailored tabular visualization of subgroup descriptions, a subgroup editing window, and a ranking visualization of multiple evaluation metrics to support subgroup overview, modification, and ranking. (B) The Covariate Projection View reduces units with high-dimensional covariates to two dimensions, allowing users to analyze similarities between subgroups and assisting in merging and splitting subgroups. (C) The Treatment Effect Validation View consists of propensity score histograms, individual treatment effect dot plots, and detailed information of matched units, which helps users interpret effect strength and uncertainty, thereby increasing trust.