Publications | Jiehui Zhou (周杰辉)

2024

CURLS: Causal Rule Learning for Subgroups with Significant Treatment Effect

Jiehui Zhou, Linxiao Yang, Xinyu Liu, and 3 more authors

ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Abs Bib HTML PDF Code Slides

In causal inference, estimating heterogeneous treatment effects (HTE) is critical for identifying how different subgroups respond to interventions, with broad applications in fields such as precision medicine and personalized advertising. Although HTE estimation methods aim to improve accuracy, how to provide explicit subgroup descriptions remains unclear, hindering data interpretation and strategic intervention management. In this paper, we propose CURLS, a novel rule learning method leveraging HTE, which can effectively describe subgroups with significant treatment effects. Specifically, we frame causal rule learning as a discrete optimization problem, finely balancing treatment effect with variance and considering the rule interpretability. We design an iterative procedure based on the minorize-maximization algorithm and solve a submodular lower bound as an approximation for the original. Quantitative experiments and qualitative case studies verify that compared with state-of-the-art methods, CURLS can find subgroups where the estimated and true effects are 16.1% and 13.8% higher and the variance is 12.0% smaller, while maintaining similar or better estimation accuracy and rule interpretability. Code is available at https://osf.io/zwp2k/.
@article{zhou2024curls, title = {CURLS: Causal Rule Learning for Subgroups with Significant Treatment Effect}, author = {Zhou, Jiehui and Yang, Linxiao and Liu, Xinyu and Gu, Xinyue and Sun, Liang and Chen, Wei}, journal = {ACM SIGKDD Conference on Knowledge Discovery and Data Mining}, year = {2024}, publisher = {ACM}, }
CausalPrism: A Visual Analytics Approach for Subgroup-based Causal Heterogeneity Exploration

Jiehui Zhou, Xumeng Wang, Kam-Kwai Wong, and 5 more authors

arxiv, 2024

Abs Bib HTML PDF Video

In causal inference, estimating Heterogeneous Treatment Effects (HTEs) from observational data is critical for understanding how different subgroups respond to treatments, with broad applications such as precision medicine and targeted advertising. However, existing work on HTE, subgroup discovery, and causal visualization is insufficient to address two challenges: first, the sheer number of potential subgroups and the necessity to balance multiple objectives (e.g., high effects and low variances) pose a considerable analytical challenge. Second, effective subgroup analysis has to follow the analysis goal specified by users and provide causal results with verification. To this end, we propose a visual analytics approach for subgroup-based causal heterogeneity exploration. Specifically, we first formulate causal subgroup discovery as a constrained multi-objective optimization problem and adopt a heuristic genetic algorithm to learn the Pareto front of optimal subgroups described by interpretable rules. Combining with this model, we develop a prototype system, CausalPrism, that incorporates tabular visualization, multi-attribute rankings, and uncertainty plots to support users in interactively exploring and sorting subgroups and explaining treatment effects. Quantitative experiments validate that the proposed model can efficiently mine causal subgroups that outperform state-of-the-art HTE and subgroup discovery methods, and case studies and expert interviews demonstrate the effectiveness and usability of the system. Code is available at https://osf.io/jaqmf/?view_only=ac9575209945476b955bf829c85196e9.
@article{zhou2024causalprism, title = {CausalPrism: A Visual Analytics Approach for Subgroup-based Causal Heterogeneity Exploration}, author = {Zhou, Jiehui and Wang, Xumeng and Wong, Kam-Kwai and Zhang, Wei and Liu, Xinyu and Zhang, Juntian and Zhu, Minfeng and Chen, Wei}, year = {2024}, }
AVA: An automated and AI-driven intelligent visual analytics framework

Jiazhe Wang, Xi Li, Chenlu Li, and 11 more authors

Visual Informatics, 2024

Abs Bib HTML PDF

With the incredible growth of the scale and complexity of datasets, creating proper visualizations for users becomes more and more challenging in large datasets. Though several visualization recommendation systems have been proposed, so far, the lack of practical engineering inputs is still a major concern regarding the usage of visualization recommendations in the industry. In this paper, we proposed AVA, an open-sourced web-based framework for Automated Visual Analytics. AVA contains both empiric-driven and insight-driven visualization recommendation methods to meet the demands of creating aesthetic visualizations and understanding expressible insights respectively. The code is available at https://github.com/antvis/AVA.
@article{zhou2024ava, title = {AVA: An automated and AI-driven intelligent visual analytics framework}, author = {Wang, Jiazhe and Li, Xi and Li, Chenlu and Peng, Di and Wang, Zeyu and Gu, Yuhui and Lai, Xingui and Zhang, Haifeng and Xu, Xinyue and Dong, Xiaoqing and Lin, Zhifeng and Zhou, Jiehui and Liu, Xingyu and Chen, Wei}, journal = {Visual Informatics}, year = {2024}, }

2023

FraudAuditor: A Visual Analytics Approach for Collusive Fraud in Health Insurance

Jiehui Zhou, Xumeng Wang, Jie Wang, and 7 more authors

IEEE Transactions on Visualization and Computer Graphics, 2023

Abs Bib HTML PDF Video Slides

Collusive fraud, in which multiple fraudsters collude to defraud health insurance funds, threatens the operation of the healthcare system. However, existing statistical and machine learning-based methods have limited ability to detect fraud in the scenario of health insurance due to the high similarity of fraudulent behaviors to normal medical visits and the lack of labeled data. To ensure the accuracy of the detection results, expert knowledge needs to be integrated with the fraud detection process. By working closely with health insurance audit experts, we propose FraudAuditor, a three-stage visual analytics approach to collusive fraud detection in health insurance. Specifically, we first allow users to interactively construct a co-visit network to holistically model the visit relationships of different patients. Second, an improved community detection algorithm that considers the strength of fraud likelihood is designed to detect suspicious fraudulent groups. Finally, through our visual interface, users can compare, investigate, and verify suspicious patient behavior with tailored visualizations that support different time scales. We conducted case studies in a real-world healthcare scenario, i.e., to help locate the actual fraud group and exclude the false positive group. The results and expert feedback proved the effectiveness and usability of the approach.
@article{zhou2023fraudauditor, title = {FraudAuditor: A Visual Analytics Approach for Collusive Fraud in Health Insurance}, author = {Zhou, Jiehui and Wang, Xumeng and Wang, Jie and Ye, Hui and Wang, Huanliang and Zhou, Zihan and Han, Dongming and Ying, Haochao and Wu, Jian and Chen, Wei}, journal = {IEEE Transactions on Visualization and Computer Graphics}, year = {2023}, publisher = {IEEE}, }

2022

DPVisCreator: Incorporating Pattern Constraints to Privacy-preserving Visualizations via Differential Privacy

Jiehui Zhou, Xumeng Wang, Jason K Wong, and 8 more authors

IEEE Transactions on Visualization and Computer Graphics, 2022

Abs Bib HTML PDF Video Code

Data privacy is an essential issue in publishing data visualizations. However, it is challenging to represent multiple data patterns in privacy-preserving visualizations. The prior approaches target specific chart types or perform an anonymization model uniformly without considering the importance of data patterns in visualizations. In this paper, we propose a visual analytics approach that facilitates data custodians to generate multiple private charts while maintaining user-preferred patterns. To this end, we introduce pattern constraints to model users’ preferences over data patterns in the dataset and incorporate them into the proposed Bayesian network-based Differential Privacy (DP) model PriVis . A prototype system, DPVisCreator , is developed to assist data custodians in implementing our approach. The effectiveness of our approach is demonstrated with quantitative evaluation of pattern utility under the different levels of privacy protection, case studies, and semi-structured expert interviews.
@article{zhou2022dpviscreator, title = {DPVisCreator: Incorporating Pattern Constraints to Privacy-preserving Visualizations via Differential Privacy}, author = {Zhou, Jiehui and Wang, Xumeng and Wong, Jason K and Wang, Huanliang and Wang, Zhongwei and Yang, Xiaoyu and Yan, Xiaoran and Feng, Haozhe and Qu, Huamin and Ying, Haochao and Chen, Wei}, journal = {IEEE Transactions on Visualization and Computer Graphics}, volume = {29}, number = {1}, pages = {809--819}, year = {2022}, publisher = {IEEE}, }

2021

MedicareVis: a Joint Visual Analytics Approach for Anti-Fraud in Medical Insurance

Jiehui Zhou, Rongchen Zhu, Wei Zhang, and 4 more authors

Journal of CAD & CG, 2021

Bib PDF

@article{18981,
  title = {MedicareVis: a Joint Visual Analytics Approach for Anti-Fraud in Medical Insurance},
  author = {Zhou, Jiehui and Zhu, Rongchen and Zhang, Wei and Lu, Junhua and Ying, Haochao and Wu, Jian and Chen, Wei},
  journal = {Journal of CAD & CG},
  volume = {33},
  number = {18981},
  pages = {1311},
  year = {2021},
  issn = {1003-9775},
  keywords = {可视分析, 医疗保险, 欺诈检测},
  url = {https://www.jcad.cn/article/doi/10.3724/SP.J.1089.2021.18981}
}