Research Artifacts and Software

Major Released Software (Code repository)
  • Appatch: a neural-symbolic interprocedural vulnerability patching tool
  • PyRTFuzz: a language runtime fuzzer for Python
  • PolyFuzz: A greybox fuzzer for multi-language software
  • PolyFax: A toolkit for characterizing multi-language software
  • PolyCruise: A cross-language dynamic information flow analysis for multilingual software
  • FlowDist: A scalable dynamic information flow analyzer for distributed systems
  • Seads: A self-tuning cost-effective dynamic dependence analysis of distributed systems
  • DistFax: A toolkit for measuring IPC and quality of distributed systems
  • Dads: A dynamic program slicer for distributed systems
  • PCA: A static interprocedural data flow analyzer and memory leak detector for C programs
  • DistTaint: A dynamic taint analyzer for distributed systems
  • EvoTaint: An incremental static taint analyzer for Android apps
  • D2ABS: A framework for dynamic dependence analysis of distributed software
  • HPC-cruise: scalable whole-system data-flow analysis framework for HPC software
  • DroidCat: A robust dynamic Android malware detector and classifier
  • ICC-Inspect: A dynamic Android app understanding tool
  • DroidFax: A toolkit for systematic characterization of Android apps
  • DistIA: A simple impact analyzer for distributed systems
  • DiaPro: A dynamic impact analysis framework with customizable cost-effectiveness
  • Diver: Hybrid program dependence approximation and trace-based online impact prediction
  • MDG: A method-level static dependence abstraction
  • TracerJD: A fine-grained Java program tracer and dynamic dependence analyzer
  • DEAM: A toolkit for comprehensive study of predictive accuracy of dynamic impact analysis
  • SensA: Quantitative semantic dependence approximation and prioritized impact analysis
  • Artifact Index

    Artifact links point to validated GitHub repositories, preferring the baltsers mirror when one is available. Public counters refresh in the browser when GitHub/Zenodo APIs are reachable. GitHub traffic views are not publicly exposed, so repository views remain N/A unless an alternative artifact host exposes views.

    andro-compat-toolsAvailable FunctionalBitbucketN/A01010N/A
    • Haipeng Cai, Ziyi Zhang, Li LI, Xiaoqin Fu. "A Large-Scale Study of Application Incompatibilities in Android." ACM ISSTA, 2019. doi:10.1145/3293882.3330564
    • Ziyi Zhang, Haipeng Cai. "A Look Into Developer Intentions for App Compatibility in Android." MobileSoft, 2019. doi:10.1109/MOBILESoft.2019.00016
    Andro-Longitudinal-StudyN/ABitbucketN/A01010N/A
    • Haipeng Cai, Barbara Ryder. "A Longitudinal Study of Application Structure and Behaviors in Android." IEEE TSE, 2020. doi:10.1109/TSE.2020.2975176
    AndroCompatStudyN/AFigshareN/A01010N/A
    • Jiawei Guo, Xiaoqin Fu, Li LI, Tao Zhang, Mattia Fazzini, Haipeng Cai. "Characterizing Installation- and Run-Time Compatibility Issues in Android Benign Apps and Malware." ACM TOSEM, 2025. doi:10.1145/3725810
    AndroCTN/AZenodoN/A01010N/A
    • Wen Li, Xiaoqin Fu, Haipeng Cai. "AndroCT: Ten Years of App Call Traces in Android." MSR, 2021. doi:10.1109/MSR52588.2021.00076
    AppatchAvailable FunctionalZenodo147310281010N/A
    • Yu Nong, Haoran Yang, Long Cheng, Honxin Hu, Haipeng Cai. "Appatch: Automated Adaptive Prompting Large Language Models for Real-World Software Vulnerability Patching." USENIX Security, 2025.
    ARPDroidN/ABitbucketN/A01010N/A
    • Malinda Dilhara, Haipeng Cai, John Jenkins. "Automated Detection and Repair of Incompatible Uses of Runtime Permissions in Android Apps." MobileSoft, 2018. doi:10.1145/3197231.3197255
    cross-lang-bug-studyAvailable FunctionalDocker Hub, Zenodo, anonymous.4open.science1819921010N/A
    • Haoran Yang, Haipeng Cai. "Dissecting Real-World Cross-Language Bugs." ACM FSE, 2025. doi:10.1145/3715777
    cross-lang-vul-pronenessAvailable Functional ReusableFigshare, FigshareN/A01010N/A
    • Wen Li, Li LI, Haipeng Cai. "On the Vulnerability Proneness of Multilingual Code." ACM FSE, 2022. doi:10.1145/3540250.3549173
    D2AbsN/ABitbucketN/A010300
    • Haipeng Cai, Xiaoqin Fu. "D2ABS: A Framework for Dynamic Dependence Analysis of Distributed Programs." IEEE TSE, 2021. doi:10.1109/TSE.2021.3124795
    Dads-toolN/ABitbucketN/A01010N/A
    • Xiaoqin Fu, Haipeng Cai, Li LI. "Dads: Dynamic Slicing Continuously-Running Distributed Programs with Budget Constraints." ACM FSE, 2020. doi:10.1145/3368089.3417920
    DeamN/AProject pageN/A01010N/A
    • Haipeng Cai, Raul Santelices. "A Comprehensive Study of the Predictive Accuracy of Dynamic Change-Impact Analysis." JSS, 2015. doi:10.1016/j.jss.2015.02.018
    • Haipeng Cai, Raul Santelices, Tianyu Xu. "Estimating the Accuracy of Dynamic Change-Impact Analysis using Sensitivity Analysis." SERE, 2014. doi:10.1109/SERE.2014.18
    DistFaxN/ABitbucketN/A010300
    • Xiaoqin Fu, Boxiang Lin, Haipeng Cai. "DistFax: A Toolkit for Measuring Interprocess Communications and Quality of Distributed Systems." IEEE/ACM ICSE, 2022. doi:10.1145/3510454.3516859
    DistIAN/AProject pageN/A01010N/A
    • Haipeng Cai, Douglas Thain. "DistIA: A Cost-Effective Dynamic Impact Analysis for Distributed Programs." ASE, 2016. doi:10.1145/2970276.2970352
    DistMeasureN/AFigshareN/A01010N/A
    • Xiaoqin Fu, Asif Zaman, Haipeng Cai. "DistMeasure: A Framework for Run-Time Characterization and Quality Assessment of Distributed Software via Interprocess Communications." ACM TOSEM, 2024. doi:10.1145/3708476
    • Xiaoqin Fu, Haipeng Cai. "Measuring Interprocess Communications in Distributed Systems." ICPC, 2019. doi:10.1109/ICPC.2019.00051
    DistODDN/ABitbucketN/A010300
    • Chandan Dhal, Xiaoqin Fu, Haipeng Cai. "A Control-Theoretic Approach to Auto-Tuning Dynamic Analysis for Distributed Services." IEEE/ACM ICSE, 2023. doi:10.1109/ICSE-Companion58688.2023.00092
    DistTaintN/ABitbucketN/A01010N/A
    • Xiaoqin Fu, Haipeng Cai. "Scaling Application-Level Dynamic Taint Analysis to Enterprise-Scale Distributed Systems." IEEE/ACM ICSE, 2020. doi:10.1145/3377812.3390910
    • Xiaoqin Fu, Haipeng Cai. "A Dynamic Taint Analyzer for Distributed Systems." ACM FSE, 2019. doi:10.1145/3338906.3341179
    DiverN/AProject pageN/A01010N/A
    • Haipeng Cai. "Hybrid Program Dependence Approximation for Effective Dynamic Impact Prediction." IEEE TSE, 2018. doi:10.1109/TSE.2017.2692783
    • Haipeng Cai, Raul Santelices, Douglas Thain. "DiaPro: Unifying Dynamic Impact Analyses for Improved and Variable Cost-Effectiveness." ACM TOSEM, 2016. doi:10.1145/2894751
    • Haipeng Cai, Raul Santelices. "A Framework for Cost-effective Dependence-based Dynamic Impact Analysis." SANER, 2015. doi:10.1109/SANER.2015.7081833
    • Haipeng Cai, Raul Santelices. "Diver: Precise Dynamic Impact Analysis Using Dependence-based Trace Pruning." ASE, 2014. doi:10.1145/2642937.2642950
    DroidCatN/AProject pageN/A01010N/A
    • Haipeng Cai, Na Meng, Barbara Ryder, Danfeng Yao. "DroidCat: Effective Android Malware Detection and Categorization via App-Level Profiling." IEEE TIFS, 2019. doi:10.1109/TIFS.2018.2879302
    DroidFaxN/AProject pageN/A01010N/A
    • Haipeng Cai, Barbara Ryder. "Artifacts for Dynamic Analysis of Android Apps." ICSME, 2017. doi:10.1109/ICSME.2017.36
    • Haipeng Cai, Barbara Ryder. "DroidFax: A Toolkit for Systematic Characterization of Android Applications." ICSME, 2017. doi:10.1109/ICSME.2017.35
    • Haipeng Cai, Barbara Ryder. "Understanding Android Application Programming and Security: A Dynamic Study." ICSME, 2017. doi:10.1109/ICSME.2017.31
    DroidSpanN/ABitbucketN/A01010N/A
    • Haipeng Cai. "Assessing and Improving Malware Detection Sustainability through App Evolution Studies." ACM TOSEM, 2020. doi:10.1145/3371924
    • Xiaoqin Fu, Haipeng Cai. "On the Deterioration of Learning-Based Malware Detectors for Android." IEEE/ACM ICSE, 2019. doi:10.1109/ICSE-Companion.2019.00110
    • Haipeng Cai, John Jenkins. "Towards Sustainable Android Malware Detection." IEEE/ACM ICSE, 2018. doi:10.1145/3183440.3195004
    DTI-displayN/Asourceforge.net, sourceforge.netN/A01010N/A
    • Jian Chen, Haipeng Cai, Alexander P. Auchus, David H. Laidlaw. "Effects of Stereo and Screen Size on the Legibility of Three-dimensional Streamtube Visualizations." IEEE TVCG, 2012. doi:10.1109/TVCG.2012.216
    dyn-crosslang-studyN/ABitbucketN/A01010N/A
    • Haoran Yang, Wen Li, Haipeng Cai. "Language-Agnostic Dynamic Analysis of Multilingual Code: Promises, Pitfalls, and Prospects." ACM FSE, 2022. doi:10.1145/3540250.3560880
    EvoTaintN/ABitbucketN/A010300
    • Jiawei Guo, Haipeng Cai. "EvoTaint: Incremental Static Taint Analysis of Evolving Android Apps." ACM TOSEM, 2025. doi:10.1145/3743132
    FlowDistN/ABitbucketN/A010300
    • Xiaoqin Fu, Haipeng Cai. "FlowDist: Multi-Staged Refinement-Based Dynamic Information Flow Analysis for Distributed Software Systems." USENIX Security, 2021.
    FreeLensN/ABitbucket, FigshareN/A01010N/A
    • Jiawei Guo, Yu Nong, Zhiqiang Lin, Haipeng Cai. "Code Speaks Louder: Exploring Security and Privacy Relevant Regional Variations in Mobile Applications." IEEE S&P, 2025. doi:10.1109/SP61157.2025.00225
    GDistN/AFigshareN/A01010N/A
    • Xiaoqin Fu, Haipeng Cai. "Two-Level Adaptation for Budget-Constrained Continuous Dynamic Dependence Analysis." ACM FSE, 2026. doi:10.1145/3808167
    ICC-InspectN/ABitbucketN/A01010N/A
    • John Jenkins, Haipeng Cai. "ICC-Inspect: Supporting Runtime Inspection of Android Inter-Component Communications." MobileSoft, 2018. doi:10.1145/3197231.3197233
    ICC-VisualizerN/ABitbucketN/A01010N/A
    • John Jenkins, Haipeng Cai. "Dissecting Android Inter-Component Communications via Interactive Visual Explorations." ICSME, 2017. doi:10.1109/ICSME.2017.74
    InShapeN/Asourceforge.net, sourceforge.netN/A01010N/A
    • Haipeng Cai, Jian Chen, Alexander P. Auchus, David H. Laidlaw. "InShape: In-Situ Shape-Based Interactive Multiple-View Exploration of Diffusion MRI Visualizations." ISVC, 2012. doi:10.1007/978-3-642-33191-6_70
    Iterative-taint-analysisN/ABitbucketN/A010300
    • Haipeng Cai, John Jenkins. "Leveraging Historical Versions of Android Apps for Efficient and Precise Taint Analysis." MSR, 2018. doi:10.1145/3196398.3196433
    MDGN/AProject pageN/A01010N/A
    • Haipeng Cai, Raul Santelices. "Method-Level Program Dependence Abstraction and Its Application to Impact Analysis." JSS, 2016. doi:10.1016/j.jss.2016.09.048
    • Haipeng Cai, Raul Santelices. "Abstracting Program Dependencies using the Method Dependence Graph." QRS, 2015. doi:10.1109/QRS.2015.18
    mem-vul-detector-studyN/ABitbucketN/A01030N/A
    • Yu Nong, Haipeng Cai, Pengfei Ye, Li LI, Feng Chen. "Evaluating and Comparing Memory Error Vulnerability Detectors." IST, 2021. doi:10.1016/j.infsof.2021.106614
    multi-lang-github-studyN/ABitbucketN/A010200
    • Wen Li, Austin Marino, Haoran Yang, Na Meng, Li LI, Haipeng Cai. "How are Multilingual Systems Constructed: Characterizing Language Use and Selection in Open-Source Multilingual Software." ACM TOSEM, 2023. doi:10.1145/3631967
    • Wen Li, Na Meng, Li LI, Haipeng Cai. "Understanding Language Selection in Multi-Language Software Projects on GitHub." IEEE/ACM ICSE, 2021. doi:10.1109/ICSE-Companion52605.2021.00119
    multi-lang-SO-studyAvailable Functional ReusableBitbucket, Figshare, Zenodo, Zenodo1419139210300
    • Haoran Yang, Yu Nong, Shaowei Wang, Haipeng Cai. "Multi-Language Software Development: Issues, Challenges, and Solutions." IEEE TSE, 2024. doi:10.1109/TSE.2024.3358258
    • Haoran Yang, Weile Lian, Shaowei Wang, Haipeng Cai. "Demystifying Issues, Challenges, and Solutions for Multilingual Software Development." IEEE/ACM ICSE, 2023. doi:10.1109/ICSE48619.2023.00157
    NeuralCodeEditing-StudyAvailable FunctionalFigshare, Zenodo10814681010N/A
    • Yu Nong, Yuzhe Ou, Michael Pradel, Feng Chen, Haipeng Cai. "Generating Realistic Vulnerabilities via Neural Code Editing: An Empirical Study." ACM FSE, 2022. doi:10.1145/3540250.3549128
    OpenScienceStudyN/AFigshareN/A01010N/A
    • Yu Nong, Rainy Sharma, Wahab Hamou-Lhadj, Xiapu Luo, Haipeng Cai. "Open Science in Software Engineering: A Study on Deep Learning-Based Vulnerability Detection." IEEE TSE, 2022. doi:10.1109/TSE.2022.3207149
    Parallel-RenderingN/AGoogle Sites, sourceforge.netN/A01010N/A
    • Haipeng Cai. "Parallel Rendering for Legible Illustrative Visualizations of Dense Geometries on Commodity CPUs." IJIG, 2016. doi:10.1142/S0219467816500029
    PCA-toolN/AGitHubN/A0217210
    • Wen Li, Haipeng Cai, Yulei Sui, David Manz. "PCA: Memory Leak Detection using Partial Call-Path Analysis." ACM FSE, 2020. doi:10.1145/3368089.3417923
    PolyCruiseAvailable FunctionalBitbucket, Bitbucket, GitHubN/A0315502
    • Wen Li, Jiang Ming, Xiapu Luo, Haipeng Cai. "PolyCruise: A Cross-Language Dynamic Information Flow Analysis." USENIX Security, 2022.
    PolyFaxN/ABitbucket, GitHubN/A020400
    • Wen Li, Li LI, Haipeng Cai. "PolyFax: A Toolkit for Characterizing Multi-Language Software." ACM FSE, 2022. doi:10.1145/3540250.3558925
    PolyFuzzAvailableBitbucket, GitHubN/A0283610
    • Wen Li, Jinyang Ruan, Guangbei Yi, Long Cheng, Xiapu Luo, Haipeng Cai. "PolyFuzz: Holistic Greybox Fuzzing of Multi-Language Systems." USENIX Security, 2023.
    PyRTFuzzAvailable Functional ReproducedBitbucket, DOI, GitHub, GitHubN/A0212900
    • Wen Li, Haipeng Cai. "A Practical Fuzzer for the Python Runtime System." ACM FSE, 2026. doi:10.1145/3803437.3806417
    • Wen Li, Haoran Yang, Xiapu Luo, Long Cheng, Haipeng Cai. "PyRTFuzz: Detecting Bugs in Python Runtimes via Two-Level Collaborative Fuzzing." ACM CCS, 2023. doi:10.1145/3576915.3623166
    SeadsN/ABitbucketN/A01010N/A
    • Xiaoqin Fu, Haipeng Cai, Wen Li, Li LI. "Seads: Scalable and Cost-Effective Dynamic Dependence Analysis of Distributed Systems via Reinforcement Learning." ACM TOSEM, 2020. doi:10.1145/3379345
    SenSAN/AProject pageN/A01010N/A
    • Haipeng Cai, Raul Santelices, Siyuan Jiang. "Prioritizing Change Impacts via Semantic Dependence Quantification." IEEE TR, 2016. doi:10.1109/TR.2015.2481000
    • Haipeng Cai, Siyuan Jiang, Raul Santelices, Yingjie Zhang, Yiji Zhang. "SensA: Sensitivity Analysis for Quantitative Change-impact Prediction." SCAM, 2014. doi:10.1109/SCAM.2014.25
    TracerJDN/AProject pageN/A01010N/A
    • Haipeng Cai, Raul Santelices. "TracerJD: Generic Trace-based Dynamic Dependence Analysis with Fine-grained Logging." SANER, 2015. doi:10.1109/SANER.2015.7081862
    VerLogAvailable Functional ReusableDocker Hub, Zenodo1011241010N/A
    • Jiawei Guo, Haoran Yang, Haipeng Cai. "VerLog: Enhancing Release Note Generation for Android Apps using Large Language Models." ACM ISSTA, 2025. doi:10.1145/3728961
    VGXAvailable ReusableFigshare, Zenodo89314731010N/A
    • Yu Nong, Richard Fang, Guangbei Yi, Kunsong Zhao, Xiapu Luo, Feng Chen, Haipeng Cai. "VGX: Large-Scale Sample Generation for Boosting Learning-Based Software Vulnerability Analyses." IEEE/ACM ICSE, 2024. doi:10.1145/3597503.3639116
    VICS-LLM-VulGenAvailable FunctionalFigshare, Zenodo1341010N/A
    • Guangbei Yi, Yu Nong, Minzhang Li, Haipeng Cai. "Exploring and Improving Real-World Vulnerability Data Generation via Prompting Large Language Models." IEEE/ACM ICSE, 2026. doi:10.1145/3744916.3773176
    VinJN/AFigshareN/A01010N/A
    • Yu Nong, Haoran Yang, Feng Chen, Haipeng Cai. "VinJ: An Automated Tool for Large-Scale Software Vulnerability Data Generation." ACM FSE, 2024. doi:10.1145/3663529.3663800
    visual-semanticsN/ABitbucketN/A01010N/A
    • Haipeng Cai, Shiv Raj Pant, Wen Li. "Towards Learning Visual Semantics." ACM FSE, 2020. doi:10.1145/3368089.3417040
    VulGenAvailable Functional ReusableFigshare, Zenodo176930251010N/A
    • Yu Nong, Yuzhe Ou, Michael Pradel, Feng Chen, Haipeng Cai. "VulGen: Realistic Vulnerable Sample Generation via Pattern Mining and Deep Learning." IEEE/ACM ICSE, 2023. doi:10.1109/ICSE48619.2023.00211
    xLocN/AFigshareN/A01010N/A
    • Haoran Yang, Yu Nong, Tao Zhang, Xiapu Luo, Haipeng Cai. "Learning to Detect and Localize Multilingual Bugs." ACM FSE, 2024. doi:10.1145/3660804
    ZifazahN/AGoogle Sites, sourceforge.netN/A01010N/A
    • Chen, Jian, Cai, Haipeng, Auchus, AlexanderP., Laidlaw, DavidH.. "Gryphon: A Little Domain-Specific Programming Language for Diffusion MRI Visualizations." Handbook of Human Centric Visualization, 2014. doi:10.1007/978-1-4614-7485-2_2
    All artifactsTotalAll alternative artifacts69308506151179122All associated papers