Reproducibility Code for

On Computing and the Complexity of Computing Higher-Order U-Statistics, Exactly
Authors: Xingyu Chen, Ruiqi Zhang, Lin Liu
Link: https://arxiv.org/abs/2508.12627

Overview

This repository provides code for reproducing the experimental results in our paper "On Computing and the Complexity of Computing Higher-Order U-Statistics, Exactly".

Our implementation is based on the Python package u-stat, available on PyPI:

U-Statistics (Python): https://github.com/zrq1706/U-Statistics-python

We also provide a corresponding R interface:

U-Statistics (R interface): https://github.com/cxy0714/U-Statistics-R

Environment Setup

To install all dependencies on a SLURM-based CPU cluster, please refer to sh/env_update.slurm. To install on a single GPU machine, please refer to sh/gpu_setup.sh.

Section 4.1: Higher-Order Influence Functions (HOIF)

All scripts and results for this section are in experiments/hoif/.

This section focuses on computing the main component of the HOIF estimators.
For a complete implementation, please refer to our R package:
https://github.com/cxy0714/HOIF

Table 1

Main script: run.py
Execution script: run_hoif.slurm (on CPU cluster); manual execution on single GPU machine.
Output results: benchmark_20260327_155952.json, benchmark_20260328_195135.json
Table generation: table.py → benchmark_20260327_155952_summary.txt, benchmark_20260328_195135_summary.txt

Table 4

Main scripts: run_count_complexity.py, run_count_u2v.py
Execution script: run_hoif_count.slurm
Output results: count_complexity_20260328_150321.json, count_u2v_20260328_153058.json, count_hoif_57008502.out

Section 4.2: Motif Counts

All scripts and results for this section are in experiments/motif_count/.

Tables 2 and 7

Dependencies: Peregrine and igraph. Peregrine installation instructions are available at the link above; igraph is installed via pip in sh/env_update.slurm.

Main script: run.py
Execution scripts: run_motif_fast.slurm (Peregrine and our method only), run_motif_igraph.slurm (igraph only)
Output results: benchmark_size3_fast_20260328_143440.json, benchmark_size3_igraph_20260328_143404.json, benchmark_size4_fast_20260328_173329.json, benchmark_size4_igraph_20260328_155723.json
Table generation: run_table_cpu.py → table_size3.txt, table_size4.txt

Table 8

Dependencies: cuGraph. Installation instructions are available at the link above.

Main script: run_cugraph.py
Execution: Manual execution on single GPU machine.
Output results: GPU_triangle_benchmark_20251115_162807.json → triangle_comparison_table.txt

Section 4.3: Distance Covariance (dCov)

All scripts and results for this section are in experiments/dcov/.

Table 3

Our U-Statistics Implementation

Main script: run.py (execution) and kernel.py (data processing)
Execution script: run_dcov_ustat.slurm
Output results: dcov_results_numpy_20250817_043111.json, dcov_results_torch_20250817_041908.json

Shao's MATLAB Implementation

All scripts and results are in data/stock_market/, also see their github repository.

Complete U-statistics: only_dcov.m
Randomized U-statistics: randmized.m
Output results: result/

Notes

CPU-based experiments were run on the π 2.0 and Siyuan-1 clusters supported by the Center for High Performance Computing at Shanghai Jiao Tong University. GPU-based experiments were run on a single GPU machine.
Please adjust paths and resource configurations as needed for your system.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
experiments		experiments
sh		sh
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproducibility Code for

Overview

Environment Setup

Section 4.1: Higher-Order Influence Functions (HOIF)

Table 1

Table 4

Section 4.2: Motif Counts

Tables 2 and 7

Table 8

Section 4.3: Distance Covariance (dCov)

Table 3

Our U-Statistics Implementation

Shao's MATLAB Implementation

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reproducibility Code for

Overview

Environment Setup

Section 4.1: Higher-Order Influence Functions (HOIF)

Table 1

Table 4

Section 4.2: Motif Counts

Tables 2 and 7

Table 8

Section 4.3: Distance Covariance (dCov)

Table 3

Our U-Statistics Implementation

Shao's MATLAB Implementation

Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages