Interpretable Network Representation Learning with Principal Component Analysis

Wilson, James D.; Lee, Jihui

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2106

Statistics > Machine Learning

Title: Interpretable Network Representation Learning with Principal Component Analysis

Authors: James D. Wilson, Jihui Lee

(Submitted on 27 Jun 2021)

Abstract: We consider the problem of interpretable network representation learning for samples of network-valued data. We propose the Principal Component Analysis for Networks (PCAN) algorithm to identify statistically meaningful low-dimensional representations of a network sample via subgraph count statistics. The PCAN procedure provides an interpretable framework for which one can readily visualize, explore, and formulate predictive models for network samples. We furthermore introduce a fast sampling-based algorithm, sPCAN, which is significantly more computationally efficient than its counterpart, but still enjoys advantages of interpretability. We investigate the relationship between these two methods and analyze their large-sample properties under the common regime where the sample of networks is a collection of kernel-based random graphs. We show that under this regime, the embeddings of the sPCAN method enjoy a central limit theorem and moreover that the population level embeddings of PCAN and sPCAN are equivalent. We assess PCAN's ability to visualize, cluster, and classify observations in network samples arising in nature, including functional connectivity network samples and dynamic networks describing the political co-voting habits of the U.S. Senate. Our analyses reveal that our proposed algorithm provides informative and discriminatory features describing the networks in each sample. The PCAN and sPCAN methods build on the current literature of network representation learning and set the stage for a new line of research in interpretable learning on network-valued data. Publicly available software for the PCAN and sPCAN methods are available at this https URL

Comments:	33 pages. Submitted and currently under review
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
Cite as:	arXiv:2106.14238 [stat.ML]
	(or arXiv:2106.14238v1 [stat.ML] for this version)

Submission history

From: James Wilson [view email]
[v1] Sun, 27 Jun 2021 13:52:49 GMT (4543kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2106.14238

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Statistics > Machine Learning

Title: Interpretable Network Representation Learning with Principal Component Analysis

Submission history