We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computational Complexity

New submissions

[ total of 7 entries: 1-7 ]
[ showing up to 1000 entries per page: fewer | more ]

New submissions for Fri, 31 Mar 23

[1]  arXiv:2303.17045 [pdf, ps, other]
Title: Training Neural Networks is NP-Hard in Fixed Dimension
Subjects: Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)

We study the parameterized complexity of training two-layer neural networks with respect to the dimension of the input data and the number of hidden neurons, considering ReLU and linear threshold activation functions. Albeit the computational complexity of these problems has been studied numerous times in recent years, several questions are still open. We answer questions by Arora et al. [ICLR '18] and Khalife and Basu [IPCO '22] showing that both problems are NP-hard for two dimensions, which excludes any polynomial-time algorithm for constant dimension. We also answer a question by Froese et al. [JAIR '22] proving W[1]-hardness for four ReLUs (or two linear threshold neurons) with zero training error. Finally, in the ReLU case, we show fixed-parameter tractability for the combined parameter number of dimensions and number of ReLUs if the network is assumed to compute a convex map. Our results settle the complexity status regarding these parameters almost completely.

[2]  arXiv:2303.17506 [pdf, other]
Title: Sum-of-Squares Lower Bounds for Densest $k$-Subgraph
Subjects: Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS)

Given a graph and an integer $k$, Densest $k$-Subgraph is the algorithmic task of finding the subgraph on $k$ vertices with the maximum number of edges. This is a fundamental problem that has been subject to intense study for decades, with applications spanning a wide variety of fields. The state-of-the-art algorithm is an $O(n^{1/4 + \epsilon})$-factor approximation (for any $\epsilon > 0$) due to Bhaskara et al. [STOC '10]. Moreover, the so-called log-density framework predicts that this is optimal, i.e. it is impossible for an efficient algorithm to achieve an $O(n^{1/4 - \epsilon})$-factor approximation. In the average case, Densest $k$-Subgraph is a prototypical noisy inference task which is conjectured to exhibit a statistical-computational gap.
In this work, we provide the strongest evidence yet of hardness for Densest $k$-Subgraph by showing matching lower bounds against the powerful Sum-of-Squares (SoS) algorithm, a meta-algorithm based on convex programming that achieves state-of-art algorithmic guarantees for many optimization and inference problems. For $k \leq n^{\frac{1}{2}}$, we obtain a degree $n^{\delta}$ SoS lower bound for the hard regime as predicted by the log-density framework.
To show this, we utilize the modern framework for proving SoS lower bounds on average-case problems pioneered by Barak et al. [FOCS '16]. A key issue is that small denser-than-average subgraphs in the input will greatly affect the value of the candidate pseudoexpectation operator around the subgraph. To handle this challenge, we devise a novel matrix factorization scheme based on the positive minimum vertex separator. We then prove an intersection tradeoff lemma to show that the error terms when using this separator are indeed small.

Cross-lists for Fri, 31 Mar 23

[3]  arXiv:2303.17028 (cross-list from cs.CG) [pdf, other]
Title: On the complexity of embedding in graph products
Subjects: Computational Geometry (cs.CG); Computational Complexity (cs.CC)

Graph embedding, especially as a subgraph of a grid, is an old topic in VLSI design and graph drawing. In this paper, we investigate related questions concerning the complexity of embedding a graph $G$ in a host graph that is the strong product of a path $P$ with a graph $H$ that satisfies some properties, such as having small treewidth, pathwidth or tree depth. We show that this is NP-hard, even under numerous restrictions on both $G$ and $H$. In particular, computing the row pathwidth and the row treedepth is NP-hard even for a tree of small pathwidth, while computing the row treewidth is NP-hard even for series-parallel graphs.

[4]  arXiv:2303.17578 (cross-list from cs.LG) [pdf, ps, other]
Title: Online Learning and Disambiguations of Partial Concept Classes
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC)

In a recent article, Alon, Hanneke, Holzman, and Moran (FOCS '21) introduced a unifying framework to study the learnability of classes of partial concepts. One of the central questions studied in their work is whether the learnability of a partial concept class is always inherited from the learnability of some ``extension'' of it to a total concept class.
They showed this is not the case for PAC learning but left the problem open for the stronger notion of online learnability.
We resolve this problem by constructing a class of partial concepts that is online learnable, but no extension of it to a class of total concepts is online learnable (or even PAC learnable).

Replacements for Fri, 31 Mar 23

[5]  arXiv:2209.13063 (replaced) [pdf, other]
Title: Quantum-Inspired Perfect Matching under Vertex-Color Constraints
Comments: 13 pages excluding appendix and reference. 4 figures
Subjects: Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Mathematical Physics (math-ph); Combinatorics (math.CO); Quantum Physics (quant-ph)
[6]  arXiv:2104.08015 (replaced) [pdf, other]
Title: On the Complexity of SHAP-Score-Based Explanations: Tractability via Knowledge Compilation and Non-Approximability Results
Comments: Up to the formatting, this is the exact content of the paper in Journal of Machine Learning Research (JMLR)
Subjects: Artificial Intelligence (cs.AI); Computational Complexity (cs.CC)
[7]  arXiv:2303.16208 (replaced) [pdf, ps, other]
Title: Lifting uniform learners via distributional decomposition
Comments: To appear in STOC 2023
Subjects: Machine Learning (stat.ML); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[ total of 7 entries: 1-7 ]
[ showing up to 1000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, recent, 2303, contact, help  (Access key information)