Machine Learning
New submissions
[ showing up to 2000 entries per page: fewer  more ]
New submissions for Thu, 13 May 21
 [1] arXiv:2105.05489 [pdf, other]

Title: Multiscale Invertible Generative Networks for HighDimensional Bayesian InferenceSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Computation (stat.CO)
We propose a Multiscale Invertible Generative Network (MsIGN) and associated training algorithm that leverages multiscale structure to solve highdimensional Bayesian inference. To address the curse of dimensionality, MsIGN exploits the lowdimensional nature of the posterior, and generates samples from coarse to fine scale (low to high dimension) by iteratively upsampling and refining samples. MsIGN is trained in a multistage manner to minimize the Jeffreys divergence, which avoids mode dropping in highdimensional cases. On two highdimensional Bayesian inverse problems, we show superior performance of MsIGN over previous approaches in posterior approximation and multiple mode capture. On the natural image synthesis task, MsIGN achieves superior performance in bitsperdimension over baseline models and yields great interpretability of its neurons in intermediate layers.
 [2] arXiv:2105.05648 [pdf, other]

Title: LookAhead Screening Rules for the LassoAuthors: Johan LarssonComments: EYSM 2021 short paper; 6 pages, 2 figuresSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
The lasso is a popular method to induce shrinkage and sparsity in the solution vector (coefficients) of regression problems, particularly when there are many predictors relative to the number of observations. Solving the lasso in this highdimensional setting can, however, be computationally demanding. Fortunately, this demand can be alleviated via the use of screening rules that discard predictors prior to fitting the model, leading to a reduced problem to be solved. In this paper, we present a new screening strategy: lookahead screening. Our method uses safe screening rules to find a range of penalty values for which a given predictor cannot enter the model, thereby screening predictors along the remainder of the path. In experiments we show that these lookahead screening rules improve the performance of existing screening strategies.
 [3] arXiv:2105.05842 [pdf, other]

Title: Kernel ThinningComments: 55 pages, 4 figuresSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
We introduce kernel thinning, a simple algorithm for generating betterthanMonteCarlo approximations to distributions $\mathbb{P}$ on $\mathbb{R}^d$. Given $n$ input points, a suitable reproducing kernel $\mathbf{k}$, and $\mathcal{O}(n^2)$ time, kernel thinning returns $\sqrt{n}$ points with comparable integration error for every function in the associated reproducing kernel Hilbert space. With high probability, the maximum discrepancy in integration error is $\mathcal{O}_d(n^{\frac{1}{2}}\sqrt{\log n})$ for compactly supported $\mathbb{P}$ and $\mathcal{O}_d(n^{\frac{1}{2}} \sqrt{(\log n)^{d+1}\log\log n})$ for subexponential $\mathbb{P}$. In contrast, an equalsized i.i.d. sample from $\mathbb{P}$ suffers $\Omega(n^{\frac14})$ integration error. Our subexponential guarantees resemble the classical quasiMonte Carlo error rates for uniform $\mathbb{P}$ on $[0,1]^d$ but apply to general distributions on $\mathbb{R}^d$ and a wide range of common kernels. We use our results to derive explicit nonasymptotic maximum mean discrepancy bounds for Gaussian, Mat\'ern, and Bspline kernels and present two vignettes illustrating the practical benefits of kernel thinning over i.i.d. sampling and standard Markov chain Monte Carlo thinning.
Crosslists for Thu, 13 May 21
 [4] arXiv:2105.05328 (crosslist from cs.LG) [pdf, other]

Title: Comparing interpretability and explainability for feature selectionSubjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
A common approach for feature selection is to examine the variable importance scores for a machine learning model, as a way to understand which features are the most relevant for making predictions. Given the significance of feature selection, it is crucial for the calculated importance scores to reflect reality. Falsely overestimating the importance of irrelevant features can lead to false discoveries, while underestimating importance of relevant features may lead us to discard important features, resulting in poor model performance. Additionally, blackbox models like XGBoost provide stateofthe art predictive performance, but cannot be easily understood by humans, and thus we rely on variable importance scores or methods for explainability like SHAP to offer insight into their behavior.
In this paper, we investigate the performance of variable importance as a feature selection method across various blackbox and interpretable machine learning methods. We compare the ability of CART, Optimal Trees, XGBoost and SHAP to correctly identify the relevant subset of variables across a number of experiments. The results show that regardless of whether we use the native variable importance method or SHAP, XGBoost fails to clearly distinguish between relevant and irrelevant features. On the other hand, the interpretable methods are able to correctly and efficiently identify irrelevant features, and thus offer significantly better performance for feature selection.  [5] arXiv:2105.05347 (crosslist from cs.LG) [pdf, other]

Title: Returnbased Scaling: Yet Another Normalisation Trick for Deep RLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Scaling issues are mundane yet irritating for practitioners of reinforcement learning. Error scales vary across domains, tasks, and stages of learning; sometimes by many orders of magnitude. This can be detrimental to learning speed and stability, create interference between learning tasks, and necessitate substantial tuning. We revisit this topic for agents based on temporaldifference learning, sketch out some desiderata and investigate scenarios where simple fixes fall short. The mechanism we propose requires neither tuning, clipping, nor adaptation. We validate its effectiveness and robustness on the suite of Atari games. Our scaling method turns out to be particularly helpful at mitigating interference, when training a shared neural network on multiple targets that differ in reward scale or discounting.
 [6] arXiv:2105.05360 (crosslist from physics.aoph) [pdf, other]

Title: Realtime Ionospheric Imaging of S4 Scintillation from Limited Data with Parallel Kalman Filters and SmoothnessAuthors: Alexandra KoulouriSubjects: Atmospheric and Oceanic Physics (physics.aoph); Machine Learning (stat.ML)
In this paper, we propose a Bayesian framework to create two dimensional ionospheric images of high spatiotemporal resolution to monitor ionospheric irregularities as measured by the S4 index. Here, we recast the standard Bayesian recursive filtering for a linear Gaussian statespace model, also referred to as the Kalman filter, first by augmenting the (pierce point) observation model with connectivity information stemming from the insight and assumptions/standard modeling about the spatial distribution of the scintillation activity on the ionospheric shell at 350 km altitude. Thus, we achieve to handle the limited spatiotemporal observations. Then, by introducing a set of Kalman filters running in parallel, we mitigate the uncertainty related to a tuning parameter of the proposed augmented model. The output images are a weighted average of the state estimates of the individual filters. We demonstrate our approach by rendering two dimensional realtime ionospheric images of S4 amplitude scintillation at 350 km over South America with temporal resolution of one minute. Furthermore, we employ extra S4 data that was not used in producing these ionospheric images, to check and verify the ability of our images to predict this extra data in particular ionospheric pierce points. Our results show that in areas with a network of ground receivers with a relatively good coverage (e.g. within a couple of kilometers distance) the produced images can provide reliable realtime results. Our proposed algorithmic framework can be readily used to visualize realtime ionospheric images taking as inputs the available scintillation data provided from freely available webservers.
 [7] arXiv:2105.05373 (crosslist from math.ST) [pdf, other]

Title: Estimation of population size based on capture recapture designs and evaluation of the estimation reliabilityAuthors: Yue You, Mark van der Laan, Philip Collender, Qu Cheng, Alan Hubbard, Nicholas P Jewell, Zhiyue Tom Hu, Robin Mejia, Justin RemaisSubjects: Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
We propose a modern method to estimate population size based on capturerecapture designs of K samples. The observed data is formulated as a sample of n i.i.d. Kdimensional vectors of binary indicators, where the kth component of each vector indicates the subject being caught by the kth sample, such that only subjects with nonzero capture vectors are observed. The target quantity is the unconditional probability of the vector being nonzero across both observed and unobserved subjects. We cover models assuming a single constraint (identification assumption) on the Kdimensional distribution such that the target quantity is identified and the statistical model is unrestricted. We present solutions for linear and nonlinear constraints commonly assumed to identify capturerecapture models, including no Kway interaction in linear and loglinear models, independence or conditional independence. We demonstrate that the choice of constraint has a dramatic impact on the value of the estimand, showing that it is crucial that the constraint is known to hold by design. For the commonly assumed constraint of no Kway interaction in a loglinear model, the statistical target parameter is only defined when each of the $2^K  1$ observable capture patterns is present, and therefore suffers from the curse of dimensionality. We propose a targeted MLE based on undersmoothed lasso model to smooth across the cells while targeting the fit towards the single valued target parameter of interest. For each identification assumption, we provide simulated inference and confidence intervals to assess the performance on the estimator under correct and incorrect identifying assumptions. We apply the proposed method, alongside existing estimators, to estimate prevalence of a parasitic infection using multisource surveillance data from a region in southwestern China, under the four identification assumptions.
 [8] arXiv:2105.05400 (crosslist from cs.LG) [pdf, ps, other]

Title: Homogeneous vector bundles and $G$equivariant convolutional neural networksAuthors: Jimmy AronssonComments: 23 pagesSubjects: Machine Learning (cs.LG); Representation Theory (math.RT); Machine Learning (stat.ML)
$G$equivariant convolutional neural networks (GCNNs) is a geometric deep learning model for data defined on a homogeneous $G$space $\mathcal{M}$. GCNNs are designed to respect the global symmetry in $\mathcal{M}$, thereby facilitating learning. In this paper, we analyze GCNNs on homogeneous spaces $\mathcal{M} = G/K$ in the case of unimodular Lie groups $G$ and compact subgroups $K \leq G$. We demonstrate that homogeneous vector bundles is the natural setting for GCNNs. We also use reproducing kernel Hilbert spaces to obtain a precise criterion for expressing $G$equivariant layers as convolutional layers. This criterion is then rephrased as a bandwidth criterion, leading to even stronger results for some groups.
 [9] arXiv:2105.05449 (crosslist from cs.LG) [pdf, ps, other]

Title: An efficient projection neural network for $\ell_1$regularized logistic regressionSubjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
$\ell_1$ regularization has been used for logistic regression to circumvent the overfitting and use the estimated sparse coefficient for feature selection. However, the challenge of such a regularization is that the $\ell_1$ norm is not differentiable, making the standard algorithms for convex optimization not applicable to this problem. This paper presents a simple projection neural network for $\ell_1$regularized logistics regression. In contrast to many available solvers in the literature, the proposed neural network does not require any extra auxiliary variable nor any smooth approximation, and its complexity is almost identical to that of the gradient descent for logistic regression without $\ell_1$ regularization, thanks to the projection operator. We also investigate the convergence of the proposed neural network by using the Lyapunov theory and show that it converges to a solution of the problem with any arbitrary initial value. The proposed neural solution significantly outperforms stateoftheart methods with respect to the execution time and is competitive in terms of accuracy and AUROC.
 [10] arXiv:2105.05555 (crosslist from cs.LG) [pdf, ps, other]

Title: Robust Learning of FixedStructure Bayesian Networks in NearlyLinear TimeSubjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Statistics Theory (math.ST); Machine Learning (stat.ML)
We study the problem of learning Bayesian networks where an $\epsilon$fraction of the samples are adversarially corrupted. We focus on the fullyobservable case where the underlying graph structure is known. In this work, we present the first nearlylinear time algorithm for this problem with a dimensionindependent error guarantee. Previous robust algorithms with comparable error guarantees are slower by at least a factor of $(d/\epsilon)$, where $d$ is the number of variables in the Bayesian network and $\epsilon$ is the fraction of corrupted samples.
Our algorithm and analysis are considerably simpler than those in previous work. We achieve this by establishing a direct connection between robust learning of Bayesian networks and robust mean estimation. As a subroutine in our algorithm, we develop a robust mean estimation algorithm whose runtime is nearlylinear in the number of nonzeros in the input samples, which may be of independent interest.  [11] arXiv:2105.05622 (crosslist from cs.LG) [pdf, other]

Title: On riskbased active learning for structural health monitoringComments: 28 pages. 23 figures. Under review, preprint submitted to Mechanical Systems and Signal ProcessingSubjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
A primary motivation for the development and implementation of structural health monitoring systems, is the prospect of gaining the ability to make informed decisions regarding the operation and maintenance of structures and infrastructure. Unfortunately, descriptive labels for measured data corresponding to healthstate information for the structure of interest are seldom available prior to the implementation of a monitoring system. This issue limits the applicability of the traditional supervised and unsupervised approaches to machine learning in the development of statistical classifiers for decisionsupporting SHM systems.
The current paper presents a riskbased formulation of active learning, in which the querying of classlabel information is guided by the expected value of said information for each incipient data point. When applied to structural health monitoring, the querying of class labels can be mapped onto the inspection of a structure of interest in order to determine its health state. In the current paper, the riskbased active learning process is explained and visualised via a representative numerical example and subsequently applied to the Z24 Bridge benchmark. The results of the case studies indicate that a decisionmaker's performance can be improved via the riskbased active learning of a statistical classifier, such that the decision process itself is taken into account.  [12] arXiv:2105.05650 (crosslist from condmat.statmech) [pdf, other]

Title: Unbiased Monte Carlo Cluster Updates with Autoregressive Neural NetworksComments: 8 pages, 5 figuresSubjects: Statistical Mechanics (condmat.statmech); Disordered Systems and Neural Networks (condmat.disnn); Machine Learning (cs.LG); Machine Learning (stat.ML)
Efficient sampling of complex highdimensional probability densities is a central task in computational science. Machine Learning techniques based on autoregressive neural networks have been recently shown to provide good approximations of probability distributions of interest in physics. In this work, we propose a systematic way to remove the intrinsic bias associated with these variational approximations, combining it with Markovchain Monte Carlo in an automatic scheme to efficiently generate cluster updates, which is particularly useful for models for which no efficient cluster update scheme is known. Our approach is based on symmetryenforced cluster updates building on the neuralnetwork representation of conditional probabilities. We demonstrate that such finitecluster updates are crucial to circumvent ergodicity problems associated with global neural updates. We test our method for first and secondorder phase transitions in classical spin systems, proving in particular its viability for critical systems, or in the presence of metastable states.
 [13] arXiv:2105.05721 (crosslist from quantph) [pdf, other]

Title: Causal networks and freedom of choice in Bell's theoremAuthors: Rafael Chaves, George Moreno, Emanuele Polino, Davide Poderini, Iris Agresti, Alessia Suprano, Mariana R. Barros, Gonzalo Carvacho, Elie Wolfe, Askery Canabarro, Robert W. Spekkens, Fabio SciarrinoComments: 17 pages, 10 figuresSubjects: Quantum Physics (quantph); Machine Learning (stat.ML)
Bell's theorem is typically understood as the proof that quantum theory is incompatible with local hidden variable models. More generally, we can see the violation of a Bell inequality as witnessing the impossibility of explaining quantum correlations with classical causal models. The violation of a Bell inequality, however, does not exclude classical models where some level of measurement dependence is allowed, that is, the choice made by observers can be correlated with the source generating the systems to be measured. Here we show that the level of measurement dependence can be quantitatively upper bounded if we arrange the Bell test within a network. Furthermore, we also prove that these results can be adapted in order to derive nonlinear Bell inequalities for a large class of causal networks and to identify quantumly realizable correlations which violate them.
 [14] arXiv:2105.05728 (crosslist from cs.LG) [pdf, other]

Title: Early prediction of respiratory failure in the intensive care unitAuthors: Matthias Hüser, Martin Faltys, Xinrui Lyu, Chris Barber, Stephanie L. Hyland, Tobias M. Merz, Gunnar RätschComments: 14 pages, 5 figuresSubjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
The development of respiratory failure is common among patients in intensive care units (ICU). Large data quantities from ICU patient monitoring systems make timely and comprehensive analysis by clinicians difficult but are ideal for automatic processing by machine learning algorithms. Early prediction of respiratory system failure could alert clinicians to patients at risk of respiratory failure and allow for early patient reassessment and treatment adjustment. We propose an early warning system that predicts moderate/severe respiratory failure up to 8 hours in advance. Our system was trained on HiRIDII, a dataset containing more than 60,000 admissions to a tertiary care ICU. An alarm is typically triggered several hours before the beginning of respiratory failure. Our system outperforms a clinical baseline mimicking traditional clinical decisionmaking based on pulseoximetric oxygen saturation and the fraction of inspired oxygen. To provide model introspection and diagnostics, we developed an easytouse web browserbased system to explore model input data and predictions visually.
 [15] arXiv:2105.05736 (crosslist from cs.LG) [pdf, other]

Title: Disentangling Sampling and Labeling Bias for Learning in LargeOutput SpacesAuthors: Ankit Singh Rawat, Aditya Krishna Menon, Wittawat Jitkrittum, Sadeep Jayasumana, Felix X. Yu, Sashank Reddi, Sanjiv KumarComments: To appear in ICML 2021Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Negative sampling schemes enable efficient training given a large number of classes, by offering a means to approximate a computationally expensive loss function that takes all labels into account. In this paper, we present a new connection between these schemes and loss modification techniques for countering label imbalance. We show that different negative sampling schemes implicitly tradeoff performance on dominant versus rare labels. Further, we provide a unified means to explicitly tackle both sampling bias, arising from working with a subset of all labels, and labeling bias, which is inherent to the data due to label imbalance. We empirically verify our findings on longtail classification and retrieval benchmarks.
 [16] arXiv:2105.05757 (crosslist from cs.LG) [pdf, other]

Title: Exploring the Similarity of Representations in ModelAgnostic MetaLearningComments: Learning to Learn workshop at ICLR 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
In past years modelagnostic metalearning (MAML) has been one of the most promising approaches in metalearning. It can be applied to different kinds of problems, e.g., reinforcement learning, but also shows good results on fewshot learning tasks. Besides their tremendous success in these tasks, it has still not been fully revealed yet, why it works so well. Recent work proposes that MAML rather reuses features than rapidly learns. In this paper, we want to inspire a deeper understanding of this question by analyzing MAML's representation. We apply representation similarity analysis (RSA), a wellestablished method in neuroscience, to the fewshot learning instantiation of MAML. Although some part of our analysis supports their general results that feature reuse is predominant, we also reveal arguments against their conclusion. The similarityincrease of layers closer to the input layers arises from the learning task itself and not from the model. In addition, the representations after inner gradient steps make a broader change to the representation than the changes during metatraining.
 [17] arXiv:2105.05782 (crosslist from cs.DS) [pdf, other]

Title: How to Design Robust Algorithms using Noisy Comparison OracleComments: PVLDB 2021Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB); Machine Learning (stat.ML)
Metric based comparison operations such as finding maximum, nearest and farthest neighbor are fundamental to studying various clustering techniques such as $k$center clustering and agglomerative hierarchical clustering. These techniques crucially rely on accurate estimation of pairwise distance between records. However, computing exact features of the records, and their pairwise distances is often challenging, and sometimes not possible. We circumvent this challenge by leveraging weak supervision in the form of a comparison oracle that compares the relative distance between the queried points such as `Is point u closer to v or w closer to x?'.
However, it is possible that some queries are easier to answer than others using a comparison oracle. We capture this by introducing two different noise models called adversarial and probabilistic noise. In this paper, we study various problems that include finding maximum, nearest/farthest neighbor search under these noise models. Building upon the techniques we develop for these comparison operations, we give robust algorithms for kcenter clustering and agglomerative hierarchical clustering. We prove that our algorithms achieve good approximation guarantees with a high probability and analyze their query complexity. We evaluate the effectiveness and efficiency of our techniques empirically on various realworld datasets.
Replacements for Thu, 13 May 21
 [18] arXiv:2011.09349 (replaced) [pdf, ps, other]

Title: BiasVariance Tradeoff and Overlearning in Dynamic Decision ProblemsComments: 22 pages, 4 TablesSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
 [19] arXiv:2104.13101 (replaced) [pdf, other]

Title: Initializing LSTM internal states via manifold learningAuthors: Felix P. Kemeth, Tom Bertalan, Nikolaos Evangelou, Tianqi Cui, Saurabh Malani, Ioannis G. KevrekidisSubjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Pattern Formation and Solitons (nlin.PS)
 [20] arXiv:2105.03584 (replaced) [pdf, other]

Title: Adaptive Latent Space Tuning for NonStationary DistributionsSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Accelerator Physics (physics.accph)
 [21] arXiv:2105.04979 (replaced) [pdf, other]

Title: Surrogate assisted active subspace and active subspace assisted surrogate  A new paradigm for high dimensional structural reliability analysisComments: 19 pagesSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
 [22] arXiv:1907.12439 (replaced) [pdf, other]

Title: Hindsight Trust Region Policy OptimizationComments: Accepted by IJCAI 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
 [23] arXiv:2002.03129 (replaced) [pdf, other]

Title: GLSearch: Maximum Common Subgraph Detection via Learning to SearchComments: Accepted by ICML 2021Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
 [24] arXiv:2006.03463 (replaced) [pdf, other]

Title: Sponge Examples: EnergyLatency Attacks on Neural NetworksComments: Accepted at 6th IEEE European Symposium on Security and Privacy (EuroS&P)Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
 [25] arXiv:2006.14026 (replaced) [pdf, other]

Title: Subpopulation Data Poisoning AttacksComments: May12 update: add sever + backdoor defenses, comparison to witches' brew attack, better comparison to related work, transferability of representations for cmatchSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
 [26] arXiv:2007.14120 (replaced) [pdf, other]

Title: Reachable Sets of Classifiers and Regression Models: (Non)Robustness Analysis and Robust TrainingComments: Published as a journal paper at ECML PKDD 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
 [27] arXiv:2009.04013 (replaced) [pdf, other]

Title: Attribute Privacy: Framework and MechanismsSubjects: Cryptography and Security (cs.CR); Computers and Society (cs.CY); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
 [28] arXiv:2010.04683 (replaced) [pdf, other]

Title: Smooth Variational Graph Embeddings for Efficient Neural Architecture SearchComments: 8 pages, 3 figures, 5 tables. CameraReady Version for IJCNN 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
 [29] arXiv:2010.14706 (replaced) [pdf, other]

Title: Datadriven prediction of multistable systems from sparse measurementsSubjects: Dynamical Systems (math.DS); Optimization and Control (math.OC); Pattern Formation and Solitons (nlin.PS); Machine Learning (stat.ML)
 [30] arXiv:2011.00467 (replaced) [pdf, other]

Title: Differentially Private Bayesian Inference for Generalized Linear ModelsSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
 [31] arXiv:2012.15085 (replaced) [pdf, other]

Title: Is Pessimism Provably Efficient for Offline RL?Comments: 60 pages, 3 figuresSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
 [32] arXiv:2104.14543 (replaced) [pdf, other]

Title: Optimal training of variational quantum algorithms without barren plateausComments: 13 pages, 14 figuresSubjects: Quantum Physics (quantph); Machine Learning (cs.LG); Machine Learning (stat.ML)
 [33] arXiv:2105.03810 (replaced) [pdf, other]

Title: The Local Approach to Causal Inference under Network InterferenceSubjects: Econometrics (econ.EM); Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
 [34] arXiv:2105.05233 (replaced) [pdf, other]

Title: Diffusion Models Beat GANs on Image SynthesisComments: Updated proof in Appendix G and added more results in Table 5Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[ showing up to 2000 entries per page: fewer  more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, stat, recent, 2105, contact, help (Access key information)