We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for stat.ML in May 2021

[ total of 416 entries: 1-416 ]
[ showing 416 entries per page: fewer | more ]
[1]  arXiv:2105.00026 [pdf, other]
Title: Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder
Comments: accepted to IEEE transactions on pattern analysis and machine intelligence (TPAMI)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[2]  arXiv:2105.00211 [pdf, other]
Title: Autoregressive Hidden Markov Models with partial knowledge on latent space applied to aero-engines prognostics
Journal-ref: European Conference of the PHM Society 2016, selected for extended version in IJPHM
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Applications (stat.AP); Methodology (stat.ME)
[3]  arXiv:2105.00233 [pdf, ps, other]
Title: Matrix completion based on Gaussian parameterized belief propagation
Comments: 21 pages, 7 figures
Subjects: Machine Learning (stat.ML); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG)
[4]  arXiv:2105.00262 [pdf, other]
Title: One-pass Stochastic Gradient Descent in Overparametrized Two-layer Neural Networks
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[5]  arXiv:2105.00351 [pdf, other]
Title: Lattice Paths for Persistent Diagrams
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[6]  arXiv:2105.00455 [pdf, other]
Title: Synthesized Difference in Differences
Comments: Accepted to ACM BCB 2021
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[7]  arXiv:2105.00581 [pdf, other]
Title: Robust Sample Weighting to Facilitate Individualized Treatment Rule Learning for a Target Population
Comments: Biometrika, in press
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[8]  arXiv:2105.01029 [pdf, other]
Title: Initialization and Regularization of Factorized Neural Layers
Comments: ICLR 2021 camera-ready, amended due to error pointed out in arXiv:2209.13569v1 (amendment shown in blue)
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[9]  arXiv:2105.01136 [pdf, other]
Title: Learning Good State and Action Representations via Tensor Decomposition
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[10]  arXiv:2105.01441 [pdf, other]
Title: Distributive Justice and Fairness Metrics in Automated Decision-making: How Much Overlap Is There?
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[11]  arXiv:2105.01463 [pdf, other]
Title: On the Sample Complexity of Rank Regression from Pairwise Comparisons
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[12]  arXiv:2105.01536 [pdf, other]
Title: Abstraction-Guided Truncations for Stationary Distributions of Markov Population Models
Comments: arXiv admin note: text overlap with arXiv:2010.10096
Subjects: Machine Learning (stat.ML); Systems and Control (eess.SY); Quantitative Methods (q-bio.QM)
[13]  arXiv:2105.01637 [pdf, other]
Title: Implicit differentiation for fast hyperparameter selection in non-smooth convex learning
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[14]  arXiv:2105.01650 [pdf, other]
Title: Stochastic gradient descent with noise of machine learning type. Part I: Discrete time analysis
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[15]  arXiv:2105.01783 [pdf, other]
Title: Nonparametric Trace Regression in High Dimensions via Sign Series Representation
Comments: 66 pages, 10 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[16]  arXiv:2105.02337 [pdf, ps, other]
Title: Non-asymptotic analysis and inference for an outlyingness induced winsorized mean
Authors: Yijun Zuo
Comments: 16 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[17]  arXiv:2105.02344 [pdf, other]
Title: Policy Learning with Adaptively Collected Data
Comments: Improved the upper bound; added simulations
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Econometrics (econ.EM)
[18]  arXiv:2105.02487 [pdf, other]
Title: High-dimensional Functional Graphical Model Structure Learning via Neighborhood Selection Approach
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[19]  arXiv:2105.02522 [pdf, other]
Title: Neural graphical modelling in continuous-time: consistency guarantees and algorithms
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Dynamical Systems (math.DS)
[20]  arXiv:2105.02569 [pdf, ps, other]
Title: Machine Collaboration
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Econometrics (econ.EM)
[21]  arXiv:2105.02816 [pdf, ps, other]
Title: Semidefinite Programming for Community Detection with Side Information
Comments: 15 pages
Subjects: Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[22]  arXiv:2105.02831 [pdf, other]
Title: The layer-wise L1 Loss Landscape of Neural Nets is more complex around local minima
Authors: Peter Hinz
Comments: 4 pages, 5 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[23]  arXiv:2105.03153 [pdf, other]
Title: Pairwise Fairness for Ordinal Regression
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[24]  arXiv:2105.03173 [pdf, other]
Title: Use of High Dimensional Modeling for automatic variables selection: the best path algorithm
Authors: Luigi Riso
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[25]  arXiv:2105.03248 [pdf, ps, other]
Title: Parameter Priors for Directed Acyclic Graphical Models and the Characterization of Several Probability Distributions
Comments: This version has improved pointers to the literature. arXiv admin note: substantial text overlap with arXiv:1301.6697
Journal-ref: The Annals of Statistics, 30: 1412-1440, 2002
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[26]  arXiv:2105.03308 [pdf, other]
Title: Geometric convergence of elliptical slice sampling
Comments: 13 pages, 2 figures, Accepted in the Proceedings of the 38th International Conference on Machine Learning
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[27]  arXiv:2105.03361 [pdf, other]
Title: What Kinds of Functions do Deep Neural Networks Learn? Insights from Variational Spline Theory
Journal-ref: SIAM Journal on Mathematics of Data Science, vol. 4, no. 2, pp. 464-489, 2022
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[28]  arXiv:2105.03425 [pdf, other]
Title: Kernel Two-Sample Tests for Manifold Data
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[29]  arXiv:2105.03584 [pdf, other]
Title: Adaptive Latent Space Tuning for Non-Stationary Distributions
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Accelerator Physics (physics.acc-ph)
[30]  arXiv:2105.03863 [pdf, other]
Title: Towards Theoretical Understandings of Robust Markov Decision Processes: Sample Complexity and Asymptotics
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[31]  arXiv:2105.04001 [pdf, other]
Title: Bayesian Kernelised Test of (In)dependence with Mixed-type Variables
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[32]  arXiv:2105.04046 [pdf, other]
Title: A likelihood approach to nonparametric estimation of a singular distribution using deep generative models
Comments: 42 pages, 13 figures, 1 table
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[33]  arXiv:2105.04087 [pdf, other]
Title: Latency Analysis of Consortium Blockchained Federated Learning
Subjects: Machine Learning (stat.ML); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[34]  arXiv:2105.04211 [pdf, other]
Title: SigGPDE: Scaling Sparse Gaussian Processes on Sequential Data
Comments: Published at ICML 2021
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[35]  arXiv:2105.04242 [pdf, other]
Title: T-EMDE: Sketching-based global similarity for cross-modal retrieval
Comments: 10 pages,5 figures, 4 tables, 1 code snippet
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[36]  arXiv:2105.04290 [pdf, other]
Title: Meta-Cal: Well-controlled Post-hoc Calibration by Ranking
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[37]  arXiv:2105.04379 [pdf, other]
Title: Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds
Comments: Under review
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO); Methodology (stat.ME)
[38]  arXiv:2105.04404 [pdf, other]
Title: Topological Uncertainty: Monitoring trained neural networks through persistence of activation graphs
Journal-ref: 2021 International Joint Conference on Artificial Intelligence, Aug 2021, Montr{\'e}al, Canada
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[39]  arXiv:2105.04448 [pdf, other]
Title: Scaffolding Simulations with Deep Learning for High-dimensional Deconvolution
Comments: 6 pages, 1 figure, 1 table
Journal-ref: ICLR simDL workshop 2021 (https://simdl.github.io/files/12.pdf)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); High Energy Physics - Phenomenology (hep-ph); Data Analysis, Statistics and Probability (physics.data-an)
[40]  arXiv:2105.04504 [pdf, other]
Title: Deep Neural Networks as Point Estimates for Deep Gaussian Processes
Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[41]  arXiv:2105.04646 [pdf, other]
Title: Deeply-Debiased Off-Policy Interval Estimation
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[42]  arXiv:2105.04816 [pdf, other]
Title: Spectral risk-based learning using unbounded losses
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[43]  arXiv:2105.04854 [pdf, other]
Title: Improving Molecular Graph Neural Network Explainability with Orthonormalization and Induced Sparsity
Comments: Fixed typo and reference
Journal-ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:4203-4213, 2021
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[44]  arXiv:2105.04920 [pdf, other]
Title: More Powerful Conditional Selective Inference for Generalized Lasso by Parametric Programming
Journal-ref: Journal of Machine Learning Research 23.300 (2022): 1-37
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[45]  arXiv:2105.04979 [pdf, other]
Title: Surrogate assisted active subspace and active subspace assisted surrogate -- A new paradigm for high dimensional structural reliability analysis
Comments: 19 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[46]  arXiv:2105.05031 [pdf, other]
Title: Gradient flow encoding with distance optimization adaptive step size
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Applications (stat.AP); Computation (stat.CO)
[47]  arXiv:2105.05115 [pdf, ps, other]
Title: Analysis of One-Hidden-Layer Neural Networks via the Resolvent Method
Comments: Final version, NeurIPS 2021. 22 pages, 4 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
[48]  arXiv:2105.05146 [pdf, other]
Title: A Twin Neural Model for Uplift
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[49]  arXiv:2105.05489 [pdf, other]
Title: Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Computation (stat.CO)
[50]  arXiv:2105.05648 [pdf, other]
Title: Look-Ahead Screening Rules for the Lasso
Authors: Johan Larsson
Comments: EYSM 2021 short paper; 6 pages, 2 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[51]  arXiv:2105.05842 [pdf, other]
Title: Kernel Thinning
Comments: Accepted for presentation as an extended abstract at the Conference on Learning Theory (COLT) 2021
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Computation (stat.CO); Methodology (stat.ME)
[52]  arXiv:2105.05953 [pdf, other]
Title: Efficient Algorithms for Estimating the Parameters of Mixed Linear Regression Models
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC); Methodology (stat.ME)
[53]  arXiv:2105.06031 [pdf, other]
Title: Joint Community Detection and Rotational Synchronization via Semidefinite Programming
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[54]  arXiv:2105.06558 [pdf, ps, other]
Title: Bias, Fairness, and Accountability with AI and ML Algorithms
Comments: 18 pages, 5 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[55]  arXiv:2105.06868 [pdf, ps, other]
Title: Priors in Bayesian Deep Learning: A Review
Authors: Vincent Fortuin
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[56]  arXiv:2105.06903 [pdf, other]
Title: Posterior Regularization on Bayesian Hierarchical Mixture Clustering
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[57]  arXiv:2105.06907 [pdf, ps, other]
Title: Adapting deep generative approaches for getting synthetic data with realistic marginal distributions
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[58]  arXiv:2105.06964 [pdf, other]
Title: BNNpriors: A library for Bayesian neural network inference with different prior distributions
Comments: Accepted for publication at Software Impacts
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[59]  arXiv:2105.07283 [pdf, other]
Title: Calibrating sufficiently
Authors: Dirk Tasche
Comments: 27 pages, 2 figures, appendix
Journal-ref: Statistics 55(6), 1356-1386, 2021
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[60]  arXiv:2105.07385 [pdf, other]
Title: Statistical Mechanical Analysis of Catastrophic Forgetting in Continual Learning with Teacher and Student Networks
Comments: 22 pages, 4 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[61]  arXiv:2105.07446 [pdf, ps, other]
Title: Sobolev Norm Learning Rates for Conditional Mean Embeddings
Comments: Appears in AISTATS 2022
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[62]  arXiv:2105.07536 [pdf, other]
Title: Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data
Authors: T. Tony Cai, Rong Ma
Comments: Accepted by Journal of Machine Learning Research
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[63]  arXiv:2105.07610 [pdf, other]
Title: Cross-Cluster Weighted Forests
Comments: 19 pages, 6 figures, 1 table
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[64]  arXiv:2105.07634 [pdf, other]
Title: Improving Graph Neural Networks with Simple Architecture Design
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[65]  arXiv:2105.07671 [src]
Title: Classifying variety of customer's online engagement for churn prediction with mixed-penalty logistic regression
Comments: This version is not sufficiently exhaustive; a wrong version of validation results has been released (using a wrong part of a dataset for validation)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Econometrics (econ.EM)
[66]  arXiv:2105.08348 [pdf, other]
Title: On Convex Clustering Solutions
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[67]  arXiv:2105.08532 [pdf, other]
Title: Robust Learning in Heterogeneous Contexts
Comments: Paper under SPL review
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[68]  arXiv:2105.08620 [pdf, other]
Title: Adversarial Examples Detection with Bayesian Neural Network
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[69]  arXiv:2105.08678 [pdf, other]
Title: Nonparametric Modeling of Higher-Order Interactions via Hypergraphons
Comments: To appear in Journal of Machine Learning Research
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Statistics Theory (math.ST)
[70]  arXiv:2105.08717 [pdf, other]
Title: Optimal radial basis for density-based atomic representations
Journal-ref: The Journal of Chemical Physics 155(10), 104106 (2021)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[71]  arXiv:2105.08866 [pdf, other]
Title: Localization, Convexity, and Star Aggregation
Authors: Suhas Vijaykumar
Comments: NeurIPS 2021
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[72]  arXiv:2105.08875 [pdf, ps, other]
Title: Statistical Optimality and Computational Efficiency of Nyström Kernel PCA
Comments: 26 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[73]  arXiv:2105.09107 [pdf, ps, other]
Title: Mill.jl and JsonGrinder.jl: automated differentiable feature extraction for learning from raw JSON data
Comments: 5 pages, 2 figures, 1 table, submitted to section on one-source software of Journal of Machine Learning Research
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Mathematical Software (cs.MS)
[74]  arXiv:2105.09261 [pdf, other]
Title: From parcel to continental scale -- A first European crop type map based on Sentinel-1 and LUCAS Copernicus in-situ observations
Comments: 19 pages, 11 Figures, 5 Tables (without appendix)
Journal-ref: Remote Sensing of Environment, Volume 266, 1 December 2021, 112708
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Applications (stat.AP)
[75]  arXiv:2105.09536 [pdf, ps, other]
Title: On the $α$-lazy version of Markov chains in estimation and testing problems
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[76]  arXiv:2105.09618 [pdf, other]
Title: Nonlinear Hawkes Process with Gaussian Process Self Effects
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
[77]  arXiv:2105.09670 [pdf, other]
Title: Ensemble machine learning approach for screening of coronary heart disease based on echocardiography and risk factors
Comments: 30 pages, 5 figures, 5 tables
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[78]  arXiv:2105.09788 [pdf, ps, other]
Title: Distributed Adaptive Nearest Neighbor Classifier: Algorithm and Theory
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[79]  arXiv:2105.09872 [pdf, other]
Title: EiGLasso for Scalable Sparse Kronecker-Sum Inverse Covariance Estimation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[80]  arXiv:2105.09917 [pdf, ps, other]
Title: Neural networks with superexpressive activations and integer weights
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[81]  arXiv:2105.09994 [pdf, other]
Title: Kernel Stein Discrepancy Descent
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[82]  arXiv:2105.10315 [pdf, ps, other]
Title: Online Statistical Inference for Parameters Estimation with Linear-Equality Constraints
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[83]  arXiv:2105.10347 [pdf, ps, other]
Title: Quantifying the mini-batching error in Bayesian inference for Adaptive Langevin dynamics
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[84]  arXiv:2105.10360 [pdf, other]
Title: Multi-source Learning via Completion of Block-wise Overlapping Noisy Matrices
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Applications (stat.AP); Methodology (stat.ME)
[85]  arXiv:2105.10590 [pdf, other]
Title: Parallelizing Contextual Bandits
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Biomolecules (q-bio.BM); Quantitative Methods (q-bio.QM)
[86]  arXiv:2105.10832 [pdf, other]
Title: Spectral Pruning for Recurrent Neural Networks
Comments: 26 pages, 2 figures
Journal-ref: AISTATS 2022
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[87]  arXiv:2105.10867 [pdf, other]
Title: EXoN: EXplainable encoder Network
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[88]  arXiv:2105.10915 [pdf, other]
Title: GOALS: Gradient-Only Approximations for Line Searches Towards Robust and Consistent Training of Deep Neural Networks
Comments: 26 pages, 8 figures and 5 tables
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[89]  arXiv:2105.11135 [pdf, other]
Title: Robust learning with anytime-guaranteed feedback
Journal-ref: Proceedings of the AAAI Conference on Artificial Intelligence, 36(6):6918-6925, 2022
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[90]  arXiv:2105.11425 [pdf, other]
Title: Uncertainty quantification for distributed regression
Authors: Valeriy Avanesov
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[91]  arXiv:2105.11522 [pdf, other]
Title: Unbiased Estimation of the Gradient of the Log-Likelihood for a Class of Continuous-Time State-Space Models
Comments: 24 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[92]  arXiv:2105.11535 [pdf, other]
Title: Scalable Cross Validation Losses for Gaussian Process Models
Comments: 20 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[93]  arXiv:2105.11724 [pdf, other]
Title: SHAFF: Fast and consistent SHApley eFfect estimates via random Forests
Authors: Clément Bénard (LPSM (UMR\_8001)), Gérard Biau (LPSM (UMR\_8001)), Sébastien da Veiga, Erwan Scornet (CMAP)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[94]  arXiv:2105.11802 [pdf, other]
Title: Bias-Robust Bayesian Optimization via Dueling Bandits
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[95]  arXiv:2105.11818 [pdf, other]
Title: SGD with Coordinate Sampling: Theory and Practice
Comments: Journal of Machine Learning Research 2022
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[96]  arXiv:2105.12033 [pdf, other]
Title: TNet: A Model-Constrained Tikhonov Network Approach for Inverse Problems
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[97]  arXiv:2105.12257 [pdf, other]
Title: Rank-one matrix estimation: analytic time evolution of gradient descent dynamics
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
[98]  arXiv:2105.12271 [pdf, other]
Title: SG-PALM: a Fast Physically Interpretable Tensor Graphical Model
Authors: Yu Wang, Alfred Hero
Comments: Accepted in ICML 2021
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Applications (stat.AP); Computation (stat.CO)
[99]  arXiv:2105.12290 [pdf, other]
Title: Block Dense Weighted Networks with Augmented Degree Correction
Comments: 43 pages, 19 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[100]  arXiv:2105.12478 [pdf, ps, other]
Title: The "given data" paradigm undermines both cultures
Authors: Tyler McCormick
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[101]  arXiv:2105.12866 [pdf, ps, other]
Title: Augmented KRnet for density estimation and approximation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[102]  arXiv:2105.12894 [pdf, other]
Title: MAGI-X: Manifold-Constrained Gaussian Process Inference for Unknown System Dynamics
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO); Methodology (stat.ME)
[103]  arXiv:2105.12941 [pdf, other]
Title: CrystalCandle: A User-Facing Model Explainer for Narrative Explanations
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[104]  arXiv:2105.13011 [pdf, other]
Title: Neural Network Training Using $\ell_1$-Regularization and Bi-fidelity Data
Comments: 28 pages, 14 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[105]  arXiv:2105.13099 [pdf, other]
Title: On the Universality of Graph Neural Networks on Large Random Graphs
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[106]  arXiv:2105.13420 [pdf, other]
Title: Model Selection for Production System via Automated Online Experiments
Comments: NeurIPS 2020
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[107]  arXiv:2105.13440 [pdf, other]
Title: Non-negative matrix factorization algorithms greatly improve topic model fits
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[108]  arXiv:2105.13727 [pdf, other]
Title: Slow Momentum with Fast Reversion: A Trading Strategy Using Deep Learning and Changepoint Detection
Comments: minor changes made to methodology to match implementation
Journal-ref: The Journal of Financial Data Science Winter 2022, jfds.2021.1.081
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Trading and Market Microstructure (q-fin.TR)
[109]  arXiv:2105.13831 [pdf, other]
Title: Implicit Regularization in Matrix Sensing via Mirror Descent
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[110]  arXiv:2105.13850 [pdf, other]
Title: pRSL: Interpretable Multi-label Stacking by Learning Probabilistic Rules
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[111]  arXiv:2105.13922 [pdf, other]
Title: Discretization Drift in Two-Player Games
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[112]  arXiv:2105.14035 [pdf, other]
Title: DeepMoM: Robust Deep Learning With Median-of-Means
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[113]  arXiv:2105.14267 [pdf, other]
Title: Information Directed Sampling for Sparse Linear Bandits
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[114]  arXiv:2105.14301 [pdf, other]
Title: A Theory of Neural Tangent Kernel Alignment and Its Influence on Training
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[115]  arXiv:2105.14328 [pdf, other]
Title: Transfer Learning under High-dimensional Generalized Linear Models
Authors: Ye Tian, Yang Feng
Comments: 94 pages, 11 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[116]  arXiv:2105.14368 [pdf, other]
Title: Fit without fear: remarkable mathematical phenomena of deep learning through the prism of interpolation
Authors: Mikhail Belkin
Comments: A version of this paper will appear in Acta Numerica
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[117]  arXiv:2105.14524 [pdf, other]
Title: Parameter Estimation for the SEIR Model Using Recurrent Nets
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[118]  arXiv:2105.14574 [pdf, other]
Title: Scalable Marked Point Processes for Exchangeable and Non-Exchangeable Event Sequences
Comments: accepted at AISTATS-2022
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[119]  arXiv:2105.14586 [pdf, ps, other]
Title: Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits
Comments: 9 pages, 6 figures, 2 tables, 2 algorithms. Accepted at IEEE Transactions on Artificial Intelligence
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[120]  arXiv:2105.14594 [pdf, other]
Title: Sparse Uncertainty Representation in Deep Learning with Inducing Weights
Comments: NeurIPS 2021 camera ready
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[121]  arXiv:2105.14742 [pdf, other]
Title: Active Learning of Continuous-time Bayesian Networks through Interventions
Comments: Accepted at ICML2021
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[122]  arXiv:2105.14866 [pdf, other]
Title: Variational Autoencoders: A Harmonic Perspective
Comments: 18 pages including Appendix, 7 Figures
Journal-ref: AISTATS 2022
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP)
[123]  arXiv:2105.14989 [pdf, other]
Title: Representation Learning Beyond Linear Prediction Functions
Comments: 1 Figure
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[124]  arXiv:2105.15004 [pdf, other]
Title: Generalization Error Rates in Kernel Regression: The Crossover from the Noiseless to Noisy Regime
Comments: 22 pages, 10 figures, 2 tables
Journal-ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) vol 34 p10131--10143. J. Stat. Mech. (2022) 114004
Subjects: Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[125]  arXiv:2105.15197 [pdf, ps, other]
Title: A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees
Comments: Biometrika 2022
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Econometrics (econ.EM); Statistics Theory (math.ST)
[126]  arXiv:2105.00393 (cross-list from math.ST) [pdf, other]
Title: Directional FDR Control for Sub-Gaussian Sparse GLMs
Comments: 37 pages
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[127]  arXiv:2105.00488 (cross-list from stat.CO) [pdf, other]
Title: Bayesian structure learning and sampling of Bayesian networks with the R package BiDAG
Subjects: Computation (stat.CO); Machine Learning (stat.ML)
[128]  arXiv:2105.00773 (cross-list from stat.AP) [pdf, other]
Title: Approximate Bayesian Computation for an Explicit-Duration Hidden Markov Model of COVID-19 Hospital Trajectories
Comments: To appear in the Proceedings of the Machine Learning for Healthcare (MLHC) conference, 2021. 20 pages, 7 figures and 1 table. 26 additional pages of supplementary material
Subjects: Applications (stat.AP); Machine Learning (cs.LG); Machine Learning (stat.ML)
[129]  arXiv:2105.00987 (cross-list from stat.ME) [pdf, other]
Title: Spectral clustering under degree heterogeneity: a case for the random walk Laplacian
Comments: 22 pages, 10 figures
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[130]  arXiv:2105.01187 (cross-list from stat.ME) [pdf, ps, other]
Title: Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
[131]  arXiv:2105.01264 (cross-list from math.ST) [pdf, other]
Title: Surrogate Assisted Semi-supervised Inference for High Dimensional Risk Prediction
Subjects: Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
[132]  arXiv:2105.01874 (cross-list from math.ST) [pdf, other]
Title: On the Optimality of Nuclear-norm-based Matrix Completion for Problems with Smooth Non-linear Structure
Comments: 47 pages, 1 figure
Subjects: Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
[133]  arXiv:2105.02083 (cross-list from math.ST) [pdf, other]
Title: AdaBoost and robust one-bit compressed sensing
Comments: 40 pages, 4 figures, code available at this https URL, extended results to features that satisfy weak-moment and anti-concentration assumption
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (stat.ML)
[134]  arXiv:2105.02180 (cross-list from math.ST) [pdf, other]
Title: A unifying tutorial on Approximate Message Passing
Comments: 99 pages, 2 figures
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (stat.ML)
[135]  arXiv:2105.02675 (cross-list from stat.ME) [pdf, other]
Title: Granger Causality: A Review and Recent Advances
Comments: 40 pages, 12 figures
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
[136]  arXiv:2105.03067 (cross-list from stat.ME) [pdf, other]
Title: The $s$-value: evaluating stability with respect to distributional shifts
Comments: 43 pages, 9 figures
Subjects: Methodology (stat.ME); Statistics Theory (math.ST); Machine Learning (stat.ML)
[137]  arXiv:2105.03396 (cross-list from stat.ME) [pdf, other]
Title: Double-matched matrix decomposition for multi-view data
Comments: Accepted to Journal of Computational and Graphical Statistics
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[138]  arXiv:2105.04399 (cross-list from stat.ME) [pdf, other]
Title: On projection methods for functional time series forecasting
Authors: Antonio Elías (1), Raúl Jiménez (2), Hanlin Shang (3) ((1) OASYS group, Department of Applied Mathematics, Universidad de Málaga, Málaga, Spain, (2) Department of Statistics, Universidad Carlos III de Madrid, Madrid, Spain, (3) Department of Actuarial Studies and Business Analytics, Macquarie University, Sydney, Australia)
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[139]  arXiv:2105.04656 (cross-list from stat.ME) [pdf, other]
Title: Distribution-free calibration guarantees for histogram binning without sample splitting
Comments: Appears at ICML 2021 (this http URL)
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
[140]  arXiv:2105.04852 (cross-list from math.ST) [pdf, other]
Title: Estimation and Quantization of Expected Persistence Diagrams
Authors: Vincent Divol (DATASHAPE, LMO), Théo Lacombe (DATASHAPE)
Journal-ref: International Conference on Machine Learning, Jul 2021, Virtual Conference, France
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[141]  arXiv:2105.05373 (cross-list from math.ST) [pdf, other]
Title: Estimation of population size based on capture recapture designs and evaluation of the estimation reliability
Subjects: Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
[142]  arXiv:2105.06347 (cross-list from math.ST) [pdf, other]
Title: Identity testing of reversible Markov chains
Comments: To appear in AISTATS'22
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
[143]  arXiv:2105.06559 (cross-list from stat.AP) [pdf, other]
Title: Extending Models Via Gradient Boosting: An Application to Mendelian Models
Comments: 46 pages, 4 figures
Subjects: Applications (stat.AP); Methodology (stat.ME); Machine Learning (stat.ML)
[144]  arXiv:2105.06600 (cross-list from stat.ME) [pdf, other]
Title: Learning Gaussian Graphical Models with Latent Confounders
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[145]  arXiv:2105.08013 (cross-list from stat.AP) [pdf, other]
Title: What makes you unique?
Subjects: Applications (stat.AP); Machine Learning (cs.LG); Machine Learning (stat.ML)
[146]  arXiv:2105.08304 (cross-list from math.ST) [pdf, other]
Title: Parametrization invariant interpretation of priors and posteriors
Authors: Jesus Cerquides
Subjects: Statistics Theory (math.ST); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[147]  arXiv:2105.08747 (cross-list from stat.ME) [pdf, other]
Title: Conformal Prediction using Conditional Histograms
Comments: 12 pages, 4 figures. Supplement: 15 pages, 3 figures, 1 table
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[148]  arXiv:2105.09254 (cross-list from math.ST) [pdf, other]
Title: Multiply Robust Causal Mediation Analysis with Continuous Treatments
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Econometrics (econ.EM); Machine Learning (stat.ML)
[149]  arXiv:2105.09695 (cross-list from stat.ME) [pdf, other]
Title: Hierarchical Non-Stationary Temporal Gaussian Processes With $L^1$-Regularization
Comments: 20 pages. Submitted to Statistics and Computing
Subjects: Methodology (stat.ME); Computation (stat.CO); Machine Learning (stat.ML)
[150]  arXiv:2105.10017 (cross-list from stat.ME) [pdf, other]
Title: Segmentation of high dimensional means over multi-dimensional change points and connections to regression trees
Authors: Abhishek Kaul
Comments: All implementations carried out in R (code available upon request)
Subjects: Methodology (stat.ME); Statistics Theory (math.ST); Machine Learning (stat.ML)
[151]  arXiv:2105.10392 (cross-list from stat.CO) [pdf, ps, other]
Title: Computational Efficient Approximations of the Concordance Probability in a Big Data Setting
Comments: 40 pages, 3 figures
Subjects: Computation (stat.CO); Machine Learning (stat.ML)
[152]  arXiv:2105.10470 (cross-list from stat.ME) [pdf, other]
Title: Geometric variational inference
Comments: 42 pages, 18 figures, accepted by Entropy
Subjects: Methodology (stat.ME); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (stat.ML)
[153]  arXiv:2105.10838 (cross-list from stat.ME) [pdf, other]
Title: Hypothesis Testing for Equality of Latent Positions in Random Graphs
Authors: Xinjie Du, Minh Tang
Comments: 51 pages, 5 figures
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[154]  arXiv:2105.11357 (cross-list from stat.ME) [pdf, other]
Title: Entropy-based adaptive design for contour finding and estimating reliability
Comments: 28 pages, 11 figures
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[155]  arXiv:2105.11886 (cross-list from stat.AP) [pdf, other]
Title: Conformal Anomaly Detection on Spatio-Temporal Observations with Missing Data
Authors: Chen Xu, Yao Xie
Comments: Submitted to ICML 2021 Workshop--Distribution-free Uncertainty Quantification
Subjects: Applications (stat.AP); Methodology (stat.ME); Machine Learning (stat.ML)
[156]  arXiv:2105.12081 (cross-list from stat.ME) [pdf, ps, other]
Title: Group selection and shrinkage: Structured sparsity for semiparametric additive models
Comments: To appear in the Journal of Computational and Graphical Statistics
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
[157]  arXiv:2105.12286 (cross-list from stat.ME) [pdf, other]
Title: An algorithm-based multiple detection influence measure for high dimensional regression using expectile
Comments: 38 pages, 11 figures
Subjects: Methodology (stat.ME); Applications (stat.AP); Computation (stat.CO); Machine Learning (stat.ML)
[158]  arXiv:2105.12778 (cross-list from math.ST) [pdf, ps, other]
Title: Statistical Depth Meets Machine Learning: Kernel Mean Embeddings and Depth in Functional Data Analysis
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[159]  arXiv:2105.12978 (cross-list from math.ST) [pdf, other]
Title: A Non-asymptotic Approach to Best-Arm Identification for Gaussian Bandits
Authors: Antoine Barrier (UMPA-ENSL, LMO), Aurélien Garivier (UMPA-ENSL), Tomáš Kocák
Journal-ref: 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022, Mar 2022, Valencia, Spain
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
[160]  arXiv:2105.13059 (cross-list from stat.CO) [pdf, other]
Title: Efficient and Generalizable Tuning Strategies for Stochastic Gradient MCMC
Subjects: Computation (stat.CO); Methodology (stat.ME); Machine Learning (stat.ML)
[161]  arXiv:2105.13302 (cross-list from math.ST) [pdf, other]
Title: Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit
Journal-ref: Annals of Statistics 2022
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[162]  arXiv:2105.13483 (cross-list from stat.AP) [pdf, other]
Title: Causal, Bayesian, & Non-parametric Modeling of the SARS-CoV-2 Viral Load Distribution vs. Patient's Age
Journal-ref: PLoS ONE 17(10): e0275011 (2022)
Subjects: Applications (stat.AP); Methodology (stat.ME); Machine Learning (stat.ML)
[163]  arXiv:2105.13504 (cross-list from math.ST) [pdf, other]
Title: Lattice partition recovery with dyadic CART
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
[164]  arXiv:2105.14957 (cross-list from stat.ME) [pdf, ps, other]
Title: Conformal Uncertainty Sets for Robust Optimization
Comments: 19 pages, 7 figures, submitted to COPA 2021, accepted
Subjects: Methodology (stat.ME); Optimization and Control (math.OC); Machine Learning (stat.ML)
[165]  arXiv:2105.15081 (cross-list from math.ST) [pdf, ps, other]
Title: Optimal Spectral Recovery of a Planted Vector in a Subspace
Comments: 54 pages
Subjects: Statistics Theory (math.ST); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[166]  arXiv:2105.00191 (cross-list from cs.LG) [pdf, other]
Title: Stochastic Mutual Information Gradient Estimation for Dimensionality Reduction Networks
Comments: Accepted for publication at Elsevier - Information Sciences
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[167]  arXiv:2105.00244 (cross-list from math.OC) [pdf, ps, other]
Title: l1-Norm Minimization with Regula Falsi Type Root Finding Methods
Comments: l1 -norm minimization, nonconvex models, Regula-Falsi, root-finding
Subjects: Optimization and Control (math.OC); Computation (stat.CO); Machine Learning (stat.ML)
[168]  arXiv:2105.00277 (cross-list from cs.LG) [pdf, other]
Title: Multi-view Clustering via Deep Matrix Factorization and Partition Alignment
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[169]  arXiv:2105.00303 (cross-list from cs.LG) [pdf, other]
Title: RATT: Leveraging Unlabeled Data to Guarantee Generalization
Comments: ICML 2021 (Long Talk)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[170]  arXiv:2105.00400 (cross-list from physics.comp-ph) [pdf, other]
Title: Model discovery in the sparse sampling regime
Subjects: Computational Physics (physics.comp-ph); Machine Learning (stat.ML)
[171]  arXiv:2105.00470 (cross-list from cs.LG) [pdf, other]
Title: On Feature Decorrelation in Self-Supervised Learning
Comments: ICCV 2021 Oral. The first two authors contribute equally
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[172]  arXiv:2105.00507 (cross-list from cs.LG) [pdf, other]
Title: Universal scaling laws in the gradient descent training of neural networks
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC); Machine Learning (stat.ML)
[173]  arXiv:2105.00545 (cross-list from econ.TH) [pdf, other]
Title: High Dimensional Decision Making, Upper and Lower Bounds
Journal-ref: Economics Letters, 2021, Elsevier
Subjects: Theoretical Economics (econ.TH); Machine Learning (stat.ML)
[174]  arXiv:2105.00619 (cross-list from cs.LG) [pdf, other]
Title: OpTorch: Optimized deep learning architectures for resource limited environments
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Machine Learning (stat.ML)
[175]  arXiv:2105.00728 (cross-list from cs.CV) [pdf, ps, other]
Title: Spectral Machine Learning for Pancreatic Mass Imaging Classification
Comments: 17 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[176]  arXiv:2105.00887 (cross-list from math.PR) [pdf, other]
Title: Mixing Time Guarantees for Unadjusted Hamiltonian Monte Carlo
Comments: 43 pages
Journal-ref: Bernoulli, Volume 29, Issue 1, pages 75-104 (February 2023)
Subjects: Probability (math.PR); Numerical Analysis (math.NA); Computation (stat.CO); Machine Learning (stat.ML)
[177]  arXiv:2105.00894 (cross-list from cs.LG) [pdf, other]
Title: How Bayesian Should Bayesian Optimisation Be?
Comments: To appear in the Proceedings of Genetic and Evolutionary Computation Conference Companion (GECCO 2021), ACM. 10 pages (main paper) + 26 pages (supplement)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[178]  arXiv:2105.00997 (cross-list from cs.SI) [pdf, other]
Title: Recovering Barabási-Albert Parameters of Graphs through Disentanglement
Comments: Accepted at the 9th International Conference on Learning Representations (ICLR 2021), Workshop on Geometrical and Topological Representation Learning
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[179]  arXiv:2105.01015 (cross-list from cs.LG) [pdf, other]
Title: Bag of Baselines for Multi-objective Joint Neural Architecture Search and Hyperparameter Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[180]  arXiv:2105.01108 (cross-list from cs.IT) [src]
Title: Consistent Density Estimation Under Discrete Mixture Models
Comments: Reason for withdrawal: There is an issue with the proof of Theorem~1
Subjects: Information Theory (cs.IT); Statistics Theory (math.ST); Machine Learning (stat.ML)
[181]  arXiv:2105.01228 (cross-list from math.NA) [pdf, ps, other]
Title: A Priori Generalization Error Analysis of Two-Layer Neural Networks for Solving High Dimensional Schrödinger Eigenvalue Problems
Subjects: Numerical Analysis (math.NA); Mathematical Physics (math-ph); Analysis of PDEs (math.AP); Probability (math.PR); Machine Learning (stat.ML)
[182]  arXiv:2105.01346 (cross-list from cs.AI) [pdf, other]
Title: Implicit Regularization in Deep Tensor Factorization
Authors: Paolo Milanesi (QARMA), Hachem Kadri (LIS, QARMA, AMU SCI), Stéphane Ayache (QARMA), Thierry Artières (QARMA)
Journal-ref: International Joint Conference on Neural Networks (IJCNN), Jul 2021, Online, China
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[183]  arXiv:2105.01420 (cross-list from cs.LG) [pdf, ps, other]
Title: Training Quantized Neural Networks to Global Optimality via Semidefinite Programming
Comments: v2: Minor edits in the text. The results are unchanged
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[184]  arXiv:2105.01426 (cross-list from econ.GN) [pdf, other]
Title: Business analytics meets artificial intelligence: Assessing the demand effects of discounts on Swiss train tickets
Subjects: General Economics (econ.GN); Machine Learning (stat.ML)
[185]  arXiv:2105.01550 (cross-list from cs.LG) [pdf, ps, other]
Title: A Finer Calibration Analysis for Adversarial Robustness
Comments: arXiv admin note: text overlap with arXiv:2104.09658
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[186]  arXiv:2105.01593 (cross-list from cs.LG) [pdf, ps, other]
Title: Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation
Comments: This version removes most assumptions of the prior one
Journal-ref: International Conference on Machine Learning (ICML) 2022
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[187]  arXiv:2105.01636 (cross-list from cs.LG) [pdf, other]
Title: Learning 3D Granular Flow Simulations
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[188]  arXiv:2105.01706 (cross-list from cs.LG) [pdf, other]
Title: Sampling From the Wasserstein Barycenter
Authors: Chiheb Daaloul (1), Thibaut Le Gouic (2), Jacques Liandrat (1), Magali Tournus (1) ((1) Aix-Marseille Univ., CNRS, I2M, UMR7373, Centrale Marseille, Marseille, France, (2) Massachusetts Institute of Technology, Department of Mathematics, USA)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[189]  arXiv:2105.01850 (cross-list from cs.LG) [pdf, other]
Title: Preference learning along multiple criteria: A game-theoretic perspective
Comments: 47 pages; published as a conference paper at NeurIPS 2020
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[190]  arXiv:2105.01867 (cross-list from cs.LG) [pdf, other]
Title: A Theoretical-Empirical Approach to Estimating Sample Complexity of DNNs
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[191]  arXiv:2105.02062 (cross-list from cs.LG) [pdf, ps, other]
Title: Understanding Short-Range Memory Effects in Deep Neural Networks
Comments: 15pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[192]  arXiv:2105.02221 (cross-list from cs.LG) [pdf, other]
Title: How Fine-Tuning Allows for Effective Meta-Learning
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[193]  arXiv:2105.02259 (cross-list from cs.IT) [pdf, other]
Title: Information Limits for Detecting a Subhypergraph
Subjects: Information Theory (cs.IT); Statistics Theory (math.ST); Machine Learning (stat.ML)
[194]  arXiv:2105.02375 (cross-list from cs.LG) [pdf, other]
Title: A Geometric Analysis of Neural Collapse with Unconstrained Features
Comments: 42 pages, 8 figures, 1 table; the first two authors contributed to this work equally
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Optimization and Control (math.OC); Machine Learning (stat.ML)
[195]  arXiv:2105.02470 (cross-list from cs.LG) [pdf, other]
Title: Generalized Multimodal ELBO
Comments: 2021 ICLR
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[196]  arXiv:2105.02551 (cross-list from cs.LG) [pdf, other]
Title: Structured Ensembles: an Approach to Reduce the Memory Footprint of Ensemble Methods
Comments: Article accepted at Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[197]  arXiv:2105.02597 (cross-list from physics.data-an) [pdf, other]
Title: Extreme Learning Machine for the Characterization of Anomalous Diffusion from Single Trajectories
Authors: Carlo Manzo
Comments: 16 pages, 6 figures
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Biological Physics (physics.bio-ph); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
[198]  arXiv:2105.02702 (cross-list from cs.SD) [pdf, other]
Title: MIMII DUE: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection with Domain Shifts due to Changes in Operational and Environmental Conditions
Comments: Accepted to IEEE WASPAA 2021
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[199]  arXiv:2105.02716 (cross-list from cs.LG) [pdf, other]
Title: Noether's Learning Dynamics: Role of Symmetry Breaking in Neural Networks
Journal-ref: NeurIPS (Advances in Neural Information Processing Systems), 2021
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Neurons and Cognition (q-bio.NC); Machine Learning (stat.ML)
[200]  arXiv:2105.02725 (cross-list from cs.LG) [pdf, other]
Title: CrossWalk: Fairness-enhanced Node Representation Learning
Comments: Association for the Advancement of Artificial Intelligence (AAAI) 2022
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[201]  arXiv:2105.02761 (cross-list from cs.LG) [pdf, other]
Title: Neural Algorithmic Reasoning
Comments: Accepted as an Opinion paper in Patterns. 7 pages, 1 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Optimization and Control (math.OC); Machine Learning (stat.ML)
[202]  arXiv:2105.02796 (cross-list from cs.LG) [pdf, other]
Title: Practical and Rigorous Uncertainty Bounds for Gaussian Process Regression
Comments: Contains supplementary material and corrections to the original version
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Machine Learning (stat.ML)
[203]  arXiv:2105.02845 (cross-list from math.PR) [pdf, ps, other]
Title: A Unifying and Canonical Description of Measure-Preserving Diffusions
Subjects: Probability (math.PR); Differential Geometry (math.DG); Machine Learning (stat.ML)
[204]  arXiv:2105.02873 (cross-list from cs.LG) [pdf, ps, other]
Title: Contextual Bandits with Sparse Data in Web setting
Comments: 4 pages, 3 tables, review paper, scoping study
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[205]  arXiv:2105.02936 (cross-list from cs.LG) [pdf, other]
Title: Exact Acceleration of K-Means++ and K-Means$\|$
Authors: Edward Raff
Comments: to appear in the 30th International Joint Conference on Artificial Intelligence (IJCAI-21)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Mathematical Software (cs.MS); Machine Learning (stat.ML)
[206]  arXiv:2105.03058 (cross-list from cs.LG) [pdf, other]
Title: Error-Robust Multi-View Clustering: Progress, Challenges and Opportunities
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[207]  arXiv:2105.03109 (cross-list from cs.LG) [pdf, other]
Title: Laplace Matching for fast Approximate Inference in Latent Gaussian Models
Comments: Added experiments and clarifications; Currently under review at JMLR
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[208]  arXiv:2105.03172 (cross-list from cs.LG) [pdf, other]
Title: Reward prediction for representation learning and reward shaping
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[209]  arXiv:2105.03310 (cross-list from cs.LG) [pdf, other]
Title: Context-Based Soft Actor Critic for Environments with Non-stationary Dynamics
Comments: 12 pages, 11 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[210]  arXiv:2105.03397 (cross-list from eess.SY) [pdf, other]
Title: Learning-enhanced robust controller synthesis with rigorous statistical and control-theoretic guarantees
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Machine Learning (stat.ML)
[211]  arXiv:2105.03418 (cross-list from hep-lat) [pdf, other]
Title: Deep Learning Hamiltonian Monte Carlo
Comments: 8 pages, 7 figures, Published as a workshop paper at ICLR 2021 SimDL Workshop
Subjects: High Energy Physics - Lattice (hep-lat); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); Machine Learning (stat.ML)
[212]  arXiv:2105.03491 (cross-list from cs.LG) [pdf, other]
Title: Uniform Convergence, Adversarial Spheres and a Simple Remedy
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[213]  arXiv:2105.03527 (cross-list from math.OC) [pdf, other]
Title: Scalable Projection-Free Optimization
Authors: Mingrui Zhang
Comments: dissertation
Subjects: Optimization and Control (math.OC); Machine Learning (stat.ML)
[214]  arXiv:2105.03594 (cross-list from cs.LG) [pdf, ps, other]
Title: Learning stochastic decision trees
Comments: To appear in ICALP 2021
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[215]  arXiv:2105.03603 (cross-list from cs.IT) [pdf, ps, other]
Title: Learning to Detect an Odd Restless Markov Arm with a Trembling Hand
Comments: 49 pages. A shorter version of this manuscript has been accepted for presentation at the 2021 IEEE International Symposium on Information Theory. This manuscript contains the proofs of all the main results
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Machine Learning (stat.ML)
[216]  arXiv:2105.03616 (cross-list from cs.LG) [pdf, other]
Title: Interpretable Mixture Density Estimation by use of Differentiable Tree-module
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[217]  arXiv:2105.03678 (cross-list from eess.SP) [pdf, other]
Title: Nearly Minimax-Optimal Rates for Noisy Sparse Phase Retrieval via Early-Stopped Mirror Descent
Comments: arXiv admin note: text overlap with arXiv:2010.10168
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Machine Learning (stat.ML)
[218]  arXiv:2105.03692 (cross-list from cs.LG) [pdf, other]
Title: Incompatibility Clustering as a Defense Against Backdoor Poisoning Attacks
Comments: ICLR 2023. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[219]  arXiv:2105.03705 (cross-list from cs.LG) [pdf, other]
Title: Understanding Neural Networks with Logarithm Determinant Entropy Estimator
Comments: 15pages,22 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[220]  arXiv:2105.03714 (cross-list from cs.LG) [pdf, other]
Title: Consistency of Constrained Spectral Clustering under Graph Induced Fair Planted Partitions
Comments: Accepted at NeurIPS 2022
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Machine Learning (stat.ML)
[221]  arXiv:2105.03746 (cross-list from cs.LG) [pdf, other]
Title: Contrastive Attraction and Contrastive Repulsion for Representation Learning
Journal-ref: Transactions on Machine Learning Research, 2023. ISSN 2835-8856
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[222]  arXiv:2105.03800 (cross-list from cs.LG) [pdf, other]
Title: Fine-Grained $ε$-Margin Closed-Form Stabilization of Parametric Hawkes Processes
Authors: Rafael Lima
Comments: Presented as a RobustML workshop paper at ICLR 2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[223]  arXiv:2105.03810 (cross-list from econ.EM) [pdf, other]
Title: The Local Approach to Causal Inference under Network Interference
Subjects: Econometrics (econ.EM); Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
[224]  arXiv:2105.03855 (cross-list from cs.LG) [pdf, ps, other]
Title: GMOTE: Gaussian based minority oversampling technique for imbalanced classification adapting tail probability of outliers
Comments: 20 pages, 6 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[225]  arXiv:2105.03875 (cross-list from cs.LG) [pdf, ps, other]
Title: Bounding Information Leakage in Machine Learning
Comments: Published in [Elsevier Neurocomputing](this https URL)
Journal-ref: Neurocomputing, 2023, , ISSN 0925-2312
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[226]  arXiv:2105.03879 (cross-list from cs.LG) [pdf, other]
Title: Directional Convergence Analysis under Spherically Symmetric Distribution
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[227]  arXiv:2105.03962 (cross-list from cs.LG) [pdf, other]
Title: Stochastic Multi-Armed Bandits with Control Variates
Comments: Accepted to NeurIPS 2021
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[228]  arXiv:2105.04026 (cross-list from cs.LG) [pdf, other]
Title: The Modern Mathematics of Deep Learning
Comments: A version of this review paper appears as a chapter in the book "Mathematical Aspects of Deep Learning" by Cambridge University Press
Journal-ref: Mathematical Aspects of Deep Learning, pp. 1-111. Cambridge University Press, 2022
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[229]  arXiv:2105.04051 (cross-list from cs.LG) [pdf, other]
Title: Aggregating From Multiple Target-Shifted Sources
Journal-ref: ICML2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[230]  arXiv:2105.04062 (cross-list from cs.SI) [pdf, other]
Title: Approximate Fréchet Mean for Data Sets of Sparse Graphs
Comments: 28 pages
Subjects: Social and Information Networks (cs.SI); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[231]  arXiv:2105.04093 (cross-list from cs.CV) [pdf, ps, other]
Title: Elastic Weight Consolidation (EWC): Nuts and Bolts
Authors: Abhishek Aich
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[232]  arXiv:2105.04100 (cross-list from cs.LG) [pdf, other]
Title: Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting
Comments: Accepted at the International Conference on Machine Learning (ICML) 2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[233]  arXiv:2105.04130 (cross-list from cond-mat.stat-mech) [pdf, other]
Title: Boltzmann machines as two-dimensional tensor networks
Comments: 12 pages, 11 figures
Journal-ref: Phys. Rev. B 104, 075154 (2021)
Subjects: Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); Computational Physics (physics.comp-ph); Quantum Physics (quant-ph); Machine Learning (stat.ML)
[234]  arXiv:2105.04143 (cross-list from cs.CV) [pdf, other]
Title: Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[235]  arXiv:2105.04240 (cross-list from cs.LG) [pdf, other]
Title: A rigorous introduction to linear models
Authors: Jun Lu
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[236]  arXiv:2105.04332 (cross-list from cs.LG) [pdf, other]
Title: Bayesian Optimistic Optimisation with Exponentially Decaying Regret
Comments: To appear at ICML 2021 (21 pages)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[237]  arXiv:2105.04373 (cross-list from cs.LG) [pdf, other]
Title: Combinatorial Multi-armed Bandits for Resource Allocation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[238]  arXiv:2105.04471 (cross-list from cs.LG) [pdf, other]
Title: Natural Posterior Network: Deep Bayesian Uncertainty for Exponential Family Distributions
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[239]  arXiv:2105.04522 (cross-list from cs.LG) [pdf, other]
Title: Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels
Comments: Neural Information Processing Systems (NeurIPS 2021)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[240]  arXiv:2105.04550 (cross-list from cs.LG) [pdf, other]
Title: Optimization of Graph Neural Networks: Implicit Acceleration by Skip Connections and More Depth
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC); Machine Learning (stat.ML)
[241]  arXiv:2105.04554 (cross-list from cs.CE) [pdf, other]
Title: Local approximate Gaussian process regression for data-driven constitutive laws: Development and comparison with neural networks
Comments: 22 pages, 15 figures
Subjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[242]  arXiv:2105.04683 (cross-list from cs.LG) [pdf, other]
Title: Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks
Journal-ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[243]  arXiv:2105.04770 (cross-list from cs.IT) [pdf, other]
Title: Exact Recovery in the General Hypergraph Stochastic Block Model
Comments: Accepted by IEEE Transactions on Information Theory
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP); Statistics Theory (math.ST); Machine Learning (stat.ML)
[244]  arXiv:2105.04857 (cross-list from cs.LG) [pdf, other]
Title: Leveraging Sparse Linear Layers for Debuggable Deep Networks
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[245]  arXiv:2105.04876 (cross-list from cs.CL) [pdf, other]
Title: Benchmarking down-scaled (not so large) pre-trained language models
Comments: 14 pages, 5 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[246]  arXiv:2105.04999 (cross-list from math.NA) [pdf, other]
Title: Learning Runge-Kutta Integration Schemes for ODE Simulation and Identification
Subjects: Numerical Analysis (math.NA); Machine Learning (stat.ML)
[247]  arXiv:2105.05001 (cross-list from cs.LG) [pdf, other]
Title: FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Convergence Analysis
Comments: ICML 2021
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[248]  arXiv:2105.05026 (cross-list from cs.LG) [pdf, other]
Title: Rethinking and Reweighting the Univariate Losses for Multi-Label Ranking: Consistency and Generalization
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[249]  arXiv:2105.05181 (cross-list from cs.LG) [pdf, ps, other]
Title: Factoring Multidimensional Data to Create a Sophisticated Bayes Classifier
Authors: Anthony LaTorre
Subjects: Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[250]  arXiv:2105.05228 (cross-list from cs.LG) [pdf, ps, other]
Title: Global Convergence of Three-layer Neural Networks in the Mean Field Regime
Comments: Appear in ICLR 2021. This is the conference version of arXiv:2001.11443 (which contains treatment of the multilayer neural nets and their global convergence)
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Statistics Theory (math.ST); Machine Learning (stat.ML)
[251]  arXiv:2105.05231 (cross-list from cs.IT) [pdf, ps, other]
Title: Soft BIBD and Product Gradient Codes
Comments: New results in Section III-A and Section IV, references added. Presented in part at the IEEE International Symposiums on Topics in Coding (ISTC) 2021 and in part at the Information-Theoretic Methods for Rigorous, Responsible, and Reliable Machine Learning (ITR3) Workshop in International Conference on Machine Learning (ICML) 2021
Subjects: Information Theory (cs.IT); Machine Learning (stat.ML)
[252]  arXiv:2105.05233 (cross-list from cs.LG) [pdf, other]
Title: Diffusion Models Beat GANs on Image Synthesis
Comments: Added compute requirements, ImageNet 256$\times$256 upsampling FID and samples, DDIM guided sampler, fixed typos
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[253]  arXiv:2105.05328 (cross-list from cs.LG) [pdf, other]
Title: Comparing interpretability and explainability for feature selection
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[254]  arXiv:2105.05347 (cross-list from cs.LG) [pdf, other]
Title: Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[255]  arXiv:2105.05360 (cross-list from physics.ao-ph) [pdf, other]
Title: Real-time Ionospheric Imaging of S4 Scintillation from Limited Data with Parallel Kalman Filters and Smoothness
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (stat.ML)
[256]  arXiv:2105.05400 (cross-list from cs.LG) [pdf, ps, other]
Title: Homogeneous vector bundles and $G$-equivariant convolutional neural networks
Authors: Jimmy Aronsson
Comments: 23 pages
Subjects: Machine Learning (cs.LG); Representation Theory (math.RT); Machine Learning (stat.ML)
[257]  arXiv:2105.05449 (cross-list from cs.LG) [pdf, ps, other]
Title: An efficient projection neural network for $\ell_1$-regularized logistic regression
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[258]  arXiv:2105.05555 (cross-list from cs.LG) [pdf, ps, other]
Title: Robust Learning of Fixed-Structure Bayesian Networks in Nearly-Linear Time
Authors: Yu Cheng, Honghao Lin
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Statistics Theory (math.ST); Machine Learning (stat.ML)
[259]  arXiv:2105.05622 (cross-list from cs.LG) [pdf, other]
Title: On risk-based active learning for structural health monitoring
Comments: 30 pages. 23 figures. Published in Mechanical Systems and Signal Processing
Journal-ref: Mechanical Systems and Signal Processing, Volume 167, Part B, 15 March 2022, 108569,
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[260]  arXiv:2105.05650 (cross-list from cond-mat.stat-mech) [pdf, other]
Title: Unbiased Monte Carlo Cluster Updates with Autoregressive Neural Networks
Comments: 12 pages, 9 figures
Journal-ref: Phys. Rev. Research 3, L042024 (2021)
Subjects: Statistical Mechanics (cond-mat.stat-mech); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); Machine Learning (stat.ML)
[261]  arXiv:2105.05721 (cross-list from quant-ph) [pdf, other]
Title: Causal Networks and Freedom of Choice in Bell's Theorem
Comments: 18 pages, 10 figures. Updated to match published version
Journal-ref: PRX Quantum 2 (2021) 040323
Subjects: Quantum Physics (quant-ph); Machine Learning (stat.ML)
[262]  arXiv:2105.05728 (cross-list from cs.LG) [pdf, other]
Title: Early prediction of respiratory failure in the intensive care unit
Comments: 14 pages, 5 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[263]  arXiv:2105.05736 (cross-list from cs.LG) [pdf, other]
Title: Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces
Comments: To appear in ICML 2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[264]  arXiv:2105.05757 (cross-list from cs.LG) [pdf, other]
Title: Exploring the Similarity of Representations in Model-Agnostic Meta-Learning
Comments: Learning to Learn workshop at ICLR 2021
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[265]  arXiv:2105.05782 (cross-list from cs.DS) [pdf, other]
Title: How to Design Robust Algorithms using Noisy Comparison Oracle
Comments: PVLDB 2021
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB); Machine Learning (stat.ML)
[266]  arXiv:2105.05947 (cross-list from math.OC) [pdf, other]
Title: A new perspective on low-rank optimization
Comments: Major revision submitted to Mathematical Programming
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[267]  arXiv:2105.05989 (cross-list from cs.IT) [pdf, ps, other]
Title: Optimal transport with some directed distances
Authors: Wolfgang Stummer
Comments: 9 pages
Journal-ref: in: F. Nielsen and F. Barbaresco (Eds.): Geometric Science of Information GSI 2021, LNCS 12829, pp. 829-840, 2021
Subjects: Information Theory (cs.IT); Probability (math.PR); Machine Learning (stat.ML)
[268]  arXiv:2105.06018 (cross-list from cs.LG) [pdf, ps, other]
Title: Robust Dynamic Multi-Modal Data Fusion: A Model Uncertainty Perspective
Authors: Bin Liu
Comments: This paper has been accepted by IEEE Signal Processing Letters for publication
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[269]  arXiv:2105.06029 (cross-list from cs.LG) [pdf, other]
Title: Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[270]  arXiv:2105.06060 (cross-list from cs.LG) [pdf, other]
Title: House Price Prediction using Satellite Imagery
Comments: Stanford CS230 Deep Learning, Winter 2018
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[271]  arXiv:2105.06241 (cross-list from cs.LG) [pdf, ps, other]
Title: Likelihoods and Parameter Priors for Bayesian Networks
Comments: This version has improved pointers to the literature
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[272]  arXiv:2105.06251 (cross-list from cs.LG) [pdf, other]
Title: Learning Weakly Convex Sets in Metric Spaces
Comments: completely revised version, currently under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[273]  arXiv:2105.06337 (cross-list from cs.LG) [pdf, other]
Title: Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[274]  arXiv:2105.06371 (cross-list from cs.LG) [pdf, other]
Title: Provably Convergent Algorithms for Solving Inverse Problems Using Generative Models
Comments: arXiv admin note: text overlap with arXiv:1810.03587, arXiv:1802.08406
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[275]  arXiv:2105.06499 (cross-list from cs.LG) [pdf, other]
Title: Improved Algorithms for Agnostic Pool-based Active Classification
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[276]  arXiv:2105.06587 (cross-list from cs.LG) [pdf, other]
Title: Empirical Evaluation of Biased Methods for Alpha Divergence Minimization
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[277]  arXiv:2105.06643 (cross-list from cs.LG) [pdf, other]
Title: Monash Time Series Forecasting Archive
Comments: 33 pages, 3 figures, 15 tables
Journal-ref: Neural Information Processing Systems Track on Datasets and Benchmarks (2021) - forthcoming
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[278]  arXiv:2105.06715 (cross-list from cs.LG) [pdf, other]
Title: Maximizing Mutual Information Across Feature and Topology Views for Learning Graph Representations
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[279]  arXiv:2105.06742 (cross-list from cs.CR) [pdf, other]
Title: Cybersecurity Anomaly Detection in Adversarial Environments
Comments: Presented at AAAI FSS-21: Artificial Intelligence in Government and Public Sector, Washington, DC, USA
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[280]  arXiv:2105.06960 (cross-list from cs.LG) [pdf, ps, other]
Title: Thompson Sampling for Gaussian Entropic Risk Bandits
Comments: arXiv admin note: text overlap with arXiv:2011.08046
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[281]  arXiv:2105.06987 (cross-list from cs.LG) [pdf, other]
Title: Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[282]  arXiv:2105.07025 (cross-list from math.AT) [pdf, other]
Title: Minimal Cycle Representatives in Persistent Homology using Linear Programming: an Empirical Study with User's Guide
Subjects: Algebraic Topology (math.AT); Computational Geometry (cs.CG); Machine Learning (stat.ML)
[283]  arXiv:2105.07168 (cross-list from cs.LG) [pdf, other]
Title: Cohort Shapley value for algorithmic fairness
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Econometrics (econ.EM); Machine Learning (stat.ML)
[284]  arXiv:2105.07222 (cross-list from cs.LG) [pdf, other]
Title: On the Distributional Properties of Adaptive Gradients
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[285]  arXiv:2105.07320 (cross-list from cs.DC) [pdf, other]
Title: LocalNewton: Reducing Communication Bottleneck for Distributed Learning
Comments: To be published in Uncertainty in Artificial Intelligence (UAI) 2021
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[286]  arXiv:2105.07338 (cross-list from cs.LG) [pdf, ps, other]
Title: CCMN: A General Framework for Learning with Class-Conditional Multi-Label Noise
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[287]  arXiv:2105.07416 (cross-list from q-bio.NC) [pdf, other]
Title: Bayesian reconstruction of memories stored in neural networks from their connectivity
Comments: Code available at this https URL
Journal-ref: PLOS Computational Biology 19(1): e1010813 2023
Subjects: Neurons and Cognition (q-bio.NC); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (stat.ML)
[288]  arXiv:2105.07593 (cross-list from cs.CV) [pdf, other]
Title: Differentiable SLAM-net: Learning Particle SLAM for Visual Navigation
Comments: CVPR 2021, extended results
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
[289]  arXiv:2105.07636 (cross-list from cs.LG) [pdf, other]
Title: DOC3-Deep One Class Classification using Contradictions
Comments: Deep Learning, Anomaly Detection, Visual Inspection, Learning from Contradictions, Disjoint Auxiliary, Outlier Exposure, MVTec-AD
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[290]  arXiv:2105.07729 (cross-list from cs.LG) [pdf, other]
Title: Data Assimilation Predictive GAN (DA-PredGAN): applied to determine the spread of COVID-19
Journal-ref: Journal of Scientific Computing, 94(1), p.25. 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[291]  arXiv:2105.07743 (cross-list from cs.LG) [pdf, other]
Title: Universal Regular Conditional Distributions
Comments: Regular Conditional Distributions, Geometric Deep Learning, Computational Optimal Transport, Measure-Valued Neural Networks, Universal Approximation, Transformers
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Metric Geometry (math.MG); Probability (math.PR); Machine Learning (stat.ML)
[292]  arXiv:2105.07829 (cross-list from cs.DC) [pdf, other]
Title: Compressed Communication for Distributed Training: Adaptive Methods and System
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[293]  arXiv:2105.07882 (cross-list from cs.AI) [pdf, other]
Title: Efficient and accurate group testing via Belief Propagation: an empirical study
Subjects: Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM); Information Theory (cs.IT); Machine Learning (stat.ML)
[294]  arXiv:2105.07900 (cross-list from math.NA) [pdf, other]
Title: Sparse solutions of the kernel herding algorithm by improved gradient approximation
Subjects: Numerical Analysis (math.NA); Computation (stat.CO); Machine Learning (stat.ML)
[295]  arXiv:2105.07911 (cross-list from cs.CL) [pdf, other]
Title: SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
Comments: 9 pages
Subjects: Computation and Language (cs.CL); Machine Learning (stat.ML)
[296]  arXiv:2105.07957 (cross-list from cs.NE) [pdf, other]
Title: Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers
Comments: Nature Machine Intelligence
Journal-ref: Nature Machine Intelligence, Vol. 2, December 2020, 753-763
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[297]  arXiv:2105.08005 (cross-list from cs.LG) [pdf, ps, other]
Title: Learning a Latent Simplex in Input-Sparsity Time
Comments: ICLR 2021
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[298]  arXiv:2105.08024 (cross-list from cs.LG) [pdf, other]
Title: Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
[299]  arXiv:2105.08164 (cross-list from cs.LG) [pdf, other]
Title: Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
Comments: 16 pages, 7 figures, ICML 2021
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[300]  arXiv:2105.08195 (cross-list from cs.LG) [pdf, other]
Title: Parallel Bayesian Optimization of Multiple Noisy Objectives with Expected Hypervolume Improvement
Comments: To appear in Advances in Neural Information Processing Systems 34, 2021. 40 pages. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[301]  arXiv:2105.08232 (cross-list from math.OC) [pdf, other]
Title: Sharp Restricted Isometry Property Bounds for Low-rank Matrix Recovery Problems with Corrupted Measurements
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[302]  arXiv:2105.08233 (cross-list from cs.LG) [pdf, ps, other]
Title: Oneshot Differentially Private Top-k Selection
Comments: Accepted to ICML 2021
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[303]  arXiv:2105.08285 (cross-list from cs.DS) [pdf, ps, other]
Title: Sublinear Least-Squares Value Iteration via Locality Sensitive Hashing
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[304]  arXiv:2105.08306 (cross-list from cs.LG) [pdf, other]
Title: Sample Efficient Linear Meta-Learning by Alternating Minimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[305]  arXiv:2105.08310 (cross-list from cs.MA) [pdf, other]
Title: BBE: Simulating the Microstructural Dynamics of an In-Play Betting Exchange via Agent-Based Modelling
Authors: Dave Cliff
Comments: 47 pages, 9 figures, 120 references
Subjects: Multiagent Systems (cs.MA); Computational Engineering, Finance, and Science (cs.CE); Computational Finance (q-fin.CP); Trading and Market Microstructure (q-fin.TR); Machine Learning (stat.ML)
[306]  arXiv:2105.08399 (cross-list from cs.LG) [pdf, other]
Title: Relative Positional Encoding for Transformers with Linear Complexity
Comments: ICML 2021 (long talk) camera-ready. 24 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[307]  arXiv:2105.08675 (cross-list from cs.LG) [pdf, ps, other]
Title: The Computational Complexity of ReLU Network Training Parameterized by Data Dimensionality
Journal-ref: Journal of Artificial Intelligence Research 74 (2022): 1775-1790
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[308]  arXiv:2105.08869 (cross-list from cs.LG) [pdf, other]
Title: Incentivized Bandit Learning with Self-Reinforcing User Preferences
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[309]  arXiv:2105.08966 (cross-list from cs.LG) [pdf, other]
Title: Latent Gaussian Model Boosting
Authors: Fabio Sigrist
Comments: arXiv admin note: text overlap with arXiv:2004.02653
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation (stat.CO); Methodology (stat.ME); Machine Learning (stat.ML)
[310]  arXiv:2105.09016 (cross-list from cs.LG) [pdf, other]
Title: E(n) Equivariant Normalizing Flows
Comments: Accepted at Neural Information Processing Systems (NeurIPS 2021)
Subjects: Machine Learning (cs.LG); Chemical Physics (physics.chem-ph); Machine Learning (stat.ML)
[311]  arXiv:2105.09095 (cross-list from cs.LG) [pdf, other]
Title: Aleatoric uncertainty for Errors-in-Variables models in deep regression
Comments: 9 pages
Journal-ref: Neural Processing Letters (2022): 1-20
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[312]  arXiv:2105.09240 (cross-list from cs.LG) [pdf, other]
Title: Boosting Variational Inference With Locally Adaptive Step-Sizes
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[313]  arXiv:2105.09433 (cross-list from cs.LG) [pdf, ps, other]
Title: L1 Regression with Lewis Weights Subsampling
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[314]  arXiv:2105.09557 (cross-list from cs.LG) [pdf, other]
Title: Power-law escape rate of SGD
Comments: 17+8 pages
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (stat.ML)
[315]  arXiv:2105.09579 (cross-list from cs.LG) [pdf, other]
Title: Aggregate Learning for Mixed Frequency Data
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[316]  arXiv:2105.09580 (cross-list from cs.LG) [pdf, other]
Title: Negational Symmetry of Quantum Neural Networks for Binary Pattern Classification
Comments: Accepted by Pattern Recognition
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph); Machine Learning (stat.ML)
[317]  arXiv:2105.09679 (cross-list from cond-mat.dis-nn) [pdf, ps, other]
Title: Improved Neuronal Ensemble Inference with Generative Model and MCMC
Comments: 23 pages, 8 figures, partially overlapped with arXiv:1911.06509
Journal-ref: J. Stat. Mech. (2021) 063501
Subjects: Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC); Machine Learning (stat.ML)
[318]  arXiv:2105.09801 (cross-list from cs.LG) [pdf, other]
Title: Monte Carlo Filtering Objectives: A New Family of Variational Objectives to Learn Generative Model and Neural Adaptive Proposal for Time Series
Comments: A complete version of manuscript accepted by IJCAI-21
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[319]  arXiv:2105.09980 (cross-list from cs.LG) [pdf, other]
Title: Data-driven discovery of interpretable causal relations for deep learning material laws with uncertainty propagation
Comments: 43 pages, 27 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[320]  arXiv:2105.09985 (cross-list from cs.LG) [pdf, other]
Title: Measuring Model Fairness under Noisy Covariates: A Theoretical Perspective
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[321]  arXiv:2105.10090 (cross-list from cs.LG) [pdf, other]
Title: Escaping Saddle Points with Compressed SGD
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[322]  arXiv:2105.10148 (cross-list from cs.LG) [pdf, other]
Title: On Instrumental Variable Regression for Deep Offline Policy Evaluation
Comments: Accepted by Journal of Machine Learning Research in 11/2022
Journal-ref: Journal of Machine Learning Research 23 (2022) 1-41
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[323]  arXiv:2105.10190 (cross-list from cs.LG) [pdf, other]
Title: AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[324]  arXiv:2105.10305 (cross-list from cs.LG) [pdf, other]
Title: Correlated Input-Dependent Label Noise in Large-Scale Image Classification
Comments: Accepted as Oral at CVPR 2021
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[325]  arXiv:2105.10439 (cross-list from eess.SP) [pdf, other]
Title: Covariance-Free Sparse Bayesian Learning
Comments: 13 pages
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Machine Learning (stat.ML)
[326]  arXiv:2105.10446 (cross-list from cs.LG) [pdf, other]
Title: ReduNet: A White-box Deep Network from the Principle of Maximizing Rate Reduction
Comments: This paper integrates previous two manuscripts: arXiv:2006.08558 and arXiv:2010.14765, with significantly improved organization, presentation, and new results; V2 polishes writing and adds citation; V3 polishes writing, adds citation and experiments
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (stat.ML)
[327]  arXiv:2105.10635 (cross-list from cs.LG) [pdf, other]
Title: Two-stage Training for Learning from Label Proportions
Comments: 10 pages, 4 figures, 5 tables, accepted by IJCAI 2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[328]  arXiv:2105.10721 (cross-list from cs.LG) [pdf, other]
Title: From Finite to Countable-Armed Bandits
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[329]  arXiv:2105.10948 (cross-list from cs.LG) [pdf, other]
Title: Regularization Can Help Mitigate Poisoning Attacks... with the Right Hyperparameters
Comments: Published at ICLR 2021 Workshop on Security and Safety in Machine Learning Systems. arXiv admin note: text overlap with arXiv:2003.00040
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[330]  arXiv:2105.11004 (cross-list from cs.DS) [pdf, other]
Title: Estimating leverage scores via rank revealing methods and randomization
Authors: Aleksandros Sobczyk (1), Efstratios Gallopoulos (2) ((1) IBM Research Europe, Zurich, Switzerland (2) Computer Engineering and Informatics Department, University of Patras, Greece)
Comments: To appear in SIAM Journal on Matrix Analysis and Applications
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Numerical Analysis (math.NA); Computation (stat.CO); Machine Learning (stat.ML)
[331]  arXiv:2105.11025 (cross-list from cs.LG) [pdf, other]
Title: Compressing Heavy-Tailed Weight Matrices for Non-Vacuous Generalization Bounds
Authors: John Y. Shin
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[332]  arXiv:2105.11045 (cross-list from cs.LG) [pdf, other]
Title: Learning Green's Functions of Linear Reaction-Diffusion Equations with Application to Fast Numerical Solver
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[333]  arXiv:2105.11053 (cross-list from q-fin.CP) [pdf, other]
Title: Arbitrage-free neural-SDE market models
Subjects: Computational Finance (q-fin.CP); Probability (math.PR); Risk Management (q-fin.RM); Statistical Finance (q-fin.ST); Machine Learning (stat.ML)
[334]  arXiv:2105.11066 (cross-list from cs.LG) [pdf, other]
Title: Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Optimization and Control (math.OC); Machine Learning (stat.ML)
[335]  arXiv:2105.11069 (cross-list from cs.LG) [pdf, other]
Title: InfoFair: Information-Theoretic Intersectional Fairness
Comments: IEEE Big Data 2022
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[336]  arXiv:2105.11447 (cross-list from cs.CL) [pdf, other]
Title: True Few-Shot Learning with Language Models
Comments: Code at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[337]  arXiv:2105.11558 (cross-list from cs.LG) [pdf, other]
Title: Near-optimal Offline and Streaming Algorithms for Learning Non-Linear Dynamical Systems
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[338]  arXiv:2105.11815 (cross-list from math.NA) [pdf, ps, other]
Title: Hashing embeddings of optimal dimension, with applications to linear least squares
Subjects: Numerical Analysis (math.NA); Optimization and Control (math.OC); Machine Learning (stat.ML)
[339]  arXiv:2105.11839 (cross-list from cs.LG) [pdf, other]
Title: DiBS: Differentiable Bayesian Structure Learning
Comments: NeurIPS 2021; updated run time results
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[340]  arXiv:2105.11964 (cross-list from eess.SP) [pdf, other]
Title: Model Mismatch Trade-offs in LMMSE Estimation
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Machine Learning (stat.ML)
[341]  arXiv:2105.11982 (cross-list from cs.AI) [pdf, other]
Title: Quantifying Uncertainty in Deep Spatiotemporal Forecasting
Comments: arXiv admin note: text overlap with arXiv:2102.06684
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP); Machine Learning (stat.ML)
[342]  arXiv:2105.12005 (cross-list from cs.LG) [pdf, ps, other]
Title: Hierarchical Subspace Learning for Dimensionality Reduction to Improve Classification Accuracy in Large Data Sets
Comments: 6 pages with 3 tables
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[343]  arXiv:2105.12022 (cross-list from math.OC) [pdf, other]
Title: Principal Component Hierarchy for Sparse Quadratic Programs
Journal-ref: ICML 2021
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[344]  arXiv:2105.12062 (cross-list from math.OC) [pdf, other]
Title: Practical Schemes for Finding Near-Stationary Points of Convex Finite-Sums
Comments: 29 pages, 4 figures
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[345]  arXiv:2105.12089 (cross-list from cs.LG) [pdf, ps, other]
Title: Investigating Manifold Neighborhood size for Nonlinear Analysis of LIBS Amino Acid Spectra
Comments: In ISCA 24th International Conference on Software Engineering and Data Engineering (SEDE 2015)
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[346]  arXiv:2105.12092 (cross-list from cs.AI) [pdf, other]
Title: Trajectory Modeling via Random Utility Inverse Reinforcement Learning
Comments: 31 pages; expanded version, with the addition of proofs not present in the first version
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[347]  arXiv:2105.12152 (cross-list from cs.LG) [pdf, other]
Title: Density estimation on low-dimensional manifolds: an inflation-deflation approach
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[348]  arXiv:2105.12237 (cross-list from cs.LG) [pdf, other]
Title: Practical Convex Formulation of Robust One-hidden-layer Neural Network Training
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Machine Learning (stat.ML)
[349]  arXiv:2105.12245 (cross-list from cs.LG) [pdf, other]
Title: Scaling Properties of Deep Residual Networks
Comments: Published at ICML 2021
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[350]  arXiv:2105.12247 (cross-list from cs.LG) [pdf, other]
Title: GraphVICRegHSIC: Towards improved self-supervised representation learning for graphs with a hyrbid loss function
Authors: Sayan Nag
Comments: Paper Accepted in the Weakly Supervised Representation Learning Workshop, IJCAI 2021 (IJCAI2021-WSRL)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[351]  arXiv:2105.12342 (cross-list from math.OC) [pdf, ps, other]
Title: A data-driven approach to beating SAA out-of-sample
Comments: 25 pages, 2 page bibliography, 2 Figures, 12 page Appendix
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Econometrics (econ.EM); Systems and Control (eess.SY); Machine Learning (stat.ML)
[352]  arXiv:2105.12356 (cross-list from cs.LG) [pdf, other]
Title: The Graph Cut Kernel for Ranked Data
Journal-ref: Transactions on Machine Learning Research (2022)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[353]  arXiv:2105.12639 (cross-list from cs.LG) [pdf, other]
Title: Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness
Comments: ICML 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[354]  arXiv:2105.12769 (cross-list from cs.LG) [pdf, ps, other]
Title: Clustered Federated Learning via Generalized Total Variation Minimization
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[355]  arXiv:2105.12806 (cross-list from cs.LG) [pdf, ps, other]
Title: A Universal Law of Robustness via Isoperimetry
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[356]  arXiv:2105.12837 (cross-list from cs.LG) [pdf, other]
Title: Fooling Partial Dependence via Data Poisoning
Comments: Accepted at ECML PKDD 2022
Journal-ref: Machine Learning and Knowledge Discovery in Databases, vol. 3, pp. 121-136, 2022
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[357]  arXiv:2105.12898 (cross-list from cs.AI) [pdf, other]
Title: Stochastic Intervention for Causal Effect Estimation
Comments: Accepted in IJCNN 21
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[358]  arXiv:2105.12909 (cross-list from cs.LG) [pdf, other]
Title: Deconditional Downscaling with Gaussian Processes
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[359]  arXiv:2105.12916 (cross-list from cs.LG) [pdf, other]
Title: Robust learning from corrupted EEG with dynamic spatial filtering
Comments: 42 pages, 9 figures
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Neurons and Cognition (q-bio.NC); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
[360]  arXiv:2105.12937 (cross-list from cs.IR) [pdf, other]
Title: Towards a Better Understanding of Linear Models for Recommendation
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[361]  arXiv:2105.13010 (cross-list from cs.LG) [pdf, other]
Title: An error analysis of generative adversarial networks for learning distributions
Journal-ref: Journal of Machine Learning Research, 23(116):1-43, 2022
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[362]  arXiv:2105.13052 (cross-list from math.NA) [pdf, other]
Title: A generalization of the randomized singular value decomposition
Comments: Accepted at ICLR 2022
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Machine Learning (stat.ML)
[363]  arXiv:2105.13093 (cross-list from cs.LG) [pdf, other]
Title: Towards Understanding Knowledge Distillation
Comments: ICML'19. Post-edited to add related work. arXiv admin note: text overlap with arXiv:2003.13438 by other authors
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[364]  arXiv:2105.13189 (cross-list from math.NA) [pdf, ps, other]
Title: Sparse recovery based on the generalized error function
Authors: Zhiyong Zhou
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[365]  arXiv:2105.13245 (cross-list from cs.LG) [pdf, other]
Title: Bayesian Optimisation for Constrained Problems
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[366]  arXiv:2105.13251 (cross-list from cs.LG) [pdf, ps, other]
Title: An Impossibility Theorem for Node Embedding
Subjects: Machine Learning (cs.LG); Discrete Mathematics (cs.DM); Machine Learning (stat.ML)
[367]  arXiv:2105.13283 (cross-list from cs.LG) [pdf, other]
Title: Deep Ensembles from a Bayesian Perspective
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[368]  arXiv:2105.13493 (cross-list from cs.LG) [pdf, other]
Title: Efficient and Accurate Gradients for Neural SDEs
Comments: Accepted at NeurIPS 2021
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Dynamical Systems (math.DS); Machine Learning (stat.ML)
[369]  arXiv:2105.13655 (cross-list from cs.LG) [pdf, other]
Title: Scheduling Jobs with Stochastic Holding Costs
Comments: Extended abstract appeared in NeurIPS 2021
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Optimization and Control (math.OC); Machine Learning (stat.ML)
[370]  arXiv:2105.13669 (cross-list from cs.LG) [pdf, ps, other]
Title: Measuring global properties of neural generative model outputs via generating mathematical objects
Subjects: Machine Learning (cs.LG); Combinatorics (math.CO); Machine Learning (stat.ML)
[371]  arXiv:2105.13745 (cross-list from cs.LG) [pdf, other]
Title: Robust Regularization with Adversarial Labelling of Perturbed Samples
Comments: Accepted to IJCAI2021
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[372]  arXiv:2105.13810 (cross-list from cs.LG) [pdf, ps, other]
Title: A Survey on Anomaly Detection for Technical Systems using LSTM Networks
Comments: 14 pages, 6 figures, 4 tables. Accepted for publication by Computers in Industry
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[373]  arXiv:2105.13841 (cross-list from cs.LG) [src]
Title: A General Taylor Framework for Unifying and Revisiting Attribution Methods
Comments: In the current version, the author information is not complete and there are some mathematical errors in the proof. We need to correct errors and add all co-authors who contribute to the paper. Therefore, we hope to withdraw the manuscript
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[374]  arXiv:2105.13859 (cross-list from cs.LG) [pdf, other]
Title: Generative Network-Based Reduced-Order Model for Prediction, Data Assimilation and Uncertainty Quantification
Comments: arXiv admin note: text overlap with arXiv:2105.07729
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[375]  arXiv:2105.13913 (cross-list from math.OC) [pdf, other]
Title: Scalable Frank-Wolfe on Generalized Self-concordant Functions via Simple Steps
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[376]  arXiv:2105.13937 (cross-list from cs.LG) [pdf, other]
Title: Polygonal Unadjusted Langevin Algorithms: Creating stable and efficient adaptive algorithms for neural networks
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Probability (math.PR); Machine Learning (stat.ML)
[377]  arXiv:2105.13939 (cross-list from cs.LG) [pdf, other]
Title: Efficient Online-Bandit Strategies for Minimax Learning Problems
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[378]  arXiv:2105.13942 (cross-list from cs.LG) [pdf, other]
Title: Towards Deterministic Diverse Subset Sampling
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[379]  arXiv:2105.13949 (cross-list from cs.LG) [pdf, other]
Title: Latent Space Exploration Using Generative Kernel PCA
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[380]  arXiv:2105.13975 (cross-list from cs.LG) [pdf, other]
Title: Relation Matters in Sampling: A Scalable Multi-Relational Graph Neural Network for Drug-Drug Interaction Prediction
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[381]  arXiv:2105.14016 (cross-list from cs.LG) [pdf, ps, other]
Title: Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
[382]  arXiv:2105.14027 (cross-list from hep-ph) [pdf, other]
Title: The Dark Machines Anomaly Score Challenge: Benchmark Data and Model Independent Event Classification for the Large Hadron Collider
Comments: v1: 54 pages, 24 figures. v2: 56 pages, citations added, extend discussion of look-elsewhere-effect, results unchanged; v3. minor typos and updated references
Journal-ref: SciPost Phys. 12, 043 (2022)
Subjects: High Energy Physics - Phenomenology (hep-ph); High Energy Physics - Experiment (hep-ex); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[383]  arXiv:2105.14080 (cross-list from cs.LG) [pdf, other]
Title: Gotta Go Fast When Generating Data with Score-Based Models
Comments: Code is available on this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC); Machine Learning (stat.ML)
[384]  arXiv:2105.14084 (cross-list from cs.LG) [pdf, other]
Title: Support vector machines and linear regression coincide with very high-dimensional features
Comments: 34 pages, 9 figures
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[385]  arXiv:2105.14095 (cross-list from cs.LG) [pdf, ps, other]
Title: Weighted Training for Cross-Task Learning
Comments: Published as a conference paper at ICLR 2022
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[386]  arXiv:2105.14099 (cross-list from cs.LG) [pdf, other]
Title: Bridging the Gap Between Practice and PAC-Bayes Theory in Few-Shot Meta-Learning
Comments: Neural Information Processing Systems 2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[387]  arXiv:2105.14114 (cross-list from cs.LG) [pdf, other]
Title: Asymptotically Optimal Bandits under Weighted Information
Comments: 9 content pages, 3 references pages, 22 appendix pages, 4 figures, 34 total pages
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Machine Learning (stat.ML)
[388]  arXiv:2105.14119 (cross-list from cs.LG) [pdf, other]
Title: Towards optimally abstaining from prediction with OOD test examples
Comments: In NeurIPS 2021 (+spotlight), 24 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[389]  arXiv:2105.14141 (cross-list from cs.LG) [pdf, other]
Title: ARMS: Antithetic-REINFORCE-Multi-Sample Gradient for Binary Variables
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[390]  arXiv:2105.14146 (cross-list from cs.LG) [pdf, other]
Title: Deep Fair Discriminative Clustering
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Machine Learning (stat.ML)
[391]  arXiv:2105.14166 (cross-list from cs.LG) [pdf, other]
Title: Rejection sampling from shape-constrained distributions in sublinear time
Comments: 23 pages, 5 figures
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[392]  arXiv:2105.14172 (cross-list from cs.LG) [pdf, other]
Title: A Stochastic Alternating Balance $k$-Means Algorithm for Fair Clustering
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[393]  arXiv:2105.14203 (cross-list from cs.LG) [pdf, other]
Title: Understanding Instance-based Interpretability of Variational Auto-Encoders
Comments: NeurIPS 2021
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[394]  arXiv:2105.14244 (cross-list from cs.LG) [pdf, other]
Title: Learning Graphon Autoencoders for Generative Graph Modeling
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[395]  arXiv:2105.14260 (cross-list from cs.LG) [pdf, ps, other]
Title: Understanding Bandits with Graph Feedback
Authors: Houshuang Chen (1), Zengfeng Huang (2), Shuai Li (1), Chihao Zhang (1) ((1) Shanghai Jiao Tong University, (2) Fudan University)
Comments: To be published in NeurIPS'21
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[396]  arXiv:2105.14363 (cross-list from cs.LG) [pdf, other]
Title: On the Theory of Reinforcement Learning with Once-per-Episode Feedback
Comments: Published at NeurIPS 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[397]  arXiv:2105.14367 (cross-list from cs.LG) [pdf, other]
Title: Deconvolutional Density Network: Modeling Free-Form Conditional Distributions
Comments: 10 pages, 5 figures, 2 tables
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[398]  arXiv:2105.14397 (cross-list from math.CO) [pdf, ps, other]
Title: On the Number of Edges of the Frechet Mean and Median Graphs
Comments: 14 pages
Subjects: Combinatorics (math.CO); Social and Information Networks (cs.SI); Data Analysis, Statistics and Probability (physics.data-an); Applications (stat.AP); Machine Learning (stat.ML)
[399]  arXiv:2105.14417 (cross-list from cs.LG) [pdf, ps, other]
Title: Overparameterization of deep ResNet: zero loss and mean-field analysis
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[400]  arXiv:2105.14529 (cross-list from cs.LG) [pdf, other]
Title: On the benefits of representation regularization in invariance based domain generalization
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[401]  arXiv:2105.14559 (cross-list from cs.LG) [pdf, other]
Title: Active Learning in Bayesian Neural Networks with Balanced Entropy Learning Principle
Authors: Jae Oh Woo
Journal-ref: International Conference on Learning Representations 2023
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[402]  arXiv:2105.14573 (cross-list from cs.LG) [pdf, other]
Title: Embedding Principle of Loss Landscape of Deep Neural Networks
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[403]  arXiv:2105.14602 (cross-list from cs.LG) [pdf, other]
Title: On the geometry of generalization and memorization in deep neural networks
Comments: ICLR 2021
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (stat.ML)
[404]  arXiv:2105.14648 (cross-list from cs.LG) [pdf, ps, other]
Title: Sharper bounds for online learning of smooth functions of a single variable
Authors: Jesse Geneson
Subjects: Machine Learning (cs.LG); Discrete Mathematics (cs.DM); Machine Learning (stat.ML)
[405]  arXiv:2105.14673 (cross-list from cs.LG) [pdf, ps, other]
Title: A Minimax Lower Bound for Low-Rank Matrix-Variate Logistic Regression
Comments: 8 pages; published in Proc. 55th Asilomar Conf. Signals, Systems, and Computers, Pacific Grove, CA, Oct. 31-Nov. 3, 2021
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Statistics Theory (math.ST); Machine Learning (stat.ML)
[406]  arXiv:2105.14710 (cross-list from cs.LG) [pdf, other]
Title: Robustifying $\ell_\infty$ Adversarial Training to the Union of Perturbation Models
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[407]  arXiv:2105.14835 (cross-list from cs.LG) [pdf, ps, other]
Title: Towards Lower Bounds on the Depth of ReLU Neural Networks
Comments: Authors' accepted manuscript for SIAM Journal on Discrete Mathematics. A preliminary conference version appeared at NeurIPS 2021
Subjects: Machine Learning (cs.LG); Discrete Mathematics (cs.DM); Neural and Evolutionary Computing (cs.NE); Combinatorics (math.CO); Machine Learning (stat.ML)
[408]  arXiv:2105.14876 (cross-list from cs.LG) [pdf, other]
Title: Fast, Accurate and Interpretable Time Series Classification Through Randomization
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[409]  arXiv:2105.14890 (cross-list from cs.LG) [pdf, other]
Title: Rawlsian Fair Adaptation of Deep Learning Classifiers
Comments: 24 figures, 19 figures
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Machine Learning (stat.ML)
[410]  arXiv:2105.14900 (cross-list from cs.LG) [pdf, other]
Title: A unified view of likelihood ratio and reparameterization gradients
Comments: AISTATS2021; Earlier paper was split in two (arXiv:1910.06419). Refer to the current paper for the unified view, but see the earlier paper for discussion on an importance sampling technique
Journal-ref: In International Conference on Artificial Intelligence and Statistics (pp. 4078-4086). PMLR (2021, March)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[411]  arXiv:2105.15069 (cross-list from cs.LG) [pdf, other]
Title: On the Consistency of Max-Margin Losses
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[412]  arXiv:2105.15134 (cross-list from cs.LG) [pdf, other]
Title: Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Authors: Zixin Wen, Yuanzhi Li
Comments: V3 corrected related works. Accepted to ICML2021
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[413]  arXiv:2105.15183 (cross-list from cs.LG) [pdf, other]
Title: Efficient and Modular Implicit Differentiation
Comments: V3: added more related work and Jacobian precision figure
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[414]  arXiv:2105.14337 (cross-list from math.OC) [pdf, other]
Title: Optimal transport with $f$-divergence regularization and generalized Sinkhorn algorithm
Authors: Dávid Terjék (1), Diego González-Sánchez (1) ((1) Alfréd Rényi Institute of Mathematics)
Comments: AISTATS 2022 camera ready with appendix, 31 pages, 7 figures
Journal-ref: Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:5135-5165, 2022
Subjects: Optimization and Control (math.OC); Information Theory (cs.IT); Machine Learning (cs.LG); Functional Analysis (math.FA); Machine Learning (stat.ML)
[415]  arXiv:2105.14084 (cross-list from cs.LG) [pdf, other]
Title: Support vector machines and linear regression coincide with very high-dimensional features
Comments: 34 pages, 9 figures
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[416]  arXiv:2105.14083 (cross-list from cs.LG) [pdf, other]
Title: Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness
Comments: 9 pages, 3 figures, 3 algorithms. Currently under blind review at NeurIPS 2021
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (stat.ML)
[ total of 416 entries: 1-416 ]
[ showing 416 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, stat, 2404, contact, help  (Access key information)