Optimization and Control
New submissions
[ showing up to 2000 entries per page: fewer  more ]
New submissions for Thu, 9 Jul 20
 [1] arXiv:2007.03728 [pdf, other]

Title: Learning to Optimize Power Distribution Grids using SensitivityInformed Deep Neural NetworksComments: Manuscript under reviewSubjects: Optimization and Control (math.OC)
Deep learning for distribution grid optimization can be advocated as a promising solution for nearoptimal yet timely inverter dispatch. The principle is to train a deep neural network (DNN) to predict the solutions of an optimal power flow (OPF), thus shifting the computational effort from realtime to offline. Nonetheless, before training this DNN, one has to solve a large number of OPFs to create a labeled dataset. Granted the latter step can still be prohibitive in timecritical applications, this work puts forth an original technique for improving the prediction accuracy of DNNs by taking into account the sensitivities of the OPF minimizers with respect to the OPF parameters. By expanding on multiparametric programming, it is shown that although inverter control problems may exhibit dual degeneracy, the required sensitivities do exist in general and can be computed readily using the output of any standard quadratic program (QP) solver. Numerical tests showcase that sensitivityinformed deep learning can enhance prediction accuracy in terms of mean square error (MSE) by 23 orders of magnitude at minimal computational overhead. Improvements are more significant in the smalldata regime, where a DNN has to learn to optimize using a few examples. Beyond multiparametric QPs, the approach is currently being generalized to parametric (non)convex optimization problems.
 [2] arXiv:2007.03798 [pdf, ps, other]

Title: Determination of convex functions via proximal operatorsAuthors: Emilio VilchesSubjects: Optimization and Control (math.OC)
We provide comparison principles for convex functions through its proximal mappings. Consequently, we prove that the norm of the proximal operator determines a convex the function up to a constant.
 [3] arXiv:2007.03830 [pdf, ps, other]

Title: Computational SemiDiscrete Optimal Transport with General Storage FeesAuthors: Mohit BansilComments: 23 pages, comments welcome!Subjects: Optimization and Control (math.OC); Analysis of PDEs (math.AP); Numerical Analysis (math.NA)
We propose and analyze a modified damped Newton algorithm to solve the semidiscrete optimal transport with storage fees. We prove global linear convergence for a wide range of storage fee functions, the main assumption being that each warehouse's storage costs are independent. We show that if $F$ is an arbitrary storage fee function that satisfies this independence condition then $F$ can be perturbed into a new storage fee function so that our algorithm converges. We also show that the optimizers are stable under these perturbations. Furthermore, our results come with quantitative rates.
 [4] arXiv:2007.03847 [pdf, other]

Title: Fast Monte Carlo Simulation of Dynamic Power Systems Under Continuous Random DisturbancesAuthors: Yiwei Qiu (1), Jin Lin (1), Xiaoshuang Chen (1), Feng Liu (1), Yonghua Song (2 and 1) ((1) State Key Laboratory of Control and Simulation of Power Systems and Generation Equipment, Department of Electrical Engineering, Tsinghua University, (2) Department of Electrical and Computer Engineering, University of Macau)Comments: Accepted in IEEE PES General Meeting 2020Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
Continuoustime random disturbances from the renewable generation pose a significant impact on power system dynamic behavior. In evaluating this impact, the disturbances must be considered as continuoustime random processes instead of random variables that do not vary with time to ensure accuracy. Monte Carlo simulation (MCs) is a nonintrusive method to evaluate such impact that can be performed on commercial power system simulation software and is easy for power utilities to use, but is computationally cumbersome. Fast samplings methods such as Latin hypercube sampling (LHS) have been introduced to speed up sampling random variables, but yet cannot be applied to sample continuous disturbances. To overcome this limitation, this paper proposes a fast MCs method that enables the LHS to speed up sampling continuous disturbances, which is based on the It\^{o} process model of the disturbances and the approximation of the It\^{o} process by functions of independent normal random variables. A case study of the IEEE 39Bus System shows that the proposed method is 47.6 and 6.7 times faster to converge compared to the traditional MCs in evaluating the expectation and variance of the system dynamic response.
 [5] arXiv:2007.03861 [pdf, other]

Title: On the Analysis of Modelfree Methods for the Linear Quadratic RegulatorSubjects: Optimization and Control (math.OC)
Many reinforcement learning methods achieve great success in practice but lack theoretical foundation. In this paper, we study the convergence analysis on the problem of the Linear Quadratic Regulator (LQR). The global linear convergence properties and sample complexities are established for several popular algorithms such as the policy gradient algorithm, TDlearning and the actorcritic (AC) algorithm. Our results show that the actorcritic algorithm can reduce the sample complexity compared with the policy gradient algorithm. Although our analysis is still preliminary, it explains the benefit of AC algorithm in a certain sense.
 [6] arXiv:2007.03952 [pdf, ps, other]

Title: On continuous selections of polynomial functionsComments: 28 pagesSubjects: Optimization and Control (math.OC)
A continuous selection of polynomial functions is a continuous function whose domain can be partitioned into finitely many pieces on which the function coincides with a polynomial. Given a set of finitely many polynomials, we show that there are only finitely many continuous selections of it and each one is semialgebraic. Then, we establish some generic properties regarding the critical points, defined by the Clarke subdifferential, of these continuous selections. In particular, given a set of finitely many polynomials with generic coefficients, we show that the critical points of all continuous selections of it are finite and the critical values are all different, and we also derive the coercivity of those continuous selections which are bounded from below. We point out that some existing results about {\L}ojasiewicz's inequality and error bounds for the maximum function of some finitely many polynomials are also valid for all the continuous selections of them.
 [7] arXiv:2007.03960 [pdf, other]

Title: On Entropic Optimization and Path Integral ControlSubjects: Optimization and Control (math.OC); Information Theory (cs.IT)
This article is motivated by the question whether it is possible to solve optimal control (OC) or dynamic optimization problems in a similar fashion to how static optimization problems can be addressed with Evolutionary Strategies (ES). The latter maintain a sequence of Gaussian search distributions that converge to the optimum. For the moment, this question has been answered partially by a set of algorithms that are known as Path Integral Control (PIC). Those maintain a sequence of locally linear Gaussian feedback controllers. So far PIC methods have been derived solely from the theory of Linearly Solvable OC, which includes only a narrow subset of optimal control problems and has only limited application potential as a consequence. We aim to address this question within a more general mathematical setting. Therefore, we first identify the framework of entropic inference as a suitable setting to synthesise stochastic search algorithms. Therewith we establish the formal framework of entropic optimization and provide a compelling justification for the inclusion of entropy measures in stochastic optimization. From this theory follows a formal optimal search distribution sequence which converges monotonically to the Dirac delta distribution centred at the optimum. Then we demonstrate how this result can be used to derive Gaussian search distributions similar to existing ES. We then proceed to transfer these ideas from a static to a dynamic setting, therewith establishing the framework of Entropic OC which shares characteristics with entropy based Reinforcement Learning. From this theory we can construct a number of formal optimal path distribution sequences. Thence we derive the outlines of a generalised algorithmic framework complementing the existing PIC class. Our main ambition is to reveal how all of these fields are related in a most exciting fashion.
 [8] arXiv:2007.03964 [pdf, other]

Title: Responsive Safety in Reinforcement Learning by PID Lagrangian MethodsComments: ICML 2020Subjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Lagrangian methods are widely used algorithms for constrained optimization problems, but their learning dynamics exhibit oscillations and overshoot which, when applied to safe reinforcement learning, leads to constraintviolating behavior during agent training. We address this shortcoming by proposing a novel Lagrange multiplier update method that utilizes derivatives of the constraint function. We take a controls perspective, wherein the traditional Lagrange multiplier update behaves as \emph{integral} control; our terms introduce \emph{proportional} and \emph{derivative} control, achieving favorable learning dynamics through damping and predictive measures. We apply our PID Lagrangian methods in deep RL, setting a new state of the art in Safety Gym, a safe RL benchmark. Lastly, we introduce a new method to ease controller tuning by providing invariance to the relative numerical scales of reward and cost. Our extensive experiments demonstrate improved performance and hyperparameter robustness, while our algorithms remain nearly as simple to derive and implement as the traditional Lagrangian approach.
 [9] arXiv:2007.03982 [pdf, ps, other]

Title: On semidiscrete subpartitions of vectorvalued measuresComments: 9 pagesSubjects: Optimization and Control (math.OC)
We introduce a concept of optimal transport for vectorvalued measures and its dual formulation. In this note we concentrate on the semidiscrete case and show some fundamental differences between the scalar and vector cases. A manifestation of this difference is the possibility of nonexistence of optimal solution for the dual problem for feasible primer problems.
 [10] arXiv:2007.03983 [pdf, other]

Title: Dynamic social learning under graph constraintsSubjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Probability (math.PR)
We argue that graphconstrained dynamic choice with reinforcement can be viewed as a scaled version of a special instance of replicator dynamics. The latter also arises as the limiting differential equation for the empirical measures of a vertex reinforced random walk on a directed graph. We use this equivalence to show that for a class of positively $\alpha$homogeneous rewards, $\alpha > 0$, the asymptotic outcome concentrates around the optimum in a certain limiting sense when `annealed' by letting $\alpha\uparrow\infty$ slowly. We also discuss connections with classical simulated annealing.
 [11] arXiv:2007.03991 [pdf, ps, other]

Title: Optimal Control of the 2D Evolutionary NavierStokes Equations with Measure Valued ControlsComments: 24 pagesSubjects: Optimization and Control (math.OC)
In this paper, we consider an optimal control problem for the twodimensional evolutionary NavierStokes system. Looking for sparsity, we take controls as functions of time taking values in a space of Borel measures. The cost functional does not involve directly the control but we assume some constraints on them. We prove the wellposedness of the control problem and derive necessary and sufficient conditions for local optimality of the controls.
 [12] arXiv:2007.03999 [pdf, other]

Title: Stacked adaptive dynamic programming with unknown system modelJournalref: IFACPapersOnLine, 50(1), 41504155 (2017)Subjects: Optimization and Control (math.OC); Dynamical Systems (math.DS)
Adaptive dynamic programming is a collective term for a variety of approaches to infinitehorizon optimal control. Common to all approaches is approximation of the infinitehorizon cost function based on dynamic programming philosophy. Typically, they also require knowledge of a dynamical model of the system. In the current work, application of adaptive dynamic programming to a system whose dynamical model is unknown to the controller is addressed. In order to realize the control algorithm, a model of the system dynamics is estimated with a Kalman filter. A stacked control scheme to boost the controller performance is suggested. The functioning of the new approach was verified in simulation and compared to the baseline represented by gradient descent on the running cost.
 [13] arXiv:2007.04104 [pdf, ps, other]

Title: Lyapunov functions and finite time stabilization in optimal time for homogeneous linear and quasilinear hyperbolic systemsComments: arXiv admin note: text overlap with arXiv:2005.13269Subjects: Optimization and Control (math.OC); Analysis of PDEs (math.AP)
Hyperbolic systems in one dimensional space are frequently used in modeling of many physical systems. In our recent works, we introduced time independent feedbacks leading to the finite stabilization for the optimal time of homogeneous linear and quasilinear hyperbolic systems. In this work, we present Lyapunov's functions for these feedbacks and use estimates for Lyapunov's functions to rediscover the finite stabilization results.
 [14] arXiv:2007.04211 [pdf, ps, other]

Title: Robust feedback stabilization of Nlevel quantum spin systemsSubjects: Optimization and Control (math.OC); Mathematical Physics (mathph); Probability (math.PR); Quantum Physics (quantph)
In this paper, we consider Nlevel quantum angular momentum systems interacting with electromagnetic fields undergoing continuoustime measurements. We suppose unawareness of the initial state and physical parameters, entailing the introduction of an additional state representing the estimated quantum state. The evolution of the quantum state and its estimation is described by a coupled stochastic master equation. Here, we study the asymptotic behavior of such a system in presence of a feedback controller. We provide sufficient conditions on the feedback controller and on the estimated parameters that guarantee exponential stabilization of the coupled stochastic system towards an eigenstate of the measurement operator. Furthermore, we estimate the corresponding rate of convergence. We also provide parametrized feedback laws satisfying such conditions. Our results show the robustness of the feedback stabilization strategy considered in [21] in case of imprecise initialization of the estimated state and with respect to the unknown physical parameters.
Crosslists for Thu, 9 Jul 20
 [15] arXiv:2007.03714 (crosslist from cs.LG) [pdf, other]

Title: Towards an Understanding of Residual Networks Using Neural Tangent Hierarchy (NTH)Comments: 72 pages, 1 figureSubjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
Gradient descent yields zero training loss in polynomial time for deep neural networks despite nonconvex nature of the objective function. The behavior of network in the infinite width limit trained by gradient descent can be described by the Neural Tangent Kernel (NTK) introduced in \cite{Jacot2018Neural}. In this paper, we study dynamics of the NTK for finite width Deep Residual Network (ResNet) using the neural tangent hierarchy (NTH) proposed in \cite{Huang2019Dynamics}. For a ResNet with smooth and Lipschitz activation function, we reduce the requirement on the layer width $m$ with respect to the number of training samples $n$ from quartic to cubic. Our analysis suggests strongly that the particular skipconnection structure of ResNet is the main reason for its triumph over fullyconnected network.
 [16] arXiv:2007.03763 (crosslist from eess.SP) [pdf, other]

Title: Realtime Intersection Optimization for Signal Phasing, Timing, and Automated Vehicles' TrajectoriesSubjects: Signal Processing (eess.SP); Optimization and Control (math.OC)
This study aims to develop a realtime intersection optimization (RIO) control algorithm to efficiently serve traffic of Connected and Automated Vehicles (CAVs) and conventional vehicles (CNVs). This paper extends previous work to consider demand over capacity conditions and trajectory deviations by reoptimizing decisions. To jointly optimize Signal Phase and Timing (SPaT) and departure time of CAVs, we formulated a joint optimization model which is reduced to and solved as a Minimum Cost Flow (MCF) problem. The MCFbased optimization models is embedded into the RIO algorithm to operate the signal controller and to plan the movement of CAVs. Simulation experiments showed 1822% travel time decrease and up to 12% capacity improvement compared to the base scenario.
 [17] arXiv:2007.03795 (crosslist from cs.LG) [pdf, other]

Title: Conditional gradient methods for stochastically constrained convex minimizationSubjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
We propose two novel conditional gradientbased methods for solving structured stochastic convex optimization problems with a large number of linear constraints. Instances of this template naturally arise from SDPrelaxations of combinatorial problems, which involve a number of constraints that is polynomial in the problem dimension. The most important feature of our framework is that only a subset of the constraints is processed at each iteration, thus gaining a computational advantage over prior works that require full passes. Our algorithms rely on variance reduction and smoothing used in conjunction with conditional gradient steps, and are accompanied by rigorous convergence guarantees. Preliminary numerical experiments are provided for illustrating the practical performance of the methods.
 [18] arXiv:2007.03946 (crosslist from cs.DS) [pdf, ps, other]

Title: A Technique for Obtaining True Approximations for $k$Center with Covering ConstraintsSubjects: Data Structures and Algorithms (cs.DS); Optimization and Control (math.OC)
There has been a recent surge of interest in incorporating fairness aspects into classical clustering problems. Two recently introduced variants of the $k$Center problem in this spirit are Colorful $k$Center, introduced by Bandyapadhyay, Inamdar, Pai, and Varadarajan, and lottery models, such as the Fair Robust $k$Center problem introduced by Harris, Pensyl, Srinivasan, and Trinh. To address fairness aspects, these models, compared to traditional $k$Center, include additional covering constraints. Prior approximation results for these models require to relax some of the normally hard constraints, like the number of centers to be opened or the involved covering constraints, and therefore, only obtain constantfactor pseudoapproximations. In this paper, we introduce a new approach to deal with such covering constraints that leads to (true) approximations, including a $4$approximation for Colorful $k$Center with constantly many colorssettling an open question raised by Bandyapadhyay, Inamdar, Pai, and Varadarajanand a $4$approximation for Fair Robust $k$Center, for which the existence of a (true) constantfactor approximation was also open. We complement our results by showing that if one allows an unbounded number of colors, then Colorful $k$Center admits no approximation algorithm with finite approximation guarantee, assuming that $\mathrm{P} \neq \mathrm{NP}$. Moreover, under the Exponential Time Hypothesis, the problem is inapproximable if the number of colors grows faster than logarithmic in the size of the ground set.
 [19] arXiv:2007.03948 (crosslist from cs.NE) [pdf, other]

Title: Learning Efficient Search Approximation in Mixed Integer Branch and BoundSubjects: Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)
In line with the growing trend of using machine learning to improve solving of combinatorial optimisation problems, one promising idea is to improve node selection within a mixed integer programming branchandbound tree by using a learned policy. In contrast to previous work using imitation learning, our policy is focused on learning which of a node's children to select. We present an offline method to learn such a policy in two settings: one that is approximate by committing to pruning of nodes; one that is exact and backtracks from a leaf to use a different strategy. We apply the policy within the popular opensource solver SCIP. Empirical results on four MIP datasets indicate that our node selection policy leads to solutions more quickly than the stateoftheart in the literature, but not as quickly as the stateofpractice SCIP node selector. While we do not beat the highlyoptimised SCIP baseline in terms of solving time on exact solutions, our approximationbased policies have a consistently better optimality gap than all baselines if the accuracy of the predictive model adds value to prediction. Further, the results also indicate that, when a time limit is applied, our approximation method finds better solutions than all baselines in the majority of problems tested.
 [20] arXiv:2007.04079 (crosslist from math.PR) [pdf, ps, other]

Title: Viscosity Solutions to First Order PathDependent HamiltonJacobiBellman Equations in Hilbert SpaceAuthors: Jianjun ZhouComments: 25 pages. arXiv admin note: substantial text overlap with arXiv:2005.05309, arXiv:2004.02095Subjects: Probability (math.PR); Optimization and Control (math.OC)
In this article, a notion of viscosity solutions is introduced for first order pathdependent HamiltonJacobiBellman (PHJB) equations associated with optimal control problems for pathdependent evolution equations in Hilbert space. We identify the value functional of optimal control problems as unique viscosity solution to the associated PHJB equations. We also show that our notion of viscosity solutions is consistent with the corresponding notion of classical solutions, and satisfies a stability property.
 [21] arXiv:2007.04202 (crosslist from cs.LG) [pdf, other]

Title: Stochastic Hamiltonian Gradient Methods for Smooth GamesAuthors: Nicolas Loizou, Hugo Berard, Alexia JolicoeurMartineau, Pascal Vincent, Simon LacosteJulien, Ioannis MitliagkasComments: ICML 2020  Proceedings of the 37th International Conference on Machine LearningSubjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Optimization and Control (math.OC); Machine Learning (stat.ML)
The success of adversarial formulations in machine learning has brought renewed motivation for smooth games. In this work, we focus on the class of stochastic Hamiltonian methods and provide the first convergence guarantees for certain classes of stochastic smooth games. We propose a novel unbiased estimator for the stochastic Hamiltonian gradient descent (SHGD) and highlight its benefits. Using tools from the optimization literature we show that SHGD converges linearly to the neighbourhood of a stationary point. To guarantee convergence to the exact solution, we analyze SHGD with a decreasing stepsize and we also present the first stochastic variance reduced Hamiltonian method. Our results provide the first global nonasymptotic lastiterate convergence guarantees for the class of stochastic unconstrained bilinear games and for the more general class of stochastic games that satisfy a "sufficiently bilinear" condition, notably including some nonconvex nonconcave problems. We supplement our analysis with experiments on stochastic bilinear and sufficiently bilinear games, where our theory is shown to be tight, and on simple adversarial machine learning formulations.
Replacements for Thu, 9 Jul 20
 [22] arXiv:1812.06196 (replaced) [pdf, ps, other]

Title: Meanfield games of optimal stopping: a relaxed solution approachSubjects: Optimization and Control (math.OC); Probability (math.PR)
 [23] arXiv:1904.11626 (replaced) [pdf, ps, other]

Title: Parametric Scenario Optimization under Limited Data: A Distributionally Robust Optimization ViewSubjects: Optimization and Control (math.OC); Statistics Theory (math.ST)
 [24] arXiv:2002.00365 (replaced) [pdf, ps, other]

Title: An Improved Distributed Nonlinear Observer for LeaderFollowing Consensus Via Differential Geometry ApproachSubjects: Optimization and Control (math.OC)
 [25] arXiv:2003.06935 (replaced) [pdf, other]

Title: Control of chaos with minimal information transferAuthors: Christoph KawanSubjects: Optimization and Control (math.OC); Dynamical Systems (math.DS)
 [26] arXiv:2006.00516 (replaced) [pdf, other]

Title: Tight Probability Bounds with Pairwise IndependenceComments: 33 pages, 4 figuresSubjects: Optimization and Control (math.OC); Combinatorics (math.CO); Probability (math.PR)
 [27] arXiv:2006.13844 (replaced) [pdf, ps, other]

Title: $\mathcal{H}_2$ optimal structurepreserving model order reduction of secondorder systems by iterative rational Krylov algorithmComments: 17 pages 12 figuresSubjects: Optimization and Control (math.OC); Systems and Control (eess.SY); Metric Geometry (math.MG); Numerical Analysis (math.NA)
 [28] arXiv:2006.16936 (replaced) [pdf, other]

Title: Integral Control Barrier Functions for Dynamically Defined Control LawsSubjects: Optimization and Control (math.OC); Dynamical Systems (math.DS)
 [29] arXiv:2007.03070 (replaced) [pdf, other]

Title: Novel current actuated piezoelectric composite model with fully dynamic electromagnetic fieldComments: This work is intended to be submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Optimization and Control (math.OC)
 [30] arXiv:2007.03326 (replaced) [pdf, other]

Title: An Integer Programming Approach to Deep Neural Networks with Binary Activation FunctionsJournalref: Workshop on Beyond firstorder methods in ML systems at the 37th International Conference on Machine Learning, Vienna, Austria, 2020Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
 [31] arXiv:1910.08828 (replaced) [pdf, ps, other]

Title: Dictionary Learning with Almost Sure Error ConstraintsSubjects: Machine Learning (cs.LG); Information Theory (cs.IT); Optimization and Control (math.OC); Machine Learning (stat.ML)
 [32] arXiv:2002.04131 (replaced) [pdf, other]

Title: QLearning Algorithm for MeanField Controls, with Convergence and Complexity AnalysisSubjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
 [33] arXiv:2006.06889 (replaced) [pdf, ps, other]

Title: Fast Objective and Duality Gap Convergence for Nonconvex Stronglyconcave Minmax ProblemsComments: Zhishuai Guo, Zhuoning Yuan and Yan Yan contributed equally to this workSubjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
 [34] arXiv:2006.15977 (replaced) [pdf, other]

Title: A privacypreserving tests optimization algorithm for epidemics containmentComments: added figures fixed typos added table of notationSubjects: Social and Information Networks (cs.SI); Optimization and Control (math.OC); Physics and Society (physics.socph)
 [35] arXiv:2007.02910 (replaced) [pdf, other]

Title: A Weighted Randomized Kaczmarz Method for Solving Linear SystemsAuthors: Stefan SteinerbergerSubjects: Numerical Analysis (math.NA); Optimization and Control (math.OC)
[ showing up to 2000 entries per page: fewer  more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, math, recent, 2007, contact, help (Access key information)