Optimization and Control
New submissions
[ showing up to 2000 entries per page: fewer  more ]
New submissions for Mon, 30 Mar 20
 [1] arXiv:2003.12151 [pdf, ps, other]

Title: QLearning in Regularized Meanfield GamesComments: 10 pages, double column. arXiv admin note: text overlap with arXiv:1912.13309Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY)
In this paper, we introduce a regularized meanfield game and study learning of this game under an infinitehorizon discounted reward function. The game is defined by adding a regularization function to the onestage reward function in the classical meanfield game model. We establish a value iteration based learning algorithm to this regularized meanfield game using fitted Qlearning. This regularization term in general makes reinforcement learning algorithm more robust with improved exploration. Moreover, it enables us to establish error analysis of the learning algorithm without imposing restrictive convexity assumptions on the system components, which are needed in the absence of a regularization term.
 [2] arXiv:2003.12160 [pdf]

Title: Traffic assignment models. Numerical aspectsSubjects: Optimization and Control (math.OC)
In this book we describe BMW traffic assignment model and NesterovdePalma model. We consider Entropy model for demand matrix. Based on this models we build multistage traffic assignment models. The equilibrium in such models can be found from convexconcave saddlepoint problem. We show how to solve this problem by using special combination of universal gradient method and Sinkhorn's algorithm.
 [3] arXiv:2003.12183 [pdf, other]

Title: Optimal Path Planning and Coordination for Connected and Automated VehiclesComments: 12 pages, 4 figuresSubjects: Optimization and Control (math.OC)
In this paper, we provide a decentralized theoretical framework for coordination of connected and automated vehicles (CAVs) in different traffic scenarios. The framework includes: (1) an upperlevel optimization that yields for each CAV its optimal path, including the time, to pass through a given traffic scenario while alleviating congestion; and (2) a lowlevel optimization that yields for each CAV its optimal control input (acceleration/deceleration) to achieve the optimal path and time derived in the upperlevel. We provide a complete, analytical solution of the lowlevel optimization problem that includes the rearend safety constraint, where the safe distance is a function of speed, in addition to the state and control constraints. Furthermore, we provide a geometric duality framework using hyperplanes to prove strong duality of the upperlevel optimization problem. The latter implies that the optimal path and time for each CAV does not activate any of the state, control, and safety constraints of the lowlevel optimization, thus allowing for online implementation. We validate the effectiveness of the proposed theoretical framework through simulation.
 [4] arXiv:2003.12330 [pdf, other]

Title: Nonlinear System Identification with Prior Knowledge of the Region of AttractionComments: 19 pages, 2 figuresSubjects: Optimization and Control (math.OC); Signal Processing (eess.SP); Systems and Control (eess.SY)
We consider the problem of nonlinear system identification when prior knowledge is available on the region of attraction (ROA) of an equilibrium point. We propose an identification method in the form of an optimization problem, minimizing the fitting error and guaranteeing the desired stability property. The problem is approached by joint identification the dynamics and a Lyapunov function verifying the stability property. In this setting, the hypothesis set is a reproducing kernel Hilbert space, and with respect to each point of the given subset of the ROA, the Lie derivative inequality of the Lyapunov function imposes a constraint. The problem is a nonconvex infinitedimensional optimization with infinite number of constraints. To obtain a tractable formulation, only a suitably designed finite subset of the constraints are considered. The resulting problem admits a solution in form of a linear combination of the sections of the kernel and its derivatives. An equivalent optimization problem with a quadratic cost function subject to linear and bilinear constraints is derived. A suitable change of variable gives a convex reformulation of the problem. To reduce the number of hyperparameters, the optimization problem is adapted to the case of diagonal kernels. The method is demonstrate by means of an example.
 [5] arXiv:2003.12336 [pdf, other]

Title: Convex Nonparametric Formulation for Identification of Gradient FlowsComments: 18 pages, 2 figuresSubjects: Optimization and Control (math.OC); Signal Processing (eess.SP); Systems and Control (eess.SY)
In this paper, we develop a nonparametric system identification method for the nonlinear gradientflow dynamics. In these systems, the vector field is the gradient field of a potential energy function. This fundamental fact about the dynamics of system plays the role of a structural prior knowledge as well as a constraint in the proposed identification method. While the nature of the identification problem is an estimation in the space of functions, we derive an equivalent finite dimensional formulation, which is a convex optimization in form of a quadratic program. This gives scalability of the problem and provides the opportunity for utilizing recently developed largescale optimization solvers. The central idea in the proposed method is representing the energy function as a difference of two convex functions and estimating these convex functions jointly. Based on necessary and sufficient conditions for function convexity, the identification problem is formulated, and then, the existence, uniqueness and smoothness of the solution is addressed. We also illustrate the method numerically for a demonstrative example.
 [6] arXiv:2003.12486 [pdf, ps, other]

Title: The General Solution for Affine Control Systems on Lie GroupsSubjects: Optimization and Control (math.OC)
The purpose of this paper is to present explicitly the solution curve for affine control systems on Lie groups under the assumption that automorphisms associated to the linear vector fields commutes. If we assume that the derivations associated to linear vector fields are inner, we obtain a simpler solution and we show some results of controllability. To end, we work with conjugation by homomorphism of Lie groups between affine systems.
 [7] arXiv:2003.12499 [pdf, other]

Title: Frequency theorem for the regulator problem with unbounded cost functional and its applications to nonlinear delay equationsAuthors: Mikhail AnikushinSubjects: Optimization and Control (math.OC); Dynamical Systems (math.DS)
We study the quadratic regulator problem with an unbounded cost functional of general type. The motivation comes from delay equations, which has the feedback part with discrete delays (or, in other words, deltalike measurements, which are unbounded in $L_{2}$). We treat the problem in an abstract context of a certain Hilbert space, which is rigged by a Banach space. We obtain a version of the nonsingular frequency theorem, which guarantees the existence of a unique optimal process, starting in the Banach space. We show that the optimal cost (that is the value of the quadratic functional on the optimal process) is given by the "quadratic form" of a bounded linear operator from the Banach space to its dual and this form can be used as a Lyapunovlike functional. For a large class of nonautonomous nonlinear delay equations in feedback form we obtain an analog of the circle criterion, which is a natural extension of the corresponding criterion for ODEs.
Crosslists for Mon, 30 Mar 20
 [8] arXiv:2003.12134 (crosslist from cs.DS) [pdf, other]

Title: Miniature Robot Path Planning for Bridge Inspection: MinMax Cycle CoverBased ApproachSubjects: Data Structures and Algorithms (cs.DS); Optimization and Control (math.OC)
We study the problem of planning the deployments of a group of mobile robots. While the problem and formulation can be used for many different problems, here we use a bridge inspection as the motivating application for the purpose of exposition. The robots are initially stationed at a set of depots placed throughout the bridge. Each robot is then assigned a set of sites on the bridge to inspect and, upon completion, must return to the same depot where it is stored.
The problem of robot planning is formulated as a rooted minmax cycle cover problem, in which the vertex set consists of the sites to be inspected and robot depots, and the weight of an edge captures either (i) the amount of time needed to travel from one end vertex to the other vertex or (ii) the necessary energy expenditure for the travel. In the first case, the objective function is the total inspection time, whereas in the latter case, it is the maximum energy expenditure among all deployed robots. We propose a novel algorithm with approximation ratio of $5 + \epsilon$, where $0<\epsilon<1$. In addition, the computational complexity of the proposed algorithm is shown to be $O\big( n^2+2^{m1} n \log(n+k) \big)$, where $n$ is the number of vertices, and $m$ is the number of depots.  [9] arXiv:2003.12189 (crosslist from eess.SY) [pdf, other]

Title: DataDriven Control of Complex NetworksSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC); Physics and Society (physics.socph)
Our ability to manipulate the behavior of complex networks depends on the design of efficient control algorithms and, critically, on the availability of an accurate and tractable model of the network dynamics. While the design of control algorithms for network systems has seen notable advances in the past few years, knowledge of the network dynamics is a ubiquitous assumption that is difficult to satisfy in practice, especially when the network topology is large and, possibly, timevarying. In this paper we overcome this limitation, and develop a datadriven framework to control a complex dynamical network optimally and without requiring any knowledge of the network dynamics. Our optimal controls are constructed using a finite set of experimental data, where the unknown complex network is stimulated with arbitrary and possibly random inputs. In addition to optimality, we show that our datadriven formulas enjoy favorable computational and numerical properties even when compared to their modelbased counterpart. Finally, although our controls are provably correct for networks with deterministic linear dynamics, we also characterize their performance against noisy experimental data and for a class of nonlinear dynamics that arise when manipulating neural activity in brain networks.
 [10] arXiv:2003.12192 (crosslist from eess.SY) [pdf, other]

Title: Moving horizonbased optimal scheduling of EV charging: A power systemcognizant approachComments: Accepted for presentation at PESGeneral Meeting, Montreal, 2020Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
The rapid escalation in plugin electric vehicles (PEVs) and their uncoordinated charging patterns pose several challenges in distribution system operation. Some of the undesirable effects include overloading of transformers, rapid voltage fluctuations, and over/under voltages. While this compromises the consumer power quality, it also puts on extra stress on the local voltage control devices. These challenges demand for a wellcoordinated and power networkaware charging approach for PEVs in a community. This paper formulates a realtime electric vehicle charging scheduling problem as an mixedinteger linear program (MILP). The problem is to be solved by an aggregator, that provides charging service in a residential community. The proposed formulation maximizes the profit of the aggregator, enhancing the utilization of available infrastructure. With a prior knowledge of load demand and hourly electricity prices, the algorithm uses a moving time horizon optimization approach, allowing the number of vehicles arriving unknown. In this realistic setting, the proposed framework ensures that power system constraints are satisfied and guarantees desired PEV charging level within stipulated time. Numerical tests on a IEEE 13node feeder system demonstrate the computational and performance superiority of the proposed MILP technique.
 [11] arXiv:2003.12423 (crosslist from cs.LG) [pdf, other]

Title: A HybridOrder Distributed SGD Method for NonConvex Optimization to Balance Communication Overhead, Computational Complexity, and Convergence RateSubjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT); Optimization and Control (math.OC); Machine Learning (stat.ML)
In this paper, we propose a method of distributed stochastic gradient descent (SGD), with low communication load and computational complexity, and still fast convergence. To reduce the communication load, at each iteration of the algorithm, the worker nodes calculate and communicate some scalers, that are the directional derivatives of the sample functions in some \emph{preshared directions}. However, to maintain accuracy, after every specific number of iterations, they communicate the vectors of stochastic gradients. To reduce the computational complexity in each iteration, the worker nodes approximate the directional derivatives with zerothorder stochastic gradient estimation, by performing just two function evaluations rather than computing a firstorder gradient vector. The proposed method highly improves the convergence rate of the zerothorder methods, guaranteeing orderwise faster convergence. Moreover, compared to the famous communicationefficient methods of model averaging (that perform local model updates and periodic communication of the gradients to synchronize the local models), we prove that for the general class of nonconvex stochastic problems and with reasonable choice of parameters, the proposed method guarantees the same orders of communication load and convergence rate, while having orderwise less computational complexity. Experimental results on various learning problems in neural networks applications demonstrate the effectiveness of the proposed approach compared to various stateoftheart distributed SGD methods.
 [12] arXiv:2003.12455 (crosslist from cs.LG) [pdf, other]

Title: On a minimum enclosing ball of a collection of linear subspacesComments: 26 pagesSubjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC)
This paper concerns the minimax center of a collection of linear subspaces. When the subspaces are $k$dimensional subspaces of $\mathbb{R}^n$, this can be cast as finding the center of a minimum enclosing ball on a Grassmann manifold, Gr$(k,n)$. For subspaces of different dimension, the setting becomes a disjoint union of Grassmannians rather than a single manifold, and the problem is no longer welldefined. However, natural geometric maps exist between these manifolds with a welldefined notion of distance for the images of the subspaces under the mappings. Solving the initial problem in this context leads to a candidate minimax center on each of the constituent manifolds, but does not inherently provide intuition about which candidate is the best representation of the data. Additionally, the solutions of different rank are generally not nested so a deflationary approach will not suffice, and the problem must be solved independently on each manifold. We propose and solve an optimization problem parametrized by the rank of the minimax center. The solution is computed using a subgradient algorithm on the dual. By scaling the objective and penalizing the information lost by the rank$k$ minimax center, we jointly recover an optimal dimension, $k^*$, and a central subspace, $U^* \in$ Gr$(k^*,n)$ at the center of the minimum enclosing ball, that best represents the data.
Replacements for Mon, 30 Mar 20
 [13] arXiv:1804.02100 (replaced) [pdf, ps, other]

Title: A Restless Bandit Model for Resource Allocation, Competition and ReservationComments: 78 pages, 8 figures, LatexSubjects: Optimization and Control (math.OC)
 [14] arXiv:1904.08787 (replaced) [pdf, other]

Title: Resilient Distributed Field EstimationSubjects: Optimization and Control (math.OC)
 [15] arXiv:1909.10300 (replaced) [pdf, other]

Title: Conservative set valued fields, automatic differentiation, stochastic gradient method and deep learningSubjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
 [16] arXiv:1910.04194 (replaced) [pdf, other]

Title: Projectionfree nonconvex stochastic optimization on Riemannian manifoldsComments: Under ReviewSubjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
 [17] arXiv:1911.02206 (replaced) [pdf, other]

Title: Resilient Load Restoration in Microgrids Considering Mobile Energy Storage Fleets: A Deep Reinforcement Learning ApproachComments: Submitted to 2020 IEEE Power and Energy Society General MeetingSubjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY)
 [18] arXiv:1911.04688 (replaced) [pdf, ps, other]

Title: Controllability analysis and optimal control of biomass drying with reduced order modelsComments: 20 pages, 11 figuresSubjects: Optimization and Control (math.OC)
 [19] arXiv:1911.04881 (replaced) [pdf, ps, other]

Title: An observer for partially obstructed wood particles in industrial drying processesComments: 21 pages, 11 figuresSubjects: Optimization and Control (math.OC)
 [20] arXiv:2003.11457 (replaced) [pdf, ps, other]

Title: A proximal bundle variant with optimal iterationcomplexity for a large range of prox stepsizesComments: 23 pagesSubjects: Optimization and Control (math.OC)
 [21] arXiv:1906.09129 (replaced) [pdf, ps, other]

Title: Metastability of the proximal point algorithm with multiparametersComments: 21 pagesSubjects: Logic (math.LO); Optimization and Control (math.OC)
 [22] arXiv:1910.06567 (replaced) [pdf, ps, other]

Title: EnergyEfficient JobAssignment Policy with Asymptotically Guaranteed Performance DeviationComments: 14 pages, 10 figuresSubjects: Distributed, Parallel, and Cluster Computing (cs.DC); Optimization and Control (math.OC)
 [23] arXiv:2003.07939 (replaced) [pdf, other]

Title: Neural Networks for Encoding Dynamic SecurityConstrained Optimal Power Flow to MixedInteger Linear ProgramsAuthors: Andreas Venzke, Daniel Timon Viola, Jeanne MermetGuyennet, George S. Misyris, Spyros ChatzivasileiadisSubjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC)
 [24] arXiv:2003.11713 (replaced) [pdf, other]

Title: EventDriven Receding Horizon Control For Online Distributed Persistent Monitoring on GraphsSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[ showing up to 2000 entries per page: fewer  more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, math, recent, 2003, contact, help (Access key information)