We gratefully acknowledge support from
the Simons Foundation and member institutions.

Optimization and Control

New submissions

[ total of 82 entries: 1-82 ]
[ showing up to 1000 entries per page: fewer | more ]

New submissions for Tue, 25 Feb 20

[1]  arXiv:2002.09488 [pdf, other]
Title: Optimal Randomized First-Order Methods for Least-Squares Problems
Comments: arXiv admin note: text overlap with arXiv:2002.00864
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)

We provide an exact analysis of a class of randomized algorithms for solving overdetermined least-squares problems. We consider first-order methods, where the gradients are pre-conditioned by an approximation of the Hessian, based on a subspace embedding of the data matrix. This class of algorithms encompasses several randomized methods among the fastest solvers for least-squares problems. We focus on two classical embeddings, namely, Gaussian projections and subsampled randomized Hadamard transforms (SRHT). Our key technical innovation is the derivation of the limiting spectral density of SRHT embeddings. Leveraging this novel result, we derive the family of normalized orthogonal polynomials of the SRHT density and we find the optimal pre-conditioned first-order method along with its rate of convergence. Our analysis of Gaussian embeddings proceeds similarly, and leverages classical random matrix theory results. In particular, we show that for a given sketch size, SRHT embeddings exhibits a faster rate of convergence than Gaussian embeddings. Then, we propose a new algorithm by optimizing the computational complexity over the choice of the sketching dimension. To our knowledge, our resulting algorithm yields the best known complexity for solving least-squares problems with no condition number dependence.

[2]  arXiv:2002.09526 [pdf, other]
Title: Stochastic Subspace Cubic Newton Method
Comments: 29 pages, 5 figures, 1 table, 1 algorithm
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)

In this paper, we propose a new randomized second-order optimization algorithm---Stochastic Subspace Cubic Newton (SSCN)---for minimizing a high dimensional convex function $f$. Our method can be seen both as a {\em stochastic} extension of the cubically-regularized Newton method of Nesterov and Polyak (2006), and a {\em second-order} enhancement of stochastic subspace descent of Kozak et al. (2019). We prove that as we vary the minibatch size, the global convergence rate of SSCN interpolates between the rate of stochastic coordinate descent (CD) and the rate of cubic regularized Newton, thus giving new insights into the connection between first and second-order methods. Remarkably, the local convergence rate of SSCN matches the rate of stochastic subspace descent applied to the problem of minimizing the quadratic function $\frac12 (x-x^*)^\top \nabla^2f(x^*)(x-x^*)$, where $x^*$ is the minimizer of $f$, and hence depends on the properties of $f$ at the optimum only. Our numerical experiments show that SSCN outperforms non-accelerated first-order CD algorithms while being competitive to their accelerated variants.

[3]  arXiv:2002.09621 [pdf, other]
Title: Global Convergence and Variance-Reduced Optimization for a Class of Nonconvex-Nonconcave Minimax Problems
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)

Nonconvex minimax problems appear frequently in emerging machine learning applications, such as generative adversarial networks and adversarial learning. Simple algorithms such as the gradient descent ascent (GDA) are the common practice for solving these nonconvex games and receive lots of empirical success. Yet, it is known that these vanilla GDA algorithms with constant step size can potentially diverge even in the convex setting. In this work, we show that for a subclass of nonconvex-nonconcave objectives satisfying a so-called two-sided Polyak-{\L}ojasiewicz inequality, the alternating gradient descent ascent (AGDA) algorithm converges globally at a linear rate and the stochastic AGDA achieves a sublinear rate. We further develop a variance reduced algorithm that attains a provably faster rate than AGDA when the problem has the finite-sum structure.

[4]  arXiv:2002.09647 [pdf, ps, other]
Title: Appropriate Learning Rates of Adaptive Learning Rate Optimization Algorithms for Training Deep Neural Networks
Authors: Hideaki Iiduka
Subjects: Optimization and Control (math.OC)

This paper deals with a convex stochastic optimization problem in deep learning and provides appropriate learning rates with which useful adaptive learning rate optimization algorithms, such as Adam and AMSGrad, for training deep neural networks can solve the problem. In particular, concrete constant learning rates are provided to approximate a solution of the problem. Moreover, sufficient conditions for diminishing learning rates are provided to ensure that any accumulation point of the sequences generated by the adaptive learning rate optimization algorithms almost surely belongs to the solution set of the problem. The adaptive learning rate optimization algorithms are examined in numerical experiments. In particular, the experiments show that the algorithms with constant learning rates perform better than ones with diminishing learning rates.

[5]  arXiv:2002.09658 [pdf, other]
Title: An Efficient MPC Algorithm For Switched Nonlinear Systems with Minimum Dwell Time Constraints
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)

This paper presents an efficient suboptimal model predictive control (MPC) algorithm for nonlinear switched systems subject to minimum dwell time constraints (MTC). While MTC are required for most physical systems due to stability, power and mechanical restrictions, MPC optimization problems with MTC are challenging to solve. To efficiently solve such problems, the on-line MPC optimization problem is decomposed into a sequence of simpler problems, which include two nonlinear programs (NLP) and a rounding step, as typically done in mixed-integer optimal control (MIOC). Unlike the classical approach that embeds MTC in a mixed-integer linear program (MILP) with combinatorial constraints in the rounding step, our proposal is to embed the MTC in one of the NLPs using move blocking. Such a formulation can speedup on-line computations by employing recent move blocking algorithms for NLP problems and by using a simple sum-up-rounding (SUR) method for the rounding step. An explicit upper bound of the integer approximation error for the rounding step is given. In addition, a combined shrinking and receding horizon strategy is developed to satisfy closed-loop MTC. Recursive feasibility is proven using a $l$-step control invariant ($l$-CI) set, where $l$ is the minimum dwell time step length. An algorithm to compute $l$-CI sets for switched linear systems off-line is also presented. Numerical studies demonstrate the efficiency and effectiveness of the proposed MPC algorithm for switched nonlinear systems with MTC.

[6]  arXiv:2002.09684 [pdf, ps, other]
Title: Time-Varying Internal Models in Robust Output Regulation
Comments: 23 pages, 3 figures, submitted
Subjects: Optimization and Control (math.OC)

We study the robust output regulation problem for linear distributed parameter systems in the situation where the frequencies of the exogeneous signals are unknown and need to be estimated based on the reference signal. We present a generalisation of the internal model principle for time-dependent controllers whose parameters converge asymptotically, and use this general framework for controller design combining adaptive frequency estimation and a time-varying internal model. The theoretic results are used in controller design for output tracking in magnetic drug delivery.

[7]  arXiv:2002.09724 [pdf, ps, other]
Title: Stochastic production planning with regime switching
Subjects: Optimization and Control (math.OC)

This paper considers the stochastic production planning with regime switching. There are two regimes corresponding to different economic cycles. A factory is planning its production so as to minimize production costs. We analyze this problem through the value function approach. The optimal production is characterized through the solution of an elliptic system of partial differential equations which is shown to have a solution.

[8]  arXiv:2002.09743 [pdf, other]
Title: Generalized Adaptive Partition-based Method for Two-Stage Stochastic Programs with Fixed Recourse
Subjects: Optimization and Control (math.OC)

We present a method to solve two-stage stochastic problems with fixed recourse, when the uncertainty space can have either discrete or continuous distributions. Given a partition of the uncertainty space, the method solves a discrete problem with one scenarios for each element of the partition. Using the information of the duals of the second stage subproblems, we provide conditions that the partition must satisfy to obtain the optimal solution. These conditions provide a guidance on how to refine the partition, converging iteratively to the optimal solution. Computational experiments show how the method automatically refine the partition of the uncertainty space in the regions of interest for the problem. The method is a generalization of the Adaptive Partition-based Method presented by Song & Luedtke (2015) for discrete distributions, also extending its applicability to more general cases in this setting.

[9]  arXiv:2002.09744 [pdf, ps, other]
Title: Controllability results for the rolling of $2$-dim. against $3$-dim. Riemannian Manifolds
Subjects: Optimization and Control (math.OC); Differential Geometry (math.DG)

In this article, we consider the rolling (or development) of two Riemannian connected manifolds $(M,g)$ and $(\hat{M},\hat{g})$ of dimensions $2$ and $3$ respectively, with the constraints of no-spinning and no-slipping. The present work is a continuation of \cite{MortadaKokkonenChitour}, which modelled the general setting of the rolling of two Riemannian connected manifolds with different dimensions as a driftless control affine system on a fibered space $Q$, with an emphasis on understanding the local structure of the rolling orbits, i.e., the reachable sets in $Q$. In this paper, the state space $Q$ has dimension eight and we show that the possible dimensions of non open rolling orbits belong to the set $\{2,5,6,7\}$. We describe the structures of orbits of dimension $2$, the possible local structures of rolling orbits of dimension $5$ and some of dimension $7$.

[10]  arXiv:2002.09774 [pdf, ps, other]
Title: Set-Convergence and Its Application: A Tutorial
Subjects: Optimization and Control (math.OC)

Optimization problems, generalized equations, and the multitude of other variational problems invariably lead to the analysis of sets and set-valued mappings as well as their approximations. We review the central concept of set-convergence and explain its role in defining a notion of proximity between sets, especially for epigraphs of functions and graphs of set-valued mappings. The development leads to an approximation theory for optimization problems and generalized equations with profound consequences for the construction of algorithms. We also introduce the role of set-convergence in variational geometry and subdifferentiability with applications to optimality conditions. Examples illustrate the importance of set-convergence in stability analysis, error analysis, construction of algorithms, statistical estimation, and probability theory.

[11]  arXiv:2002.09788 [pdf, other]
Title: Lifting for Simplicity: Concise Descriptions of Convex Sets
Comments: 45 pages, 27 figures
Subjects: Optimization and Control (math.OC); Combinatorics (math.CO)

This paper presents a selected tour through the theory and applications of lifts of convex sets. A lift of a convex set is a higher-dimensional convex set that projects onto the original set. Many convex sets have lifts that are dramatically simpler to describe than the original set. Finding such simple lifts has significant algorithmic implications, particularly for optimization problems. We consider both the classical case of polyhedral lifts, described by linear inequalities, as well as spectrahedral lifts, defined by linear matrix inequalities, with a focus on recent developments related to spectrahedral lifts.
Given a convex set, ideally we would either like to find a (low-complexity) polyhedral or spectrahedral lift, or find an obstruction proving that no such lift is possible. To this end, we explain the connection between the existence of lifts of a convex set and certain structured factorizations of its associated slack operator. Based on this characterization, we describe a uniform approach, via sums of squares, to the construction of spectrahedral lifts of convex sets and illustrate the method on several families of examples. Finally, we discuss two flavors of obstruction to the existence of lifts: one related to facial structure, and the other related to algebraic properties of the set in question.
Rather than being exhaustive, our aim is to illustrate the richness of the area. We touch on a range of different topics related to the existence of lifts, and present many examples of lifts from different areas of mathematics and its applications.

[12]  arXiv:2002.09796 [pdf, other]
Title: A Hierarchical Optimization Architecture for Large-Scale Power Networks
Subjects: Optimization and Control (math.OC)

We present a hierarchical optimization architecture for large-scale power networks that overcomes limitations of fully centralized and fully decentralized architectures. The architecture leverages principles of multigrid computing schemes, which are widely used in the solution of partial differential equations on massively parallel computers. The top layer of the architecture uses a coarse representation of the entire network while the bottom layer is composed of a family of decentralized optimization agents each operating on a network subdomain at full resolution. We use an alternating direction method of multipliers (ADMM) framework to drive coordination of the decentralized agents. We show that state and dual information obtained from the top layer can be used to accelerate the coordination of the decentralized optimization agents and to recover optimality for the entire system. We demonstrate that the hierarchical architecture can be used to manage large collections of microgrids.

[13]  arXiv:2002.09802 [pdf, other]
Title: Computing Economic-Optimal and Stable Equilibria for Droop-Controlled Microgrids
Subjects: Optimization and Control (math.OC)

We consider the problem of computing equilibria (steady-states) for droop-controlled, islanded, AC microgrids that are both economic-optimal and dynamically stable. This work is motivated by the observation that classical optimal power flow (OPF) formulations used for economic optimization might provide equilibria that are not reachable by low-level controllers (i.e., closed-loop unstable). This arises because OPF problems only enforce steady-state conditions and do not capture the dynamics. We explain this behavior by using a port-Hamiltonian microgrid representation. To overcome the limitations of OPF, the port-Hamiltonian representation can be exploited to derive a bilevel OPF formulation that seeks to optimize economics while enforcing stability. Unfortunately, bilevel optimization with a nonconvex inner problem is difficult to solve in general. As such, we propose an alternative approach (that we call probing OPF), which identifies an economic-optimal and stable equilibrium by probing a neighborhood of equilibria using random perturbations. The probing OPF is advantageous in that it is formulated as a standard nonlinear program, in that it is compatible with existing OPF frameworks, and in that it is applicable to diverse microgrid models. Experiments with the IEEE 118-bus system reveal that few probing points are required to enforce stability.

[14]  arXiv:2002.09806 [pdf, ps, other]
Title: Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games
Comments: 21 Pages. Under review
Subjects: Optimization and Control (math.OC); Computer Science and Game Theory (cs.GT); Machine Learning (stat.ML)

We consider multi-agent learning via online gradient descent (OGD) in a class of games called $\lambda$-cocoercive games, a broad class of games that admits many Nash equilibria and that properly includes strongly monotone games. We characterize the finite-time last-iterate convergence rate for joint OGD learning on $\lambda$-cocoercive games; further, building on this result, we develop a fully adaptive OGD learning algorithm that does not require any knowledge of the problem parameter (e.g., the cocoercive constant $\lambda$) and show, via a novel double-stopping-time technique, that this adaptive algorithm achieves the same finite-time last-iterate convergence rate as its non-adaptive counterpart. Subsequently, we extend OGD learning to the noisy gradient feedback case and establish last-iterate convergence results---first qualitative almost sure convergence, then quantitative finite-time convergence rates---all under non-decreasing step-sizes. These results fill in several gaps in the existing multi-agent online learning literature, where three aspects---finite-time convergence rates, non-decreasing step-sizes, and fully adaptive algorithms---have not been previously explored.

[15]  arXiv:2002.09844 [pdf, ps, other]
Title: Exact Approaches for Competitive Facility Location with Discrete Attractiveness
Subjects: Optimization and Control (math.OC)

We study a variant of the competitive facility location problem, in which a company is to locate new facilities in a market where competitor's facilities already exist. We consider the scenario where only a limited number of possible attractiveness levels is available, and the company has to select exactly one level for each open facility. The goal is to decide the facilities' locations and attractiveness levels that maximize the profit. We apply the gravity-based rule to model the behavior of the customers and formulate a multi-ratio linear fractional 0-1 program. Our main contributions are the exact solution approaches for the problem. These approaches allow for easy implementations without the need for designing complicated algorithms and are "friendly" to the users without a solid mathematical background. We conduct computational experiments on the randomly generated datasets to assess their computational performance. The results suggest that the mixed-integer quadratic conic approach outperforms the others in terms of computational time. Besides that, it is also the most straightforward one that only requires the users to be familiar with the general form of a conic quadratic inequality. Therefore, we recommend it as the primary choice for such a problem.

[16]  arXiv:2002.09852 [pdf, other]
Title: Training Linear Neural Networks: Non-Local Convergence and Complexity Results
Authors: Armin Eftekhari
Subjects: Optimization and Control (math.OC)

Linear networks provide valuable insight into the workings of neural networks in general. In this paper, we improve the state of the art in [bah2019learning] by identifying conditions under which gradient flow successfully trains a linear network, in spite of the non-strict saddle points present in the optimization landscape. We also improve the state of the art for computational complexity of training linear networks in [arora2018convergence] by establishing non-local linear convergence rates for gradient flow.
Crucially, these new results are not in the lazy training regime, cautioned against in [chizat2019lazy,yehudai2019power]. Our results require the network to have a layer with one neuron, which corresponds to the popular spiked covariance model in statistics, and subsumes the important case of networks with a scalar output. Extending these results to all linear networks remains an open problem.

[17]  arXiv:2002.09862 [pdf, other]
Title: Distributed Optimization Over Markovian Switching Random Network
Authors: Peng Yi, Li Li
Comments: 10 pages, 2 figures
Subjects: Optimization and Control (math.OC)

In this paper, we investigate the distributed convex optimization problem over a multi-agent system with Markovian switching communication networks. The objective function is the sum of each agent's local objective function, which cannot be known by other agents. The communication network is assumed to switch over a set of weight-balanced directed graphs with a Markovian property.We propose a consensus sub-gradient algorithm with two time-scale step-sizes to handle the uncertainty due to the Markovian switching topologies and the absence of global gradient information. With a proper selection of step-sizes, we prove the almost sure convergence of all agents' local estimates to the same optimal solution when the union graph of the Markovian network' states is strongly connected and the Markovian network is irreducible. Simulations are given for illustration of the results.

[18]  arXiv:2002.09916 [pdf, ps, other]
Title: Extended formulation and valid inequalities for the multi-item inventory lot-sizing problem with supplier selection
Subjects: Optimization and Control (math.OC); Computational Complexity (cs.CC)

This paper considers the multi-item inventory lot-sizing problem with supplier selection. The problem consists in determining an optimal purchasing plan in order to satisfy dynamic deterministic demands for multiple items over a finite planning horizon, taking into account the fact that multiple suppliers are available to purchase from. As the complexity of the problem was an open question, we show that it is NP-hard. We propose a facility location extended formulation for the problem which can be preprocessed based on the cost structure and describe new valid inequalities in the original space of variables, which we denote $(l,S_j)$-inequalities. Furthermore, we study the projection of the extended formulation into the original space and show the connection between the inequalities generated by this projection and the newly proposed $(l,S_j)$-inequalities. Additionally, we present a simple and easy to implement yet very effective MIP (mixed integer programming) heuristic using the extended formulation. Computational results show that the preprocessed facility location extended formulation outperforms all other formulations for small and medium instances, as it can solve nearly all of them to optimality within the time limit. Moreover, the presented MIP heuristic is able to obtain solutions which strictly improve those achieved by a state-of-the art method for all the large benchmark instances.

[19]  arXiv:2002.10023 [pdf, other]
Title: Suboptimal Stabilization of Unknown Nonlinear Systems via Extended State Observers
Authors: Amir Shakouri
Comments: 5 pages, 2 figures
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)

This paper introduces a globally asymptotically stable, locally optimal, stabilizer for multi-input muti-output nonlinear systems of any order with totally unknown dynamics in a special form. The control scheme proposed in this paper lies at the intersection of the active disturbance rejection control (ADRC) and the state-dependent Riccati equation (SDRE) control method. It is shown that using an extended state observer, the state-dependent coefficient matrix of the nonlinear system can be estimated. The system in then stabilized by a suboptimal controller in the region where SDRE method is effective (an estimated region of attraction) and uses an ADRC outside the region as a backup for global stability assurance.

[20]  arXiv:2002.10065 [pdf, other]
Title: Stochastic Model Predictive Control for Central HVAC Plants
Comments: 34 pages, 15 figures
Subjects: Optimization and Control (math.OC)

We present a stochastic model predictive control (MPC) framework for central heating, ventilation, and air conditioning (HVAC) plants. The framework uses real data to forecast and quantify uncertainty of disturbances affecting the system over multiple timescales (electrical loads, heating/cooling loads, and energy prices). We conduct detailed closed-loop simulations and systematic benchmarks for the central HVAC plant of a typical university campus. Results demonstrate that deterministic MPC fails to properly capture disturbances and that this translates into economic penalties associated with peak demand charges and constraint violations in thermal storage capacity (overflow and/or depletion). Our results also demonstrate that stochastic MPC provides a more systematic approach to mitigate uncertainties and that this ultimately leads to cost savings of up to 7.5% and to mitigation of storage constraint violations. Benchmark results also indicate that these savings are close to ideal savings (9.6%) obtained under MPC with perfect information.

[21]  arXiv:2002.10079 [pdf, other]
Title: A Distributed Architecture for Real-time Hybrid Traffic Light Control in Urban Transportation Networks
Authors: Yicheng Zhang
Comments: This is a brief summary of the talk presented during the IEEE ITSS Young Professionals Workshop affiliated with IEEE Intelligent Transportation Systems Conference 2019 at Auckland, New Zealand
Subjects: Optimization and Control (math.OC)

A macroscopic model is proposed to depict the traffic dynamics involved in urban traffic systems. The link dynamics are described based on the cell-transmission model and bounded by the link capacities, while the flow dynamics are proposed based on the discharge headways and saturation flow at intersections. To fulfill the requirement of a closed-loop traffic light control strategy, an approach to estimate the branching ratios at intersections is proposed and simulations show that the convergence would be achieved under constant cyclic flow profiles. Furthermore, a system partitioning approach is proposed based congestion level identification, which is achieved via a machine learning method and a hybrid traffic network control strategy is proposed to integrate different traffic light control schemes together.

[22]  arXiv:2002.10090 [pdf]
Title: Multi-objective beetle antennae search algorithm
Comments: 5 figures and 1 table
Subjects: Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)

In engineering optimization problems, multiple objectives with a large number of variables under highly nonlinear constraints are usually required to be simultaneously optimized. Significant computing effort are required to find the Pareto front of a nonlinear multi-objective optimization problem. Swarm intelligence based metaheuristic algorithms have been successfully applied to solve multi-objective optimization problems. Recently, an individual intelligence based algorithm called beetle antennae search algorithm was proposed. This algorithm was proved to be more computationally efficient. Therefore, we extended this algorithm to solve multi-objective optimization problems. The proposed multi-objective beetle antennae search algorithm is tested using four well-selected benchmark functions and its performance is compared with other multi-objective optimization algorithms. The results show that the proposed multi-objective beetle antennae search algorithm has higher computational efficiency with satisfactory accuracy.

[23]  arXiv:2002.10124 [pdf, other]
Title: Reformulation of the M-stationarity conditions as a system of discontinuous equations and its solution by a semismooth Newton method
Subjects: Optimization and Control (math.OC)

We show that the Mordukhovich-stationarity system associated with a mathematical program with complementarity constraints (MPCC) can be equivalently written as a system of discontinuous equations which can be tackled with a semismooth Newton method. We show that the resulting algorithm can be interpreted as an active set strategy for MPCCs. Local fast convergence of the method is guaranteed under validity of an MPCC-tailored version of LICQ and a suitable second-order condition. In case of linear-quadratic MPCCs, the LICQ-type constraint qualification can be replaced by a weaker condition which depends on the underlying multipliers. We discuss a suitable globalization strategy for our method. Some numerical results are presented in order to illustrate our theoretical findings.

[24]  arXiv:2002.10140 [pdf, ps, other]
Title: Continuity of Chen-Fliess Series for Applications in System Identification and Machine Learning
Comments: 17 pages, 1 figure, 24th International Symposium on Mathematical Theory of Networks and Systems, (MTNS 2020)
Subjects: Optimization and Control (math.OC); Functional Analysis (math.FA)

Model continuity plays an important role in applications like system identification, adaptive control, and machine learning. This paper provides sufficient conditions under which input-output systems represented by locally convergent Chen-Fliess series are jointly continuous with respect to their generating series and as operators mapping a ball in an $L_p$-space to a ball in an $L_q$-space, where $p$ and $q$ are conjugate exponents. The starting point is to introduce a class of topological vector spaces known as Silva spaces to frame the problem and then to employ the concept of a direct limit to describe convergence. The proof of the main continuity result combines elements of proofs for other forms of continuity appearing in the literature to produce the desired conclusion.

[25]  arXiv:2002.10153 [pdf, ps, other]
Title: Last-mile Delivery: Optimal Locker Location Under Multinomial Logit Choice Model
Subjects: Optimization and Control (math.OC)

One innovative solution to the last-mile delivery problem is the self-service locker system. Motivated by a real case in Singapore, we consider a POP-Locker Alliance who operates a set of POP-stations and wishes to improve the last-mile delivery by opening new locker facilities. We propose a quantitative approach to determine the optimal locker location with the objective to maximize the overall service provided by the alliance. Customer's choices regarding the use of facilities are explicitly considered. They are predicted by a multinomial logit model. We then formulate the location problem as a multi-ratio linear-fractional 0-1 program and provide two solution approaches. The first one is to reformulate the original problem as a mixed-integer linear program, which is further strengthened using conditional McCormick inequalities. This approach is an exact method, developed for small-scale problems. For large-scale problems, we propose a Suggest-and-Improve framework with two embedded algorithms. Numerical studies indicated that our framework is an efficient approach that yields high-quality solutions. Finally, we conducted a case study. The results highlighted the importance of considering the customers' choices. Under different parameter values of the multinomial logit model, the decisions could be completely different. Therefore, the parameter value should be carefully estimated in advance.

[26]  arXiv:2002.10205 [pdf, other]
Title: Velocity-aided IMU-based Attitude Estimation
Subjects: Optimization and Control (math.OC)

This paper addresses the problem of estimating the attitude of a rigid body, which is subject to high accelerations and equipped with inertial measurement unit (IMU) and sensors providing the body velocity (expressed in the reference frame attached to the body). That issue can be treated differently depending on the level of confidence in the measurements of the magnetometer of the IMU, particularly with regard to the observation of the inclination component with respect to the vertical direction, rendering possible to describe the interaction with gravity. Two cases are then studied: either (i) the magnetometer is absent and only the inclination can be estimated, (ii) the magnetometer is present, giving redundancy and full attitude observability. In the latter case, the presented observer allows to tune how much the inclination estimation is influenced by the magnetometer. All state estimators are proposed with proof of almost global asymptotic stability and local exponential convergence. Finally, these estimators are compared with state-of-the-art solutions in clean and noisy simulations, allowing recommended solutions to be drawn for each case.

[27]  arXiv:2002.10291 [pdf, other]
Title: Estimation-aware model predictive path-following control for a general 2-trailer with a car-like tractor
Comments: Submitted to IEEE Transactions on Robotics. arXiv admin note: text overlap with arXiv:2002.06874
Subjects: Optimization and Control (math.OC); Robotics (cs.RO)

The design of the path-following controller is crucial to enable reliable autonomous vehicle operation. This design problem is especially challenging for a general 2-trailer with a car-like tractor due to the tractor's curvature limitations and the vehicle's structurally unstable joint-angle kinematics in backward motion. Additionally, to make the control system independent of any sensor mounted on the trailer, advanced sensors placed in the rear of the tractor have been proposed to solve the joint-angle estimation problem. Since these sensors typically have a limited field of view, the proposed estimation solution introduces restrictions on the joint-angle configurations that can be estimated with high accuracy. To model and explicitly consider these constraints in the controller, a model predictive path-following control approach is proposed. Two approaches with different computation complexity and performance are presented. In the first approach, the constraint on the joint angles is modeled as a union of convex polytopes, making it necessary to incorporate binary decision variables. The second approach avoids binary variables at the expense of a more restrictive approximation of the joint-angle constraints. In simulations and field experiments, the performance of the proposed path-following control approach in terms of suppressing disturbances and recovering from non-trivial initial states is compared with a previously proposed control strategy where the joint-angle constraints are neglected.

[28]  arXiv:2002.10338 [pdf]
Title: Extended Convex Hull-Based Distributed Operation of Integrated Electric-Gas Systems
Comments: 8 pages, 5 figures
Subjects: Optimization and Control (math.OC)

Distributed operation of integrated electricity and gas systems (IEGS) receives much attention since it respects data security and privacy between different agencies. This paper proposes an extended convex hull (ECH) based method to address the distributed optimal energy flow (OEF) problem in the IEGS. First, a multi-block IEGS model is obtained by dividing it into N blocks according to physical and regional differences. This multi-block model is then convexified by replacing the nonconvex gas transmission equation with its ECH-based constraints. The Jacobi-Proximal alternating direction method of multipliers (J-ADMM) algorithm is adopted to solve the convexified model and minimize its operation cost. Finally, the feasibility of the optimal solution for the convexified problem is checked, and a sufficient condition is developed. The optimal solution for the original nonconvex problem is recovered from that for the convexified problem if the sufficient condition is satisfied. Test results reveal that this method is tractable and effective in obtaining the feasible optimal solution for radial gas networks.

[29]  arXiv:2002.10369 [pdf, ps, other]
Title: On the stabilizability radius of linear switched systems
Comments: 14 pages
Subjects: Optimization and Control (math.OC)

We investigate the stabilizability of discrete-time linear switched systems, when the sole control action of the controller is the switching signal, and when the controller has access to the state of the system in real time. Despite their importance in many control settings, no algorithm is known that allows to decide the stabilizability of such systems, and very simple examples have been known for long, for which the stabilizability question is open.
We provide new results allowing us to bound the so-called stabilizability radius, which characterizes the stabilizability property of discrete-time linear switched systems. These results allow us to improve significantly the computation of the stabilizability radius for the above-mentioned examples. As a by-product, we exhibit a discontinuity property for this problem, which brings theoretical understanding of its complexity.

[30]  arXiv:2002.10386 [pdf]
Title: A Novel Decomposition Solution Approach for the Restoration Problem in Distribution Networks
Subjects: Optimization and Control (math.OC)

The distribution network restoration problem is by nature a mixed integer and non-linear optimization problem due to the switching decisions and Optimal Power Flow (OPF) constraints, respectively. The link between these two parts involves logical implications modelled through big-M coefficients. The presence of these coefficients makes the relaxation of the mixed-integer problem using branch-and-bound method very poor in terms of computation burden. Moreover, this link inhibits the use of classical Benders algorithm in decomposing the problem because the resulting cuts will still depend on the big-M coefficients. In this paper, a novel decomposition approach is proposed for the restoration problem named Modified Combinatorial Benders (MCB). In this regard, the reconfiguration problem and the OPF problem are decomposed into master and sub problems, which are solved through successive iterations. In the case of a large outage area, the numerical results show that the MCB provides, within a short time (after a few iterations), a restoration solution with a quality that is close to the proven optimality when it can be exhibited.

[31]  arXiv:2002.10393 [pdf]
Title: Optimal Load Restoration in Active Distribution Networks Complying with Starting Transients of Induction Motors
Subjects: Optimization and Control (math.OC)

Large horsepower induction motors play a critical role as industrial drives in production facilities. The operational safety of distribution networks during the starting transients of these motor loads is a critical concern for the operators. In this paper, an analytical and convex optimization model is derived representing the starting transients of the induction motor in a semi-static fashion. This model is used to find the optimal energization sequence of different loads (static and motor loads) following an outage in a distribution network. The optimization problem includes the optimal control of the converter-based DGs and autotransformers that are used for the induction motor starting. These models together with the semi-static model of the induction motor are integrated into a relaxed power flow formulation resulting in a Mixed-Integer Second Order Cone Programming (SOCP) problem. This formulation represents the transient operational limits that are imposed by different protection devices both in the motor side and network side. The functionality of the proposed optimization problem is evaluated in the case of a large-scale test study and under different simulation scenarios. The feasibility and accuracy of the optimization results are validated using I) off-line time-domain simulations, and II) a Power Hardware-In-the-Loop experiment.

[32]  arXiv:2002.10404 [pdf, other]
Title: An Outer-approximation Guided Optimization Approach for Constrained Neural Network Inverse Problems
Authors: Myun-Seok Cheon
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)

This paper discusses an outer-approximation guided optimization method for constrained neural network inverse problems with rectified linear units. The constrained neural network inverse problems refer to an optimization problem to find the best set of input values of a given trained neural network in order to produce a predefined desired output in presence of constraints on input values. This paper analyzes the characteristics of optimal solutions of neural network inverse problems with rectified activation units and proposes an outer-approximation algorithm by exploiting their characteristics. The proposed outer-approximation guided optimization comprises primal and dual phases. The primal phase incorporates neighbor curvatures with neighbor outer-approximations to expedite the process. The dual phase identifies and utilizes the structure of local convex regions to improve the convergence to a local optimal solution. At last, computation experiments demonstrate the superiority of the proposed algorithm compared to a projected gradient method.

[33]  arXiv:2002.10421 [pdf, other]
Title: Dual Mirror Descent for Online Allocation Problems
Subjects: Optimization and Control (math.OC)

We consider online allocation problems with concave revenue functions and resource constraints, which are central problems in revenue management and online advertising. In these settings, requests arrive sequentially during a finite horizon and, for each request, a decision maker needs to choose an action that consumes a certain amount of resources and generates revenue. The revenue function and resource consumption of each request are drawn independently and at random from a probability distribution that is unknown to the decision maker. The objective is to maximize cumulative revenues subject to a constraint on the total consumption of resources.
We design a general class of algorithms that achieve sub-linear expected regret compared to the hindsight optimal allocation. Our algorithms operate in the Lagrangian dual space: they maintain a dual multiplier for each resource that is updated using online mirror descent. By choosing the reference function accordingly, we recover dual sub-gradient descent and dual exponential weights algorithm. The resulting algorithms are simple, efficient, and shown to attain the optimal order of regret when the length of the horizon and the initial number of resources are scaled proportionally. We discuss applications to online bidding in repeated auctions with budget constraints and online proportional matching with high entropy.

Cross-lists for Tue, 25 Feb 20

[34]  arXiv:2002.09539 (cross-list from cs.LG) [pdf, other]
Title: Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD
Comments: Accepted to ICASSP 2020
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Optimization and Control (math.OC); Machine Learning (stat.ML)

Distributed stochastic gradient descent (SGD) is essential for scaling the machine learning algorithms to a large number of computing nodes. However, the infrastructures variability such as high communication delay or random node slowdown greatly impedes the performance of distributed SGD algorithm, especially in a wireless system or sensor networks. In this paper, we propose an algorithmic approach named Overlap-Local-SGD (and its momentum variant) to overlap the communication and computation so as to speedup the distributed training procedure. The approach can help to mitigate the straggler effects as well. We achieve this by adding an anchor model on each node. After multiple local updates, locally trained models will be pulled back towards the synchronized anchor model rather than communicating with others. Experimental results of training a deep neural network on CIFAR-10 dataset demonstrate the effectiveness of Overlap-Local-SGD. We also provide a convergence guarantee for the proposed algorithm under non-convex objective functions.

[35]  arXiv:2002.09718 (cross-list from cs.LG) [pdf, other]
Title: Safe Screening for the Generalized Conditional Gradient Method
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)

The conditional gradient method (CGM) has been widely used for fast sparse approximation, having a low per iteration computational cost for structured sparse regularizers. We explore the sparsity acquiring properties of a generalized CGM (gCGM), where the constraint is replaced by a penalty function based on a gauge penalty; this can be done without significantly increasing the per-iteration computation, and applies to general notions of sparsity. Without assuming bounded iterates, we show $O(1/t)$ convergence of the function values and gap of gCGM. We couple this with a safe screening rule, and show that at a rate $O(1/(t\delta^2))$, the screened support matches the support at the solution, where $\delta \geq 0$ measures how close the problem is to being degenerate. In our experiments, we show that the gCGM for these modified penalties have similar feature selection properties as common penalties, but with potentially more stability over the choice of hyperparameter.

[36]  arXiv:2002.09795 (cross-list from cs.LG) [pdf, ps, other]
Title: Periodic Q-Learning
Authors: Donghwan Lee, Niao He
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)

The use of target networks is a common practice in deep reinforcement learning for stabilizing the training; however, theoretical understanding of this technique is still limited. In this paper, we study the so-called periodic Q-learning algorithm (PQ-learning for short), which resembles the technique used in deep Q-learning for solving infinite-horizon discounted Markov decision processes (DMDP) in the tabular setting. PQ-learning maintains two separate Q-value estimates - the online estimate and target estimate. The online estimate follows the standard Q-learning update, while the target estimate is updated periodically. In contrast to the standard Q-learning, PQ-learning enjoys a simple finite time analysis and achieves better sample complexity for finding an epsilon-optimal policy. Our result provides a preliminary justification of the effectiveness of utilizing target estimates or networks in Q-learning algorithms.

[37]  arXiv:2002.09880 (cross-list from cs.DS) [pdf, other]
Title: Mixed Integer Programming for Searching Maximum Quasi-Bicliques
Comments: This paper draft is stored here for self-archiving purposes
Journal-ref: Springer Proceedings in Mathematics & Statistics, vol 315. Springer, Cham (2020)
Subjects: Data Structures and Algorithms (cs.DS); Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM); Social and Information Networks (cs.SI); Optimization and Control (math.OC)

This paper is related to the problem of finding the maximal quasi-bicliques in a bipartite graph (bigraph). A quasi-biclique in the bigraph is its "almost" complete subgraph. The relaxation of completeness can be understood variously; here, we assume that the subgraph is a $\gamma$-quasi-biclique if it lacks a certain number of edges to form a biclique such that its density is at least $\gamma \in (0,1]$. For a bigraph and fixed $\gamma$, the problem of searching for the maximal quasi-biclique consists of finding a subset of vertices of the bigraph such that the induced subgraph is a quasi-biclique and its size is maximal for a given graph. Several models based on Mixed Integer Programming (MIP) to search for a quasi-biclique are proposed and tested for working efficiency. An alternative model inspired by biclustering is formulated and tested; this model simultaneously maximizes both the size of the quasi-biclique and its density, using the least-square criterion similar to the one exploited by triclustering \textsc{TriBox}.

[38]  arXiv:2002.09889 (cross-list from stat.ML) [pdf, other]
Title: Investigating the interaction between gradient-only line searches and different activation functions
Comments: 37 pages, 9 figures, submitted for journal review
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)

Gradient-only line searches (GOLS) adaptively determine step sizes along search directions for discontinuous loss functions resulting from dynamic mini-batch sub-sampling in neural network training. Step sizes in GOLS are determined by localizing Stochastic Non-Negative Associated Gradient Projection Points (SNN-GPPs) along descent directions. These are identified by a sign change in the directional derivative from negative to positive along a descent direction. Activation functions are a significant component of neural network architectures as they introduce non-linearities essential for complex function approximations. The smoothness and continuity characteristics of the activation functions directly affect the gradient characteristics of the loss function to be optimized. Therefore, it is of interest to investigate the relationship between activation functions and different neural network architectures in the context of GOLS. We find that GOLS are robust for a range of activation functions, but sensitive to the Rectified Linear Unit (ReLU) activation function in standard feedforward architectures. The zero-derivative in ReLU's negative input domain can lead to the gradient-vector becoming sparse, which severely affects training. We show that implementing architectural features such as batch normalization and skip connections can alleviate these difficulties and benefit training with GOLS for all activation functions considered.

[39]  arXiv:2002.09996 (cross-list from stat.ML) [pdf, other]
Title: ConBO: Conditional Bayesian Optimization
Comments: 10 pages, 7 pages appendix
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)

Bayesian optimization is a class of data efficient model based algorithms typically focused on global optimization. We consider the more general case where a user is faced with multiple problems that each need to be optimized conditional on a state variable, for example we optimize the location of ambulances conditioned on patient distribution given a range of cities with different patient distributions. Similarity across objectives boosts optimization of each objective in two ways: in modelling by data sharing across objectives, and also in acquisition by quantifying how all objectives benefit from a single point on one objective. For this we propose ConBO, a novel efficient algorithm that is based on a new hybrid Knowledge Gradient method, that outperforms recently published works on synthetic and real world problems, and is easily parallelized to collecting a batch of points.

[40]  arXiv:2002.10069 (cross-list from cs.LG) [pdf, other]
Title: Robust Learning-Based Control via Bootstrapped Multiplicative Noise
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Dynamical Systems (math.DS); Optimization and Control (math.OC); Machine Learning (stat.ML)

Despite decades of research and recent progress in adaptive control and reinforcement learning, there remains a fundamental lack of understanding in designing controllers that provide robustness to inherent non-asymptotic uncertainties arising from models estimated with finite, noisy data. We propose a robust adaptive control algorithm that explicitly incorporates such non-asymptotic uncertainties into the control design. The algorithm has three components: (1) a least-squares nominal model estimator; (2) a bootstrap resampling method that quantifies non-asymptotic variance of the nominal model estimate; and (3) a non-conventional robust control design method using an optimal linear quadratic regulator (LQR) with multiplicative noise. A key advantage of the proposed approach is that the system identification and robust control design procedures both use stochastic uncertainty representations, so that the actual inherent statistical estimation uncertainty directly aligns with the uncertainty the robust controller is being designed against. We show through numerical experiments that the proposed robust adaptive controller can significantly outperform the certainty equivalent controller on both expected regret and measures of regret risk.

[41]  arXiv:2002.10110 (cross-list from math.NA) [pdf, ps, other]
Title: Revisiting EXTRA for Smooth Distributed Optimization
Authors: Huan Li, Zhouchen Lin
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Optimization and Control (math.OC)

EXTRA is a popular method for the dencentralized distributed optimization and has broad applications. This paper revisits the EXTRA. Firstly, we give a sharp complexity analysis for EXTRA with the improved $O\left(\left(\frac{L}{\mu}+\frac{1}{1-\sigma_2(W)}\right)\log\frac{1}{\epsilon(1-\sigma_2(W))}\right)$ communication and computation complexities for $\mu$-strongly convex and $L$-smooth problems, where $\sigma_2(W)$ is the second largest singular value of the weight matrix $W$. When the strong convexity is absent, we prove the $O\left(\left(\frac{L}{\epsilon}+\frac{1}{1-\sigma_2(W)}\right)\log\frac{1}{1-\sigma_2(W)}\right)$ complexities. Then, we use the Catalyst framework to accelerate EXTRA and obtain the $O\left(\sqrt{\frac{L}{\mu(1-\sigma_2(W))}}\log\frac{ L}{\mu(1-\sigma_2(W))}\log\frac{1}{\epsilon}\right)$ communication and computation complexities for strongly convex and smooth problems and the $O\left(\sqrt{\frac{L}{\epsilon(1-\sigma_2(W))}}\log\frac{1}{\epsilon(1-\sigma_2(W))}\right)$ complexities for non-strongly convex ones. Our communication complexities of the accelerated EXTRA are only worse by the factors of $\left(\log\frac{L}{\mu(1-\sigma_2(W))}\right)$ and $\left(\log\frac{1}{\epsilon(1-\sigma_2(W))}\right)$ from the lower complexity bounds for strongly convex and non-strongly convex problems, respectively.

[42]  arXiv:2002.10113 (cross-list from cs.LG) [pdf, other]
Title: APAC-Net: Alternating the Population and Agent Control via Two Neural Networks to Solve High-Dimensional Stochastic Mean Field Games
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Optimization and Control (math.OC); Machine Learning (stat.ML)

We present APAC-Net, an alternating population and agent control neural network for solving stochastic mean field games (MFGs). Our algorithm is geared toward high-dimensional instances MFGs that are beyond reach with existing solution methods. We achieve this in two steps. First, we take advantage of the underlying variational primal-dual structure that MFGs exhibit and phrase it as a convex-concave saddle point problem. Second, we parameterize the value and density functions by two neural networks, respectively. By phrasing the problem in this manner, solving the MFG can be interpreted as a special case of training a generative adversarial generative network (GAN). We show the potential of our method on up to 50-dimensional MFG problems.

[43]  arXiv:2002.10172 (cross-list from cs.AI) [pdf, other]
Title: Optimal strategies in the Fighting Fantasy gaming system: influencing stochastic dynamics by gambling with limited resource
Authors: Iain G. Johnston
Comments: Keyword: stochastic game; Markov decision problem; stochastic simulation; dynamic programming; resource allocation; stochastic optimal control; Bellman equation
Subjects: Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Optimization and Control (math.OC)

Fighting Fantasy is a popular recreational fantasy gaming system worldwide. Combat in this system progresses through a stochastic game involving a series of rounds, each of which may be won or lost. Each round, a limited resource (`luck') may be spent on a gamble to amplify the benefit from a win or mitigate the deficit from a loss. However, the success of this gamble depends on the amount of remaining resource, and if the gamble is unsuccessful, benefits are reduced and deficits increased. Players thus dynamically choose to expend resource to attempt to influence the stochastic dynamics of the game, with diminishing probability of positive return. The identification of the optimal strategy for victory is a Markov decision problem that has not yet been solved. Here, we combine stochastic analysis and simulation with dynamic programming to characterise the dynamical behaviour of the system in the absence and presence of gambling policy. We derive a simple expression for the victory probability without luck-based strategy. We use a backward induction approach to solve the Bellman equation for the system and identify the optimal strategy for any given state during the game. The optimal control strategies can dramatically enhance success probabilities, but take detailed forms; we use stochastic simulation to approximate these optimal strategies with simple heuristics that can be practically employed. Our findings provide a roadmap to improving success in the games that millions of people play worldwide, and inform a class of resource allocation problems with diminishing returns in stochastic games.

[44]  arXiv:2002.10246 (cross-list from cs.CE) [pdf, other]
Title: A subtractive manufacturing constraint for level set topology optimization
Journal-ref: Structural and Multidisciplinary Optimization (2020)
Subjects: Computational Engineering, Finance, and Science (cs.CE); Optimization and Control (math.OC)

We present a method for enforcing manufacturability constraints in generated parts such that they will be automatically ready for fabrication using a subtractive approach. We primarily target multi-axis CNC milling approaches but the method should generalize to other subtractive methods as well. To this end, we take as user input: the radius of curvature of the tool bit, a coarse model of the tool head and optionally a set of milling directions. This allows us to enforce the following manufacturability conditions: 1) surface smoothness such that the radius of curvature of the part does not exceed the milling bit radius, 2) orientation such that every part of the surface to be milled is visible from at least one milling direction, 3) accessibility such that every surface patch can be reached by the tool bit without interference with the tool or head mount. We will show how to efficiently enforce the constraint during level set-based topology optimization modifying the advection velocity such that at each iteration the topology optimization maintains a descent optimization direction and does not violate any of the manufacturability conditions. This approach models the actual subtractive process by carving away material accessible to the machine at each iteration until a local optimum is achieved.

[45]  arXiv:2002.10255 (cross-list from cs.CE) [pdf, other]
Title: Ambiguous phase assignment of discretized 3D geometries in topology optimization
Subjects: Computational Engineering, Finance, and Science (cs.CE); Optimization and Control (math.OC)

Level set-based immersed boundary techniques operate on nonconforming meshes while providing a crisp definition of interface and external boundaries. In such techniques, an isocontour of a level set field interpolated from nodal level set values defines a problem's geometry. If the interface is explicitly tracked, the intersected elements are typically divided into sub-elements to which a phase needs to be assigned. Due to loss of information in the discretization of the level set field, certain geometrical configurations allow for ambiguous phase assignment of sub-elements, and thus ambiguous definition of the interface. The study presented here focuses on analyzing these topological ambiguities in embedded geometries constructed from discretized level set fields on hexahedral meshes. The analysis is performed on three-dimensional problems where several intersection configurations can significantly affect the problem's topology. This is in contrast to two-dimensional problems where ambiguous topological features exist only in one intersection configuration and identifying and resolving them is straightforward. A set of rules that resolve these ambiguities for two-phase problems is proposed, and algorithms for their implementations are provided. The influence of these rules on the evolution of the geometry in the optimization process is investigated with linear elastic topology optimization problems. These problems are solved by an explicit level set topology optimization framework that uses the extended finite element method to predict physical responses. This study shows that the choice of a rule to resolve topological features can result in drastically different final geometries. However, for the problems studied in this paper, the performances of the optimized design do not differ.

[46]  arXiv:2002.10298 (cross-list from physics.soc-ph) [pdf, ps, other]
Title: Reducing Urban Traffic Congestion Due To Localized Routing Decisions
Subjects: Physics and Society (physics.soc-ph); Optimization and Control (math.OC)

Balancing traffic flow by influencing drivers' route choices to alleviate congestion is becoming increasingly more appealing in urban traffic planning. Here, we introduce a discrete dynamical model comprising users who make their own routing choices on the basis of local information and those who consider routing advice based on localized inducement. We identify the formation of traffic patterns, develop a scalable optimization method for identifying control values used for user guidance, and test the effectiveness of these measures on synthetic and real-world road networks.

Replacements for Tue, 25 Feb 20

[47]  arXiv:1706.09741 (replaced) [src]
Title: A Differential Game Model of Opinion Dynamics: Accord and Discord as Nash Equilibria
Comments: It is uploaded as a second version of arXiv:1706.09741. However, it is a different paper, and we would like to submit it as such
Subjects: Optimization and Control (math.OC)
[48]  arXiv:1808.02121 (replaced) [pdf, other]
Title: Composite Convex Optimization with Global and Local Inexact Oracles
Comments: 28 pages, 6 figures, and 2 tables
Journal-ref: Computational Optimization and Applications, 2020
Subjects: Optimization and Control (math.OC)
[49]  arXiv:1809.08271 (replaced) [pdf, other]
Title: Asymptotically Optimal Inventory Control for Assemble-to-Order Systems
Comments: 63 pages
Subjects: Optimization and Control (math.OC)
[50]  arXiv:1809.08554 (replaced) [pdf, other]
Title: An explicit solution for a multimarginal mass transportation problem
Comments: 31 pages, 4 figures. The paper was completely rewritten. Heuristic considerations to find a solution of the primal problem added. Algorithm to find the primal problem solution numerically added (arbitrary marginals). The construction was generalized for a C(ln x + ln y + ln z), C is convex. Measure on the triangle was found with the support singular with respect to the Lebesgue measure
Subjects: Optimization and Control (math.OC)
[51]  arXiv:1812.00885 (replaced) [pdf, ps, other]
Title: AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity
Comments: Accepted by AISTATS 2020
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[52]  arXiv:1812.10278 (replaced) [pdf, ps, other]
Title: Primal-dual interior-point Methods for Semidefinite Programming from an algebraic point of view, or: Using Noncommutativity for Optimization
Authors: Konrad Schrempf
Comments: 49 pages, 10 figures, 3 tables, 5 Octave/Matlab files; slightly updated version
Subjects: Optimization and Control (math.OC)
[53]  arXiv:1901.07217 (replaced) [pdf, ps, other]
Title: Reiterated periodic homogenization of integral functionals with convex and nonstandard growth integrands
Subjects: Optimization and Control (math.OC)
[54]  arXiv:1903.01786 (replaced) [pdf, other]
Title: Managing Randomization in the Multi-Block Alternating Direction Method of Multipliers for Quadratic Optimization
Comments: Expanded and streamlined theoretical sections. Added comparisons with other multi-block ADMM variants. Updated Computational Studies Section on continuous problems -- reporting primal and dual residuals instead of objective value gap. Added selected machine learning problems (ElasticNet/Lasso and Support Vector Machine) to Computational Studies Section
Subjects: Optimization and Control (math.OC)
[55]  arXiv:1903.02124 (replaced) [pdf, other]
Title: Experimenting in Equilibrium
Subjects: Optimization and Control (math.OC); Econometrics (econ.EM); Methodology (stat.ME)
[56]  arXiv:1903.08754 (replaced) [pdf, ps, other]
Title: Stability and Error Analysis for Optimization and Generalized Equations
Subjects: Optimization and Control (math.OC)
[57]  arXiv:1907.13428 (replaced) [pdf, ps, other]
Title: Fast Solution Methods for Convex Quadratic Optimization of Fractional Differential Equations
Subjects: Optimization and Control (math.OC)
[58]  arXiv:1908.00697 (replaced) [pdf, ps, other]
Title: Model-Free Stochastic Reachability Using Kernel Distribution Embeddings
Journal-ref: in IEEE Control Systems Letters, vol. 4, no. 2, pp. 512-517, April 2020
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
[59]  arXiv:1911.06273 (replaced) [pdf, other]
Title: RLC Circuits based Distributed Mirror Descent Method
Subjects: Optimization and Control (math.OC)
[60]  arXiv:1911.12859 (replaced) [pdf, other]
Title: Decomposed Structured Subsets for Semidefinite and Sum-of-Squares Optimization
Comments: 15 pages, 14 figures, 15 tables
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
[61]  arXiv:1912.02865 (replaced) [pdf, ps, other]
Title: On the construction of maximal $p$-cyclically monotone operators
Subjects: Optimization and Control (math.OC)
[62]  arXiv:1912.13016 (replaced) [pdf]
Title: Global optimization of multivariable functions satisfying the Vanderbei condition
Comments: 23 pages, 4 tables, 31 figure, in Russian. v2 adds corrections to Algorithm 2 and a proposition on its convergence
Subjects: Optimization and Control (math.OC)
[63]  arXiv:2001.04286 (replaced) [pdf, other]
Title: Nonparametric Continuous Sensor Registration
Comments: 19 pages. arXiv admin note: substantial text overlap with arXiv:1904.02266
Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[64]  arXiv:1806.04225 (replaced) [pdf, other]
Title: PAC-Bayes Control: Learning Policies that Provably Generalize to Novel Environments
Comments: Extended version of paper presented at the 2018 Conference on Robot Learning (CoRL)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)
[65]  arXiv:1809.01674 (replaced) [pdf, other]
Title: Hierarchical Selective Recruitment in Linear-Threshold Brain Networks -- Part I: Single-Layer Dynamics and Selective Inhibition
Subjects: Systems and Control (eess.SY); Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)
[66]  arXiv:1809.02493 (replaced) [pdf, other]
Title: Hierarchical Selective Recruitment in Linear-Threshold Brain Networks -- Part II: Multi-Layer Dynamics and Top-Down Recruitment
Subjects: Systems and Control (eess.SY); Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)
[67]  arXiv:1901.00137 (replaced) [pdf, ps, other]
Title: A Theoretical Analysis of Deep Q-Learning
Comments: 65 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[68]  arXiv:1906.05204 (replaced) [pdf, other]
Title: Model-Free Practical Cooperative Control for Diffusively Coupled Systems
Comments: 12 pages, 7 figures
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[69]  arXiv:1907.04409 (replaced) [pdf, other]
Title: Global Optimality Guarantees for Nonconvex Unsupervised Video Segmentation
Comments: Proceedings of the 57th Annual Allerton Conference on Communication, Control, and Computing, 2019; added funding source information and notation definitions
Journal-ref: Proceedings of the 57th Annual Allerton Conference on Communication, Control, and Computing, pp. 965--972, 2019
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC); Machine Learning (stat.ML)
[70]  arXiv:1908.06634 (replaced) [pdf, other]
Title: Cluster-based Distributed Augmented Lagrangian Algorithm for a Class of Constrained Convex Optimization Problems
Subjects: Multiagent Systems (cs.MA); Optimization and Control (math.OC)
[71]  arXiv:1909.09511 (replaced) [pdf, other]
Title: Optimal Dividend Strategy for an Insurance Group with Contagious Default Risk
Comments: Keywords: Insurance group, credit default contagion, optimal group dividend, default-state-modulated barriers, recursive system of HJBVIs
Subjects: Risk Management (q-fin.RM); Optimization and Control (math.OC)
[72]  arXiv:1910.03020 (replaced) [pdf, other]
Title: Joint Grid Topology Reconfiguration and Design of Watt-VAR Curves for DERs
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[73]  arXiv:1910.10818 (replaced) [pdf, ps, other]
Title: Stochastic Reachability for Systems up to a Million Dimensions
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[74]  arXiv:1911.11854 (replaced) [pdf, other]
Title: Compressed MRI Reconstruction Exploiting a Rotation-Invariant Total Variation Discretization
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[75]  arXiv:1912.01032 (replaced) [pdf, other]
Title: FourierSAT: A Fourier Expansion-Based Algebraic Framework for Solving Hybrid Boolean Constraints
Comments: The paper was accepted by Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020). V2 (Feb 24): Typos corrected
Subjects: Logic in Computer Science (cs.LO); Information Theory (cs.IT); Machine Learning (cs.LG); Optimization and Control (math.OC)
[76]  arXiv:2001.00080 (replaced) [pdf, other]
Title: Review on Set-Theoretic Methods for Safety Verification and Control of Power System
Subjects: Systems and Control (eess.SY); Algebraic Geometry (math.AG); Optimization and Control (math.OC)
[77]  arXiv:2001.00218 (replaced) [pdf, other]
Title: Lossless Compression of Deep Neural Networks
Comments: CPAIOR 2020 (to appear)
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Optimization and Control (math.OC); Machine Learning (stat.ML)
[78]  arXiv:2001.11988 (replaced) [pdf, other]
Title: Consensus-based Optimization on the Sphere II: Convergence to Global Minimizers and Machine Learning
Subjects: Machine Learning (cs.LG); Analysis of PDEs (math.AP); Numerical Analysis (math.NA); Optimization and Control (math.OC); Machine Learning (stat.ML)
[79]  arXiv:2001.11994 (replaced) [pdf, other]
Title: Consensus-Based Optimization on the Sphere I: Well-Posedness and Mean-Field Limit
Subjects: Analysis of PDEs (math.AP); Machine Learning (cs.LG); Optimization and Control (math.OC)
[80]  arXiv:2002.06277 (replaced) [pdf, other]
Title: A mean-field analysis of two-player zero-sum games
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Probability (math.PR); Machine Learning (stat.ML)
[81]  arXiv:2002.06694 (replaced) [pdf, other]
Title: Structures of Spurious Local Minima in $k$-means
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC); Statistics Theory (math.ST)
[82]  arXiv:2002.08000 (replaced) [pdf, other]
Title: Action-Manipulation Attacks Against Stochastic Bandits: Attacks and Defense
Comments: 13 pages, 7 figures, submitted to IEEE Transaction on Signal Processing
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Optimization and Control (math.OC); Machine Learning (stat.ML)
[ total of 82 entries: 1-82 ]
[ showing up to 1000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, math, recent, 2002, contact, help  (Access key information)