Neural and Evolutionary Computing
New submissions
[ showing up to 2000 entries per page: fewer  more ]
New submissions for Mon, 24 Feb 20
 [1] arXiv:2002.09063 [pdf, other]

Title: RealTime Optimal Guidance and Control for Interplanetary Transfers Using Deep NetworksSubjects: Neural and Evolutionary Computing (cs.NE); Systems and Control (eess.SY)
We consider the EarthVenus massoptimal interplanetary transfer of a lowthrust spacecraft and show how the optimal guidance can be represented by deep networks in a large portion of the state space and to a high degree of accuracy. Imitation (supervised) learning of optimal examples is used as a network training paradigm. The resulting models are suitable for an onboard, realtime, implementation of the optimal guidance and control system of the spacecraft and are called G&CNETs. A new general methodology called Backward Generation of Optimal Examples is introduced and shown to be able to efficiently create all the optimal state action pairs necessary to train G&CNETs without solving optimal control problems. With respect to previous works, we are able to produce datasets containing a few orders of magnitude more optimal trajectories and obtain network performances compatible with real missions requirements. Several schemes able to train representations of either the optimal policy (thrust profile) or the value function (optimal mass) are proposed and tested. We find that both policy learning and value function learning successfully and accurately learn the optimal thrust and that a spacecraft employing the learned thrust is able to reach the target conditions orbit spending only 2 permil more propellant than in the corresponding mathematically optimal transfer. Moreover, the optimal propellant mass can be predicted (in case of value function learning) within an error well within 1%. All G&CNETs produced are tested during simulations of interplanetary transfers with respect to their ability to reach the target conditions optimally starting from nominal and offnominal conditions.
 [2] arXiv:2002.09106 [pdf, other]

Title: An Evolutionary Deep Learning Method for Shortterm Wind Speed Prediction: A Case Study of the Lillgrund Offshore Wind FarmAuthors: Mehdi Neshat, Meysam Majidi Nezhad, Ehsan Abbasnejad, Lina Bertling Tjernberg, Davide Astiaso Garcia, Bradley Alexander, Markus WagnerSubjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG); Signal Processing (eess.SP)
Accurate shortterm wind speed forecasting is essential for largescale integration of wind power generation. However, the seasonal and stochastic characteristics of wind speed make forecasting a challenging task. This study uses a new hybrid evolutionary approach that uses a popular evolutionary search algorithm, CMAES, to tune the hyperparameters of two Long shortterm memory(LSTM) ANN models for wind prediction. The proposed hybrid approach is trained on data gathered from an offshore wind turbine installed in a Swedish wind farm located in the Baltic Sea. Two forecasting horizons including tenminutes ahead (absolute short term) and onehour ahead (short term) are considered in our experiments. Our experimental results indicate that the new approach is superior to five other applied machine learning models, i.e., polynomial neural network (PNN), feedforward neural network (FNN), nonlinear autoregressive neural network (NAR) and adaptive neurofuzzy inference system (ANFIS), as measured by five performance criteria.
 [3] arXiv:2002.09227 [pdf, other]

Title: Recent Trends in the Use of Statistical Tests for Comparing Swarm and Evolutionary Computing Algorithms: Practical Guidelines and a Critical ReviewComments: 52 pages, 10 figures, 19 tablesSubjects: Neural and Evolutionary Computing (cs.NE); Methodology (stat.ME)
A key aspect of the design of evolutionary and swarm intelligence algorithms is studying their performance. Statistical comparisons are also a crucial part which allows for reliable conclusions to be drawn. In the present paper we gather and examine the approaches taken from different perspectives to summarise the assumptions made by these statistical tests, the conclusions reached and the steps followed to perform them correctly. In this paper, we conduct a survey on the current trends of the proposals of statistical analyses for the comparison of algorithms of computational intelligence and include a description of the statistical background of these tests. We illustrate the use of the most common tests in the context of the Competition on singleobjective real parameter optimisation of the IEEE Congress on Evolutionary Computation (CEC) 2017 and describe the main advantages and drawbacks of the use of each kind of test and put forward some recommendations concerning their use.
Crosslists for Mon, 24 Feb 20
 [4] arXiv:2002.09259 (crosslist from eess.IV) [pdf, other]

Title: Binary Probability Model for Learning Based Image CompressionAuthors: Théo Ladune (IETR), Pierrick Philippe, Wassim Hamidouche (IETR), Lu Zhang (IETR), Olivier Deforges (IETR)Journalref: International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020, 2020Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Signal Processing (eess.SP)
In this paper, we propose to enhance learned image compression systems with a richer probability model for the latent variables. Previous works model the latents with a Gaussian or a Laplace distribution. Inspired by binary arithmetic coding , we propose to signal the latents with three binary values and one integer, with different probability models. A relaxation method is designed to perform gradientbased training. The richer probability model results in a better entropy coding leading to lower rate. Experiments under the Challenge on Learned Image Compression (CLIC) test conditions demonstrate that this method achieves 18% rate saving compared to Gaussian or Laplace models.
 [5] arXiv:2002.09285 (crosslist from cs.CV) [pdf, other]

Title: A Convolutional Neural Network into graph spaceComments: arXiv admin note: text overlap with arXiv:1611.08402 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Convolutional neural networks (CNNs), in a few decades, have outperformed the existing state of the art methods in classification context. However, in the way they were formalised, CNNs are bound to operate on euclidean spaces. Indeed, convolution is a signal operation that are defined on euclidean spaces. This has restricted deep learning main use to euclideandefined data such as sound or image. And yet, numerous computer application fields (among which network analysis, computational social science, chemoinformatics or computer graphics) induce noneuclideanly defined data such as graphs, networks or manifolds. In this paper we propose a new convolution neural network architecture, defined directly into graph space. Convolution and pooling operators are defined in graph domain. We show its usability in a backpropagation context. Experimental results show that our model performance is at state of the art level on simple tasks. It shows robustness with respect to graph domain changes and improvement with respect to other euclidean and noneuclidean convolutional architectures.
 [6] arXiv:2002.09286 (crosslist from eess.AS) [pdf, other]

Title: Efficient Trainable FrontEnds for Neural Speech EnhancementComments: 5 pages, 5 figures, ICASSP 2020Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Sound (cs.SD); Machine Learning (stat.ML)
Many neural speech enhancement and source separation systems operate in the timefrequency domain. Such models often benefit from making their ShortTime Fourier Transform (STFT) frontends trainable. In current literature, these are implemented as large Discrete Fourier Transform matrices; which are prohibitively inefficient for lowcompute systems. We present an efficient, trainable frontend based on the butterfly mechanism to compute the Fast Fourier Transform, and show its accuracy and efficiency benefits for lowcompute neural speech enhancement models. We also explore the effects of making the STFT window trainable.
 [7] arXiv:2002.09398 (crosslist from cs.LG) [pdf, other]

Title: It's Not What Machines Can Learn, It's What We Cannot TeachSubjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Can deep neural networks learn to solve any task, and in particular problems of high complexity? This question attracts a lot of interest, with recent works tackling computationally hard tasks such as the traveling salesman problem and satisfiability. In this work we offer a different perspective on this question. Given the common assumption that $\textit{NP} \neq \textit{coNP}$ we prove that any polynomialtime sample generator for an $\textit{NP}$hard problem samples, in fact, from an easier subproblem. We empirically explore a case study, Conjunctive Query Containment, and show how common data generation techniques generate biased datasets that lead practitioners to overestimate model accuracy. Our results suggest that machine learning approaches that require training on a dense uniform sampling from the target distribution cannot be used to solve computationally hard problems, the reason being the difficulty of generating sufficiently large and unbiased training sets.
Replacements for Mon, 24 Feb 20
 [8] arXiv:1911.05266 (replaced) [pdf, other]

Title: Learning NonParametric Invariances from Data with Permanent Random ConnectomesComments: Preprint (accepted at NeurIPS SVRHM 2019 Workshop)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
 [9] arXiv:1912.01137 (replaced) [pdf, ps, other]

Title: Mixing autoencoder with classifier: conceptual data visualizationAuthors: Pitoyo HartonoSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
 [10] arXiv:1912.05671 (replaced) [pdf, other]

Title: Linear Mode Connectivity and the Lottery Ticket HypothesisComments: This submission subsumes 1903.01611 ("Stabilizing the Lottery Ticket Hypothesis" and "The Lottery Ticket Hypothesis at Scale")Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[ showing up to 2000 entries per page: fewer  more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, recent, 2002, contact, help (Access key information)