Distributed Learning and its Application for Time-Series Prediction

Nguyen, Nhuong V.; Legitime, Sybille

Full-text links:

Download:

Computer Science > Machine Learning

Title: Distributed Learning and its Application for Time-Series Prediction

Authors: Nhuong V. Nguyen, Sybille Legitime

(Submitted on 6 Jun 2021 (v1), last revised 10 Jun 2021 (this version, v2))

Abstract: Extreme events are occurrences whose magnitude and potential cause extensive damage on people, infrastructure, and the environment. Motivated by the extreme nature of the current global health landscape, which is plagued by the coronavirus pandemic, we seek to better understand and model extreme events. Modeling extreme events is common in practice and plays an important role in time-series prediction applications. Our goal is to (i) compare and investigate the effect of some common extreme events modeling methods to explore which method can be practical in reality and (ii) accelerate the deep learning training process, which commonly uses deep recurrent neural network (RNN), by implementing the asynchronous local Stochastic Gradient Descent (SGD) framework among multiple compute nodes. In order to verify our distributed extreme events modeling, we evaluate our proposed framework on a stock data set S\&P500, with a standard recurrent neural network. Our intuition is to explore the (best) extreme events modeling method which could work well under the distributed deep learning setting. Moreover, by using asynchronous distributed learning, we aim to significantly reduce the communication cost among the compute nodes and central server, which is the main bottleneck of almost all distributed learning frameworks.
We implement our proposed work and evaluate its performance on representative data sets, such as S&P500 stock in $5$-year period. The experimental results validate the correctness of the design principle and show a significant training duration reduction upto $8$x, compared to the baseline single compute node. Our results also show that our proposed work can achieve the same level of test accuracy, compared to the baseline setting.

Comments:	8 pages, 10 figures, and 2 tables
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
Cite as:	arXiv:2106.03211 [cs.LG]
	(or arXiv:2106.03211v2 [cs.LG] for this version)

Submission history

From: Nhuong Nguyen [view email]
[v1] Sun, 6 Jun 2021 18:57:30 GMT (934kb,D)
[v2] Thu, 10 Jun 2021 22:04:36 GMT (934kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2106.03211

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Distributed Learning and its Application for Time-Series Prediction

Submission history