We gratefully acknowledge support from
the Simons Foundation and member institutions.


New submissions

[ total of 11 entries: 1-11 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Tue, 2 Mar 21

[1]  arXiv:2103.00060 [pdf, ps, other]
Title: Simultaneous Bandwidths Determination for DK-HAC Estimators and Long-Run Variance Estimation in Nonparametric Settings
Subjects: Econometrics (econ.EM)

We consider the derivation of data-dependent simultaneous bandwidths for double kernel heteroskedasticity and autocorrelation consistent (DK-HAC) estimators. In addition to the usual smoothing over lagged autocovariances for classical HAC estimators, the DK-HAC estimator also applies smoothing over the time direction. We obtain the optimal bandwidths that jointly minimize the global asymptotic MSE criterion and discuss the trade-off between bias and variance with respect to smoothing over lagged autocovariances and over time. Unlike the MSE results of Andrews (1991), we establish how nonstationarity affects the bias-variance trade-o?. We use the plug-in approach to construct data-dependent bandwidths for the DK-HAC estimators and compare them with the DK-HAC estimators from Casini (2021) that use data-dependent bandwidths obtained from a sequential MSE criterion. The former performs better in terms of size control, especially with stationary and close to stationary data. Finally, we consider long-run variance estimation under the assumption that the series is a function of a nonparametric estimator rather than of a semiparametric estimator that enjoys the usual T^(1/2) rate of convergence. Thus, we also establish the validity of consistent long-run variance estimation in nonparametric parameter estimation settings.

[2]  arXiv:2103.00557 [pdf, ps, other]
Title: Algorithmic subsampling under multiway clustering
Subjects: Econometrics (econ.EM)

This paper proposes a novel method of algorithmic subsampling (data sketching) for multiway cluster dependent data. We establish a new uniform weak law of large numbers and a new central limit theorem for the multiway algorithmic subsample means. Consequently, we discover an additional advantage of the algorithmic subsampling that it allows for robustness against potential degeneracy, and even non-Gaussian degeneracy, of the asymptotic distribution under multiway clustering. Simulation studies support this novel result, and demonstrate that inference with the algorithmic subsampling entails more accuracy than that without the algorithmic subsampling. Applying these basic asymptotic theories, we derive the consistency and the asymptotic normality for the multiway algorithmic subsampling generalized method of moments estimator and for the multiway algorithmic subsampling M-estimator. We illustrate an application to scanner data.

[3]  arXiv:2103.01115 [pdf, other]
Title: Structural models for policy-making: Coping with parametric uncertainty
Subjects: Econometrics (econ.EM)

The ex-ante evaluation of policies using structural microeconometric models is based on estimated parameters as a stand-in for the truth. This practice ignores uncertainty in the counterfactual policy predictions of the model. We develop an approach that deals with parametric uncertainty and properly frames model-informed policy-making as a decision problem under uncertainty. We use the seminal human capital investment model by Keane and Wolpin (1997) as a well-known, influential, and empirically-grounded test case. We document considerable uncertainty in their policy predictions and highlight the resulting policy recommendations from using different formal rules on decision-making under uncertainty.

[4]  arXiv:2103.01201 [pdf, other]
Title: Can Machine Learning Catch the COVID-19 Recession?
Subjects: Econometrics (econ.EM); Applications (stat.AP); Machine Learning (stat.ML)

Based on evidence gathered from a newly built large macroeconomic data set for the UK, labeled UK-MD and comparable to similar datasets for the US and Canada, it seems the most promising avenue for forecasting during the pandemic is to allow for general forms of nonlinearity by using machine learning (ML) methods. But not all nonlinear ML methods are alike. For instance, some do not allow to extrapolate (like regular trees and forests) and some do (when complemented with linear dynamic components). This and other crucial aspects of ML-based forecasting in unprecedented times are studied in an extensive pseudo-out-of-sample exercise.

Cross-lists for Tue, 2 Mar 21

[5]  arXiv:2103.00264 (cross-list from q-fin.ST) [pdf, other]
Title: Forecasting high-frequency financial time series: an adaptive learning approach with the order book data
Comments: Key words: forecasting methods, statistical learning, high-frequency order book
Subjects: Statistical Finance (q-fin.ST); Econometrics (econ.EM); Trading and Market Microstructure (q-fin.TR); Applications (stat.AP)

This paper proposes a forecast-centric adaptive learning model that engages with the past studies on the order book and high-frequency data, with applications to hypothesis testing. In line with the past literature, we produce brackets of summaries of statistics from the high-frequency bid and ask data in the CSI 300 Index Futures market and aim to forecast the one-step-ahead prices. Traditional time series issues, e.g. ARIMA order selection, stationarity, together with potential financial applications are covered in the exploratory data analysis, which pave paths to the adaptive learning model. By designing and running the learning model, we found it to perform well compared to the top fixed models, and some could improve the forecasting accuracy by being more stable and resilient to non-stationarity. Applications to hypothesis testing are shown with a rolling window, and further potential applications to finance and statistics are outlined.

[6]  arXiv:2103.00366 (cross-list from q-fin.ST) [pdf]
Title: Confronting Machine Learning With Financial Research
Subjects: Statistical Finance (q-fin.ST); Machine Learning (cs.LG); Econometrics (econ.EM)

This study aims to examine the challenges and applications of machine learning for financial research. Machine learning algorithms have been developed for certain data environments which substantially differ from the one we encounter in finance. Not only do difficulties arise due to some of the idiosyncrasies of financial markets, there is a fundamental tension between the underlying paradigm of machine learning and the research philosophy in financial economics. Given the peculiar features of financial markets and the empirical framework within social science, various adjustments have to be made to the conventional machine learning methodology. We discuss some of the main challenges of machine learning in finance and examine how these could be accounted for. Despite some of the challenges, we argue that machine learning could be unified with financial research to become a robust complement to the econometrician's toolbox. Moreover, we discuss the various applications of machine learning in the research process such as estimation, empirical discovery, testing, causal inference and prediction.

[7]  arXiv:2103.00711 (cross-list from stat.ML) [pdf, ps, other]
Title: Panel semiparametric quantile regression neural network for electricity consumption forecasting
Comments: 30
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Econometrics (econ.EM)

China has made great achievements in electric power industry during the long-term deepening of reform and opening up. However, the complex regional economic, social and natural conditions, electricity resources are not evenly distributed, which accounts for the electricity deficiency in some regions of China. It is desirable to develop a robust electricity forecasting model. Motivated by which, we propose a Panel Semiparametric Quantile Regression Neural Network (PSQRNN) by utilizing the artificial neural network and semiparametric quantile regression. The PSQRNN can explore a potential linear and nonlinear relationships among the variables, interpret the unobserved provincial heterogeneity, and maintain the interpretability of parametric models simultaneously. And the PSQRNN is trained by combining the penalized quantile regression with LASSO, ridge regression and backpropagation algorithm. To evaluate the prediction accuracy, an empirical analysis is conducted to analyze the provincial electricity consumption from 1999 to 2018 in China based on three scenarios. From which, one finds that the PSQRNN model performs better for electricity consumption forecasting by considering the economic and climatic factors. Finally, the provincial electricity consumptions of the next $5$ years (2019-2023) in China are reported by forecasting.

[8]  arXiv:2103.01126 (cross-list from stat.ML) [pdf, ps, other]
Title: BERT based patent novelty search by training claims to their own description
Subjects: Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG); Econometrics (econ.EM)

In this paper we present a method to concatenate patent claims to their own description. By applying this method, BERT trains suitable descriptions for claims. Such a trained BERT (claim-to-description- BERT) could be able to identify novelty relevant descriptions for patents. In addition, we introduce a new scoring scheme, relevance scoring or novelty scoring, to process the output of BERT in a meaningful way. We tested the method on patent applications by training BERT on the first claims of patents and corresponding descriptions. BERT's output has been processed according to the relevance score and the results compared with the cited X documents in the search reports. The test showed that BERT has scored some of the cited X documents as highly relevant.

Replacements for Tue, 2 Mar 21

[9]  arXiv:1803.00798 (replaced) [pdf]
Title: Permutation Tests for Equality of Distributions of Functional Data
Comments: 48 pages, 5 figures, 5 tables
Subjects: Econometrics (econ.EM); Methodology (stat.ME)
[10]  arXiv:1904.11060 (replaced) [pdf, ps, other]
Title: Normal Approximation in Large Network Models
Subjects: Econometrics (econ.EM); Statistics Theory (math.ST)
[11]  arXiv:2012.09627 (replaced) [pdf]
Title: United States FDA drug approvals are persistent and polycyclic: Insights into economic cycles, innovation dynamics, and national policy
Authors: Iraj Daizadeh
Subjects: Econometrics (econ.EM)
[ total of 11 entries: 1-11 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, econ, recent, 2103, contact, help  (Access key information)