We gratefully acknowledge support from
the Simons Foundation and member institutions.


New submissions

[ total of 25 entries: 1-25 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Tue, 2 Mar 21

[1]  arXiv:2103.00060 [pdf, ps, other]
Title: Simultaneous Bandwidths Determination for DK-HAC Estimators and Long-Run Variance Estimation in Nonparametric Settings
Subjects: Econometrics (econ.EM)

We consider the derivation of data-dependent simultaneous bandwidths for double kernel heteroskedasticity and autocorrelation consistent (DK-HAC) estimators. In addition to the usual smoothing over lagged autocovariances for classical HAC estimators, the DK-HAC estimator also applies smoothing over the time direction. We obtain the optimal bandwidths that jointly minimize the global asymptotic MSE criterion and discuss the trade-off between bias and variance with respect to smoothing over lagged autocovariances and over time. Unlike the MSE results of Andrews (1991), we establish how nonstationarity affects the bias-variance trade-o?. We use the plug-in approach to construct data-dependent bandwidths for the DK-HAC estimators and compare them with the DK-HAC estimators from Casini (2021) that use data-dependent bandwidths obtained from a sequential MSE criterion. The former performs better in terms of size control, especially with stationary and close to stationary data. Finally, we consider long-run variance estimation under the assumption that the series is a function of a nonparametric estimator rather than of a semiparametric estimator that enjoys the usual T^(1/2) rate of convergence. Thus, we also establish the validity of consistent long-run variance estimation in nonparametric parameter estimation settings.

[2]  arXiv:2103.00095 [pdf]
Title: The Cost of Pollution in the Upper Atoyac River Basin: A Systematic Review
Comments: 14 pages, 6 figures
Subjects: General Economics (econ.GN)

The Atoyac River is among the two most polluted in Mexico. Water quality in the Upper Atoyac River Basin (UARB) has been devastated by industrial and municipal wastewater, as well as from effluents from local dwellers, that go through little to no treatment, affecting health, production, ecosystems and property value. We did a systematic review and mapping of the costs that pollution imposes on different sectors and localities in the UARB, and initially found 358 studies, of which 17 were of our particular interest. We focus on estimating the cost of pollution through different valuation methods such as averted costs, hedonic pricing, and contingent valuation, and for that we only use 10 studies. Costs range from less than a million to over $16 million dollars a year, depending on the sector, with agriculture, industry and tourism yielding the highest costs. This exercise is the first of its kind in the UARB that maps costs for sectors and localities affected, and sheds light on the need of additional research to estimate the total cost of pollution throughout the basin. This information may help design further research needs in the region.

[3]  arXiv:2103.00173 [pdf, other]
Title: Deciphering Bitcoin Blockchain Data by Cohort Analysis
Subjects: General Economics (econ.GN); Numerical Analysis (math.NA); Computational Finance (q-fin.CP); Computation (stat.CO)

Bitcoin is a peer-to-peer electronic payment system that popularized rapidly in recent years. Usually, we need to query the complete history of Bitcoin blockchain data to acquire variables with economic meaning. This becomes increasingly difficult now with over 1.6 billion historical transactions on the Bitcoin blockchain. It is thus important to query Bitcoin transaction data in a way that is more efficient and provides economic insights. We apply cohort analysis that interprets Bitcoin blockchain data using methods developed for population data in social science. Specifically, we query and process the Bitcoin transaction input and output data within each daily cohort, which enables us to create datasets and visualizations for some key indicators of Bitcoin transactions, including the daily lifespan distributions of spent transaction output (STXO) and the daily age distributions of the accumulated unspent transaction output (UTXO). We provide a computationally feasible approach to characterize Bitcoin transactions, which paves the way for the future economic studies of Bitcoin.

[4]  arXiv:2103.00231 [pdf]
Title: A Comparison of Indonesia E-Commerce Sentiment Analysis for Marketing Intelligence Effort
Journal-ref: The 8th International Conference on Sustainable Collaboration in Business, Technology, Information and Innovation, 2017
Subjects: General Economics (econ.GN)

The rapid growth of the e-commerce market in Indonesia, making various e-commerce companies appear and there has been high competition among them. Marketing intelligence is an important activity to measure competitive position. One element of marketing intelligence is to assess customer satisfaction. Many Indonesian customers express their sense of satisfaction or dissatisfaction towards the company through social media. Hence, using social media data provides a new practical way to measure marketing intelligence effort. This research performs sentiment analysis using the naive bayes classifier classification method with TF-IDF weighting. We compare the sentiments towards of top-3 e-commerce sites visited companies, are Bukalapak, Tokopedia, and Elevenia. We use Twitter data for sentiment analysis because it's faster, cheaper, and easier from both the customer and the researcher side. The purpose of this research is to find out how to process the huge customer sentiment Twitter to become useful information for the e-commerce company, and which of those top-3 e-commerce companies has the highest level of customer satisfaction. The experiment results show the method can be used to classify customer sentiments in social media Twitter automatically and Elevenia is the highest e-commerce with customer satisfaction.

[5]  arXiv:2103.00254 [pdf]
Title: How to Issue a Central Bank Digital Currency
Comments: Swiss National Bank Working Paper3/2021
Subjects: General Economics (econ.GN); Cryptography and Security (cs.CR)

With the emergence of Bitcoin and recently proposed stablecoins from BigTechs, such as Diem (formerly Libra), central banks face growing competition from private actors offering their own digital alternative to physical cash. We do not address the normative question whether a central bank should issue a central bank digital currency (CBDC) or not. Instead, we contribute to the current research debate by showing how a central bank could do so, if desired. We propose a token-based system without distributed ledger technology and show how earlier-deployed, software-only electronic cash can be improved upon to preserve transaction privacy, meet regulatory requirements in a compelling way, and offer a level of quantum-resistant protection against systemic privacy risk. Neither monetary policy nor financial stability would be materially affected because a CBDC with this design would replicate physical cash rather than bank deposits.

[6]  arXiv:2103.00557 [pdf, ps, other]
Title: Algorithmic subsampling under multiway clustering
Subjects: Econometrics (econ.EM)

This paper proposes a novel method of algorithmic subsampling (data sketching) for multiway cluster dependent data. We establish a new uniform weak law of large numbers and a new central limit theorem for the multiway algorithmic subsample means. Consequently, we discover an additional advantage of the algorithmic subsampling that it allows for robustness against potential degeneracy, and even non-Gaussian degeneracy, of the asymptotic distribution under multiway clustering. Simulation studies support this novel result, and demonstrate that inference with the algorithmic subsampling entails more accuracy than that without the algorithmic subsampling. Applying these basic asymptotic theories, we derive the consistency and the asymptotic normality for the multiway algorithmic subsampling generalized method of moments estimator and for the multiway algorithmic subsampling M-estimator. We illustrate an application to scanner data.

[7]  arXiv:2103.00565 [pdf]
Title: Modelling Optimal Policies of Demand Responsive Transport and Interrelationships between Occupancy Rate and Costs
Subjects: General Economics (econ.GN)

This paper presents a model addressing welfare optimal policies of demand responsive transportation service, where passengers cause external travel time costs for other passengers due to the route changes. Optimal pricing and trip production policies are modelled both on the aggregate level and on the network level. The aggregate model is an extension from Jokinen (2016) with flat pricing model, but occupancy rate is now modelled as an endogenous variable depending on demand and capacity levels. The network model enables to describe differences between routes from the viewpoint of occupancy rate and efficient trip combining. Moreover, the model defines the optimal differentiated pricing for routes.

[8]  arXiv:2103.00591 [pdf, other]
Title: Epidemics with Behavior
Subjects: General Economics (econ.GN); Theoretical Economics (econ.TH)

We study equilibrium distancing during epidemics. Distancing reduces the individual's probability of getting infected but comes at a cost. It creates a single-peaked epidemic, flattens the curve and decreases the size of the epidemic. We examine more closely the effects of distancing on the outset, the peak and the final size of the epidemic. First, we define a behavioral basic reproduction number and show that it is concave in the transmission rate. The infection, therefore, spreads only if the transmission rate is in the intermediate region. Second, the peak of the epidemic is non-monotonic in the transmission rate. A reduction in the transmission rate can lead to an increase of the peak. On the other hand, a decrease in the cost of distancing always flattens the curve. Third, both an increase in the infection rate as well as an increase in the cost of distancing increase the size of the epidemic. Our results have important implications on the modeling of interventions. Imposing restrictions on the infection rate has qualitatively different effects on the trajectory of the epidemics than imposing assumptions on the cost of distancing. The interventions that affect interactions rather than the transmission rate should, therefore, be modeled as changes in the cost of distancing.

[9]  arXiv:2103.00680 [pdf]
Title: Consequential LCA for territorial and multimodal transportation policies: method and application to the free-floating e-scooter disruption in Paris
Journal-ref: Journal of Cleaner Production, Volume 273, 2020, 122898
Subjects: General Economics (econ.GN)

The indirect environmental impacts of transport disruptions in urban mobility are frequently overlooked due to a lack of appropriate assessment methods. Consequential Life Cycle Assessment (CLCA) is a method to capture the environmental consequences of the entire cause and effect chain of these disruptions but has never been adapted to transportat disruption at the city scale. This paper proposes a mathematical formalization of CLCA applied to a territorial mobility change. The method is applied to quantify the impact on climate change of the breakthrough of free-floating e-scooters (FFES) in Paris. A FFES user survey is conducted to estimate the modal shifts due to FFES. Trip substitutions from all the Parisian modes concerned are considered - personal or shared bicycles and motor scooters, private car, taxi and ride-hailing, bus, streetcar, metro and RER (the Paris metropolitan area mass rapid transit system). All these Parisian modes are assessed for the first time using LCA. Final results estimate that over one year, the FFES generated an extra thirteen thousand tons of CO2eq under an assumption of one million users, mainly due to major shifts coming from lower-emitting modes (60% from the metro and the RER, 22% from active modes). Recommendations are given to enhance their carbon footprint. A scenario analysis shows that increasing the lifetime mileage is insufficient to get a positive balance: reducing drastically servicing emissions is also required. A sensitivity analysis switching the French electricity mix for eleven other country mixes suggests a better climate change effect of the FFES in similar metropolitan areas with higher electricity carbon intensity, such as in Germany and China. Finally, the novelty and the limits of the method are discussed, as well as the results and the role of e-scooters, micromobility, and shared vehicles towards a sustainable mobility.

[10]  arXiv:2103.00734 [pdf, other]
Title: Welfare v. Consent: On the Optimal Penalty for Harassment
Comments: 38 pages, 3 figures
Subjects: General Economics (econ.GN); Theoretical Economics (econ.TH)

The economic approach to determine optimal legal policies involves maximizing a social welfare function. We propose an alternative: a consent-approach that seeks to promote consensual interactions and deter non-consensual interactions. The consent-approach does not rest upon inter-personal utility comparisons or value judgments about preferences. It does not require any additional information relative to the welfare-approach. We highlight the contrast between the welfare-approach and the consent-approach using a stylized model inspired by seminal cases of harassment and the #MeToo movement. The social welfare maximizing penalty for harassment in our model can be zero under the welfare-approach but not under the consent-approach.

[11]  arXiv:2103.01115 [pdf, other]
Title: Structural models for policy-making: Coping with parametric uncertainty
Subjects: Econometrics (econ.EM)

The ex-ante evaluation of policies using structural microeconometric models is based on estimated parameters as a stand-in for the truth. This practice ignores uncertainty in the counterfactual policy predictions of the model. We develop an approach that deals with parametric uncertainty and properly frames model-informed policy-making as a decision problem under uncertainty. We use the seminal human capital investment model by Keane and Wolpin (1997) as a well-known, influential, and empirically-grounded test case. We document considerable uncertainty in their policy predictions and highlight the resulting policy recommendations from using different formal rules on decision-making under uncertainty.

[12]  arXiv:2103.01201 [pdf, other]
Title: Can Machine Learning Catch the COVID-19 Recession?
Subjects: Econometrics (econ.EM); Applications (stat.AP); Machine Learning (stat.ML)

Based on evidence gathered from a newly built large macroeconomic data set for the UK, labeled UK-MD and comparable to similar datasets for the US and Canada, it seems the most promising avenue for forecasting during the pandemic is to allow for general forms of nonlinearity by using machine learning (ML) methods. But not all nonlinear ML methods are alike. For instance, some do not allow to extrapolate (like regular trees and forests) and some do (when complemented with linear dynamic components). This and other crucial aspects of ML-based forecasting in unprecedented times are studied in an extensive pseudo-out-of-sample exercise.

Cross-lists for Tue, 2 Mar 21

[13]  arXiv:2103.00045 (cross-list from math.OC) [pdf, ps, other]
Title: Repeated Games with Switching Costs -- Stationary vs History Independent Strategies
Subjects: Optimization and Control (math.OC); Theoretical Economics (econ.TH)

We study zero-sum repeated games where the minimizing player has to pay a certain cost each time he changes his action. Our contribution is twofold. First, we show that the value of the game exists in stationary strategies, which depend solely on the previous action of the player (and not the entire history), and we provide a full characterization of the value and the optimal strategies. The strategies exhibit a robustness property and typically do not change with a small perturbation of the switching costs. Second, we consider a case where the player is limited to playing completely history-independent strategies and provide a full characterization of the value and optimal strategies in this case. Naturally, this limitation worsens his situation. We deduce a bound on his loss in the general case as well as more precise bounds when more assumptions regarding the game or the switching costs are introduced.

[14]  arXiv:2103.00264 (cross-list from q-fin.ST) [pdf, other]
Title: Forecasting high-frequency financial time series: an adaptive learning approach with the order book data
Comments: Key words: forecasting methods, statistical learning, high-frequency order book
Subjects: Statistical Finance (q-fin.ST); Econometrics (econ.EM); Trading and Market Microstructure (q-fin.TR); Applications (stat.AP)

This paper proposes a forecast-centric adaptive learning model that engages with the past studies on the order book and high-frequency data, with applications to hypothesis testing. In line with the past literature, we produce brackets of summaries of statistics from the high-frequency bid and ask data in the CSI 300 Index Futures market and aim to forecast the one-step-ahead prices. Traditional time series issues, e.g. ARIMA order selection, stationarity, together with potential financial applications are covered in the exploratory data analysis, which pave paths to the adaptive learning model. By designing and running the learning model, we found it to perform well compared to the top fixed models, and some could improve the forecasting accuracy by being more stable and resilient to non-stationarity. Applications to hypothesis testing are shown with a rolling window, and further potential applications to finance and statistics are outlined.

[15]  arXiv:2103.00295 (cross-list from cond-mat.stat-mech) [pdf, other]
Title: Nash equilibrium mapping vs Hamiltonian dynamics vs Darwinian evolution for some social dilemma games in the thermodynamic limit
Comments: 15 pages, 4 figures, 2 tables
Subjects: Statistical Mechanics (cond-mat.stat-mech); Theoretical Economics (econ.TH); Physics and Society (physics.soc-ph); Populations and Evolution (q-bio.PE)

How cooperation evolves and manifests itself in the thermodynamic or infinite player limit of social dilemma games is a matter of intense speculation. Various analytical methods have been proposed to analyse the thermodynamic limit of social dilemmas. In a previous work [Chaos Solitons and fractals 135, 109762(2020)] involving one among us, two of those methods, Hamiltonian Dynamics(HD) and Nash equilibrium(NE) mapping were compared. The inconsistency and incorrectness of HD approach vis-a-vis NE mapping was brought to light. In this work we compare a third analytical method, i.e, Darwinian evolution(DE) with NE mapping and a numerical agent based approach. For completeness, we give results for HD approach as well. In contrast to HD which involves maximisation of payoffs of all individuals, in DE, payoff of a single player is maximised with respect to its nearest neighbour. While, HD utterly fails as compared to NE mapping, DE method gives a false positive for game magnetisation -- the net difference between the fraction of cooperators and defectors -- when payoffs obey the condition a+d=b+c, wherein a, d represent the diagonal elements and b, c the off diagonal elements in symmetric social dilemma games. When either a+d =/= b+c or, when one looks at average payoff per player, DE method fails much like the HD approach. NE mapping and numerical agent based method on the other hand agree really well for both game magnetisation as well as average payoff per player for the social dilemmas in question, i.e., Hawk-Dove game and Public goods game. This paper thus bring to light the inconsistency of the DE method vis-a-vis both NE mapping as well as a numerical agent based approach.

[16]  arXiv:2103.00366 (cross-list from q-fin.ST) [pdf]
Title: Confronting Machine Learning With Financial Research
Subjects: Statistical Finance (q-fin.ST); Machine Learning (cs.LG); Econometrics (econ.EM)

This study aims to examine the challenges and applications of machine learning for financial research. Machine learning algorithms have been developed for certain data environments which substantially differ from the one we encounter in finance. Not only do difficulties arise due to some of the idiosyncrasies of financial markets, there is a fundamental tension between the underlying paradigm of machine learning and the research philosophy in financial economics. Given the peculiar features of financial markets and the empirical framework within social science, various adjustments have to be made to the conventional machine learning methodology. We discuss some of the main challenges of machine learning in finance and examine how these could be accounted for. Despite some of the challenges, we argue that machine learning could be unified with financial research to become a robust complement to the econometrician's toolbox. Moreover, we discuss the various applications of machine learning in the research process such as estimation, empirical discovery, testing, causal inference and prediction.

[17]  arXiv:2103.00711 (cross-list from stat.ML) [pdf, ps, other]
Title: Panel semiparametric quantile regression neural network for electricity consumption forecasting
Comments: 30
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Econometrics (econ.EM)

China has made great achievements in electric power industry during the long-term deepening of reform and opening up. However, the complex regional economic, social and natural conditions, electricity resources are not evenly distributed, which accounts for the electricity deficiency in some regions of China. It is desirable to develop a robust electricity forecasting model. Motivated by which, we propose a Panel Semiparametric Quantile Regression Neural Network (PSQRNN) by utilizing the artificial neural network and semiparametric quantile regression. The PSQRNN can explore a potential linear and nonlinear relationships among the variables, interpret the unobserved provincial heterogeneity, and maintain the interpretability of parametric models simultaneously. And the PSQRNN is trained by combining the penalized quantile regression with LASSO, ridge regression and backpropagation algorithm. To evaluate the prediction accuracy, an empirical analysis is conducted to analyze the provincial electricity consumption from 1999 to 2018 in China based on three scenarios. From which, one finds that the PSQRNN model performs better for electricity consumption forecasting by considering the economic and climatic factors. Finally, the provincial electricity consumptions of the next $5$ years (2019-2023) in China are reported by forecasting.

[18]  arXiv:2103.00911 (cross-list from cs.GT) [pdf, ps, other]
Title: Distortion in Social Choice Problems: The First 15 Years and Beyond
Comments: Survey
Subjects: Computer Science and Game Theory (cs.GT); Theoretical Economics (econ.TH)

The notion of distortion in social choice problems has been defined to measure the loss in efficiency -- typically measured by the utilitarian social welfare, the sum of utilities of the participating agents -- due to having access only to limited information about the preferences of the agents. We survey the most significant results of the literature on distortion from the past 15 years, and highlight important open problems and the most promising avenues of ongoing and future work.

[19]  arXiv:2103.01126 (cross-list from stat.ML) [pdf, ps, other]
Title: BERT based patent novelty search by training claims to their own description
Subjects: Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG); Econometrics (econ.EM)

In this paper we present a method to concatenate patent claims to their own description. By applying this method, BERT trains suitable descriptions for claims. Such a trained BERT (claim-to-description- BERT) could be able to identify novelty relevant descriptions for patents. In addition, we introduce a new scoring scheme, relevance scoring or novelty scoring, to process the output of BERT in a meaningful way. We tested the method on patent applications by training BERT on the first claims of patents and corresponding descriptions. BERT's output has been processed according to the relevance score and the results compared with the cited X documents in the search reports. The test showed that BERT has scored some of the cited X documents as highly relevant.

Replacements for Tue, 2 Mar 21

[20]  arXiv:1801.03680 (replaced) [pdf, other]
Title: The time interpretation of expected utility theory
Comments: 8 pages, 3 figures
Subjects: General Economics (econ.GN)
[21]  arXiv:1803.00798 (replaced) [pdf]
Title: Permutation Tests for Equality of Distributions of Functional Data
Comments: 48 pages, 5 figures, 5 tables
Subjects: Econometrics (econ.EM); Methodology (stat.ME)
[22]  arXiv:1904.11060 (replaced) [pdf, ps, other]
Title: Normal Approximation in Large Network Models
Subjects: Econometrics (econ.EM); Statistics Theory (math.ST)
[23]  arXiv:2011.00498 (replaced) [pdf, other]
Title: Price of Anarchy of Simple Auctions with Interdependent Values
Subjects: Computer Science and Game Theory (cs.GT); Theoretical Economics (econ.TH)
[24]  arXiv:2012.09627 (replaced) [pdf]
Title: United States FDA drug approvals are persistent and polycyclic: Insights into economic cycles, innovation dynamics, and national policy
Authors: Iraj Daizadeh
Subjects: Econometrics (econ.EM)
[25]  arXiv:2102.10909 (replaced) [pdf, ps, other]
Title: Optimal Transport of Information
Subjects: General Economics (econ.GN); Optimization and Control (math.OC); Other Statistics (stat.OT)
[ total of 25 entries: 1-25 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, econ, recent, 2103, contact, help  (Access key information)