New submissions for Mon, 30 Mar 20

[1]  arXiv:2003.12178
Title: Separable and Semiparametric Network-based Counting Processes applied to the International Combat Aircraft Trades
Subjects: Applications (stat.AP)

We propose a novel tie-oriented model for longitudinal event network data. The generating mechanism is assumed to be a multivariate Poisson process that governs the onset and repetition of yearly observed events with two separate intensity functions. We apply the model to a network obtained from the number of international deliveries of combat aircraft trades between 1950 and 2017. Based on a modified trade gravity approach we identify economic and political factors impeding or lightening the number of transfers. Extensive dynamics as well as country heterogeneity require the specification of semiparametric time-varying effects as well as random effects.

[2]  arXiv:2003.12447
Title: Anchor Attention for Hybrid Crowd Forecasts Aggregation
Subjects: Applications (stat.AP); Multiagent Systems (cs.MA)

Forecasting the future is a notoriously difficult task. To overcome this challenge, state-of-the-art forecasting platforms are "hybridized", they gather forecasts from a crowd of humans, as well as one or more machine models. However, an open challenge remains in how to optimally combine forecasts from these pools into a single forecast. We proposed anchor attention for this type of sequence summary problem. Each forecast is represented by a trainable embedding vector, and use computed anchor attention score as the combined weight. We evaluate our approach using data from real-world forecasting tournaments, and show that our method outperforms the current state-of-the-art aggregation approaches.

Cross-lists for Mon, 30 Mar 20

[3]  arXiv:1804.00049 (cross-list from q-bio.NC)
Title: Gaussian graphical models reveal inter-modal and inter-regional conditional dependencies of brain alterations in Alzheimer's disease
Comments: 24 pages, 9 figures, 2 tables, supporting material
Subjects: Neurons and Cognition (q-bio.NC); Applications (stat.AP)

Alzheimer's disease (AD) is characterized by a sequence of pathological changes, which are commonly assessed in vivo using MRI and PET. Currently, the most approaches to analyze statistical associations between brain regions rely on Pearson correlation. However, these are prone to spurious correlations arising from uninformative shared variance. Notably, there are no appropriate multivariate statistical models available that can easily integrate dozens of variables derived from such data, being able to use the additional information provided from the combination of data sources. Gaussian graphical models (GGMs) can estimate the conditional dependency from given data, which is expected to reflect the underlying causal relationships. We applied GGMs to assess multimodal regional brain alterations in AD. We obtained data from N=972 subjects from the Alzheimer's Disease Neuroimaging Initiative. The mean amyloid load (AV45-PET), glucose metabolism (FDG-PET), and gray matter volume (MRI) were calculated. GGMs were estimated using a Bayesian framework for the combined multimodal data to obtain conditional dependency networks. Conditional dependency matrices were much sparser (10% density) than Pearson correlation matrices (50% density). Within modalities, conditional dependency networks yielded clusters connecting anatomically adjacent regions. For associations between different modalities, only few region-specific connections remained. Graph-theoretical network statistics were significantly altered between groups, with a biphasic u-shape trajectory. GGMs removed shared variance among multimodal measures of regional brain alterations in MCI and AD, and yielded sparser matrices compared to Pearson correlation networks. Therefore, GGMs may be used as alternative to thresholding-approaches typically applied to correlation networks to obtain the most informative relations between variables.

[4]  arXiv:2003.12405 (cross-list from eess.SP)
Title: Bayesian Sequential Joint Detection and Estimation under Multiple Hypotheses
Comments: 13 pages, 2 figures, with supplementing materials, submitted to the IEEE Transactions on Signal Processing
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Applications (stat.AP)

We consider the problem of jointly testing multiple hypotheses and estimating a random parameter of the underlying distribution. This problem is investigated in a sequential setup under mild assumptions on the underlying random process. The optimal test minimizes the expected number of samples while ensuring that the average detection/estimation errors do not exceed a certain level. After converting the constrained problem to an unconstrained one, we characterize the general solution by a non-linear Bellman equation, which is parametrized by a set of cost coefficients. A strong connection between the derivatives of the cost function with respect to the coefficients and the detection/estimation errors of the sequential procedure is derived. Based on this fundamental property, we further show that for suitably chosen cost coefficients the solutions of the constrained and the unconstrained problem coincide. We present two approaches to finding the optimal coefficients. For the first approach, the final optimization problem is converted to a linear program, whereas the second approach solves it with a projected gradient ascent. To illustrate the theoretical results, we consider two problems for which the optimal tests are designed numerically. Using Monte Carlo simulations, it is validated that the numerical results agree with the theory.

[5]  arXiv:2003.12540 (cross-list from stat.ME)
Title: A super scalable algorithm for short segment detection
Comments: To be published in Statistics in Biosciences
Subjects: Methodology (stat.ME); Applications (stat.AP); Computation (stat.CO)

In many applications such as copy number variant (CNV) detection, the goal is to identify short segments on which the observations have different means or medians from the background. Those segments are usually short and hidden in a long sequence, and hence are very challenging to find. We study a super scalable short segment (4S) detection algorithm in this paper. This nonparametric method clusters the locations where the observations exceed a threshold for segment detection. It is computationally efficient and does not rely on Gaussian noise assumption. Moreover, we develop a framework to assign significance levels for detected segments. We demonstrate the advantages of our proposed method by theoretical, simulation, and real data studies.

Replacements for Mon, 30 Mar 20

[6]  arXiv:1901.10399 (replaced)
Title: Optimal Replacement Policy under Cumulative Damage Model and Strength Degradation with Applications
Subjects: Applications (stat.AP); Computation (stat.CO); Methodology (stat.ME)
