We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: Sparse multivariate regression with missing values and its application to the prediction of material properties

Abstract: In the field of materials science and engineering, statistical analysis and machine learning techniques have recently been used to predict multiple material properties from an experimental design. These material properties correspond to response variables in the multivariate regression model. This study conducts a penalized maximum likelihood procedure to estimate model parameters, including the regression coefficients and covariance matrix of response variables. In particular, we employ $l_1$-regularization to achieve a sparse estimation of regression coefficients and the inverse covariance matrix of response variables. In some cases, there may be a relatively large number of missing values in response variables, owing to the difficulty in collecting data on material properties. A method to improve prediction accuracy under the situation with missing values incorporates a correlation structure among the response variables into the statistical model. The expectation and maximization algorithm is constructed, which enables application to a data set with missing values in the responses. We apply our proposed procedure to real data consisting of 22 material properties.
Comments: 18 pages
Subjects: Methodology (stat.ME)
MSC classes: 62D10, 62J05, 62J07, 65K10
Cite as: arXiv:2103.09619 [stat.ME]
  (or arXiv:2103.09619v1 [stat.ME] for this version)

Submission history

From: Keisuke Teramoto [view email]
[v1] Wed, 17 Mar 2021 13:09:06 GMT (563kb,D)

Link back to: arXiv, form interface, contact.