DYNAMIC CUR, AN ALTERNATIVE TO VARIABLE SELECTION IN CUR DECOMPOSITION

  1. María Purificación Galindo Villardón 1
  2. Nerea González García 1
  3. Carlos Manuel Martín Barreiro 2
  4. Greibin Villegas Barahona 3
  5. Sergio Hernández González 4
  6. Mercedes Sánchez Barba 1
  1. 1 Universidad de Salamanca, España
  2. 2 ESPOL Polytechnic University, ESPOL, FCNM, Guayaquil, Ecuador
  3. 3 Universidad Estatal a Distancia, Costa Rica
  4. 4 Universidad Veracruzana, México
Journal:
Investigación Operacional

ISSN: 2224-5405

Year of publication: 2019

Volume: 40

Issue: 3

Pages: 391-399

Type: Article

More publications in: Investigación Operacional

Abstract

CUR decomposition is one of the matrix decomposition techniques proposed in the literature for the selection of rows and/or columns of a data matrix. Dynamic CUR is proposed as an alternative to the selection criteria of the CUR decomposition based on probabilistic criteria. This alternative tries to fit the most adequate theoretical probability distribution to the empirical distribution of the leverages obtained from the start and based on it, automatically determines not only the individuals and/or variables that need to be selected, but also their numbers. In this way, Dynamic CUR sets itself apart from CUR in the information selection criteria, dynamizing the calculation of the approximation error starting from an optimal initial selection of parameters based on the most adequate probability distribution. Lastly, with the purpose of facilitating the use of this new method in any practical context, the Dynamic CUR algorithm has been developed in C#.NET and R languages. KEYWORDS: Multivariate analysis, Principal component analysis, CUR decomposition, Correlation, Singular Value Decomposition. MSC: 62E17, 62G30, 49M27