IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/0165.html
   My bibliography  Save this paper

Rosetak Document 4: Rank Degeneracies and Least Square Problems

Author

Listed:
  • Gene Golub
  • Virginia Klema
  • G. W. Stewart

Abstract

In this paper we shall be concerned with the following problem. Let A be an m x n matrix with m being greater than or equal to n, and suppose that A is near (in a sense to be made precise later) a matrix B whose rank is less than n. Can one find a set of linearly independent columns of A that span a good approximation to the column space of B? The solution of this problem is important in a number of applications. In this paper we shall be chiefly interested in the case where the columns of A represent factors or carriers in a linear model which is to be fit to a vector of observations b. In some such applications, where the elements of A can be specified exactly (e.g. the analysis of variance), the presence of rank degeneracy in A can be dealt with by explicit mathematical formulas and causes no essential difficulties. In other applications, however, the presence of degeneracy is not at all obvious, and the failure to detect it can result in meaningless results or even the catastrophic failure of the numerical algorithms being used to solve the problem. The organization of this paper is the following. In the next section we shall give a precise definition of approximate degeneracy in terms of the singular value decomposition of A. In Section 3 we shall show that under certain conditions there is associated with A a subspace that is insensitive to how it is approximated by various choices of the columns of A, and in Section 4 we shall apply this result to the solution of the least squares problem. Sections 5, 6, and 7 will be concerned with algorithms for selecting a basis for the stable subspace from among the columns of A.

Suggested Citation

  • Gene Golub & Virginia Klema & G. W. Stewart, 1977. "Rosetak Document 4: Rank Degeneracies and Least Square Problems," NBER Working Papers 0165, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:0165
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w0165.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Douglas M. Hawkins, 1973. "On the Investigation of Alternative Regressions by Principal Component Analysis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 22(3), pages 275-286, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Claudia García-García & Catalina B. García-García & Román Salmerón, 2021. "Confronting collinearity in environmental regression models: evidence from world data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(3), pages 895-926, September.
    2. David A. Belsley, 1976. "Multicollinearity: Diagnosing its Presence and Assessing the Potential Damage It Causes Least Squares Estimation," NBER Working Papers 0154, National Bureau of Economic Research, Inc.
    3. Enache, Daniel & Weihs, Claus, 2004. "Importance Assessment of Correlated Predictors in Business Cycles Classification," Technical Reports 2004,66, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    4. Norman Fickel, 2001. "Sequential Regression: A Neodescriptive Approach to Multicollinearity," EERI Research Paper Series EERI_RP_2001_09, Economics and Econometrics Research Institute (EERI), Brussels.
    5. Bauer, Jan O. & Drabant, Bernhard, 2023. "Regression based thresholds in principal loading analysis," Journal of Multivariate Analysis, Elsevier, vol. 193(C).
    6. Norman Fickel, 2000. "Sequential Regression: A Neodescriptive Approach to Multicollinearity," Econometrics 0004009, University Library of Munich, Germany.
    7. Fickel, Norman, 2000. "Sequential regression: a neodescriptive approach to multicollinearity," Discussion Papers 33/2000, Friedrich-Alexander University Erlangen-Nuremberg, Chair of Statistics and Econometrics.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:0165. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.