Singular Value Decomposition and Its Visualization

Lingsong Zhang1, J. S. Marron, Haipeng Shen and Zhengyuan Zhu

Abstract

Singular Value Decomposition (SVD) is a useful tool in Functional Data Analysis (FDA). Comparing to Principal Component Analysis (PCA), SVD is more fundamental, because SVD simultaneously provides the PCAs in both row and column spaces. We compare SVD and PCA from the FDA view point, and extend the usual SVD to variations by considering different recenterings. A general scree plot is proposed to select an appropriate recentering in practice. Several matrix views of the SVD components are introduced to explore different features in data, including SVD surface plot, rotation movie, curve movie and image plot. These methods visualize both column and row information of a two-way matrix simultaneously, relate the matrix to relevant curves, show local variations and interactions between columns and rows. Several toy examples are designed to compare as well as reveal the different variations of SVD, and real data examples are used to illustrate the usefulness of the visualization methods.
Key words: Exploratory Data Analysis, Functional Data Analysis, Principal Component Analysis.

Introduction

Functional Data Analysis (FDA) is the study of curves (and more complex objects) as data [Ramsay and Silverman, 1997,Ramsay and Silverman, 2002]. Methods related to Principal Component Analysis (PCA) have provided many insights. Comparing to the PCA method, Singular Value Decomposition (SVD) is more fundamental, because SVD not only provides a direct approach to calculate the principal components (PCs), but also derives the PCAs in row and column spaces simultaneously. In this paper, we view a set of curves as a two-way data matrix, explore the connections and differences between SVD and PCA from a FDA view point, and propose several visualization methods for the SVD components.
SVD provides a useful factorization of the data matrix X, while PCA provides a nearly parallel factoring, via eigen-analysis of the covariance matrix, i.e. XTX, when X is column centered at 0. The eigenvalues for XTX are then the squares of the singular values for X, and the eigenvectors for XTX are the singular rows for X. In this paper, we extend the usual (column centered) PCA method into a general SVD framework, and consider four types of SVDs based on different recenterings: SVD, column SVD (CSVD), row SVD (RSVD) and double SVD (DSVD) (cf. Section for details). Several criteria are discussed in this paper for model selection, i.e. selecting the appropriate type of SVD, including model simplicity, approximation performance, and interpretability etc. We introduce a general scree plot, which combines model simplicity and approximation performance together, and provides a visual aid for model selection. See Section for details. Several toy examples in Section are designed to illustrate the general scree plot and the differences of the four types of recenterings.
Visualization methods always help to find underlying features of a data set. In the context of PCA or SVD, common visualization methods include the biplot [Gabriel, 1971], scatter plots between singular columns or singular rows [Shen and Huang, 2005], etc. The biplot shows the relations between the rows and columns, and the scatter plot can be used to cluster the rows or columns. However these plots fail to show the functional curves.
In the FDA field, besides the above plots, it is common to plot singular columns or singular rows as curves [Ramsay and Silverman, 1997]. Marron et al., [2004] provided a visualization method for functional data (using functional PCA, basically CSVD in our paper), which shows the functional objects (curves), projections on the PCs and the residuals. These methods can also be applied in the SVD framework, but to understand all the structure in the data, they need to be applied twice, once for the rows and once for the columns. When the data set is a time series of curves, the method in Marron et al., [2004] uses different colors to show the time ordering. However, if the time series structure is complicated, color coding curves might not be enough to reveal the time effect.
In our paper, we propose several matrix views of the SVD components, which provide new underlying features of the data set. The major visualization tool is a set of surface plots. Related to the surface plot, we introduce SVD rotation movie, SVD curve movie, and SVD image plot. The surface plots for the SVD components show the functional curves of rows and columns simultaneously. SVD rotation movie shows the surface plots from different view angles, which makes it easier to find underlying information. In fact, the visualization method in Marron et al., [2004] (for example, the first column in Figure 3 of their paper) can be viewed as the surface plots from a special angle. Another special view angle, the overlook view of the surface plots, becomes the image plot, which can show the relative variation of the surface, and highlight the interaction between columns and rows. Motivated by the definition of SVD components (cf. Section 3), we introduce the SVD curve movie which can show the time varying features of the functions. One motivating example is an Internet traffic data set, which is discussed in Section 2 to illustrate the usefulness of the surface plot, the rotation movie and the curve movie. Two other real applications are reported in Section 6 as well. One chemometrics data set is used to show that a zoomed version of the surface plot and the SVD movies highlight local behaviors. A Spanish mortality data set is analyzed to illustrate that the image plot highlights the cohort effect (i.e., the interaction between age groups and years).
The rest of the paper is organized as follows. The motivating example in network traffic analysis is in Section 2. Section 3 gives a brief introduction of SVD, and compares it with PCA. Section 4 describes in detail about the generation of the plots. Section 5 shows several toy examples to illustrate the four types of recenterings. Two more real applications in chemometrics and demography are reported in Section 6.

Discussions

In this paper, we propose four types of recenterings for SVD, and describe a general scree plot to help decide which recentering is appropriate in practice. Various SVD visualization tools, such as the SVD surface plot, the image plot, the SVD curve movie and the SVD rotation movie, are proposed to find insightful features in a two way data matrix, especially a functional data set. These visualization methods simultaneously show the rows, columns, local variations, and interactions, which help to find underlying features. Furthermore, when the data set has time series structure in one direction, our method shows the time varying features clearly.
With noisy data, we notice that the SVD components contain a lot of noise. If one prefers, one can pre-smooth the data matrix using his favorite smoother, and then apply our programs to gain some insights, and then apply our program to get additional insights. We are working on incorporating some regularized SVD method [Huang et al., 2005] into our program, so that the exploration of noisy data will be easier.
Our notion of the two-way data matrix requires that the data are observed on common grid points. In cases with irregularly observed data points or missing data, one could preprocess the data using some smoothing or interpolation methods to obtain regular observed data matrices, before applying our visualization methods.

Acknowledgement

The authors gratefully acknowledge Dr. Andrés M. Alonso Fernández of Departamento de Estadística in Universidad Carlos III de Madrid, for providing the Spanish mortality data to us. Thanks also go to two referees and an Associate Editor for their helpful comments and constructive suggestions.

References

[Gabriel 1971]
Gabriel, K. R. (1971), "The biplot graphic display of matrices with application to principal component analysis," Biometrika, 58, 453-467.
[Huang et al. 2005]
Huang, J. Z., Shen, H., and Buja, A. (2005), Analysis of Two-Way Functional Data Using Regularized Singular Value Decomposition, working paper.
[Marron et al. 2004]
Marron, J. S., Wendelberger, J. R., and Kober, E. M. (2004), Time Series Functional Data Analysis, Los Alamos National Lab, no. LA-UR-04-3911.
[Ramsay and Silverman 1997]
Ramsay, J. O. and Silverman, B. W. (1997), Functional Data Analysis, Springer-Verlag.
[Ramsay and Silverman 2002]
- (2002), Applied Functional Data Analysis, Methods and Case Studies, Springer-Verlag.
[Shen and Huang 2005]
Shen, H. and Huang, J. Z. (2005), "Analysis of Call Center Arrival Data Using Singular Value Decomposition," Applied Stochastic Models in Business and Industry, 21, 251-263.

Footnotes:

1Correspondence to Lingsong Zhang, Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, NC, 27599-3260. Email: lszhang@email.unc.edu


File translated from TEX by TTH, version 3.67.
On 25 Apr 2006, 02:30.