Professional Documents
Culture Documents
∑
1
the one-order moment x = aij and the two-order The matrix A of a linear transformation T relative
n i, j =1
to a given basis {ε1 , ε 2 ,", ε n } is unique. The
n
eigenvalues of A , is same as ones of T , indicate the
∑
1
moment s 2 = (aij − x ) 2 of the slices. This characteristics of the linear transformation T . For
n − 1 i , j =1
example, the equation Kx = ω 2 x is a vibration system,
method is useful and efficient. Note that the sources of
data are multiform. It is difficult to find the difference the vibration frequency ω 2 is just the eigenvalue of the
among slices using the moment of order one or two if system and K represents the stiffness matrix in the
the slices have the similar data elements but the dynamic system.[8]
different data distribution at an target dimension in a
data cube. As an example, two slices are displayed in 5. Eigenvalues Mining
Figure 2.
Let A ∈ R m×n be a m × n matrix, R m×n be the
1 2 3 8 9 7 linear space of m × n real matrices. The eigenvalues
4 5 6 6 5 4 of any n× n real matrix are usually complex. In
7 8 9 3 2 1 addition, the matrices generated by slices are also in
R m×n . So, the main work in this paper focuses on
Figure 2. Two slices
mining eigenvalues which exhibit the traits of
movement from the slices in the system.
From the two slices, we only have x1 = x2 = 5 and
s 21 = s 2 2 = 0 .
41
Definition 1. Assume that A ∈ C m×n , and the
numbers λ1 , λ2 , ", λr are the eigenvalues of A Η A .
The nonnegative numbers
σ i = λi , (i = 1, 2, ", r ) are called the singular values
of the matrix A . Where C m×n is the collection of m× n
complex matrices, A Η is the transpose of A , and r is
the rank of A .
Definition 2. Let A ∈ C m×n . The matrix A is called
the unitary matrix if A Η A = E . And we use Σ n×n to
denotes the collection of the unitary matrices.
Theorem 1.(Singular Value Decomposition Theorem) Figure 3. Results(1)
m×n
Let A ∈ Cr . Then there exists two unitary
m×m
matrices U ∈ Σ and V ∈ Σ n×n such that
D 0 r ×n− r .
U Η AV = r
0 m− r×r 0 m− r×n −r
Where Dr = diag(σ 1 , σ 2 , ", σ r ) , σ 1 ≥ σ 2 ≥ " ≥ σ r > 0 and
r is the rank of A .
Theorem 2. Let A ∈ C m×n . Then σ 1 =|| A || 2 is the
maximum singular value of A , called spectral norm of
A .[9]
According to Theorem 1, applying singular value
decomposition to slices is an effective method of Figure 3 . Results(2)
algebraic feature extraction. Using this method can
find the basic structure and the algebraic essence of the Analysis of Results
data matrix. Form 1951 to 2001, Population of the ten blocks
Theorem 2 indicates that the first singular value of increased from 3,400,000 to 6,920,000, area from
the matrix is maximum, and it can present the algebraic 2,412 m2 to 2,963 m2 and population density from
feature of the matrix. Therefore, searching for the 1,410 people per m2 to 2,336 people per m2 .
maximum singular value from the slices in the same
According to the distribution of the singular values, we
class does not influence the similarity and the
have the following:
abnormality of the slices. Furthermore, the
(1) The curve of singular values in outskirts which
computation to find the singular value is not difficult.
has the stable area is growing, such as in Nanhui and
We can easily find the singular value of the two slices
Congming. Especially in Congming located in the delta
shown in Figure 2 is (16.85,1.07,0.00) and
of the Yangtse Rive, the population is increasing while
(15.36,7.00,0.28), respectively.
the area does not change.
(2) The curve of singular values of the areas in the
6. Experiment and Analysis centre of the city tends to decrease such as in Hongkou,
Zhabei, Yangpu, Luwan and Xuhui. The change focus
Hardware and Software: 1.3GHz CPU, 128M on the years 1960 and 1980 since there was an area re-
Memory, Matlab 6.5 division in Shanghai in 1960 and 1984. The curve of
Data Source: To analyze the singular value, we have singular values exhibits the period and the efficiency of
chosen the data including the populations, families, this regulation.
areas and population density involving 10 blocks of 19 (3) The most abnormal area lies in Huangpu. Being
regions in Shanghai City from 1951 to 2001. Among the centre of economy and culture in Shanghai,
them, 60 percent of blocks are located in the centre of Huangpu area is always very susceptible to economy
the city, 40 percent of blocks are located in outskirt. and polity.
Choose the time as the target dimension and the size of In this example, by analyzing the singular values of
the subcube as each 3 year. Finding the maximum the slices and searching for the similarity and
singular values of 17 slices in each region have taken abnormality, we knew the tendency of movement and
1.05 seconds. The results are demonstrated in Figure 3. found why .it was so. The results of this example
42
indicate that it is valuable to research the singular Data Bases, Edinburgh: Morgan Kaufmann Publishers, 1999,
values of the slices in data mining. pp. 42-53.
8. References
43