Professional Documents
Culture Documents
I. INTRODUCTION
The speaker verification is regarded as a subcategory of
automatic speaker recognition (ASR) system and can apply
to determine whether a person is who he/she claims to be.
Therefore, the problem of speaker verification is a true-false
(accept-reject) question [1-6]. The speaker verification is
desirable widely in many speech related applications, such as
banking by telephone, voice dialing, and biometric security
system [1-6]. Meanwhile, depended on the differences of
recognition target, the systems of speaker verification fall
into two types: text-dependent and text-independent. The
former one requires that the speaker should provide
keywords or sentences of the same text for both training and
recognition, while the latter one dose not depend on the
specific text being spoken [1-6]. For security consideration,
this paper will focus on the problem of the text-dependent
speaker verification.
Several methods have been proposed for speaker
verification. It follows from [1-6] that a typical speaker
verification system consists of two tasks: enrollment and
verification as shown in Fig. 1. Enrollment is the task to
construct a speaker model. This step will capture the speaker
characteristics or features. Most of the present-day systems
use the speaker-specific vocal tract information like MFCCs
or linear prediction coefficients (LPCs) as speaker features
for speaker verification. Then these speaker features are used
to build a model that could authenticate the speaker during
the verification phase. In the speaker verification task, the
This work was supported in part by the National Science Council under
Grant NSC 97 -2221 -E -366 -010 -MY3. Shi-Huang Chen* and
Yu-Ren Luo are with the Department of Computer Science Information
Engineering, Shu-Te University, Kaohsiung, 824, Taiwan.
(*Corresponding author, Tel: +886-7-6158000 ext. 4607, Fax:
+886-7-6152559, e-mail: shchen@mail.stu.edu.tw)
ISBN: 978-988-17012-2-0
Proceedings of the International MultiConference of Engineers and Computer Scientists 2009 Vol I
IMECS 2009, March 18 - 20, 2009, Hong Kong
| S (k ) | =
s (n) e
Fig. 2 show the mel space filter bank with M=40 [10].
(6)
MFCC
1
is the
boundary frequencies for the entire filter bank. f mel
j 2nk
Q p =0
2 Q
where 0 m M-1. Fig. 3 shows the summary of MFCC
calculation process.
(2)
n =1
e (i ) = | S ( k ) | H i ( k )
2
(3)
k =1
,
( f b (i ) f b (i 1) )
H i (k ) =
( f b (i +1) k )
,
( f b (i +1) f b (i ) )
0,
f (x) = i yi K (x, x i ) + b
for k < f b (i 1)
for
f b (i 1) k < f b (i )
for
f b (i ) k < f b (i +1)
(8)
i =1
(4)
In (4), fb(i) are the boundary points of the filters and are
depended on the sampling points Fs and the number of points
N in DFT:
f (mel)low
f
N 1
f mel(low) + i (mel)high
. (5)
f b ( i ) = f mel
M +1
Fs
Here, fmel(low) and fmel(high) are respectively the low and high
ISBN: 978-988-17012-2-0
i yi = 0 , and
i =1
where < , > denotes the inner product and maps the input
space X to another high dimensional feature space F. With
suitably chosen , the given nonlinearly separable samples S
may be linearly separated in F, as shown in Fig. 4. An
improved SVM called soft-margin SVM can tolerate minor
IMECS 2009
Proceedings of the International MultiConference of Engineers and Computer Scientists 2009 Vol I
IMECS 2009, March 18 - 20, 2009, Hong Kong
(+1)
+1
+1
(1) (1)
(+1)
+1
+1
(1)
(+1)
1
+1
1
+1
(1)
(+1)
(+1)
(+1)
(1)
MFCCs
Calculation
SVM
Decision
Reject
User
Claimed
Speaker Model
(9)
Imposter
Model
hyperplane H (x )
ISBN: 978-988-17012-2-0
FAR =
(10)
(11)
IMECS 2009
Proceedings of the International MultiConference of Engineers and Computer Scientists 2009 Vol I
IMECS 2009, March 18 - 20, 2009, Hong Kong
EER
12.2%
5.8%
2.2%
2.7%
2.0%
0.7%
1.3%
0.4%
0.0%
0.2%
0.0%
0.0%
0.4%
80
70
FRR
FAR
60
FRR
FAR
70
60
50
Error Rate
Error Rate
50
40
30
40
30
20
20
FAR[X:97,Y:0]
10
10
20
30
40
50
60
Threshold value
70
80
90
100
ISBN: 978-988-17012-2-0
EER[X:88,Y:0]
FRR[X:96,Y:0]
FAR[X:88,Y:0]
10
FRR[X:94,Y:0]
EER[X:95,Y:2.2222]
10
20
30
40
50
60
Threshold value
70
80
90
100
Fig. 8. A ROC plot of FRR and FAR with MFCC order = 22.
IMECS 2009