Abstract
Our approach in text independent Speaker Verification (SV) proposes to integrate different aspects of the speech signal which convey information about the speaker’s identity using Graphical Models (GM). Prosodic, spectral and source information obtained from the residue of linear prediction analysis are modeled in a probabilistic framework with a system based on Bayesian Networks (BN). The structure, or conditional independencies between the variables, is learned directly from the data using two different algorithms. In particular, the interpretation and comparation of the structures is presented. Some experiments conducted on the NIST 2003 one speaker text-independent data base have been conducted to demonstrate the feasibility of this approach.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chow, C.K., Liu, C.N.: Approximating discrete probability distributions with dependence trees. IEEE Transactions on Information Theory 14(3), 462–467 (1968)
Cooper, G.F., Herskovits, E.: A Bayesian Method for the Induction of Probabilistic Networks from Data. Machine Learning 9, 309–347 (1992)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society 34, 1–38 (1997)
Heckerman, D.: A tutorial on Learning with Bayesian Network Structures. Lerning in Graphical Models. MIT Press, Cambridge (1998)
Switchboard Corpora LDC, http://www.ldc.upenn.edu/
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Diego (1988)
Reynolds, D.A.: A Gaussian Mixture Modeling Approach to Text-Independent Speaker Identification. PhD thesis, Georgia Institute of Technology (1992)
Rissanen, J.: Modeling by shortest data description. Automatica 14, 465–471 (1978)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sánchez-Soto, E., Sigelle, M., Chollet, G. (2005). Graphical Models for Text-Independent Speaker Verification. In: Chollet, G., Esposito, A., Faundez-Zanuy, M., Marinaro, M. (eds) Nonlinear Speech Modeling and Applications. NN 2004. Lecture Notes in Computer Science(), vol 3445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11520153_26
Download citation
DOI: https://doi.org/10.1007/11520153_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27441-4
Online ISBN: 978-3-540-31886-6
eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science
Keywords
- Bayesian Network
- Bayesian Information Criterion
- Speech Signal
- Conditional Independency
- Minimal Description Length
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
