Graphical Models for Text-Independent Speaker Verification

Sánchez-Soto, Eduardo; Sigelle, Marc; Chollet, Gérard

doi:10.1007/11520153_26

Eduardo Sánchez-Soto²²,
Marc Sigelle²² &
Gérard Chollet²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3445))

Included in the following conference series:

International School on Neural Networks, Initiated by IIASS and EMFCSC

1224 Accesses
1 Citation

Abstract

Our approach in text independent Speaker Verification (SV) proposes to integrate different aspects of the speech signal which convey information about the speaker’s identity using Graphical Models (GM). Prosodic, spectral and source information obtained from the residue of linear prediction analysis are modeled in a probabilistic framework with a system based on Bayesian Networks (BN). The structure, or conditional independencies between the variables, is learned directly from the data using two different algorithms. In particular, the interpretation and comparation of the structures is presented. Some experiments conducted on the NIST 2003 one speaker text-independent data base have been conducted to demonstrate the feasibility of this approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Improved Text-Independent Speaker Identification and Verification with Gaussian Mixture Models

Robust features for text-independent speaker recognition with short utterances

Article 10 March 2020

Intelligent Speaker Identification System Under Multi-Variability Speech Conditions

References

Chow, C.K., Liu, C.N.: Approximating discrete probability distributions with dependence trees. IEEE Transactions on Information Theory 14(3), 462–467 (1968)
Article MATH MathSciNet Google Scholar
Cooper, G.F., Herskovits, E.: A Bayesian Method for the Induction of Probabilistic Networks from Data. Machine Learning 9, 309–347 (1992)
MATH Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society 34, 1–38 (1997)
Google Scholar
Heckerman, D.: A tutorial on Learning with Bayesian Network Structures. Lerning in Graphical Models. MIT Press, Cambridge (1998)
Google Scholar
Switchboard Corpora LDC, http://www.ldc.upenn.edu/
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Diego (1988)
Google Scholar
Reynolds, D.A.: A Gaussian Mixture Modeling Approach to Text-Independent Speaker Identification. PhD thesis, Georgia Institute of Technology (1992)
Google Scholar
Rissanen, J.: Modeling by shortest data description. Automatica 14, 465–471 (1978)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Département de Traitement du Signal et des Images, CNRS UMR LTCI, École Nationale Supérieure des Télécommunications, 46, rue Barrault, 75634, Paris Cedex 13, France
Eduardo Sánchez-Soto, Marc Sigelle & Gérard Chollet

Authors

Eduardo Sánchez-Soto
View author publications
Search author on:PubMed Google Scholar
Marc Sigelle
View author publications
Search author on:PubMed Google Scholar
Gérard Chollet
View author publications
Search author on:PubMed Google Scholar

Editor information

Editors and Affiliations

CNRS LTCI/TSI Paris, 46 rue Barrault, 75634, Paris Cedex 13, France
Gérard Chollet
Department of Psychology, Second University of Naples, and IIASS, Via Pellegrino 19, 84019, Vietri sul Mare, SA, Italy
Anna Esposito
Escola Universitària Politècnica de Mataró, Universitat Politècnica de Catalunya, Barcelona, Spain
Marcos Faundez-Zanuy
Dipartimento di Fisica “E.R. Caianiello”, Università degli Studi di Salerno, Via S. Allende, 84081, Baronissi, SA, Italy
Maria Marinaro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sánchez-Soto, E., Sigelle, M., Chollet, G. (2005). Graphical Models for Text-Independent Speaker Verification. In: Chollet, G., Esposito, A., Faundez-Zanuy, M., Marinaro, M. (eds) Nonlinear Speech Modeling and Applications. NN 2004. Lecture Notes in Computer Science(), vol 3445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11520153_26

Download citation

DOI: https://doi.org/10.1007/11520153_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27441-4
Online ISBN: 978-3-540-31886-6
eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics