(Downloads - 0)
For more info about our services contact : help@bestpfe.com
Table of contents
1 Introduction
1.1 Applications of speech production knowledge
1.2 Purpose and motivation
1.3 Global overview
1.3.1 Techniques to capture articulatory information
1.3.2 Speech synthesis approaches
1.3.3 Articulatory data augmentation
1.3.4 Generic speaker modeling
1.4 Thesis organization
2 Databases
2.1 Requirements for a database
2.2 MRI databases for speech production research
2.2.1 Vowels MRI database
2.2.2 ATR MRI database
2.2.3 rtMRI-TIMIT database
2.2.4 rtMRI database for Portuguese
2.2.5 USC-EMO-MRI corpus
2.2.6 USC Speech and Vocal Tract Morphology MRI database
2.2.7 « Seeing speech » database
2.3 ArtSpeechMRIfr
2.3.1 General description of the ArtSpeechMRIfr database
2.3.2 Data acquisition
2.3.3 Database description
2.3.4 Applications
2.4 Conclusion of Databases
3 Acoustic Simulations
3.1 Comparison between various types of simulations
3.1.1 Introduction about acoustic simulations
3.1.2 Data acquisition
3.1.3 Data processing
3.1.4 Acoustic simulations
3.1.5 Electrical simulation
3.1.6 Experiments
3.1.7 Discussion about various types of simulations
3.2 Impact of head position on phonation
3.2.1 Introduction about the effect of head position on phonation
3.2.2 Experiments
3.2.3 Discussion about the effect of head position on phonation
3.3 Impact of approximation at the level of velum and epiglottis
3.3.1 Introduction about geometric simplifications of the vocal tract
3.3.2 Experiments
3.3.3 Discussion about the effect of velum and epiglottis simplification
3.4 Discussion about acoustic simulations
4 2D to 3D extension
4.1 Introduction about 2D to 3D extension
4.2 Dynamic 3D vocal tract shape generation
4.2.1 Acquiring the data
4.2.2 Phonetic alignment of sound recordings
4.2.3 Image transformation
4.2.4 Denoising procedure
4.2.5 Experiments on 3D shape generation
4.2.6 Conclusions about dynamic 3D vocal tract shape generation
4.3 Further extensions
4.3.1 Vocal tract sagittal slices estimation from MRI midsagittal slices
4.3.2 Synthesize MRI vocal tract data using « silence » MR Images
4.4 Discussion about 2D to 3D extension
5 Generic speaker model
5.1 Method
5.1.1 Subjects
5.1.2 Data acquisition
5.1.3 Vocal tract measurements
5.1.4 Atlas construction
5.2 Results
5.3 Discussion about generic speaker model
6 Discussion
6.1 Contributions of thesis
6.2 Selection of unexplored research questions
6.3 Directions to expand this thesis
7 Résumé détaillé en français
7.1 Introduction
7.2 Bases de données
7.3 Simulations acoustiques
7.4 Transformation 2D à 3D
7.5 Modèle générique de locuteur
7.6 Discussion
Bibliography




