Automatic speech recognition system

somdn_product_page

(Downloads - 0)

Catégorie :

For more info about our services contact : help@bestpfe.com

Table of contents

List of Figures
List of Tables
List of Acronyms
Résumé
Introduction
I Background
1 Automatic and human speech recognition
1.1 Automatic speech recognition system
1.1.1 Voice mechanism
1.1.2 Brief history of ASR
1.1.3 ASR architecture
1.2 Pronunciation variations
1.2.1 Pronunciation variation modeling for ASR
1.2.2 Pronunciation variation modeling for French
1.3 Errors
1.3.1 Errors by ASR
1.3.2 Errors by humans
1.4 Conclusion
2 Prosody
2.1 General definition of prosody
2.1.1 Prosody of French
2.1.2 Prosody for speech technology
2.2 Acoustic correlation of prosody
2.2.1 Fundamental frequency (f0)/Pitch
2.2.2 Intensity/Loudness
2.2.3 Duration/Length
2.2.4 Formant/Timbre
2.2.5 Pauses
2.3 Prosodic structure
2.3.1 Prosodic structure of French
2.4 Prosody in perception
2.5 Conclusion
II Realized works
3 Corpora and methodology
3.1 Corpora
3.1.1 ESTER corpus
3.1.2 PFC corpus
3.2 Methodology
3.2.1 Automatic speech alignment system
3.2.2 Extraction f0, F1, F2, F3 and intensity
3.3 Summary and Conclusion
4 Classification for homophone words
4.1 Automatic transcription errors
4.2 Automatic classification
4.2.1 Corpora for automatic classification
4.2.2 Measurements of acoustic parameters
4.2.3 Considered parameters
4.2.4 Automatic homophone classification
4.3 Perceptual transcription test
4.3.1 Corpus for perceptual evaluation
4.3.2 Perceptual evaluation
4.3.3 Discussion on perceptual evaluation
4.4 Summary and conclusion
5 Large-scale prosodic analyses of French words and phrases
5.1 Corpora and methodology
5.1.1 Corpora
5.1.2 Methodology
5.2 Lexical versus grammatical words
5.2.1 f0 profiles
5.2.2 Duration profiles
5.2.3 Intensity profiles
5.2.4 Short versus long duration impact
5.3 Noun versus noun phrase
5.3.1 f0 profiles
5.3.2 Duration profiles
5.3.3 Intensity profiles
5.3.4 Intervocalic measurements
5.3.5 Homophone noun phrases: fine phonetic detail?
5.4 Conclusion
Conclusions
III Appendix
A 62 selected attributes
A.1 Intra-phonemic attributes: 40 attributes
A.2 Inter-phonemic attributes: 22 attributes
B Homophone classification results
C Average prosodic parameters
C.1 Fundamental frequency and intensity
C.2 Duration
D f0 Profiles in Terms of POS
E f0 Profiles: PFC text reading
Author’s publications
References

Laisser un commentaire

Votre adresse e-mail ne sera pas publiée. Les champs obligatoires sont indiqués avec *