Glottal activity region based processing for speech synthesis

Show simple item record

dc.contributor.author Adiga, Nagaraj
dc.date.accessioned 2019-03-29T11:59:03Z
dc.date.available 2019-03-29T11:59:03Z
dc.date.issued 2017
dc.identifier.other ROLL No. 11610235
dc.identifier.uri http://gyan.iitg.ernet.in/handle/123456789/1038
dc.description Supervisor: S R M Prasanna en_US
dc.description.abstract Statistical parametric speech synthesis (SPSS) is the mostly preferred synthesizer compared to concatenative synthesis system, due to small footprint and flexibility. However, the naturalness and intelligibility of SPSS are still lagging behind the concatenative synthesis system. In this thesis, glottal activity region based processing for speech synthesis is proposed to improve the quality of speech. Glottal activity regions are perceptually important and constitute the majority of speech sounds. The major contributions of the present thesis are (I) Glottal activity region detection using features like strength of excitation, normalized autocorrelation peak strength, and higher order statistics. (ii) Vocal-tract smoothed spectral envelope computation by applying Riesz transform in the 2-D domain. (iii) Source model is designed with representation for aperiodic and phase components using integrated LP residual. (iv) The combination of suprasegmental, system, and source features for modeling together in SPSS to improves the prosody, naturalness, and intelligibility of SPSS. en_US
dc.language.iso en en_US
dc.relation.ispartofseries TH-1840;
dc.subject ELECTRONICS AND ELECTRICAL ENGINEERING en_US
dc.title Glottal activity region based processing for speech synthesis en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Browse

My Account