Text to Speech System for the Hindi Language

About This Presentation

Title:

Description:

Number of Views:1367

Avg rating:3.0/5.0

Slides: 17

Provided by: kanupri

Category:

more less

Transcript and Presenter's Notes

Title: Text to Speech System for the Hindi Language

1
Text to Speech System for the Hindi Language

2
Formato de la Presentación

3
Introducción

4
Motivación

In the movie 2001 A Space Odyssey A computer
HAL talks like a human being. It was a brilliant
imagination in 1960s. Now this dream has come
true for English what for hindi movies ???? .
Dependence of human computer interaction on
written texts and images, makes the use of
computers impossible for visually and physically
impaired and illiterate masses
TTS has not been extensively dealt with in the
Hindi Domain

5
La Arquitectura
Source CDAC home page
6
Continue

7
Advantages and problems with Hindi

Hindi resembles Sanskrit. Deterministic rules
for a language like Sanskrit - which is called
loyal - in the sense that people speak what they
write. .
Schwa deletion The effect of not pronouncing
a in certain positions as in the terminal a
for Himalaya.
Anusvara ambiguity This is caused because the
grapheme for anusvara can represent any nasal
sound. eg aMbara ( sky ) , aMda ( egg )

8
Previous Works

Grapheme to Phoneme mapping
1. Media lab in IIT kgp
Rule based Grapheme to Phoneme mapping
by Monojit choudhary
Approach
three category C Half
V Full not the matras
CV Full
Observations from our day-today speaking , few
rules are made to delete schwa and for Anuswara
and chandra-bindu .

9
Example

10
Speech Synthesis

Quality
Intelligibility measured with phonemes,
syllables, words, phrases
Naturalness Difficult to get
1. Formant In this kind of TTS Systems
Voice is generated by the simulation of the
behaviour of human vocal cord . Robotic voice
2. Concatenation syllables are
concatenated at run time and they produce
phonetic representation of text. large storage
3. Sinusoidal speech signal can be
represented as a sum of sine waves with time
varying amplitude and frequencies.

11
Work already done

CDAC has used concatenation method to develop
natural voice
Anuvacchaka
Uchharaka
Bhavna
Bilss technology hybridization of Formant
and Concatenated. We called it RecSimCat .

12
ANN approach

neural networks are trained from actual speech
samples, they have the potential to generate more
natural sounding speech than other synthesis
technologies.
ANN structure
First ANN
Generation of segment durations from
phonetic descriptions
Second ANN
Generation of acoustic
information from phonetic and timing information
speech parameters