Sound Structure I: Phonetics - PowerPoint PPT Presentation

1 / 24
About This Presentation
Title:

Sound Structure I: Phonetics

Description:

Sound Structure I: Phonetics Acoustic phonetics Jan. 27 Make vowel sounds from household materials The source in speech production The production of speech consists ... – PowerPoint PPT presentation

Number of Views:197
Avg rating:3.0/5.0
Slides: 25
Provided by: Office20041124
Category:

less

Transcript and Presenter's Notes

Title: Sound Structure I: Phonetics


1
Sound Structure I Phonetics
  • Acoustic phonetics
  • Jan. 27

2
Make vowel sounds from household materials
Buzzing sound Resonators
Vowels

a
i

From Mark Huckvale http//www.phon.ucl.ac.uk/hom
e/mark/vowels
3
The source in speech production
  • The production of speech consists of two kinds of
    operations (1) the generation of sound sources
    (the raw material), and (2) the
    processing/resonating of these sources by the
    vocal tract (the resonator).
  • Vocal fold vibration generates buzzing sound,
    which is the raw material (source) for producing
    vowel sounds.

4
The vibration of the vocal folds
  • The aerodynamics of vocal fold vibration
  • With the glottis (the space between the vocal
    folds) closed and an airstream issuing from the
    lungs, the excess pressure below the glottis
    (sub-glottal pressure) forces the vocal folds
    apart the air passing between the folds
    generates a Bernoulli force that quickly closes
    the glottis. The sub-glottal pressure builds up
    again, forcing the vocal folds apart again.
  • The vibration of the vocal folds are periodic.
    The frequency of the vibration is determined by
    the air pressure in the lungs and by the vocal
    folds mechanical properties.

5
Periodic and frequency
  • Vocal fold vibration is a complex periodic
    motion, it can be decomposed into many Simple
    Harmonic Motions.
  • Simple Harmonic Motion is the simplest periodic
    motion, in which a body oscillates about an
    equilibrium position (rest position) in a
    sinusoidal pattern. Tuning forks are such an
    example.

6
Simple Harmonic Motion
  • Properties of Simple Harmonic Motion
  • displacement momentary distance from
  • the rest point
  • cycle one complete oscillation
  • amplitude maximum displacement
  • frequency number of cycles per second (Hz)
  • period number of seconds per cycle
  • What is the difference?

7
Spectrum
  • A complex periodic motion can be represented as a
    spectrum showing
  • Frequency of each simple harmonic component on
    the x-axis
  • Amplitude of each simple harmonic component on on
    the y-axis

8
The source spectrum in speech production
  • As the vocal folds open and close, puffs of air
    flow through the glottal opening. The frequency
    of these pulses determines the fundamental
    frequency (F0) and contributes to the perceived
    pitch of the produced sound.
  • Besides F0, the glottal air flow also contains a
    large number of harmonics (overtones), whose
    frequencies are integer multiples of F0. The
    amplitude of the harmonics decreases gradually.

9
Vocal tract is a resonator
  • The vocal tract is a resonator. The resonance
    frequencies of the vocal tract, called formants,
    depend on its shape. The first resonance
    frequency is F1, the second resonance frequency
    is F2, and so on.
  • The first two or three formants (their
    frequencies, not amplitudes) are most important
    in speech communication.

10
Final product vowels
  • The presence of the formats disrupts the
    uniformly sloping envelop of the source spectrum,
    imposing peaks at the formant frequencies.
    Formant frequencies determine vowel quality (/i/
    vs. /a/ vs. /u/ etc.)

Vowel spectrum
Vocal tract resonance
Source spectrum
11
Acoustic vowel space
12
Vowel formants
  • Vowel height is closely related to the first
    formant frequency
  • low vowels -gt higher F1
  • Vowel frontness/backness is closely related to
    the second formant frequency
  • front vowels -gt higher F2
  • Formant frequencies of American English vowels
    (average)
  • Male
    Female
  • /i/ F1 270Hz F2 2290Hz F1
    310Hz F2 2790Hz
  • /a/ F1 730Hz F2 1090Hz F1
    850Hz F2 1220Hz
  • /u/ F1 300Hz F2 870Hz F1
    370Hz F2 950Hz

13
Measuring vowel formants
  • Formants can be displayed by computing a
    spectrogram, showing the energy in the signal at
    different frequencies.
  • Spectrograms are a plot of the intensity of the
    frequency content of a signal as time progresses.
    The y-axis is frequency, the x-axis is time, and
    the intensity is shown on a grey-scale.
  • Praat doing phonetics by computer.
    http//www.fon.hum.uva.nl/praat/

14
The acoustics of consonants
  • The sources of consonant sounds are either
    turbulent airflow (fricatives) or release of a
    compressed air behind a closure (stops). The
    sources are aperiodic, noises.
  • Note White noise contains an equal
    distribution of energy spread over all
    frequencies, just as white light is composed of
    all of the colors of the spectrum of visible
    light.
  • The resonator is the front cavity, i.e., the part
    of the oral cavity that is in front of the
    constriction place. The nasal cavity is used as a
    resonator in producing nasal sounds.

15
Fricatives
  • Fricatives have relatively long durations of
    noise, and it is this lengthy interval of
    aperiodic energy that distinguishes fricatives as
    a sound class. s, ? have more energy than f,
    ?.

16
Stops
  • Stops are dynamic, it has a series of
    articulatory postures
  • Gap during closure, the only possible source
    is voicing (shown as a voice bar).
  • Burst a transient noise produced when the
    oral closure is released
  • Aspiration a diffuse noise generated at
    the larynx and possibly the lower pharynx. Its
    spectrum resembles that for the fricative h.
  • tie, dye, style.

Gap
Aspiration
Burst
Voice bar
17
VOT
  • To further distinguish voiced/voiceless and
    aspirated/unaspirated, we can use Voice Onset
    Time (VOT).
  • VOT is the duration of the period of time between
    the release of a plosive/stop and the beginning
    of vocal fold vibration. This period is usually
    measured in milliseconds (ms).
  • It is useful to distinguish at least three types
    of VOT which are shown in the schematic diagram
    below

18
Speech prosody
  • Segments characterize speech sounds.
    Suprasegmental features (prosody) superimpose on
    segments or segmental sequences.
  • Acoustic cues of prosody
  • Pitch, determined by fundamental frequency
  • Length, determined by duration
  • Loudness, determined by amplitude
  • Stress, tone, intonation, rhythm, etc.

19
Tones
  • Tone languages use pitch to make lexical
    contrasts (to distinguish words).
  • contour tones (predominating in Chinese and the
    languages of southeast Asia) The tones are made
    up of dynamic pitch changes.
  • Register tone (predominating in African
    languages). The tones are made up of a few pitch
    registers high, low, and maybe mid.

20
Intonation in English
  • A What types of foods are a good source of
    vitamins?
  • B1 Legumes are a good source of vitamins.
  • B2 Legumes are a good source of vitamins.
  • A Id like to fly to Davenport, Iowa on TWA.
  • B TWA doesnt fly there ...
  • B1 They fly to Des Moines.
  • B2 They fly to Des Moines.
  • A1 I met Mary and Elenas mother at the mall
    yesterday.
  • A2 I met Mary and Elenas mother at the mall
    yesterday.

21
Intonation in English
legumes are a good source of VITAMINS
LEGUMES are a good source of vitamins
22
Intonation in Mandarin Chinese
  • Interaction between tone and intonation

23
Rhythm
  • Rhythm refers to the temporal organization of
    speech elements. It depends on many factors.
  • At the segmental level, for example, vowels are
    longer before voiced stops than before voiceless
    ones. The effect is very large in English (bad
    vs. bat), but smaller in many other languages
    such as French.
  • At the level of syllables, languages have
    different phonotactic constraints on syllable
    shape. For example, many dialects of Chinese
    allow only alveolar and velar nasals in the coda
    position.

24
Rhythm
  • In natural speech, over 70 of French and Spanish
    syllables were open (e . g ., CV and V) , while
    more than 50 of English syllables were closed (e
    . g ., CVC and VC).
  • In Hawaiian there is never more than a single
    consonant between any two vowels, but in English
    there can be seven, as in texts spread. In Polish
    there are even more
Write a Comment
User Comments (0)
About PowerShow.com