Title: Acoustic Phonetics How to read spectrograms
1Acoustic PhoneticsHow to read spectrograms
Credits Bert Vaux, UW-Milwaukee Andrew
Faulkner, University College London
2Acoustic Phonetics
- Spectrogram Basic
- Spectral Cues
- Vowels
- Consonants
- Continuous Speech
- Epilogue
3I. Spectrogram Basics
4Source-Filter Theory of Speech Production
Source
Filter
Speech
- Speech production can be divided into two
independent parts - Source of sound, such as the larynx
- Filters that modify the source, such as the vocal
tract
5Spectrograms
Spectrograms of "hee" "ha" and "who".
6Spectrograms
X-axis (horizontal) Time (milliseconds) Y-axis
(vertical) Frequency (Hertz) Z-axis (3-d
basically, coming right toward you) Amplitude
--gt amplitude corresponds to signal strength,
perceived as volume/loudness higher amplitudes
show up as darker marks on the spectrogram
7Fundamental Frequency
- measure of vocal fold vibration (Hz)
- Hz hertz cycles per second
-
- Males 120 Hz
- Females 255 Hz
- Children 265 Hz
8II. Spectral Cues
9A. Vowels
- Produced with no significant constriction of oral
tract - Articulations change resonances of the vocal
tract - Moving the tongue, lips and jaw change the shape
of the vocal tract - Resonances of the vocal tract are called formants
- Changing the shape of the vocal tract changes the
- formant frequencies
10Vowels Formants
11Vowels Formants
12Vowels Formants
13Vocal Folds
14The Vocal Folds in Action(very slow motion)
http//www.humnet.ucla.edu/humnet/linguistics/faci
liti/demos/demos.html
15Fundamental Frequency
- measure of vocal fold vibration (Hz)
- Hz hertz cycles per second
-
- Males 120 Hz
- Females 255 Hz
- Children 265 Hz
16Vowels Formants
17B. Consonants
- Stops
- Voicing
- Aspiration
- Nasals
- Fricatives
- Voicing
- Approximants
- (liquids and glides)
181. Stops (e.g., /b/,/p/,/d/,/t/,/k/,/g/)
- Complete closure of the vocal tract
- Blocks flow of air through the oral cavity
- During voiced stops, only a voice bar can be
produced during the closure
19Stops Voice Bar
idi
Voice Bar
20Stops (e.g., /b/,/p/,/d/,/t/,/k/,/g/)
- Complete closure of the vocal tract
- Blocks flow of air through the oral cavity
- During voiced stops, only a voice bar can be
produced during the closure - Burst created at consonant release
- Created as closure is partially opened
- Frication noise created by blowing air through
partial opening - Very short duration
21Stops Burst
idi
Burst
22Stops Voicing
- Voiced Consonants
- Voicing starts less than about 30 ms after
release - Voicing can occur during closure
- Unvoiced Consonants
- Voicing starts more than about 50 ms after
release - Voicing cannot occur during closure
23Stops Voicing
AtA vs. AdA?
24Stops Voicing
AdA
AtA
25Voicing and Aspiration
- Voiced Consonants
- Voicing starts less than about 30 ms after
release - Voicing can occur during closure
- Unvoiced Consonants
- Voicing starts more than about 50 ms after
release - Voicing cannot occur during closure
- Duration between release (i.e. burst) and start
of voicing is - called Voice Onset Time (VOT).
-
- long VOT gt aspiration
26Stops Aspiration
AdA
AtHA
Aspiration
27American English Aspiration
282. Nasals (e.g., /m/, /n/)
- Similar to voiced stops, except air also flows
through nasal cavity
- Velum controls airflow through nasal cavity
- Airflow allows voicing to continue during closure
- Stoppage of airflow in oral tract results in
decreased energy, characterized by nasal zeroes
(regions of extremely low energy)
29Nasals
/ana/ vs. /ada/?
30Nasals
ada
ana
313. Fricatives (e.g., /s/,/z/,/f/,/v/)
- Created by forcing air through a small
constriction - Noise is produced due to turbulence
32Fricatives
afa
asa
33Voiced Fricatives (e.g., /z/,/v/)
- Vocal folds can vibrate at the same time that
fricative energy is produced - Creates a voice bar during the fricative
34Voiced vs. Voiceless Fricatives (1)
/T/ vs. /D/?
35Voiced vs. Voiceless Fricatives (1)
aTa
aDa
36Voiced Vs. Voiceless Fricatives (2)
37Voiced Vs. Voiceless Fricatives (3)
38Approximants (e.g., /l/,
/r/, /j/, /w/)
- More open constriction than for fricatives
- Free flow of air produces no turbulence
- Voicing continues during consonant
39Approximants (e.g., /l/, /r/, /j/, /w/)
- More open constriction than for fricatives
- Free flow of air produces no turbulence
- Voicing continues during consonant
- Similar to vowels
- Approximants have lower F1 than for vowels
- Glides tend to have more formant movement than
vowels
40Approximants
awa
41Continuous Speech
42Continuous Speech
Im an idiot
43Continuous Speech
ajm n Idijt
44Continuous Speech, Segmented
a
a
45Summary of Spectral Cues
- Vowels (open oral tract)
- identified using formant structure (esp. F1 and
F2) - Consonants (differing degrees of constriction in
the oral tract) - Stops stoppage of air voice bar aspiration
- Nasals vowel-like, with nasal zeros
- Fricatives noise
- Approximants vowel-like formant movement
46IV. Epilogue
47Broad vs. Narrow Transcription
lta teegt lta rheegt
Broad tij rij
Narrow tHij rij
48Broad vs. Narrow Transcription
lta teegt lta treegt lta churchgt
Broad tij trij
tSrtS
Narrow tHij tSrij
tSrtS
49l "clean" vs. "lean"
50l "clean" vs. "lean"
klijn
lijn
51"Stoughton"
52"Stoughton"
stHo?n
stHow?n