Speech Detection - PowerPoint PPT Presentation

About This Presentation

Title:

Speech Detection

Description:

Noisy computer room has loud background noise, making some edges ... mono or stereo. SOUND_PCM_WRITE_RATE. sample/playback rate. Program Template (Linux) ... – PowerPoint PPT presentation

Number of Views:12

Avg rating:3.0/5.0

Slides: 18

Provided by: clay2

Learn more at: http://web.cs.wpi.edu

Category:

Tags: detection | speech | stereo

Transcript and Presenter's Notes

Title: Speech Detection

1
Speech Detection

Project 1

2
Outline

Motivation
Problem Statement
Details
Hints

3
Motivation

Word recognition needs to detect word boundaries
in speech

Silence Is Golden
4
Motivation

Recognizing silence can reduce
Network bandwidth
Processing load
Easy in sound proof room, with digitized tape
Measure energy level in digitized voice

5
Research Problem

Noisy computer room has loud background noise,
making some edges difficult

Five
6
Research Problem

Computer audio often for interactive applications
Voice commands
Teleconferencing
?Needs to be done in real-time

7
Project Solution

Implement end-point algorithm by Rabiner and
Sambur RS75
(Paper for class, next)
Implementation in Linux or Windows
Basis for audioconference/Internet phone
(Project 2)

8
Details

Voice-quality
8000 samples/second
8 bits per sample
One channel
Record sound, write files
sound.all - audio plus silence
sound.speech - audio no silence
sound.data - text-based data audio data, energy,
zero crossings
128 10 3
127 12 4
127 20 3
Other features allowed

9
Sound in Windows

Microsoft Visual C
See Web page for basic tutorials
Use sound device ? WAVEFORMATEX
wFormatTag set to WAVE_FORMAT_PCM
nChannels, nSamplesPerSec, wBitsPerSample set to
voice quality audio settings
nBlockAlign set to number of channels times the
number of bytes per sample
nAvgBytesPerSec set to the number of samples per
second times the nBlockAlign value
cbSize set this to zero

10
Sound in Windows

waveInOpen()
a device handle (HWAVEIN)
the device number (1 in the movie lab)
the WAVEFORMATEX variable
a callback function
?gets invoked when the sound device has a sample
of audio

11
Sound in Windows

Sound device needs buffers to fill
LPWAVEHDR
lpData for raw data samples
dwBufferLength set to nBlockAlign times the
length (in bytes) of the sound chunk you want
waveInAddBuffer() to give buffer to sound device
Give it device
Buffer (LPWAVEHDR)
Size of variable
When callback invoked, buffer (lpData) has raw
data to analyze
Must give it another via waveInAddBuffer() again

12
Sound in Windows

Useful header files
include ltwindows.hgt
include ltstdio.hgt
include ltstdlib.hgt
include ltmmsystem.hgt
include ltwinbase.hgt
include ltmemory.hgt
include ltstring.hgt
include ltsignal.hgt
extern "C"

Useful data types
HWAVEOUT
writing audio device
HWAVEIN
reading audio device
WAVEFORMATEX
sound format structure
LPWAVEHDR
buffer
MMRESULT
Return type from wave system calls

See the online documentation from Visual C for
more information

13
Sound in Linux

Linux audio device just like a file
/dev/dsp
open("/dev/dsp", O_RDWR)
Recording and Playing by
read() to record
write() to play

14
Sound Parameters

Use ioctl() to change sound card parameters
To change sample size to 8 bits
fd open("/dev/dsp", O_RDWR)
arg 8
ioctl(fd, SOUND_PCM_WRITE_BITS, arg)
Remember to error check all system calls!

15
Sound Parameters

The parameters you will be interested in are
SOUND_PCM_WRITE_BITS
the number of bits per sample
SOUND_PCM_WRITE_CHANNELS
mono or stereo
SOUND_PCM_WRITE_RATE
sample/playback rate

16
Program Template (Linux)

open sound device
set sound device parameters
record silence
set algorithm parameters
while(1)
record sound
compute algorithm stuff
detect speech
write data to file
write sound to file
if speech, write speech to file

17
Hand In

Online turnin (see Web page)
Turn in
Code
Makefile/Project file
Via email

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

EE2F1 Multimedia 1: Speech PowerPoint PPT Presentation

EE2F1 Multimedia 1: Speech - From John Holmes, 'Speech synthesis and recognition', courtesy of British ... From: John Holmes and Wendy Holmes, 'Speech synthesis and recognition', Taylor ... | PowerPoint PPT presentation | free to view

Evaluation of Speech Detection Algorithm PowerPoint PPT Presentation

Evaluation of Speech Detection Algorithm - Experiments to evaluate performance of your Speech Detection ... Background noise: quiet, noisy, Patriot's game, ... Systems: OS version, CPU, sound card... | PowerPoint PPT presentation | free to view

SPEECH RECOGNITION PowerPoint PPT Presentation

SPEECH RECOGNITION - 'I think you know what the problem is just as much as I do.'--HAL ' ... Keystone Speech master. Microsoft Office XP(inbuilt) FUTURE IN SPEECH RECOGNITION ... | PowerPoint PPT presentation | free to view

Emotional Speech detection PowerPoint PPT Presentation

Emotional Speech detection - Real-time system for 'real-life' emotional speech detection in order ... Annoyance, Impatience, ColdAnger, HotAnger. Anger. Fear, Anxiety, Stress, Panic, Embarrassment ... | PowerPoint PPT presentation | free to view

EEL 6586: AUTOMATIC SPEECH PROCESSING Windows Lecture PowerPoint PPT Presentation

EEL 6586: AUTOMATIC SPEECH PROCESSING Windows Lecture - Speech windows. What is a short' window of time? ... Text-to-speech synthesis, Noise reduction. Typical window (frame) length: 20-30 ms ... | PowerPoint PPT presentation | free to view

Speech Enhancement for ASR PowerPoint PPT Presentation

Speech Enhancement for ASR - Speech Enhancement for ASR by Hans Hwang 8/23/2000 Reference 1. Alan V. Oppenheim ,etc., Multi-Channel Signal Separation by Decorrelation ,IEEE Trans. on ASSP,405 ... | PowerPoint PPT presentation | free to view

Process Detection PowerPoint PPT Presentation

Process Detection - Process Detection George Cybenko Dartmouth gvc@dartmouth.edu Acknowledgements Overview of Lectures Process modeling Process detection, theory Software and ... | PowerPoint PPT presentation | free to view

Deceptive Speech PowerPoint PPT Presentation

Deceptive Speech - Title: Emotional Speech Author: Frank Enos Last modified by: Frank Enos Created Date: 3/7/2003 12:21:24 AM Document presentation format: On-screen Show | PowerPoint PPT presentation | free to view

Deceptive Speech PowerPoint PPT Presentation

Deceptive Speech - How does this relate to other subjective phenomena in speech? E.g. emotion, ... good recognition rates Process What are the processes involved Generation ... | PowerPoint PPT presentation | free to view

Speech Recognition Robot PowerPoint PPT Presentation

Speech Recognition Robot - Lab Book Beti Lula John Speech Recognition Robot Teamwork The importance of teamwork in solving complex problems. How many people worked for NASA when we landed on ... | PowerPoint PPT presentation | free to view

Human and Machine Performance in Speech Processing PowerPoint PPT Presentation

Human and Machine Performance in Speech Processing - Title: Flexible, Robust, and Efficient Human Speech Processing Versus Present-day Speech Technology Author: Louis C.W. Pols Last modified by: Louis Pols | PowerPoint PPT presentation | free to view

Introduction to Computer Speech Processing PowerPoint PPT Presentation

Introduction to Computer Speech Processing - Introduction to Computer Speech Processing Alex Acero Research Area Manager Microsoft Research | PowerPoint PPT presentation | free to view

Topic Tracking, Detection, and Summarization: Some IE Applications PowerPoint PPT Presentation

Topic Tracking, Detection, and Summarization: Some IE Applications - Natural Language Processing Lab. National Taiwan University Topic Tracking, Detection, and ... | PowerPoint PPT presentation | free to view

Analyzing Poetry Figures of Speech PowerPoint PPT Presentation

Analyzing Poetry Figures of Speech - Analyzing Poetry Figures of Speech & Imagery in A Dream Deferred by Langston Hughes Hilltop High School English 9 PLC Figures of Speech Figure of Speech- is ... | PowerPoint PPT presentation | free to view

Implementation of Linear Predictive Coding (LPC) of Speech PowerPoint PPT Presentation

Implementation of Linear Predictive Coding (LPC) of Speech - Term Project by Komel Rauf Saba Hameed Mahinn Zahoor Implementation of Linear Predictive Coding (LPC) of Speech | PowerPoint PPT presentation | free to view

Producing Emotional Speech PowerPoint PPT Presentation

Producing Emotional Speech - Title: Producing Emotional Speech Last modified by: Julia Hirschberg Document presentation format: Custom Other titles: Marker Felt ProN W3 Arial ... | PowerPoint PPT presentation | free to view

Speech-Coding Techniques PowerPoint PPT Presentation

Speech-Coding Techniques - Speech-Coding Techniques Chapter 3 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones and zeros The lower the bandwidth, the ... | PowerPoint PPT presentation | free to view

Robust Voice Activity Detection for Interview Speech in NIST Speaker Recognition Evaluation PowerPoint PPT Presentation

Robust Voice Activity Detection for Interview Speech in NIST Speaker Recognition Evaluation - ... Threshold S S S S S S Use speech enhancement as a pre-processing step VAD ... Ordinary Energy-based VAD Spectral-Subtraction VAD VAD in ETSI AMR ... | PowerPoint PPT presentation | free to view

Course presentation: Speech Recognition PowerPoint PPT Presentation

Course presentation: Speech Recognition - Robust Methods for Automatic Transcription and Alignment of Speech Signals Course presentation: Speech Recognition Leif Gr nqvist (leifg@ling.gu.se) | PowerPoint PPT presentation | free to view

The CUED Speech Group PowerPoint PPT Presentation

The CUED Speech Group - The CUED Speech Group Dr Mark Gales Machine Intelligence Laboratory Cambridge University Engineering Department | PowerPoint PPT presentation | free to view

The Language Loft Provide Speech and Language Therapy PowerPoint PPT Presentation

The Language Loft Provide Speech and Language Therapy - The Language Loft Provide Speech and Language Therapy in Fort Myers Florida. http://www.thelanguageloft.com/ | PowerPoint PPT presentation | free to view

Speech Communication Resources PowerPoint PPT Presentation

Speech Communication Resources - Speech Communication Resources. Rickman Library, Southern Wesleyan University. Available Resources at Rickman Library. Viewpoints on controversial issues. Newspapers. | PowerPoint PPT presentation | free to view

Emotion Detection and Recognition Market Trends 2022-2027 PowerPoint PPT Presentation

Emotion Detection and Recognition Market Trends 2022-2027 - The escalating demand for enhanced client experiences and the increasing requirement for a human touch in digital interactions are primarily driving the global emotion detection and recognition market For an Executive Summary of Emotion Detection and Recognition Report Visit the following link: https://www.imarcgroup.com/emotion-detection-recognition-market E-mail: sales@imarcgroup.com Contact: +91-120-415-5099 | PowerPoint PPT presentation | free to view

How does speech recognition AI work? PowerPoint PPT Presentation

How does speech recognition AI work? - Advancements in speech recognition AI technology are underway, offering users an alternative means of interacting with computers that minimizes the need for extensive typing | PowerPoint PPT presentation | free to view

The Future of Communication: Unlocking the Power of Speech-to-Text Software PowerPoint PPT Presentation

The Future of Communication: Unlocking the Power of Speech-to-Text Software - Discover how speech-to-text software is transforming communication across industries. Explore the benefits, advancements, and applications of this innovative technology to enhance productivity and accessibility. Call Us 24/7 For Business Inquiry +1 (347) 9739732, +91-90399-28143 inquiry@digiprima.com ashesh@digiprima.com | PowerPoint PPT presentation | free to view

What is child speech? PowerPoint PPT Presentation

What is child speech? - Child speech refers to the ability of young children to communicate verbally. It encompasses the sounds, words, and sentences that children use to express their thoughts, needs, and feelings. Developing strong speech skills is crucial for children as it helps them connect with others, learn effectively, and build their confidence. Dr. T.R. Yadav, a renowned Child Speech Specialist at Bright Brain Children's Clinic, provides personalized therapy to address individual communication needs. With expertise in diagnosing and treating various speech disorders, Dr. Yadav supports children in developing their language skills. His compassionate approach ensures a comfortable environment for kids to thrive in their communication abilities. For More Details: Name: Bright Brain Children's Clinic Address: UGF-22, Nishith Plaza, Engineering College Chauraha, Near Shiva Plaza, Lucknow, 226021 Google Map: https://maps.app.goo.gl/bedc8fBPtwPd7VVR9 Phone: 09415047722 | PowerPoint PPT presentation | free to view

Chandrasekhar Institute of Speech and Hearing PowerPoint PPT Presentation

Chandrasekhar Institute of Speech and Hearing - Chandrasekhar Institute of Speech and Hearing (CISH) is a renowned institution dedicated to addressing speech, language, and hearing challenges. Established with the mission of providing quality education, research, and clinical services in the field of audiology and speech-language pathology, CISH has grown to become a center of excellence in India. | PowerPoint PPT presentation | free to view