You are on page 1of 10

INTRODUCTION

Voice recognition is the automated recognition of human speech. Digital Signal Processing take real-world signals like voice, audio, video, etc that can be digitized and then mathematically manipulated. The simulation focuses on feature extraction followed by feature matching.

FUNDAMENTAL APPROACHES TO VOICE RECOGNITION

1.template matching:
Simple, accurate , uses analog-to-digital converters , vocabulary based.

2.feature analysis:
Speaker independent, first processes the voice input using "Fourier transforms" or "linear predictive coding (LPC)", then finds characteristic similarities between the expected inputs and the actual digitized voice input.

FEATURE EXTRACTION
First step in voice recognition

A DSP contains these key components: 1.Program Memory 2. Data Memory 3. Compute Engine 4. Input/Output

Two main processes in feature extraction 1. Recording 2. Playback

WORD RATE &SAMPLING RATE

WORD SIZE :
The number of bits used to represent a single audio wave affects the achievable noise level of a signal recorded

SAMPLING RATE:
The sampling frequency used is equal to 11025 Hz. The sample rate is even more important a consideration than the word size. If the sample rate is too low, the sampled signal cannot be reconstructed to the original sound signal.

TEMPLATE MATCHING
1. The user to speak a word or phrase into a microphone 2. The electrical signal from the microphone is digitized by an "analog-to-digital (A/D) converter", and is stored in memory. 3. The computer matches the input with a digitized voice sample, or template, that has a known meaning. 4. The program contains the input template to match appropriately. 5. the program is "trained" with a new user's voice input before that user's voice can be recognized by the program. 6. Limited to vocabulary that is stored.

FEATURE ANALYSIS
1. It is "speaker-independent" voice recognition. 2. processes the voice input using "Fourier transforms" or "linear predictive coding (LPC) 3. Find characteristic similarities between the expected inputs and the actual digitized voice input. 4. Need not be trained by each new user.

APPLICATIONS
Healthcare: Speech recognition can be implemented in front-end or back-end of the medical documentation process. Military: In High-performance fighter aircraft, Helicopters, Training air traffic controllers. Telephony and other domains Hands-free computing: Speech recognition computer user interface Home automations Interactive voice response

Mobile telephony , including mobile email Multimodal interaction Pronunciation evaluation in computer-aided language learning applications Robotics Speech-to-text reporter (transcription of speech into text, video captioning, Court reporting ) Telematics (e.g., vehicle Navigation Systems) Transcription (digital speech-to-text) Video games

You might also like