You are on page 1of 102

Sound Forge and the Intervoice System Audio File Guide

60001198Second Edition

Sound Forge and the Intervoice System Audio File Guide

Document No. 60001198 Second Edition

Copyright 2004 Intervoice, Inc. All rights reserved. Trademarks of Intervoice, Inc. and/or its affiliates:

Names and marks for products provided by the Companys suppliers are used herein for identification purposes and may be trademarks of their respective companies. Windows NT and Windows 2000 are registered trademarks of Microsoft. The Apache Software License, Version 1.1 Copyright 20002001 The Apache Software Foundation. All rights reserved. Java Runtime Environment Versions 1.2, 1.3, and 1.4 are products of Sun Microsystems, Inc. All rights reserved. Please refer to the license agreement on your system. Sound Forge is a registered trademark of Sonic Foundry. No part of this document may be reproduced without the written permission of Intervoice, Inc.
The information in this document explains how to create audio files for InterSoft version 3 systems with IQTalk or VoiceXML Browser. The runtime .wav files created here can also be used on MTC 2.x systems. This document includes information on Sound Forge 6.0 and 7.0, Sonic Foundry Batch Converter 5.0, and CRVoice2. If earlier software versions are used, refer to Using Sound Forge for InterVoice Systems (document number 60000259) for Sound Forge 4.0 or Sound Forge on InterSoft Systems Users Guide (document number 60000984) for Sound Forge 5.0. This document was previously released in December 2002 titled, Sound Forge and the Omvia Media Server Audio File Guide. This is the second edition of this document, November 2004.

Any comments or suggestions concerning this publication should be directed to: Technical Documentation Department Intervoice, Inc. 17811 Waterview Parkway Dallas, Texas 75252 Telephone: (972) 454-8000 Fax: (972) 454-8505 E-mail: document.feedback@intervoice.com For more information about Intervoice, Inc., call (972) 454-8000, or visit the Intervoice Web site at www.intervoice.com.

Documentation Comment Form


Our goal is to provide accurate and efficient information to our customers. Please complete this form to help us improve the quality and usability of this product documentation. List the document you used by name, number, and edition.

____________________________________________________________________ ____________________________________________________________________ Did the document provide the information needed?

Yes No

Explain______________________________________________________________ ____________________________________________________________________ ____________________________________________________________________ Were you able to locate the information in a timely manner?

Yes No

Explain______________________________________________________________ ____________________________________________________________________ ____________________________________________________________________ How accurate did you find the information in the document to be?

Very Accurate Accurate Somewhat Accurate Not Accurate


When you need to find information, where is the first place you usually look?

Table of Contents Index Headings Scan the text


Was the index thorough?

Yes No

How could it be improved?

____________________________________________________________________ ____________________________________________________________________ If you completed the steps in this document, how easy were they to follow?

Very Easy Easy Somewhat Easy Not Easy


Overall, how helpful did you find the document to be?

Very Helpful Helpful Somewhat Helpful Not Helpful


Are there any technical errors in the document you would like to report? ____________________________________________________________________ ____________________________________________________________________ List any additional topics you would like to see covered in the documentation. ____________________________________________________________________ ____________________________________________________________________ Fax to: (972) 454-8505 E-mail to: document.feedback@intervoice.com

Table of Contents

List of Figures List of Tables About This Document

v vii ix

Purpose and Audience ......................................................................... ix Chapters ................................................................................................ x Intervoice Documentation.................................................................... xi Documentation Formats................................................................. xi Adobe Acrobat .pdf Files......................................................... xi Microsoft HTML Help .chm Files........................................... xi Documentation Access .................................................................. xi Product CD-ROMs .................................................................. xi Web Site .................................................................................. xii Entire Documentation Set................................................. xii Site Preparation and Installation Document Set ............... xii Hard Copy Documents............................................................ xii Related Intervoice Documents........................................................... xiii InterSoft Version 3 ....................................................................... xiii Sound Forge ................................................................................. xiii Conventions ....................................................................................... xiv Information Symbols ................................................................... xiv Technical Support ............................................................................... xv Western Hemisphere ..................................................................... xv Eastern Hemisphere ...................................................................... xv
Overview 1-1

Summary of Changes......................................................................... 1-2 Voice Recording Concept Diagram ................................................... 1-3 Recommended Hardware................................................................... 1-4 Specific Hardware........................................................................ 1-4 Recommended Software .................................................................... 1-5

60001198, Second Edition

Table of Contents

Setting Up the Studio

2-1

Studio Setup Checklist....................................................................... 2-2 Installing the Hardware...................................................................... 2-3 Connecting the Sound Card to the Speakers................................ 2-4 Installing the Sonic Foundry Batch Converter .................................. 2-6 Installing CRVoice2 ........................................................................... 2-7 Installing Sound Forge....................................................................... 2-7 Making Quality Recordings............................................................... 2-8 Setting Microphone Levels .......................................................... 2-8 Placing the Microphone ............................................................... 2-9 Avoiding Clipping...................................................................... 2-10 Performing the Noise Floor Test...................................................... 2-12
Defining Audio File Formats 3-1

Intervoice Master Format................................................................... 3-2


Using Sound Forge to Record Messages 4-1

Setting Up Automatic Labeling ......................................................... 4-2 Setting Recording Attributes ............................................................. 4-5 Recording Process Checklist ............................................................. 4-6 Recording Methods............................................................................ 4-7 Continuous Recording ................................................................. 4-8 Multiple Takes (No Regions)................................................. 4-8 Message-by-Message Recording ................................................. 4-8 Multiple Takes Creating Regions........................................... 4-8 Create a New Window for Each Take.................................... 4-8 Recording Messages ..........................................................................4-9 Continuous Recording ................................................................. 4-9 Multiple Takes (No Regions)................................................. 4-9 Message-by-Message Recording ............................................... 4-12 Multiple Takes Creating Regions......................................... 4-12 Extracting Regions......................................................... 4-14 Create a New Window for Each Take.................................. 4-15 Reviewing Messages........................................................................4-17 Removing Intervoice Batch Scripts ................................................. 4-18
Postprocessing 5-1

Batch Converter Overview ................................................................ 5-2 Downloading the Batch Converter .............................................. 5-3 Using the Sample Batch Script .................................................... 5-3 Converting the .wav Files to Runtime Format Overview ............ 5-4 Using the Batch Converter................................................................. 5-6 Modifying the Sample Batch Script................................................... 5-8

ii

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Naming and Converting .wav to .ivc to m.### Files

6-1

Filename Formats (.ivc to m.###)...................................................... 6-2 Voice File Naming Convention.................................................... 6-3 Using the CRVoice2 Utility ............................................................... 6-4
Maintaining Audio Quality 7-1

Voice File Quality Issues ................................................................... 7-2


Frequently Asked Questions A-1

FAQs ................................................................................................. A-2 Recording Steps .................................................................... A-2 Defining DC Offset............................................................... A-2 Producing 64-KB Messages.................................................. A-3 Converting Messages............................................................ A-3 Using the Gain Leveling Preset ............................................ A-4 Installing the Audiophile 2496 Sound Card ......................... A-4 Turning Off AGC.................................................................. A-5 Normalizing the Message Volume ........................................ A-5 Filtering Messages ................................................................ A-6 Reducing Pops and Clicks in Messages................................ A-6
Glossary Index Glossary-1 Index-1

60001198, Second Edition

Table of Contents

iii

List of Figures

Figure 1-1. Figure 2-1. Figure 2-2. Figure 2-3. Figure 4-1. Figure 5-1. Figure 5-2. Figure 6-1. Figure 6-2.

Voice Recording Concept Diagram............................. 1-3 Voice Recording Hardware ......................................... 2-3 Voice Recording Without Clipping ........................... 2-10 Voice Recording with Clipping................................. 2-11 Recording Process....................................................... 4-7 .wav to .ivc Batch Conversion Process ....................... 5-2 .wav to runtime .wav Batch Conversion Process........ 5-2 Filename Formats (.ivc to m.###)............................... 6-2 Transforming .wav Files to .ivc Files.......................... 6-3

60001198, Second Edition

List of Figures

List of Tables

Table 2-1. Table 3-1. Table 4-1.

Studio Setup Checklist ................................................. 2-2 Format Differences....................................................... 3-3 Recording Process Checklist........................................ 4-6

60001198, Second Edition

List of Tables

vii

About This Document

Purpose and Audience


The Sound Forge and the Intervoice System Audio File Guide provides information on creating audio files using Sound Forge. Topics include studio set up, file formats and file naming conventions, post-processing using the Batch Converter, and troubleshooting. This document also provides detailed instructions on using the CRVoice2 utility to rename audio files to the Intervoice runtime format. Sound Forge can be used for recording and editing, but it is required for voice file postprocessing, noise floor testing, and complex editing. The audience for this guide includes voice talents and speech developers interested in an efficient method of postprocessing voice files, testing the noise floor, and editing complex voice recordings.

Sound Forge Versions and Screen Appearance


This document includes information for using Sound Forge version 6 or 7 for creating audio files for use on the Intervoice system. The appearance of the graphical user interface (GUI) screens pictured in this document may vary slightly from those on your system.

60001198, Second Edition

About This Document

ix

Chapters
This document contains the chapters briefly described below.
Chapter 1Overview

Provides the overview for this document. It contains a purpose statement for the entire document. It also contains a concept diagram and a list of required hardware and software.
Chapter 2Setting Up the Studio

Describes the elements necessary to properly set up the recording studio. It explains how to install the voice recording hardware and configure volume controls. It also discusses how to set recording attributes and the microphone level, and how to perform the noise floor test.
Chapter 3Defining Audio File Formats

Describes the voice file formats and explains the master format differences.
Chapter 4Using Sound Forge to Record Messages

Describes how to set up the automatic labeling functions and describes the voice recording process, the methods of recording voice messages, and the use of Sound Forge to record voice messages.
Chapter 5Postprocessing

Explains the steps to convert multiple voice files to the voice file format using the Sonic Foundry Batch Converter 5.0.
Chapter 6Naming and Converting .wav to .ivc to m.### Files

Provides the Intervoice InterSoft IQTalk voice file naming convention and explains how to use the CRVoice2 utility to rename .ivc files to m.###.
Chapter 7Maintaining Audio Quality

Explains the purpose of master voice files and stresses the importance of editing the files before compressing them. It also stresses the importance of retaining original master voice files.

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Intervoice Documentation
All Intervoice standard product documentation is available online in various locations. Documents are included on the product CD-ROMs, on the product GUI Help menu, as well as on the Intervoice Web site. The online Core Product Documentation Guide, document number 60000021, includes descriptions of all available documents.

Documentation Formats
Intervoice provides its standard product documentation in two online formats: Adobe Acrobat .pdf files and Microsoft HTML Help .chm files.
Adobe Acrobat .pdf Files

The Adobe Acrobat Reader with the Search tool plug-in enables viewing and full-text searches of .pdf documents. Use Adobe Acrobat Reader with Search version 4 or later. For more information, go to their Web site: http://www.adobe.com. Microsoft HTML Help is the help file standard for Windows. This file typically serves as the product online help as well as a standalone online document file. The compressed .chm file contains the entire document. The viewer includes a Table of Contents, Index, and Search. Use Internet Explorer version 4 or later. For more information, go to their Web site: http://msdn.microsoft.com.

Microsoft HTML Help .chm Files

Documentation Access
Intervoice provides its standard product documentation in these online locations:
Product CD-ROMs

Product CD-ROMs Web Site Hard Copy

Intervoice product CD-ROMs include the product documentation online files. Sometimes the product CD-ROMs are a mix of Intervoice products as well as other vendor partner product CD-ROMs. The most common locations to find online documentation are:
Documentation or Docs Help

directory on product CD-ROM

menu on product GUI window directory on the system hard drive

INTV\DOCS

60001198, Second Edition

About This Document

xi

Web Site

Entire Documentation Set

Intervoice provides an online source of its documentation through the company Web site. RealCare support customers, customers under warranty, channel partners, and distributors can visit the site to register for access to the documentation.
1 2 3 4

Go to http://www.intervoice.com/support/techinfo/ Select Technical Documents. Log in with your user ID and password. Choose an applicable document category or type in a keyword to search. To find a list of documents that have been uploaded to the Web recently, view the Web Upload Report in the Document Update category.

Site Preparation and Installation Document Set 1 2 3 Hard Copy Documents

Go to http://www.intervoice.com/support/techinfo/ Select Site Preparation Information. (no password required) View the list to determine which document applies to your system.

Because Intervoice primarily provides its product information as online documents on product CD-ROMs and the Web, only a limited number of documents are available as hard copy for shipments. These include product quick reference cards for hardware, system installation, as well as vendor partner products. Contact your Sales Representative for more information.

xii

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Related Intervoice Documents


The following documents contain useful information about your product. The documents are listed in functional order.

InterSoft Version 3
Loading InterSoft Software (InterSoft version 3) Doc Number 60001263

Describes how to install the latest InterSoft version 3 software. It also describes how to edit configuration files and apply system utilities.
InterSoft System Administration (InterSoft version 3) Doc Number 60001092

Explains how to prepare a system for the runtime production environment. It also provides commands and explanations of the basic system administration functions of an InterSoft version 3 system, including starting and stopping the runtime system, monitoring system status, creating and editing the various application tables, and working with voice messages.
InterSoft System Utilities (InterSoft version 3) Doc Number 60001094

Describes the system utilities used to prepare, operate, and monitor a system in runtime.
Troubleshooting InterSoft Systems (InterSoft version 3) Doc Number 60001078

Explains how to isolate problems on an InterSoft version 3 runtime system and troubleshoot once a problem has been isolated. It describes the various tools available for problem isolation, explains how to use each tool (or what document to reference for instructions), and explains how to analyze the results of the diagnostic tests.

Sound Forge
Sound Forge Manual

Provides overview and installation information. It describes how to use and optimize Sound Forge. See the product CD-ROM and Web site: http://mediasoftware.sonypictures.com/download/.

60001198, Second Edition

About This Document

xiii

Conventions
This document uses the following keyboard, command prompt, and menu conventions:
Example
<Enter> <Ctrl><Esc> [d:\appl]dir <Enter>

Instructs you to...


Press the Enter key. Hold the Control key while pressing the Escape key. Enter a command at a Windows command prompt.
Note These commands are not case-sensitive.

Edit > Paste > Save

Select Edit from the menu bar, choose Paste, and then select Save from its menu.

Information Symbols
Intervoice documents use the following symbols to help you quickly identify the type of information contained in a note box:
Symbol Name
Warning

Description
Warns you about actions that can create electrical hazards. Cautions you about actions that can cause data loss. Provides additional general information about the topic being discussed. Provides a useful tip about the topic being discussed. The tip is located in the margin of the page for easy reference. Refers you to a related document for further information on the product.

Caution

General

Tip

Document Reference

xiv

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Technical Support
Intervoice wants its customers to be satisfied with its products. Your service contract and/or support guidelines specify which regional support office to contact. If you need assistance, contact Intervoice technical support at the numbers listed below.

Western Hemisphere
For 24-hour support, please telephone or fax the following regional support office:
USA

(outside USA) (within USA)

Tel: +1 800 955-4688 or +1 972 454-8130 Tel: (800) 955-4688 or (972) 454-8130

Eastern Hemisphere
During normal office hours, please telephone or fax one of the following regional support offices:
England

(outside UK) (within UK)


Germany

Tel: +44 161 495 1234 Fax: +44 161 495 1007 Tel: 0161 495 1234 Fax: 0161 495 1007

(outside Germany) Tel: +49 611 184 440 Fax: +49 611 184 4444 (within Germany) Tel: 0611 184 440 Fax: 0611 184 4444
Singapore

(outside Singapore) Tel: +65 395 4300 (within Singapore) Tel: 0395 4300
United Arab Emirates

Fax: +65 395 4277 Fax: 0395 4277 Fax: +9714 332 5580 Fax: 04 332 5580

(outside UAE) (within UAE)

Tel: +9714 332 5480 Tel: 04 332 5480

If out of hours support is included in your contract, please telephone: (outside UK) Tel: +44 208 750 6387 (within UK) Tel: 0208 750 6387 Out of hours calls are directed to a support desk, where you can record your problem details. You will be contacted as soon as possible.

60001198, Second Edition

About This Document

xv

1
In this chapter...
M M

Overview
The Sound Forge and the Intervoice System Audio File Guide provides information on creating audio files using Sound Forge. Topics include studio set up, file formats and file naming conventions, post-processing using the Batch Converter, and troubleshooting. This document also provides detailed instructions on using the CRVoice2 utility to rename audio files to the Intervoice runtime format. Sound Forge can be used for recording and editing, but it is required for voice file postprocessing, noise floor testing, and complex editing. The audience for this guide includes voice talents and speech developers interested in an efficient method of postprocessing voice files, testing the noise floor, and editing complex voice recordings. This chapter provides the overview for this document. It contains a purpose statement for the entire document. It also contains a concept diagram and a list of required hardware and software. For more information on preparing voice files, refer to the Sound Forge documents and Intervoices InterSoft runtime documents. For more information on Sound Forge, visit theWeb site: http://mediasoftware.sonypictures.com/support/, or visit the Intervoice Web site at www.intervoice.com/support/techinfo/voicerecording.

Summary of Changes, 1-2 Voice Recording Concept Diagram, 1-3 Recommended Hardware, 1-4 Recommended Software, 1-5

60001198, Second Edition

1-1

Summary of Changes
The major revisions to this document are summarized below:
Changes

Updated the Web site URL for downloading Sound Forge and Batch Converter. Reorganized chapters into a more functional order. Replaced Omvia Media Server term with Intervoice runtime system (or more specifically with InterSoft IQTalk runtime system or VoiceXML Browser system).

Additions

Added information about Sound Forge version 7. Added information about recommended audio file formats for VoiceXML Browser systems.

1-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Voice Recording Concept Diagram

Voice Recording Concept Diagram


Figure 1-1 illustrates the process of creating voice recordings.
Computer

Voice Recording Software

Recording
Preamplifier

Speakers Microphone

Sound Card

Master .wav files

Processing

Sound Forge Batch Converter

Intervoice format .ivc files for InterSoft IQTalk

Runtime .wav files for VoiceXML

Renaming

CRVoice2 Utility

m.###

runtime files

NA for VoiceXML

Intervoice System

Figure 1-1. Voice Recording Concept Diagram

60001198, Second Edition

Overview

1-3

Recommended Hardware
The following hardware is recommended: Intel Pentium or compatible computer Sound card Preamplifier Amplified speakers with cables High-quality, unidirectional, low-impedance microphone XLR Cable (connects the microphone to the preamplifier) Y-adapter audio cable (connects the preamplifier to the sound card)

Specific Hardware
Intervoice recommends the following hardware for the best quality when recording: M-Audio Audiophile 2496 sound card Midiman Audio Buddy preamplifier Sennheiser E845 supercardioid vocal microphone
Note In noisy environments, a head-mounted,

noise-canceling electret microphone, such as the Shure WH20XLR, is recommended. Microphone base Horizon 10-ft. XLR microphone cable (connects the microphone to the preamplifier) 6-ft shielded audio cable (1/4-inch to RCA jack) (Radio Shack 42-2373A) 6-ft Y-adapter audio cable (1/8-inch to RCA (phono) or 3.5 mm stereo plug to two RCA plugs (connects the preamplifier to the sound card) 3.5 mm stereo jack-to-jack shield adapter Labtec speakers (AC powered)

1-4

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Recommended Software

Recommended Software
The following software is recommended: Sound Forge 6: Microsoft Windows 98, ME, or 2000 Intervoice CRVoice2 utility Sonic Foundry Batch Converter 5.0

Sound Forge 7: Microsoft Windows 2000 or XP Intervoice CRVoice2 utility Sonic Foundry Batch Converter 5.0

60001198, Second Edition

Overview

1-5

2
In this chapter...
M

Setting Up the Studio


This chapter describes the elements necessary to properly set up the recording studio. It explains how to install the voice recording hardware and configure volume controls. It also discusses how to set recording attributes and the microphone level, and how to perform the noise floor test.

Studio Setup Checklist, 2-2 Installing the Hardware, 2-3 Installing the Sonic Foundry Batch Converter, 2-6 Installing CRVoice2, 2-7 Installing Sound Forge, 2-7 Making Quality Recordings, 2-8 Performing the Noise Floor Test, 2-12

M M

60001198, Second Edition

2-1

Studio Setup Checklist


Use the checklist in Table 2-1 to ensure that you set up the studio properly.
Table 2-1. Studio Setup Checklist Studio Setup Checklist
Set up hardware.

Install the sound card, following the manufacturers instructions. Connect the preamplifier. Verify that the preamplifier is attached to a line-level input from the sound card. This may be labeled line or aux. Connect the microphone.

Note Using a penny, remove the gold plated screw in the microphone mount.

Then attach the microphone mount to the top of the microphone base. Connect the speakers. Turn on the computer. Refer to the OEM documentation for installation instructions. Turn on the preamplifier.

Install software. Sonic Foundry Batch Converter 5.0 CRVoice2 utility Sound Forge 6.0 or 7.0

Set up audio. On some sound cards, you must select correct audio input using the sound card control application. The recommended sound card, M-Audio Audiophile 2496, does not use an audio control application on the PC. In this configuration, the audio volume is controlled through the preamplifier. Set up audio levels. The input level should be set as loud as possible without clipping the .wav file. Perform noise floor tests.

2-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Installing the Hardware

Installing the Hardware


The following hardware is required for voice recording:

Sennheiser E845 Microphone

XLR Microphone Cable

Midiman Audio Buddy Preamplifier

Pre
Audio Cable 1/4-inch to RCA Jack Power Cord

Jack-to-Jack Shield Adapter

Y-adapter Audio Cable

Labtec Speakers AC Powered with Cables

M-Audio Audiophile 2496 Sound Card

Figure 2-1. Voice Recording Hardware

60001198, Second Edition

Setting Up the Studio

2-3

Connecting the Sound Card to the Speakers


To connect the speakers to the sound card, follow these steps:
1

You will need the Y-Adapter audio cable, the stereo jack-to-jack shield adapter, the speakers, and the sound card installed in your computer. Insert the speaker connector into one end of the stereo jack.

Insert the Y-cable into the other end of the stereo jack-to-jack adapter.

2-4

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Installing the Hardware

Insert the Y-cable connectors into the sound card connectors in your computer.
Input Jack to Preamplifier Audio Cable Y-Cable to Speakers Output Jack

Use the manufacturers instructions to install the other components.

60001198, Second Edition

Setting Up the Studio

2-5

Installing the Sonic Foundry Batch Converter


If Batch Converter 5.0 is not on the Sound Forge CD, Sound Forge 6.0 and 7.0 customers with a valid serial number can download Batch Converter 5.0 from this Web site URL:
http://mediasoftware.sonypictures.com/download/step2.asp?DID=339& #Return/. It is important that you register your copy of Sound Forge and

the Batch Converter to obtain technical support.


Note Sound Forge 7.0 users should contact Intervoice

Customer Support if the serial number supplied with their copy of Sound Forge 7.0 fails to enable the Batch Converter 5.0. Periodic updates are available to Sound Forge and the Batch Converter. Intervoice recommends checking the Web site first for the latest updates to the Batch Converter. For more Sound Forge information, contact technical support at http://mediasoftware.sonypictures.com/support. For Intervoice 24-hour support, contact the following support office: (outside USA) Tel: +1 800 955-4688 or +1 972 454-8130 (within USA) Tel: (800) 955-4688 or (972) 454-8130 To install the Batch Converter:
1

Run the Batch Converter installation file. For example, if it was downloaded to your desktop, double-click it. Or, if it is on a CDROM, go to the directory and start the .exe file. Follow the installation screens. After installing the Batch Converter, download the Intervoice Example Batch Script file at: www.intervoice.com/support/techinfo/voicerecording. Save the IVBExampleBatchScriptV6.zip file to the Batch Converter directory. Unzip the batch script, IVBExampleBatchScriptV6.bcs. Refer to the topic Using the Sample Batch Script on page 5-3 for more information.

2 3

5 6

2-6

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Installing CRVoice2

Installing CRVoice2
Numerical file extensions are not recognized correctly by Windows, so Intervoice created a proprietary extension, .ivc, to help identify Intervoice files on Windows systems. However, when .ivc files are transferred to an Intervoice InterSoft IQTalk system, the .ivc extensions (for example BankABC502.ivc) must be replaced with numeric extensions such as m.502. The CRVoice2 utility renames the .ivc files to m.### files. To install CRVoice2:
1

Download the file from the Intervoice Web site at:


www.intervoice.com/support/techinfo/voicerecording.

Save the file to the directory where you will be converting your .wav master files to .ivc files for InterSoft IQTalk systems.

Refer to Using the CRVoice2 Utility on page 6-4 for more information.

Installing Sound Forge


To install Sound Forge:
1

Ensure that the Batch Converter, sample Intervoice script, and CRVoice2 utility have already been installed. Insert the Sound Forge CD-ROM in the CD-ROM drive. The installation program should autostart. Follow the installation instruction screens. Go to the Web site and download the latest update to Sound Forge:
http://mediasoftware.sonypictures.com/download/step2.asp?DID=4 65. (This update fixes an .ivc file issue.)

3 4

Run the downloaded file (such as soundforge70b.exe) and follow the onscreen installation instructions. Refer to the Making Quality Recordings on page 2-8 and the other topics in this document for information about using Sound Forge.

60001198, Second Edition

Setting Up the Studio

2-7

Making Quality Recordings


To ensure that recordings are of the highest quality, set the microphone levels correctly, place the microphone properly, and avoid clipping. As a general rule, microphone levels should be as loud as possible without causing any clipping. You can adjust final volume levels in message files after they have all been recorded. Use the volume or the dynamic graphic preset. The best signals in the noise ratio are achieved when a loud, unclipped recording is made.

Setting Microphone Levels


To set microphones levels correctly, follow these steps:
1 2

Set the recording attributes using File > New as 11,025, 16 bit, Mono. (Refer to Setting Recording Attributes on page 4-5). In the main Sound Forge window, select the Record button the left of the button bar. The Record window appears. on

Set the microphone volume to the highest level at which you can speak at a normal volume without clipping. This produces the best signal-to-noise ratio.

In Sound Forge 6, check the Monitor checkbox beneath the Reset button to turn on the record meters.

2-8

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Making Quality Recordings

Speak into the microphone in your loudest normal voice. Adjust the microphone volume on the preamplifier so that clipping does not occur even when you speak at your loudest volume. (Clipping occurs when the level meters are pushed into the red zone and the word Clip appears.) If clipping occurs, click the Reset button. Remember, if clipping occurs during the recording process, the recording is ruined. On the preamplifier, lower the microphone level. Repeat step 4 through step 6 until clipping no longer occurs. The microphone level is then set correctly.

6 7

Placing the Microphone


Intervoice recommends a supercardioid vocal microphone (such as the Sennheiser E845) to help eliminate background noise during recording. When using a desk-mounted microphone, ensure that the distance between the voice talents mouth and the microphone is fairly constant for each recording. Most commercial microphones do not have an output signal level that is high enough to drive the sound card. Therefore, they require a microphone preamplifier. External microphone preamplifiers improve signal-to-noise ratios by keeping high-gain amplification stages out of the PCs electrical noise fields. For more information about the Sennheiser E845 microphone, go to www.allprosound.com or www.fullcompass.com.

60001198, Second Edition

Setting Up the Studio

2-9

Avoiding Clipping
Sound Forge clips a recording when it exceeds the maximum storable amplitude or when any value beyond the maximum is cut off, resulting in audible distortion. Clipping destroys a recordings quality; therefore, clipped recordings must be rerecorded. Clipping occurs when the level meters are pushed into the red zone (see Figure 2-3 on page 2-11). Figure 2-2 illustrates a voice recording without clipping.
No clipping

No clipping

Figure 2-2. Voice Recording Without Clipping

To prevent clipping, set the recording input level just low enough to enable the voice talent to speak at normal volume without pushing the level meters into the red zone.

2-10

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Making Quality Recordings

The flat tops in Figure 2-3 illustrate clipping.


Clipping

Clipping

Figure 2-3. Voice Recording with Clipping

60001198, Second Edition

Setting Up the Studio

2-11

Performing the Noise Floor Test


Always check the background noise and a noise floor of the recording studio and equipment before starting a recording session. Recordings made with high background noise will not provide a clear voice sound during playback. Sound Forge must be installed to perform this test. A high noise floor is a common cause of poor audio quality and clicks in compressed files. (See Using Sound Forge to Record Messages, Chapter 4 for more information.) To check for noise using Sound Forge tools, follow these steps:
1

Ensure that the microphone levels on the preamplifier are set correctly for a recording session, that is, as loud as possible without causing clipping (see Setting Microphone Levels on page 2-8). Verify that the recording attributes are 11,025 Hz (or 11.025 KHz), 16 bit, Mono. Record five seconds of silence, at the standard volume level. Do not speak while recording. The recording studio should be as quiet as possible during this test. Stop recording. Close the Record dialog box. With a sound file window open, go to the main Sound Forge window and select Process > DC Offset. The DC Offset <filename> window appears.

In Sound Forge 6, set the Preset field to Intervoice DC Offset Removal. In Sound Forge 7, set the Preset field to Remove DC offset....

2-12

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Performing the Noise Floor Test

Select the Automatically detect and remove option to remove the DC offset, and click OK. To view statistics, select Tools > Statistics. The Statistics window appears.

Verify that the RMS power in the statistics dialog box is below 60 dB (for example, 65 dB). Verify that the Average value (DC offset) statistic is 0.

Perform this test regularly on systems used to record voice messages. The voice message fails the test if the noise floor is louder than 60 dB. Louder noise is represented by lower negative number. For example, a 55dB noise floor is louder than 60 dB. The test may fail because the room noise is too loud (due to air conditioning, PC fan, paging system), or the sound card preamplifier is noisy. Correct any problems before continuing to record. Noise floors greater than 60 dB (for example, 55 or 50 dB) can be heard in the playback process and cause pops and other audible degradation during the compression process.

60001198, Second Edition

Setting Up the Studio

2-13

3
In this chapter...
M

Intervoice Master Format, 3-2

Defining Audio File Formats


This chapter describes the voice file formats and explains the master format differences. The voice files are recorded as a standard Intervoice master .wav files, then depending on the runtime system, the files are converted to runtime files: .wav or .ivc formats.

60001198, Second Edition

3-1

Intervoice Master Format


Intervoice uses the .wav format for recording and saving all voice files. A .wav file is a standard format in which messages are recorded on a computer. Almost any audio recording tool designed for computers can be used to create the .wav file. The Intervoice standard master format uses .wav files because this format supports high-quality recordings, and the equipment required to record and play .wav files is commonly available. The Intervoice master .wav format is 11,025 samples per second, 16 bits per sample, linear encoding, which equates to a data rate of 22,000 bytes per second. The Intervoice master format keeps the master files in a much higher quality format than the older compressed -law masters. The Intervoice format makes the file sizes somewhat larger, but the voice quality of subsequent copies of this master is much better.

Intervoice Standard Master Format


The Intervoice standard format for recording master voice files is a 11.025 KHz (or 11,025 Hz), 16-bit, monophonic, linear .wav file. This format is the master format for storage of Intervoice voice files.

Although higher sampling rates are available, Intervoice uses the 11.025 KHz (or 11,025 Hz) sample rate for master files because any advantage to using a higher sample rate is lost through bandwidth limitations of phone line transmission. The higher the sample rate, the wider the audio bandwidth, and the larger the file size. Regardless of the master format used, the InterSoft IQTalk runtime .ivc format is compressed using ADPCM techniques to 8,000 samples per second and 4 bits per sample. This is also referred to as 8KHz ADPCM or 32KBps (8 KHz x 4 bits). The VoiceXML runtime .wav format is 64KBps (8 KHz x 8bits). KBps is kilobits per second.

3-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Intervoice Master Format

Table 3-1 charts the differences between the InterSoft master format and the standard runtime format.
Table 3-1. Format Differences Format Name
Windows Master Format InterSoft IQTalk Runtime Format VoiceXML Runtime Format

Sample Rate
11,025 smp/sec 8,000 smp/sec

Sample Size (Mono)


16 bits/sample

Data Format
Linear .wav

Size of 10 sec Message


220 KB

4 bits/sample

ADPCM (.ivc) G.721

40 KB

8,000 smp/sec

8 bits/sample

A-law or -law PCM


.wav

80 KB

After messages are recorded, they must be compressed from master .wav files into ADPCM (.ivc) or A-law or -law PCM .wav files. Sound Forge provides a Batch Converter that enables you to easily modify, edit, and convert large groups of messages. See Batch Converter Overview on page 5-2 for more information.

60001198, Second Edition

Defining Audio File Formats

3-3

4
In this chapter...
M

Setting Up Automatic Labeling, 4-2 Setting Recording Attributes, 4-5 Recording Process Checklist, 4-6 Recording Methods, 4-7 Recording Messages, 4-9 Reviewing Messages, 4-17 Removing Intervoice Batch Scripts, 4-18

Using Sound Forge to Record Messages


This chapter explains how to set up the automatic labeling functions and describes the voice recording process, the methods of recording voice messages, and the use of Sound Forge to record voice messages.

M M M

Sample Screens
This document contains representations of application screens. These screens are used as examples only and may not appear exactly as those on your system.

60001198, Second Edition

4-1

Setting Up Automatic Labeling


Windows 2000 allows long filenames. Assigning long filenames (BANKABC.1stGreeting.1048, for example) to message files may help keep track of messages and associate them with the correct applications. Long filenames make it easier to identify message files to be renamed for use in runtime. (See Filename Formats (.ivc to m.###) on page 6-2 and .) The automatic labeling function provided by Sound Forge can be used to assign descriptive filenames to master voice files. It is also used to automatically name and number regions or files during the recording process. Intervoice recommends that you set up the automatic labeling function before recording messages.

Automatic versus Manual Labeling


If you do not set up the automatic labeling function before recording messages, you must manually number the regions or takes after recording.

To set up Sound Forges automatic labeling function before recording your messages, follow these steps:
1 2

Open Sound Forge. From the main Sound Forge window, open the Record window by either clicking the Record button Transport > Record. or selecting Special >

4-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Setting Up Automatic Labeling

The Record window appears.

3 4

Right-click the Record windows title bar. Select Automatic Labeling from the menu that appears. The Automatic Labeling window opens.

60001198, Second Edition

Using Sound Forge to Record Messages

4-3

If you are recording in Create a new window for each take mode, type the prefix to be assigned to all files in the New window title prefix field in the Sound windows area. Modify the fields immediately following it, as explained in step 6 and step 7. If you are recording in Multiple takes creating regions mode, type the prefix to be assigned to all files in the New region prefix field in the Regions area. Modify the fields below it, as explained in step 6 and step 7.

Select the Use counter and start at (0 to 65,000) checkbox, and change the number in the field to the number at which the numbers should begin (500, for example). For information on Intervoice voice file naming conventions, see page 6-3. Make sure to select the Insert leading zeros in field width of (2 to 24) checkbox, and set leading zeros to 3. Click OK. Each region or window is numbered starting at the number entered in the Use counter and start at (0 to 65,000) field in step 5.

Do not include any numeric characteristics at the end of your filenames unless you want them to be included in the final message number. The names assigned to the regions or files are converted to message names and numbers in the runtime system.

4-4

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Setting Recording Attributes

Setting Recording Attributes


The recording attributes control how Sound Forge stores the master files. Sound Forge remembers these settings for new recordings, so you set attributes only once. To set recording attributes, follow these steps:
1 2

Open Sound Forge. From the main Sound Forge window, select File > New. The New Window dialog box appears:

Enter the appropriate recording attributes. The Intervoice standard settings are: Sample rate: 11,025 Hz Sample size: 16-bit Channels: Mono Click OK. These settings are stored as the default. Therefore, you should create them the first time you record.

60001198, Second Edition

Using Sound Forge to Record Messages

4-5

Recording Process Checklist


Use the checklist in Table 4-1 to ensure that you complete all the steps in the recording process.
Table 4-1. Recording Process Checklist Recording Process Checklist

Complete the noise floor test. Record the master file using the voice recording software. Use the Batch Converter to postprocess the master file (remove pause, adjust gain, and so on) and compress the file into applicable runtime format. Run the CRVoice2 utility (for the InterSoft IQTalk runtime system only). Move the files to the Intervoice system.

4-6

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Recording Methods

Recording Methods
Two major methods of recording are used: Continuous (recording continuously while reading a script) Message-by-message (stopping the recording after each message)

In Sound Forge, the continuous recording mode is called Multiple takes (no Regions). The two modes of message-by-message recording are called Multiple takes creating Regions and Create a new window for each take. Regardless of which recording method you use, you must separate all messages into individual files during the final process of message creation. Figure 4-1 illustrates the recording process and recording methods.
Recording Files
Manually mark regions Extract regions

Postprocessing

Continuous
(Record all messages in one continuous session)

1. Review masters. 2. Archive masters. 3. Complete batch post processing. 4. Use CRVoice2 utility. (InterSoft IQTalk systems only) 5. Place files in runtime.

Message by Message

Regions

Rename regions*
(if required)

Extract regions

Files
(Record each message separately, stopping after each)

Rename files*
(if required)

Extract regions

Figure 4-1. Recording Process

Refer to the Sound Forge document for detailed information about recording modes. * Renaming is required if the automatic naming or numbering function was not set up or if the messages are recorded out of sequence.

60001198, Second Edition

Using Sound Forge to Record Messages

4-7

Continuous Recording
The continuous recording method is appropriate when voice talents prefer to read continuously from a script or prefer not to operate the computer while recording. The voice talent records a group of all the messages at once. Then a subsequent editing process marks the individual recordings using region markers.
Multiple Takes (No Regions)

The Multiple takes (no Regions) mode in Sound Forge allows multiple takes to be recorded in one file, but avoids defining any regions in the Regions List. Recording starts at the position shown in the Start field when the Record button is selected and continues until the Stop button is selected. Any data that is currently after the position in the Start field is replaced. When recording is stopped, the start position remains at the end of the take allowing the next take to be recorded immediately. After all messages are recorded, the editor must manually place the regions.

Message-by-Message Recording
The message-by-message method means that the user stops recording after each message, creating multiple regions or new files for each take.
Multiple Takes Creating Regions

The Multiple takes creating Regions mode allows multiple takes to be recorded with each take automatically defining a region in the Regions List. Recording starts at the position shown in the Start field when the Record button is selected and continues until the Stop button is selected. Any data that is currently after the position in the Start field is replaced. When recording is stopped, the start position remains at the end of the take allowing the next take to be recorded immediately. When the Record button is selected, the automatic naming function increments the region name. The Create a new window for each take mode is similar to the Multiple takes creating Regions mode. However, instead of creating new regions with each take, the system creates a new file for each take. This proves helpful in recording data where each take is intended to be saved into its own file. This mode is useful if the messages in a script are not in sequence (such as m501, m504, m503, m502).

Create a New Window for Each Take

4-8

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Recording Messages

Recording Messages
The recording procedures described here are divided into continuous and message-by-message recording.

Continuous Recording
The continuous recording method is appropriate when voice talents prefer to read continuously from a script or prefer not to operate the computer while recording. The voice talent records a group of all the messages at once. Then a subsequent editing process marks the individual recordings using region markers.
Multiple Takes (No Regions)

The Multiple takes (no Regions) mode in Sound Forge allows multiple takes to be recorded in one file, but no regions are defined in the Regions List. The editor must manually place the regions. To record messages in the Multiple takes (no Regions) manner, follow these steps:
1 2

Open Sound Forge. From the button bar, select the Record The Record <Sound x> window appears. button.

3 4

Select Multiple takes (no Regions) in the Mode field. In the Mode field, select the recording mode.
Note Make sure that DC adjust, in the lower right of the

Record window, is NOT checked.


5

In Sound Forge 6, select the Monitor checkbox to display the level of the recording input on the meters. To prevent clipping, use the Monitor function to set the appropriate recording volume. When a recording is clipped, the word Clip appears above each meter. Clipping destroys a recordings quality.

60001198, Second Edition

Using Sound Forge to Record Messages

4-9

Set the recording volume using either the sound cards or the operating systems volume control application. The volume should be set at the highest level at which you can speak at normal volume without clipping. Select the Record button in the Record <Sound x> window.

Always record as loudly as possible without clipping. The volume of the master files can be adjusted after recording is complete.

7 8 9

Record successive messages without stopping between them. Click the Stop button.

The messages appear as one long interrupted set of wave formations.

10 Highlight the area of wave that you want to include in your first

region.

11 From the grey bar directly above the waves, right click and select
Insert Marker/Region.

4-10

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Recording Messages

The Insert Marker/Region window appears.

12 Enter a name for the region in the Name field. (Alternately, you can

set up automatic labeling for your windows if you want to give the same prefix to all of them. See Setting Up Automatic Labeling on page 4-2.)
13 Select the Region button. 14 Click OK. 15 Repeat step 10 through step 14 on page 4-11 for each region you

wish to create. Your waveforms are now divided by regions.

60001198, Second Edition

Using Sound Forge to Record Messages

4-11

Message-by-Message Recording
The message-by-message method means that the user stops recording after each message, creating multiple regions or new files for each take.
Multiple Takes Creating Regions

The Multiple takes creating Regions mode allows multiple takes to be recorded with each take automatically defining a region in the Regions List. If you have the Batch Converter plug-in, Multiple takes creating Regions is the easiest recording mode to use. This mode places all takes in a single window, marking the beginning and end of each new take with a region marker and a vertical black line. Later, using the Extract Regions tool (available only with the Batch Converter plug-in), automatically saves each region as a separate file. When recording large numbers of messages (over 50), you may want to divide the regions into several different files for easier manipulation. To record messages in the Multiple takes creating Regions manner, follow these steps:
1 2

Open Sound Forge. From the button bar, select the Record The Record <Sound x> window appears. button.

Select Multiple takes creating Regions in the Mode field.


Note Make sure that DC adjust, in the lower right of the

Record window, is NOT checked.


4

In Sound Forge 6, select the Monitor checkbox to display the level of the recording input on the meters. To prevent clipping, use the Monitor function to set the appropriate recording volume. When a recording is clipped, the word Clip appears above each meter. Clipping destroys a recordings quality.

4-12

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Recording Messages

Checking for Clipping


After recording each region or take, check the input level meters. If the Clip indicator appears above them, you must rerecord the region or take.

5 6 7

Select the Record

button in the Record <Sound x> window. button after it.

Record a first message, and click the Stop

Continue creating messages and clicking Stop between each. Your messages are automatically labeled for you.

60001198, Second Edition

Using Sound Forge to Record Messages

4-13

Extracting Regions

If you have a file composed of multiple regions, you must extract the regions to use them in a runtime system. After recording is complete, each message is marked with its own region. They can be subsequently separated into different files through the Batch Converter plug-in in Sound Forge. To extract regions in the Multiple takes creating Regions manner, follow these steps:
1

In Sound Forge, complete recording in the Multiple takes creating Regions manner. From the menu bar, select Tools > Extract Regions. The Extract Regions window appears.

3 4

Select the regions to extract in the Regions to extract field. Click Extract.

4-14

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Recording Messages

Create a New Window for Each Take

The Create a new window for each take mode is similar to the Multiple takes creating Regions mode. However, instead of creating new regions with each take, the system creates a new file for each take. This proves helpful in recording data where each take is intended to be saved into its own file. This mode is useful if the messages in a script are not in sequence (such as m501, m504, m503, m502). If you do not have the Batch Converter plug-in, use New window for each take mode. This mode places each take in its own data window. To view all your takes, choose Window > Cascade > Tile Horizontally, or Window > Cascade > Tile Vertically. To record messages in the Create a new window for each take manner, follow these steps:
1 2

Open Sound Forge. From the button bar, select the Record The Record <Sound x> window appears. button.

3 4

Select Create a new window for each take in the Mode field. In Sound Forge 6, select the Monitor checkbox to display the level of the recording input on the meters. To prevent clipping, use the Monitor function to set the appropriate recording volume. When a recording is clipped, the word Clip appears above each meter. Clipping destroys a recordings quality.

Checking for Clipping


After recording each region or take, check the input level meters. If the Clip indicator appears above them, you must rerecord the region or take.

Set the recording volume using either the sound cards or the operating systems volume control application. The volume should be set at the highest level at which you can speak at normal volume without clipping. Click the Prepare button.

60001198, Second Edition

Using Sound Forge to Record Messages

4-15

This function allows time for you to create and rename the new window before Sound Forge begins recording. If you do not use the Prepare button, the beginning of each message may be cut off. It is not necessary to use this function if you are using the Multiple takes creating Regions mode. After you click Prepare, a flashing green box that reads Prepare appears to the left of the Sync button.
7 8 9

Select the Record

button in the Record <Sound x> window. button after it.

Record a first message, and click the Stop

Continue creating messages and clicking Stop between each. Each message appears with its own window, as shown below:

4-16

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Reviewing Messages

Reviewing Messages
Review all messages after recording and check for noises, clicks, pops, and clipping. After a message is recorded, the data window for that recording appears in the Sound Forge main window. To review your messages, follow these steps:
1

To listen to your messages, click the Play button in the data window. (Or, place the cursor at the beginning of the message to be played, then press the spacebar. Press the spacebar again to stop.) The data windows buttons are labeled below:

Play button

Stop button

Go to Start button

Go to End button

For instructions on manually adjusting region markers, refer to the Sound Forge 6.0 Document.

If you recorded in Multiple takes creating Regions mode, you can manually adjust the region markers to remove leading and trailing silences. Or, use the Auto Trim/Crop process to trim the regions after they have been extracted. After completing these steps, if you recorded in Multiple takes creating Regions mode or if you recorded in Create a new window for each take mode, proceed to Batch Converter Overview on page 5-2.

60001198, Second Edition

Using Sound Forge to Record Messages

4-17

Removing Intervoice Batch Scripts


Before installing the new sample batch scripts, remove the old batch scripts. To remove the old sample batch scripts, follow these steps:
1 2

Open Sound Forge. From the main Sound Forge window, select Tools > Batch Converter. The Batch Converter window opens, and the list of batch scripts appears.

4 5 6

Select a script and right-click. Click Remove to delete the batch script. Repeat step 4 and step 5 to remove old batch scripts.

4-18

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

5
In this chapter...
M

Postprocessing
This chapter provides the steps to convert multiple voice files to the voice file format using the Sonic Foundry Batch Converter 5.0. The voice talents responsibility is to provide individual master voice files recorded with high quality. The voice talent may or may not have the expertise to postprocess audio files and troubleshoot problems. Either the voice talent or the application engineer must postprocess the master .wav files to compress, enhance, and normalize volume levels to complete the message creation process.

Batch Converter Overview, 5-2 Using the Batch Converter, 5-6 Modifying the Sample Batch Script, 5-8

60001198, Second Edition

5-1

Batch Converter Overview


To prepare files for the InterSoft IQTalk runtime system, the Sonic Foundry Batch Converter 5.0 takes the master .wav format and converts it to the .ivc file ADPCM format.
Master Format 11 KHz, 16-bit Mono .wav Sonic Foundry Batch Converter InterSoft IQTalk Format 8 KHz, 4-bit ADPCM .ivc

Figure 5-1. .wav to .ivc Batch Conversion Process

To prepare files for the VoiceXML Browser runtime system, the Sonic Foundry Batch Converter 5.0 takes the master .wav format and converts it to the runtime .wav file format.
Master Format 11 KHz, 16-bit Mono .wav Sonic Foundry Batch Converter VoiceXML Format 8 KHz, 8-bit A-law/-law .wav

Figure 5-2. .wav to runtime .wav Batch Conversion Process

The batch script, Intervoice Example Batch Script, automates most of the postprocessing steps for voice files. The sample script shows how most of the steps can be automated in a single batch process using the Batch Converter. (The Intervoice Example Batch Script is for demonstration purposes. You must modify the preset parameters to suit the specific circumstance.)

5-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Batch Converter Overview

Downloading the Batch Converter


Sony Pictures Digital Media offers periodic updates to the Batch Converter software. Intervoice recommends that you check their Web site. You can download the Batch Converter from Sony Pictures Digital Media at:
http://mediasoftware.sonypictures.com/download/step2.asp?DID=339& #Return/

Be sure to register your copy of the Batch Converter to obtain technical support. See Installing the Sonic Foundry Batch Converter on page 2-6.
Note Sound Forge 7.0 users should contact Intervoice

Customer Support if the serial number supplied with their copy of Sound Forge 7.0 fails to enable the Batch Converter 5.0.

Using the Sample Batch Script


The batch script described below is available on the Intervoice Web site at www.intervoice.com/support/techinfo/voicerecording. To use the sample batch script in the Batch Converter 5.0, follow these steps:
1 2 3

Open the Batch Converter. Select File > Open Script. The Open dialog box appears. Navigate to the Intervoice Example Batch Script file, and click Open to install the batch script.

The Intervoice Example Batch Script creates 32 KBps compressed messages for -law playback and 12 dB volume normalization. You may obtain other options, such as -law playback, different volumes, and 32 KB compression, by editing the script in the Sonic Foundry Batch Converter. The Intervoice Example Batch Script is for demonstration purposes. You must modify the preset parameters to suit the specific circumstance.

60001198, Second Edition

Postprocessing

5-3

Converting the .wav Files to Runtime Format Overview


A series of transformational steps is required to convert a .wav file into the runtime file format. All of these steps can be automated into a single batch process in the Batch Converter. The Batch Converter can apply this process to a large number of master .wav files in a single step, creating a set of runtime files, one for each master .wav file. To convert a master .wav file into a runtime file, the Batch Converter must perform a specific set of processes in a specific order on the original .wav master file. In version 5.0, Sonic Foundry calls these steps plug-ins (presets in Sound Forge 4.5). The final script is called a plugin chain. The basic recording process provides a clean audio signal digitized to the systems hard drive in .wav file format. However, there are several other processing steps that must be applied to an audio file after it is recorded, but before it can be placed on an Intervoice system. The Intervoice sample Batch Converter postprocessing procedures are described below:
DC Offset removal

Removes any DC offset before messages are compressed. Most sound cards have some DC offset that can affect the compression process.
Front and Back silence trimming with 20-ms fades

Removes excess silence on the front and back of messages to tighten the message delivery. Generally the messages are trimmed to 20 milliseconds or less of silence on both ends. Noise at the beginning or end of messages (such as mouse or keyboard clicks) causes the auto-trimming process to trim too early or too late, leaving extra silence in the message. These messages must be hand-trimmed.
Front and Back silence buffering with 50-ms buffers

Prevents the destabilization of the ADPCM decoder when two messages are played back to back. Once the arbitrary silences are trimmed from the message, this step places a standard 50-millisecond silence buffer on the beginning and end of every message.
Message level normalization

Converts all the messages in a single application to a single standard volume level, so that listeners hear every message at the same volume.
8 KHz resampling

Converts the 11.025 KHz sample rate of the master file to an 8 KHz sampling rate.

5-4

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Batch Converter Overview

Equalization

Performs the frequency manipulation required to make high-quality message playback over telephone lines.
ADPCM compression for A-law playback and Intervoice formatting

Compresses the linear format to ADPCM to reduce the size of the message files on the Intervoice system. This step is where you specify whether a message is to be played as -law or A-law, or to be compressed to 32 KB.

60001198, Second Edition

Postprocessing

5-5

Using the Batch Converter


To use the Batch Converter, follow these steps:
1

From the main Sonic Foundry Batch Converter window, select File > Open Script. Navigate to and open the Intervoice Example Batch Script, IVBExampleBatchScriptV6.bcs. The Sonic Foundry Batch Converter IVBExampleBatchScriptV6.bcs window appears in the Script tab view.

3 4 5

Do not select any of the options on the window. Select the directory where you want the output files saved. In the File format area, set the Format and Template.
Note If you change any selection, save the changes as

a new batch script and retain the original sample batch script.

5-6

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Using the Batch Converter

To choose the files that are to be converted, go to the main Sonic Foundry Batch Converter window and select File > Add Media, or click Add Media on the toolbar. In the Add Media dialog box, select all the .wav files to be converted and click Open to continue The Files to Convert window tab view appears.

Backup Master Files Before Converting


Be sure to make a backup copy of all master .wav files and store them in a separate directory away from any editing or converting activity. Do this to prevent losing master files by accidently converting the master files into the compressed .ivc or .wav format. The compressed files cannot be converted back to masters, due to quality loss during conversion.

Select Convert Files. The status changes to complete.

60001198, Second Edition

Postprocessing

5-7

Modifying the Sample Batch Script


The Intervoice Example Batch Script file provides the Batch Converter with an example of the appropriate presets to be used during the conversion of master voice files into ADPCM (.ivc) or A-law or -law PCM .wav files. The batch script described below is available on the Intervoice Web site at www.intervoice.com/support/techinfo/voicerecording. This topic explains how to modify the sample batch script for different requirements. The basic script is designed to resample and compress .wav files into the runtime format. This sample script also includes settings to trim and normalize volume. The settings that can be modified are message level normalization and equalization. See page 5-9 for the settings that cannot be modified. To modify the batch script, follow these steps:
1 2

This is a general batch script explanation only. You need to change formats and volume levels for real runtime systems. For more information on batch scripts, refer to the Sound Forge document.

Open the Batch Converter. From the main Sonic Foundry Batch Converter window, select File > Open Script. Double-click the settings you want to change.
Note If you change any selection, save the changes as

a new batch script and retain the original sample batch script.
4

Rename and save the changed batch script.

5-8

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Modifying the Sample Batch Script

Batch Script Settings That Cannot Be Modified


Some batch script settings should not be modified, as they control critical parameters in the conversion process. Other batch script parameters can be modified to adjust the runtime messages for various use. The settings that should not be modified are: DC offset removal Front and back silence trimming Front and back silence buffering 8 KHz resampling ADPCM compress for A-law playback Intervoice formatting

60001198, Second Edition

Postprocessing

5-9

6
In this chapter...
M

Filename Formats (.ivc to m.###), 6-2 Using the CRVoice2 Utility, 6-4

Naming and Converting .wav to .ivc to m.### Files


This chapter describes the Intervoice InterSoft IQTalk voice file naming convention and explains how to use the CRVoice2 utility to rename .ivc files to m.###. Most audio recording applications accept message files with long filenames such as BankABCgreeting503.wav. Messages can be given long filenames for easy identification. The Intervoice InterSoft IQTalk system only accepts messages in a specific format. The format is m.###. All InterSoft runtime messages must start with m, and end with three numeric digits. Message numbers larger than 999 are represented by placing more digits to the left of the suffix and to the right of the m. For example, message number 2046 is renamed to m2.048 and message number 48, 423 would be m48.423. The CRVoice2 utility can be used to quickly convert long filenames to the InterSoft runtime standard of m.###.

60001198, Second Edition

6-1

Filename Formats (.ivc to m.###)


Intervoice master files are .wav files. They have filenames of naming convention xxxxx.wav, where xxxxx can be alphabetic, numeric, alphanumeric, or other characters except special characters (for example, slashes and colons). The filenames have a limit of 253 characters, plus 3 character extension. Intervoice InterSoft IQTalk compressed files have two different naming conventions: xxxxx.ivc and m.###. Microsoft Windows programs identify the .ivc extension as the Intervoice file extension. The m.### naming convention is used to identify Intervoice messages placed on the Intervoice runtime system. However, Windows does not correctly identify files named m.###, so these files are not used on the Windows development or recording systems. The m.### files are used only on the Intervoice system. Figure 6-1 shows examples of accepted filename formats, including .wav, .ivc, m.###, and m#.### formats.
Original Master
BankABC502.wav

Original Master
Firstmenu2046.wav

Compress to Intervoice Format


BankABC502.ivc

Compress to Intervoice Format


Firstmenu2046.ivc

Rename for Runtime (CRVoice2 Utility)


m.502

Rename for Runtime (3-character Extension)


m2.046

Figure 6-1. Filename Formats (.ivc to m.###)

6-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Filename Formats (.ivc to m.###)

Voice File Naming Convention


Before voice files can be used on an Intervoice system, the files must conform to Intervoices voice file naming conventions. InterSoft IQTalk runtime voice files must be named in the following format: m.###. The smallest message number allowed is m.000. Once m.999 is reached, begin renumbering at m1.000, m1.001, and so on. The highest message number supported in runtime is m2147483.647. Windows 2000 also requires all media files to have three-character extensions to identify the file type. Numeric file extensions are not recognized by Microsoft Windows, so Intervoice created a new extension, .ivc, to identify Intervoice files on Windows systems. However, when .ivc files are transferred to an Intervoice system, the .ivc extensions must be replaced with numeric extensions. The CRVoice2 utility renames the .ivc files to the m.### format. For Intervoice purposes, the .wav file extension indicates a master voice file, while the .ivc file extension indicates a file that has been compressed for use on an Intervoice InterSoft IQTalk system. Once you have compressed the messages into .ivc files, use the CRVoice2 utility (see page 6-4) to copy and rename all the .ivc files to the m.### format. The CRVoice2 utility leaves the original long filenames on the .wav and .ivc files for easy reference later. Figure 6-2 illustrates the progression from .wav files to .ivc files then to m.### files.
Voice Recording Software
BankABC501.wav BankABC502.wav BankABC503.wav

Batch Converter
BankABC501.ivc BankABC502.ivc BankABC503.ivc

(original masters)

(trimmed, EQd, normalized, compressed) Same files, same content, different names CRVoice2 utility
m.501 m.502 m.503

Figure 6-2. Transforming .wav Files to .ivc Files

60001198, Second Edition

Naming and Converting .wav to .ivc to m.### Files

Using the CRVoice2 Utility


Before the .ivc voice files can be used on an Intervoice InterSoft IQTalk system, they must be renamed to the Intervoice voice filename format, m.###. The CRVoice2 utility automatically copies and renames all the .ivc compressed voice files created in Sound Forge to the Intervoice m.### file naming convention. CRVoice2 leaves the original long filenames on the .wav and .ivc files for easy reference later. The CRVoice2 utility also checks and corrects file lengths to ensure that all the files are exact multiples of four bytes. The CRVoice2 utility does not change the contents of a file in any way. If the .ivc filename includes the message number (for example, BankABCMainMenu503.ivc), the CRVoice2 utility copies and renames all message files quickly and correctly. This feature allows you to apply the full capabilities of long filenames to help identify message files before renaming them for placement on a runtime system. To use the default operation of the CRVoice2 utility, place the CRVoice2.exe file in the directory that contains the .ivc files to be renamed.

The CRVoice2 utility renames only .ivc files in the directory where the utility is located.

CRVoice2 Utility Options


To view the list of the options available in the CRVoice2 utility, start a command prompt in Windows, then navigate to the directory where the CRVoice2 utility is located. Type CRVoice2 -? <Enter>. A list of all of the CRVoice2 utility options is displayed.

6-4

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Using the CRVoice2 Utility

To start the CRVoice2 utility from an icon, follow these steps:


1

Copy the CRVoice2.exe utility file from the Intervoice Web site to the directory containing the .ivc files to be copied and renamed. Make sure to copy the actual CRVoice2.exe file rather than a shortcut to it. (Hold down the <Ctrl> key to drag and drop files.) Double-click the copy of the CRVoice2 icon.

The CRVoice2 utility copies all the .ivc files in the current directory and renames the copies with the m.### convention. CRVoice2 affects only the files in the local directory with the .ivc extension. To start the CRVoice2 utility from a command prompt, type the following command in this format:
CRVoice2 <-rename> <source file spec> <target directory>

The CRVoice2 utility provides the following options:


-h

Displays the available parameters in the CRVoice2 utility.


<-rename> (optional)

Renames the files instead of copying them. The <target directory> parameter is ignored when another one is specified. This renames the existing .ivc files to m.500 instead of duplicating them, so the original .ivc filenames are lost.
<source file spec>

Indicates the files to be renamed or copied (wildcards are allowed) and the location of the files. The command copies and renames all .ivc files in the current directory.
<target directory>

Indicates the directory in which the renamed copies are placed. The default is the current directory, if this parameter is not used. Example: The following sample CRVoice2 command copies all the .ivc files in the c:\audio directory that begin with the bankabc prefix and renames them to the appropriate m.### filenames. The copies are placed in the c:\audio\bankabc directory. [c:\audio]Crvoice2 bankabc*.ivc c:\audio\bankabc

60001198, Second Edition

Naming and Converting .wav to .ivc to m.### Files

7
In this chapter...
M

Maintaining Audio Quality


This chapter explains the purpose of master voice files and stresses the importance of editing the files before compressing them. It also stresses the importance of retaining original master voice files.

Voice File Quality Issues, 7-2

60001198, Second Edition

7-1

Voice File Quality Issues


Master .wav voice files are intended to store messages in a high-quality format for editing and archiving original messages. To prevent loss of quality, do not edit and reuse compressed messages. After an Intervoice master.wav file has been compressed into the .ivc, m.###, or runtime .wav format, it must not be changed or edited. If you need to change a voice file, edit the Intervoice .wav master file and then compress the new edited file into the .ivc, m.###, or runtime .wav format. The Intervoice .wav master format is an 11.025 KHz (or 11,025 Hz), 16-bit per sample, linear format. It provides a high-quality method of voice storage for editing and manipulation before compression. All editing and volume control adjustments should be made on the 16-bit per sample master .wav file before compression into the .ivc or .wav runtime format. After all editing of the master .wav voice file is complete, the file should be processed using the Batch Converter. No editing of any type should be done on the compressed file.
Keep two versions of the master file: one original and one for editing. Do not discard or edit the original master.

Note that the original masters of the .wav voice files should be retained for future use. After a .wav voice file is compressed into the .ivc format, some quality is lost. Even though the Batch Converter and Sound Forge can convert the .ivc or runtime .wav files back to master .wav file size, a substantial loss of quality occurs in the new master .wav files, and even more quality is lost in the second compression step. For this reason, all re-editing should be done on the original 16-bit master .wav file.

Loss of Voice Quality


Severe loss of voice quality occurs if you convert Intervoice .ivc files to .wav files, edit those .wav files, and try to convert the .wav files back to .ivc files. To correctly edit files, save a copy of the original 16-bit master .wav file, edit the original 16-bit master .wav file, and then convert the newly edited .wav into .ivc and m.### (or 8-bit runtime .wav) for use on the Intervoice system.

7-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

A
In this appendix...
M

FAQs, A-2

Frequently Asked Questions


This appendix provides some frequently asked message recording questions and the answers to those questions. Users should consult this chapter before calling for technical support. This chapter will be updated as new questions are encountered. Contact Sound Forge technical support at:
http://mediasoftware.sonypictures.com/support. (Intervoice recommends contacting Sonic Foundry first for any technical support issues, as Intervoice does not have specific support personnel for Sound Forge.)

For Intervoice technical support, see Technical Support on page xv. For a list of message recording FAQs, visit the Intervoice Web site at www.intervoice.com/support/techinfo/voicerecording.

60001198, Second Edition

A-1

FAQs
The following questions and answers pertain to recording messages.
Recording Steps
What are the main steps to record a voice message?

The primary steps in the recording process are as follows:


1 2 3 4 5

Set levels. Perform the noise floor test (see page 2-12). Record messages (Refer to the voice recording software). Apply the batch script (see page 5-8). Run the CRVoice2 utility (see page 6-4), for InterSoft IQTalk only. Move the files to the Intervoice system.

6 Defining DC Offset

What is the DC offset and what does it do?

DC offset is the average vertical offset from 0 dB of the recorded wave form. This is normally visible in the Sound Forge window only in silent sections of the wave form. Most sound cards have some DC offset when you are recording. Usually you cannot see this offset in the Sound Forge window unless you expand the vertical gain nearly to the limit and look into a silent portion of the recorded wave form. DC offset can cause problems in compressing voice files using the Intervoice ADPCM format. It also causes errors when you are trying to measure the noise floor of a recording. The Sound Forge Tools > Statistics command measures the average value or DC offset of a wave form along with several other important statistics about a wave form. This is the command used to check the noise floor of your recording setup. Since virtually all sound cards have some DC offset, it should be routinely removed as part of the standard processing of 11.025 KHz master files for compression into Intervoice formatted messages. A preset to remove DC offset should be placed in the standard Intervoice batch script that is applied to all Sound Forge voice messages prepared for runtime systems.

A-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

FAQs

Sound Forge already has a preset (Process/DC Offset) that removes the DC offset from any recording. Select Process > DC Offset in the Sound Forge main window. The DC Offset window opens. Ensure that the Automatically detect & remove option is selected and that the Compute from first 5 seconds only option is unchecked. Save this preset as Intervoice DC Removal Preset and place it in the standard batch script.
Producing 64-KB Messages
Is it possible to produce 64-KB -law or A-law messages with the Sound Forge Tool?

Yes, you can create 64-KB messages with Sound Forge, as follows: Before you can save a Sound Forge file as an Intervoice 64-KB message, you must make sure the file is sampled at exactly 8.0 KHz. If the file is not sampled at 8 KHz, you must resample the message in Sound Forge to change the sample rate to 8 KHz. Remember that if you are down-sampling (for example, 22 KHz to 8 KHz), you must use the antialias filter in the resample preset. Up-sampling (for example, 6 KHz to 8 KHz) does not require the antialias filter. (See Batch Converter Overview on page 5-2 for more information.) Once you have an 8-KHz message, select File > Save as. The Save As dialog box appears. In the Save as type field, select the Intervoice (*.ivc) format. In the Format selection field, select the G.711 -law or A-law format. Click the Save button. You can also put all of this into a batch converter script to convert several .wav voice files to Intervoice 64-KB master files in a single step.

Converting Messages

Can I convert Dialogic IVR messages to Intervoice messages?

Yes. Just open the Dialogic messages in Sound Forge, resample to 8 KHz (if not already there), and save as an .ivc file. You can save the file as an Intervoice format 24-KB, 32-KB, or 64-KB, -law or A-law message, depending on the Save As format option you select. You can also put all of this in a batch converter script to convert many .wav or Dialogic voice files to Intervoice 64-KB master files in a single step.

60001198, Second Edition

Frequently Asked Questions

A-3

Using the Gain Leveling Preset

When should I use the Intra-message Gain Leveling Preset?

The Intra-message Gain Leveling Preset is intended to be used only for a specific poorly recorded single message where the voice talents dynamics are too wide. (That is why it is called intra-message.) This usually occurs if the voice talent leans too close and then too far away from the microphone while recording a single message. That is to say, if a specific single message has passages that are too soft, while other parts of the same message are too loud, then you can use this preset to level the audio within that message. This preset will not equalize the level of a group of messages. The Normalize preset is included in the utilities package and it is the correction to adjust the levels of a group of messages to a consistent level. That is why it is included in the Sound Forge utilities batch scripts, and the Intra-message Gain Leveling Preset is not.

Installing the Audiophile 2496 Sound Card

When I installed the M-Audio Audiophile 2496 sound card, the software disabled the Windows volume controls on my system.

This is not an error or problem. The software is working correctly. Once the sound card is installed, the volume is controlled through the Midiman Audio Buddy preamplifier.

A-4

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

FAQs

Turning Off AGC

How do I turn off the Automatic Gain Control (AGC) on my sound card?

The AGC that is on many sound cards must be turned off for any Intervoice voice message recordings. Unfortunately, Creative Labs turns on the Automatic Gain Control on all of their sound cards by default (except the AWE Gold). You can turn off AGC only by using the Creative mixer applet that comes with the Sound Blaster card in the application software package (ctmix32.exe). The standard Microsoft Windows 95 mixer applet (yellow speaker) does not have the ability to change this setting on Creative Labs cards. To turn off the AGC, follow these steps:
1 2 3 4 5

Install Creative Labs Audio Mixer applet (ctmix32.exe). Start ctmix32.exe. Right-click the mixer face. Select the Input/Output settings. Uncheck Automatic Gain Control.

You should not make recordings with the Sound Card AGC enabled. It can cause background noise as well as clicks and pops in the final recording.
Normalizing the Message Volume
When is the best time to normalize the volume so that it matches the messages already on the system?

The messages should be saved in 11, 025 KHz, 16-bit, monophonic, linear .wav format and recorded as loud as possible without clipping. All the subsequent processing can be performed using the Batch Converter. Generally, reducing the volume of the message is one of the last steps before compressing the .wav file into the runtime format.

60001198, Second Edition

Frequently Asked Questions

A-5

Filtering Messages

When I am converting messages to the Intervoice ADPCM format, what filtering or optimization is necessary for the telephony network?

Converting the .wav Files to Runtime Format Overview on page 5-4 describes the steps for postprocessing a .wav file to InterSoft .ivc file format. The most critical step for voice clarity is equalization. By using the parametric equalization function in the Batch Converter to change the low frequency cutoff of the recorded voice, you can significantly change the clarity of the voice over the phone. Most users find that they can achieve their voice quality requirements by modifying the low-shelf frequency of the paragraphic EQ step. Usually they raise the low-shelf to between 200 and 300 Mhz for improved clarity. You may find that a modification of the high-shelf parameter also will improve some types of voices. If you want, you can boost a small portion of the audio spectrum centered at 800 Hz up a few decibels to get extra clarity voice. Do not improve the voice by boosting the bass. While this may seem improved on a hi-fi speaker, the voice quality does not work well over a phone handset. The primary information carrying bandwidth of the human voice is between 400 and 2000 Hz, and boosting frequencies outside of this range actually lowers the listeners comprehension.
Reducing Pops and Clicks in Messages
I have heard that the CRVoice2 utility fixes the pops and clicks in messages. Is this a result of making sure the message is in 4-byte increments, or does something else resolve this problem? And what does the CRVoice2 utility do?

Sound Forge version 4.5 and later versions of the Batch Converter with the Example Batch Script Vx.0.bcs file create InterSoft .ivc files with the correct byte count lengths. If you convert your .wav message masters using Sound Forge 4.5 or later versions of the Batch Converter, which is the correct way to convert messages, you do not have problems with pops caused by message length. Other wave editors do not process or compress the message files correctly and should not be used to create InterSoft .ivc files. Other editors can be used to record the .wav master files, but only the Sonic Foundry Batch Converter can convert them.

A-6

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

FAQs

The CRVoice2 utility checks the message length and corrects it when necessary. However, it should not be used for that purpose because it does not perform the other operations on the .wav file that the Batch Converter does. The CRVoice2 utility is used strictly to rename the .ivc files to m.### for Intervoice systems. That is all it is designed to do. Checking the message length was included in this utility to fix a bug in an earlier version of Sound Forge. This earlier version is no longer available.

60001198, Second Edition

Frequently Asked Questions

A-7

Glossary

-law (pronounced mu-law)

A Pulse Code Modulation (PCM) voice-encoding standard in which analog voice is band-limited to 4-KHz bandwidth and then sampled at twice its highest frequency (4000 x 2), with the sample then converted as an eight-bit binary number. Samples must be taken 8000 times per second, so each sample is 1/8000 of a second, or 125 microseconds. A standard telephone voice channel is transmitted at 64 KBps, or eight samples in 125 microseconds. The -law standard is used on T1 systems in the United States, Japan, Mexico, and Canada. See also A-law. A method of encoding digitized voice that requires fewer bits than linear or -law encoding (voice compression). The Intervoice system has a standard file format for voice messages using this compression method at 32 KBps and 24 KBps.

adaptive differential pulse code modulation (ADPCM)

ADPCM

See adaptive differential pulse code modulation.


A-law

The method of encoding sampled audio waveforms used in the 2048-KB, 30-channel PCM primary system. A-law is the standard widely used outside of North America on E1 systems. See also -law.
automatic labeling

The function of Sound Forge that allows the user to assign descriptive filenames to master voice files. This function should be set up before messages are recorded.
bandwidth

The range of electrical frequencies a device can employ.

60001198, Second Edition

Glossary-1

Batch Converter

The Sonic Foundry program that allows the processing of multiple audio files in a single process.
click

A sound, or noise, that occurs in voice recordings; a type of audible interference that must be corrected in voice recordings.
command prompt

An operating system prompt that appears in an operating system window or full-screen session. Commands that run programs are entered at this prompt.
CRVoice2 utility

A utility that assists in the naming of multiple voice files for use in InterSoft runtime.
dBm

A decibel below 1 milliwatt (mW). See also decibel.


DC offset

The average of all of the positive and negative integers in a waveform. This number can be positive or negative. Zero is referred to as no DC offset.
decibel (dB)

A unit for measuring the power of a sound or the strength of a signal.


direct memory access

See DMA.
DMA (direct memory access)

A technique in which an adapter bypasses a computers CPU and handles the transfer of data between itself and the systems memory directly.
equalization

The frequency manipulation of a sound using adjustable filters.


filename

A name given by the user to electronically stored data for purposes of saving it and retrieving it at a later time. Filenames may be of varying lengths depending on the operating system used. The three-character extension (in the 8.3 format) designates the type of file: text (.txt), word processing (.doc), executable program (.exe), and so on. For multiple-operating system compatibility, the system requires the 8.3 format and recommends lowercase filenaming.

Glossary-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

imsgmod4.exe utility

A utility that truncates messages to a length divisible by 4 bytes. This truncation eliminates clicks and pops within messages used on the system. This utility is not recommended by Intervoice.
InterSoft system

An Intervoice system that operates on the InterSoft runtime environment with IQTalk or the VoiceXML Browser.
InterSoft version 3

The core software program used to bring applications into IQTalk voice browser runtime. InterSoft version 3 is designed to run on Windows NT and Windows 2000 operating systems. Intervoice IQ Environment [VoiceXML Browser System] A completely integrated environment that is bundled with the Intervoice InterSoft runtime system. However, it can also have speech recognition and text-to-speech servers. It offers a complete solution including design, development, and deployment tools as well as operation administration and management tools. It also includes data integration options to connect to legacy, enterprise server, and Web-based data sources. Browser options are IQTalk or VoiceXML, and operating system options are Windows NT or Windows 2000.
Intervoice runtime system

One or more Intervoice voice browser systems and other hardware that perform call processing, audiotex, speech recognition, call directing, voice messaging, and interactive voice response. This system is the interface between telephone callers and a host computer, the Internet, a local area network (LAN), a wide area network (WAN), or telecommunication network, and local or remote databases. The Intervoice voice browser systems can be integrated in one box or distributed in multiple boxes. This can also be called IVR and was formerly called Omvia Media Server.
IQTalk

The InterSoft runtime engine and voice browser that interprets and processes the InVision application. It can also interface with userdefined programs written in C or Assembly.
KBps

Kilobits per second. This is the number of bits passing a specific point per second in an audio file/voice message.
message

An audio file that, combined with other messages, comprises the spoken information in an application.

60001198, Second Edition

Glossary-3

Mu-law

See -law. A process used in the Intervoice Example Batch V2.sfz file to adjust the volume of all the audio files to the same level.

normalization

Omvia Media Server

See Intervoice runtime system.


pop

A sound, or noise, that occurs in voice recordings; a type of audible interference that must be corrected in voice recordings.
preset

An audio processing function in Sound Forge.


RealCare

The remote diagnostic and troubleshooting service available to customers as a contractual option. RealCare operates a national WATS line at 1-800-955-4688.
runtime

The mode in which the system answers customer calls and communicates with the host; the system in the production environment.
Sound Forge

A program, created by Sonic Foundry, and distributed by Sony Media Software, Inc., that performs postprocessing and complex editing. It can also be used to record and edit messages.
trimming

The action of eliminating or cutting silence from the beginnings and ends of voice messages.
voice compression

A method for reducing the bandwidth required for storing or transmitting digital voice signals.
voice file set

A set of individual voice files that contain the recorded messages for an application. These voice files are contained in a subdirectory of the application directory.

Glossary-4

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

voice message A digital .wav file used when an individual message is changed in a voice file set (.vfs). While the message file is being changed, any

application playing the current message continues to play it. The new message is played at the next message request.
voice message file numbering

The method of naming voice message files m.###. They must begin with m to be recognized, for example:
m.001 message number 1 m.777 message number 777 m2.010 message number 2010

Voice Pool Set

A type of VoiceSet that contains a collection of voice messages for applications that record large amounts of voice, such as LeaveWord recording. Messages from a Voice Pool Set (as in an InVision application) are used in place of messages from the voice file set (.vfs). Voice Pool Sets are contained in a subdirectory of the application directory. See also VSUTIL utility.
Voice Set utility

See VSUTIL utility.


voice talent

The person whose voice is used to produce voice recordings.


VSUTIL utility (VoiceSet utility)

The program that creates VoiceSet directories based on the information it reads in the .app file (stored in the application directory). Actual pools are created in the Voice Pool Set for use with LeaveWord and Advanced LeaveWord, as well as in the Alternate Message table for use by YourVoice. It also creates any necessary indirection files. VSUTIL should be started from a command prompt in the application directory.
.wav

A standard format for audio files. A file that contains Windows-compliant audio data. The file can be played on any Windows system with audio hardware. Intervoice has defined an 11.025 KHz, 16-bit .wav file as its master format.

60001198, Second Edition

Glossary-5

Windows 2000

A network operating system built from Microsoft Windows NT technology. Windows 2000 provides support for both peer-to-peer networking as a work group and centralized administration networking as a domain. Windows 2000 has two major versions: Windows 2000 Server and Windows 2000 Professional. For IVRs, Intervoice recommends Windows 2000 Server, which supports communication between a larger number of systems. Windows 2000 Professional can be used to run applications on business desktops and laptop computers.

Glossary-6

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

Index

.ivc files 6-4, 6-5 naming 6-3 quality of 7-2 .wav files 6-4 clipping 2-2 conversion and reconversion 7-2 converting 5-2 defined Glossary-5 described 3-2 editing and compressing 7-2 master format 7-2 keeping two copies 7-2 renaming 6-3 updating 7-2 -law compressed data format, explained 3-2 A AC-powered speakers with cables, recommended hardware 2-3 adaptive pulse code modulation (ADPCM) 5-2 .ivc data format 3-3, 5-8 decoder 5-4 ADPCM. see adaptive differential pulse code modulation A-law ADPCM compression 5-5 defined Glossary-1 applet, ctmix32.exe A-5 audience for this document ix audio cable recommended hardware 2-3 Y-adapter 2-4

audio file format InterSoft IQTalk 3-3 VoiceXML 3-3 Automatic Gain Control (ACG) A-5 automatic labeling and Sound Forge 4-2 defined Glossary-1 setting up 4-2 B background noise 2-12 bandwidth, defined Glossary-1 bass boosting A-6 Batch Converter 3-3, 4-6 defined Glossary-2 downloading 2-6 installing 5-2 modifying sample batch script 5-8 parametric equalization function A-6 postprocessing steps 8 KHz resampling 5-4 DC Offset Removal 5-4 Front and Back silence buffering 5-4 Front and Back silence trimming 5-4 Message level normalization 5-4 registering 2-6 removing batch scripts 4-18 using 5-6 using sample batch script 5-3 batch scripts modifying 5-3, 5-8 removing 4-18 buffering, silence 5-4

60001198, Second Edition

Index-1

buttons Record 2-8, 4-2 Reset 2-8 Stop 4-8 C caution note xiv checklists recording process 4-6 studio set up 2-2 click defined Glossary-2 reducing A-6 clipping described 2-9 fixing 2-9 preventing 2-10 command prompt 6-4 defined Glossary-2 connecting the sound card and speakers 2-4 conventions used in this document xiv converting .wav file to.ivc file 5-4 CRVoice2 utility 4-6 and .ivc files 2-7 command 6-5 defined Glossary-2 install directory 6-4 options <-rename> (optional) 6-5 <source file spec> 6-5 <target directory> 6-5 -h 6-5 renaming files 2-7 starting from a command prompt 6-5 starting from an icon 6-5 using 6-4 CRVoice2.exe file 6-4, 6-5 ctmix32.exe A-5 customer support xv

D dB(decibel), defined Glossary-2 dBm, defined Glossary-2 DC offset defined Glossary-2 removal 2-13, 5-4 statistics, viewing 2-13 window 2-12 diagram conversion process 5-2 creating audio recordings 1-3 from .wav to .ivc 6-3 DMA (direct memory access), defined
Glossary-2

document reference xiv documentation access xi formats xi Intervoice xi OEM for installation instructions 2-2 downloading from Web site Batch Converter 2-6 E equalization 5-5 defined Glossary-2 example Batch Script 5-2 F FAQs converting messages A-3 defining DC offset A-2 filtering messages A-6 installing the Audiophile 2496 card A-4 normalizing the message volume A-5 producing 64-KB messages A-3 recording steps A-2 reducing pops and clicks A-6 turning off Automatic Gain Control (AGC)
A-5

using the Gain Leveling Preset A-4 figures, list of v filename defined Glossary-2 formats 6-2

Index-2

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

files .ivc 6-5 .wav defined Glossary-5 CRVoice2.exe 6-4, 6-5 Intervoice Example Batch Script 5.0 5-3 filtering messages A-6 formats, standard InterSoft runtime 3-3 G Gain Leveling Preset, described A-4 general note xiv H hardware, recommended 1-4 I information symbols used in this document xiv installing Batch Converter 2-6 CRVoice2 2-7 hardware 2-3 InterSoft file naming conventions 6-1 master format, differences between Windows master format 3-3 runtime format 3-2 runtime messages, format 6-1 InterSoft system, defined Glossary-3 InterSoft version 3, defined Glossary-3 Intervoice documentation xi Example Batch Script 5.0 file 5-3 Example Batch Script file contents 5-3 Example Batch V2.sfz file 5-2 master format 3-2 standard master format 3-2 technical support xv voice file, naming conventions 6-3 Web site 1-1, 5-3, 5-8, 6-5, A-1 Intervoice system defined Glossary-3 IQTalk, defined Glossary-3

J jack-to-jack shield adapter, recommended hardware 2-3 K KBps defined Glossary-3 kilobits per second Glossary-3 L list of figures v list of tables vii M master format .wav files 7-2 Windows 3-3 M-audio sound card, recommended 2-3 message defined Glossary-3 file, final volume levels 2-8 filtering A-6 volume, normalizing A-5 microphone 2-2 cable, recommended hardware 2-3 commercial and preamplifiers 2-9 connecting 2-2 level on preamplifier 2-9, 2-12 placement 2-9 recommended hardware 2-3 recommended type, supercardioid 2-9 Sennheiser, more information 2-9 setting levels 2-8 modifying the example batch script 5-8 mu law, defined Glossary-1 N noise floor test 2-2, 2-12, 4-6 failure causes 2-13 normalization 5-4 defined Glossary-4 Normalize preset A-4

60001198, Second Edition

Index-3

note boxes caution xiv document reference xiv general xiv tip xiv warning xiv O Omvia Media Server defined Glossary-4 optimization of messages A-6 P parametric equalization function A-6 playback 5-5 plug-in chain 5-4 pop defined Glossary-4 reducing A-6 postprocessing 5-4 power cord, recommended hardware 2-3 preamplifier 2-2 and commercial microphones 2-9 line-level input aux 2-2 line 2-2 recommended hardware 2-3 preset defined Glossary-4 Gain Leveling A-4 Normalize A-4 prompt, command, defined Glossary-2 purpose of this document ix R RealCare xv defined Glossary-4 recommended hardware AC-powered speakers with cables 2-3, 2-4 audio cable 2-3 jack-to-jack shield adapter 2-3, 2-4 M-audio sound card 2-3 microphone 2-3 microphone cable 2-3 power cord 2-3 preamplifier 2-3

sound card 2-4 Y-adapter audio cable 2-3 Record button 2-8, 4-2 recording attributes 2-8 Intervoice standard settings 4-5 setting 4-5 continuous methods 4-7, 4-8, 4-9 Multiple takes (no Regions) 4-8, 4-9 message-by-message methods 4-7, 4-8, 4-12 Create a new window for each take 4-8,
4-15

Multiple takes creating Regions 4-8,


4-12

messages in Sound Forge 4-9 process checklist 4-6 resampling 5-4 Reset button 2-8 RMS power 2-13 runtime, defined Glossary-4 S sample batch script, using into Batch Converter
5-3

setting recording attributes 4-5 setting up automatic labeling 4-2 signal-to-noise ratio 2-8 silence buffering 5-4 trimming 5-4 software, recommended 1-5 sound card 2-2, 5-4 audio input 2-2 Automatic Gain Control (ACG) A-5 connecting to speakers 2-4 installing A-4 volume controls disabled upon installation
A-4

Sound Forge automatic labeling function 4-2 Batch Converter 5-2 defined Glossary-4 importance of registering 2-6 level monitors 4-9, 4-12, 4-15 recording messages 4-9 recording mode 4-9

Index-4

Sound Forge and the Intervoice System Audio File Guide

60001198, Second Edition

technical support A-1 speakers 2-2 connecting to sound card 2-4 Stop button 4-8 studio setup 2-2 summary of changes 1-2 system utilities CRVoice2 utility 6-4 VSUTIL utility (VoiceSet utility)
Glossary-5

defined Glossary-5 reading preference 4-8, 4-9 Voice Pool Set, defined Glossary-5 VSUTIL utility (VoiceSet utility) defined Glossary-5 W warning note xiv Web sites Intervoice A-1 Windows 2000, defined Glossary-6 Y Y-adapter audio cable 2-4 recommended hardware 2-3

T tables, list of vii technical support xv Sound Forge A-1 tip xiv trimming defined Glossary-4 silence 5-4 U updating the Batch Converter 2-6 using Example Batch Script 5-3 utilities batch scripts A-4 CRVoice2 6-4 defined Glossary-2 VSUTIL (VoiceSet utility) Glossary-5 defined Glossary-5 V viewing DC offset statistics 2-13 voice compression, defined Glossary-4 file set, defined Glossary-4 files converting Dialogic A-3 quality issues 7-2 message file numbering, defined
Glossary-5 message, defined Glossary-5 quality 7-2 talent A-4

and individual master files 5-1 and microphone 2-9

60001198, Second Edition

Index-5

You might also like