
VoiceXML: Introduction to Developing
Speech Applications
13.1
For each of the following applications, write a multimodal dialog scenario
using notion similar to that of Figure 13.4.
A. Present directions
to a restaurant in New York City
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
Let's practice the new vocabulary.
What is this?
|
Display a picture of a red book |
That is a book |
|
2 |
Good.What color is the book?
|
|
The color of the book is red |
|
3 |
Put the book into the bookcase |
|
OK |
Manipulates the mouse to grab the book and drop it on the bookcase |
C. Shopping for
clothes in an online clothing store.
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
What would you like to buy?
|
|
A shirt |
|
2 |
Here are three shirt styles, which do you like?
|
Display three shirt styles |
This one |
Points to desired shirt style |
3 |
This style comes in three colors, which do you like? |
Displays four shirt colors and the words that describe each color |
Peach color |
|
13.2 For
each of the following input/output modes, describe an application that
uses the mode and illustrate part of a multimodal dialog scenario using
notation similar to that of Figure 13.4.
A. Sequential input
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
What would you like to buy? |
Display assortment of vegetables |
Four apples |
|
2 |
|
|
|
Bag of peanuts |
3 |
|
|
Two big grapefruit |
|
B. Uncoordinated
simultaneous input
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
What is your account number? |
|
18576 |
18576 |
C. Coordinated simultaneous
input
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
What should I do next? |
Display a scene of objects and locations |
Put this |
Points to an object |
2 |
|
|
here |
Points to a location |
D. Sequential output
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
|
Welcome to Ajax CD and movie rental |
|
|
2 |
Are you interested in music or movies? |
|
CDs |
|
3 |
|
Display list of "hot" CDs. |
|
|
E. Uncoordinated
simultaneous output
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
Welcome to Ajax CD and movie rental |
Display list of "hot" CDs and movies |
|
|
2 |
Are you interested in music or movies? |
Display another list of "hot" CDs and movies |
|
|
F. Coordinated or
synchronized output
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
Start here |
Display starting point on a map |
OK |
|
2 |
Go two blocks north |
Draw an arrow on the map pointing northward |
OK |
|
3 |
Turn left and go three blocks |
Draw and arrow on the map pointing westward |
Yes |
|
4 |
Here is the restaurant |
Place an X on the map where the restaurant is located |
Thanks |
|
13.3 Write
two multimodal versions of Exercise 5.2
A.Write a multimodal
dialog scenario for a "point and speak" dialog (Use notation similar
to Figure 13.4.)
Step |
Prompt to User |
User Response |
|
Spoken |
Displayed |
Spoken |
Keyboard/Mouse |
1 |
|
|
|
Points to date field |
2 |
|
Date field highlighted |
December 18, 2002 |
|
3 |
|
December 18, 2002 in date field |
|
Points to time field |
4 |
|
Time field highlighted |
Three thirty PM |
|
5 |
|
3:30 pm in time filed |
|
|
B. Write extended
VoiceXML code for a "point and speak" dialog.
C. Write HTML plus
SALT code for a "point and speak" dialog.
D. Compare the two
versions with respect to programming style and lines of code required.
Extended VoiceXML
is declarative code, while HTML plus SALT is event driven procedural
code
a
|