Phoneme shapes
 
 
 

Phonemes are the smallest individual units of sound that are combined to form speech. A phoneme shape refers to the position of the lips, teeth, tongue, and jaw required to utter a phoneme, or a specific sound of speech.

For example, the phoneme shape for the phoneme /m/ consists of lips that are pressed together, whereas the phoneme shape for /l/ consists of lips that are not round and that are slightly apart, with the tip of the tongue pressed lightly to the gums behind the upper teeth.

A. /m/ phoneme shape B. /l/ phoneme shape

Phoneme shapes are needed when you use the Voice device, in conjunction with the Character Face asset, to drive your facial animation. Refer to the Various Phoneme shapes figure as a guide for creating phoneme shapes supported by the Voice device.

However, you can also drive phoneme shapes as you would custom shapes, using keyframes or devices such as joysticks and keyboards.

Create as many phoneme shapes as you need. If you create only a few phoneme shapes, they can be blended to simulate other shapes. However, the more phoneme shapes you create for a model, the better quality the final animation.

In speech, vowels are particularly important because they comprise the majority of visual cues in spoken language.

How vowels are articulated depends on the different movements of the mouth and tongue, such as the front or back placement of the tongue and the movement of the lips.

The following figure shows the English vowel system (partial) in a scale ranging from front unrounded (IY) to back rounded (UW) vowels. Most other vowels fall in between these two extremes (IY and UW). AX is approximately in the middle, and represents the sound that is made when the tongue and mouth are at complete rest.

Partial English Vowel System

Phonemes are not always pronounced in the same way and may fall between vowels. To compensate for this, or if your model was created without a specific shape, MotionBuilder weights the nearest vowels to generate a mix of shapes that approximates the position of your model’s lips.

If your model says a vowel that falls outside the range that your shapes cover, MotionBuilder chooses the closest vowel.

You should select the vowels that correspond to your shapes. Ideally, you should create your model with the shapes for the extreme ends of the vowel range, then uniformly space more vowels in between. See the following figure for a visual example of phonemes and their shapes.

Various phoneme shapes

For example, if you want your character to slightly open and close its mouth, you could use only one vowel such as AX (as in “the”). If you want more subtle movement to represent the sound between different vowels, use more than one. Accordingly, if your model has three shapes that say “beat”, “bat”, and “boot”, use IY, AE, and UW. If your model has six shapes saying “bit”, “bet”, “bob”, “boat”, “book”, and “seat”, use IH, EH, AA, OW, UH, and S respectively.

The following table lists the phoneme shapes available in MotionBuilder and gives an example illustrating their pronunciation.

Phoneme Example
AE cat
AO fought
AX alas
B bike
D dam
F fox
G gate
H hat
IY heat
K cake
KG concur
L lock
M moon
N new
OW goat
P point
S site
SH shape
T tap
UH look
UW loot
V veer
Z zoo
ZH pleasure