The pronunciation of a word depends on the type of text being spoken. For instance, when speaking mathematics, it is important to say ``cap gamma'' when rendering [tex2html_wrap5606], whereas when rendering plain text, upper case is not significant. Similarly, if an English document contains French phrases, these should be pronounced according to French pronunciation rules.
We model pronunciation as a separate component of the audio formatter. The corresponding state space is discrete, with points representing different pronunciation modes. Notice that the pronunciation component does not correspond to a separate hardware component of the audio formatter.