During database creation, each recorded utterance is segmented into some or all of the following: individual phones, syllables, morphemes, words, phrases, and sentences.
[1] An index of the units in the speech database is then created based on the segmentation and acoustic parameters like the fundamental frequency (pitch), duration, position in the syllable, and neighbouring phones.
Unit selection provides the greatest naturalness, because it applies digital signal processing (DSP) to the recorded speech only at concatenation points.
Gleason appeared in Microsoft's Super Bowl XLVIII commercial praising the power of technology, using his synthetic voice to narrate.
This process was featured in a BBC Radio 4 documentary, Giving the Critic Back His Voice, broadcast in August 2011.
CereProc's client/server system cServer, aimed principally at the corporate IVR market, can be run on Windows and Linux.