0

Since the list of phonemes in English is fixed, it should be possible to come up with a sample text(s) to collect all possible biphones for text-to-speech synthesis.

Does anyone have a sample text for this purpose?

alvas
  • 51
  • 5
  • 1
    The question is not fully determined, as the pronunciation of biphones is a matter of dialect. I think You are looking for a text that will cover all biphones in all dialects, which is a little more to ask. – user7358 Dec 29 '13 at 16:41
  • technically, even a list of all biphones of every language very limited since the total number of IPA signs are sorted of limited to a handful, no? – alvas Dec 31 '13 at 15:51
  • Covering all the biphones is easy. The hard part in text to speech is getting all the different ways people pronounce each phone. – Aaron Dec 31 '13 at 16:25
  • if data size is large enough, wouldn't it be a normalization problem to get the most representative sound file that suits a certain biphone? – alvas Dec 31 '13 at 16:28

1 Answers1

1

The "Harvard sentences" corpus is a classic:

https://en.wikipedia.org/wiki/Harvard_sentences

Matthias Odisio
  • 1,456
  • 7
  • 19