Alternative handling of non-latin characters for TTS

v4u6h4n · May 13, 2026

I'm wondering if an alternative approach to handling non-latin characters with the TTS engine would be possible? Currently when encountering a non-latin character it reads <language> letter <number>, for example reading Thai letter 324 for every character in a non-latin word, like in this document.

I would prefer that either the entire word is simply read out, or that instead of reading each letter separately something like Thai letters is used to indicate that multiple characters are present.

dstillman · May 13, 2026

What voice? Thai characters seem to work fine for me with Premium voices. Standard voices don't support multilingual text.

v4u6h4n · May 13, 2026

Standard Voice 1.

Premium Voices seem fine; they pronounce the words normally.

dstillman · May 13, 2026

Right, so that's expected.

dstillman · May 13, 2026

We can look into stripping unpronounceable spans of text for Standard, but mostly we just don't make any claim that multilingual text will be readable.

v4u6h4n · May 13, 2026

Yeah that's totally reasonable. My complaint is just the tedious reading of each character, I don't have any expectation that it actually read the different alphabets, even though its nice.