Mimic 3 TTS - speech metadata

Martin_Miksik · September 24, 2022, 4:51pm

Hello ,

Is obtaining metadata about the generated speech supported? I would like to get timestamps of where the pronunciation of individual words starts and ends. I cannot find anything about it in the documentation.

If not, is there any plan to bring the support in?

Thanks!