I have been playing around with some custom voices and built-in voices in piper and have found that I would like to have different settings for the length scale and speaking cadence depending on the voice I’m using. Would it be possible to either have configuration options per voice in the add-on configuration (even if only possible in YAML) or be able to specify these specific parameters when calling the tts.speak action with piper as additional options?
Something like this:
action: tts.speak
target:
entity_id: tts.piper
data:
cache: true
media_player_entity_id: media_player.study_speaker
message: I enjoy the sight of humans on their knees... That was a joke.
options:
voice: edi
length_scale: 1.5
speaking_cadence: 0.5
This would allow the use of different settings for each voice simultaneously and make them all clearer and more natural-sounding, especially if you wish to use multiple voices at the same time.