speechd-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: What is the best way to control espeak?


From: Samuel Thibault
Subject: Re: What is the best way to control espeak?
Date: Sat, 11 Jun 2022 22:10:48 +0200
User-agent: NeoMutt/20170609 (1.8.3)

Hello,

Yes, ssml is definitely the way to go, you can use espeak-ng -m -w
to directly what you want, or you can pass ssml to speech-dispatcher,
and that'll be passed on to espeak-ng.

Samuel

Jeremy Whiting, le ven. 10 juin 2022 15:08:13 -0600, a ecrit:
> Hello Rastislav,
> 
> From looking at espeak --help here I think I can answer some of this.
> 
> 1) To output to a wav file instead of speaking on audio device use the -w
> filename parameter.
> 2) To have espeak interpret SSML, use the -m argument to parse SSML (makes it
> ignore <> characters also though, so those would need to be escaped if by
> themselves in your text.)
> 3) For sound effects I'm not sure if espeak supports sound icons, but if it
> does those could be used for sound effects.
> 
> SSML can also be used for manipulating the voice parameters throughout the 
> text
> with tags that specify how the voice should change or even to change voices
> completely as far as I understand it.
> 
> BR,
> Jeremy Whiting
> 
> On Fri, Jun 10, 2022 at 2:23 PM Rastislav Kish <[1]
> rastislav.kish@protonmail.com> wrote:
> 
>     Hello list,
> 
>     this question would likely be better suited for the espeak mailing list,
>     though I'm not subscribed there and it's partially also related to
>     Speech dispatcher, so I'm posting it here.
> 
> 
>     When synthesizing text, I would like espeak-ng to do these things:
> 
>     * Change pitch in specified areas of the text
> 
>     * Change the voice variant in specified areas of the text, preferably
>     without changing the on-going intonation, though I suppose it's not an
>     absolute necessity
> 
>     * Mix in some kind of sound effect. Note that I don't mean specific
>     sounds to be played, but rather generation of various noises, brummings,
>     tones etc. in the exact time the marked text is being spoken in the audio.
> 
> 
>     It would be best if it was possible to do these things simply by some
>     markup, for example the SSML speechd modules use, though especially in
>     case of the last mentioned, that would likely be difficult to do in this
>     way, so I could do it programmatically as far as espeak could somehow
>     tell me the exact time marks an area of text is spoken, so I could
>     include the effect there.
> 
>     And of course, as far as it could give me the audio. :)
> 
> 
>     Thank you in advance!
> 
> 
>     Best regards
> 
> 
>     Rastislav
> 
> 
> 
> 
> 
> 
> References:
> 
> [1] mailto:rastislav.kish@protonmail.com

-- 
Samuel
---
Pour une évaluation indépendante, transparente et rigoureuse !
Je soutiens la Commission d'Évaluation de l'Inria.



reply via email to

[Prev in Thread] Current Thread [Next in Thread]