speechd-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: How to get rid of lengthy pause at end of sentences for speechSynthe


From: guest271314
Subject: Re: How to get rid of lengthy pause at end of sentences for speechSynthesis.speak()?
Date: Sat, 24 Aug 2024 15:49:20 -0700

I posted the error message. There is no "GenericDelimiters", the
module name apparently needs to be used.

Why would I need a speech synthesis interface when I can already get
the audio directly in the browser?

What I'm actually working on is adjusting the code in piper.conf to
send the raw audio to the currently focused tab when
window.speechSynthesis.speak() is executed in the focused tab.

The I can use Web Audio API to play the s16le as Float32Array's directly.

On Sat, Aug 24, 2024 at 3:29 PM Samuel Thibault
<samuel.thibault@ens-lyon.org> wrote:
>
> guest271314, le sam. 24 août 2024 15:03:26 -0700, a ecrit:
> > That option throws the same error when placed in /modules.piper.conf.
>
> "Works for me".
>
> Again, details matter. Please paste the actual error message and the
> actual configuration files that you have used.
>
> > That would solve the issue because then we would get the output
> > directly from the speech synthesis engine to do with whatever we want,
>
> You're here changing the interface: peeking audio from the synthesis,
> and not running an external pipeline like the generic module interface
> currently does.
>
> Really, it's not a question of putting stuff in wasm, but getting away
> from the generic module interface, which is convenient to get something
> quick&dirty, but by construction brings synchronization.
>
> We could introduce a different kind of generic module, from which we
> peek audio samples. The difficult part is knowing what format is used,
> which'd need looking at metadata, or using explicit configuration in the
> case where some synthesizers would print just raw data.
>
> https://github.com/brailcom/speechd/issues/952
>
> Also, you'd probably like to get the audio into the browser, so the
> browser can play it at will,
>
> https://github.com/brailcom/speechd/issues/348
>
> again, no need for wasm here, it's the interfaces that matter, not the
> way software is written.
>
> Samuel



reply via email to

[Prev in Thread] Current Thread [Next in Thread]