Building a Vocal Synthesizer in Rust

I'm writing a vocal synthesizer in Rust!

Some initial scaffolding:

All it has is a very simple WAV writer for mono 44.1khz 16-bit audio. Offline rendering to disk is very helpful for prototyping and debugging.

Like 29 May at 14:56 | Open on merveilles.town

19 comments

patchlore

Building a Vocal Synthesizer in Rust

Next up: porting a glottal source model. The one I use is based on the LF model with pulsed glottal noise for breathiness.

Original C code here:

https://git.sr.ht/~pbatch/mnodes/tree/master/item/glot/glot.c

30 May at 1:25 | Open on merveilles.town

patchlore

Building a Vocal Synthesizer in Rust

The glottal component of my singing synthesizer has now been ported to Rust [0], which was translated from C. Rust like to use x.exp() instead of exp(x) for math funcs, so that tripped me up a few times.

The sound is pretty unremarkable [1], and will need a filter (tract) component to give it vowels.

0: https://github.com//PaulBatchelor/voxbox/blob/main/src/glot.rs

1: https://github.com/PaulBatchelor/voxbox/blob/main/examples/glot_simple.rs

30 May at 18:15 | Open on merveilles.town

patchlore

Building a Vocal Synthesizer in Rust

I have ported my tract filter to rust now [0]. I couldn't resist garnishing my "simple" example a little bit [1] to make it more musical and sing-y.

In addition to the bare minimum tract filter processing the glottal source, I've also added some controls over vibrato, amplitude, and tract shape (for vowel morphing). I tuned the tract shapes by ear using the distinct region model.

0: https://github.com/PaulBatchelor/voxbox/blob/main/src/tract.rs

1: https://github.com/PaulBatchelor/voxbox/blob/main/examples/tract_simple.rs

I have ported my tract filter to rust now [0]. I couldn't resist garnishing my "simple" example a little bit [1] to make it more musical and sing-y.

Expand text...

1 June at 13:42 | Open on merveilles.town

patchlore

Building a Vocal Synthesizer in Rust

dev logs I made while coding up the implementation: https://pbat.ch/recurse/tasks/implement_tract/

1 June at 13:45 | Open on merveilles.town

patchlore

Building a Vocal Synthesizer in Rust

An initial interactive demo inside the browser! Hit begin, then use the sliders to control glottal and tract params:

https://pbat.ch/recurse/demos/singer_test/

2 June at 22:22 | Open on merveilles.town

MarcatoMarc

Building a Vocal Synthesizer in Rust

@patchlore great start! How does this run in brower? Webasm?

2 June at 22:37 | Open on merveilles.town

Show 1 reply

patchlore

Building a Vocal Synthesizer in Rust

There has been a game jam this week, so it's distracted me from some of this vocal synth work.

Velum support comes next, which is what is needed to get nasal sounds.

Devlogs so far: https://pbat.ch/recurse/tasks/implement_velum/

#rust #dsp #vocalsynth

6 June at 14:15 | Open on merveilles.town

th4

Building a Vocal Synthesizer in Rust

@patchlore it is absolutely fascinating seeing you implement this.
Do you have any pointers for someone who would like to learn more about the theory? (I reckon it's your domain of specialty, right?)

6 June at 15:08 | Open on post.lurk.org

patchlore

Building a Vocal Synthesizer in Rust

Porting the nasal sounds has been a bit of a bumpy ride.

I have introduced a NaN somewhere. This kills the DSP. Now I need to track down where it has been introduced.

My approach has been to use panic and counters to iteratively bisect and find the earliest instance of a NaN. It's slow and tedious work.

The NaNhunt will resume tomorrow.

7 June at 21:33 | Open on merveilles.town

crop

@patchlore
Your problem made me search for "better" floats. ... maybe this crate would help in your case: https://lib.rs/crates/typed_floats

8 June at 16:17 | Open on hachyderm.io

patchlore

Building a Vocal Synthesizer in Rust

@crop nice! this looks very helpful indeed. I will check it out!

8 June at 17:15 | Open on merveilles.town

patchlore

Building a Vocal Synthesizer in Rust

With some trial and error, I managed to get throat singing working!

#vocalsynthesis #dsp #rust

9 June at 18:36 | Open on merveilles.town

Adrian Cochrane

Building a Vocal Synthesizer in Rust

@patchlore Sounds beautiful! With lots of promise!

9 June at 18:43 | Open on floss.social

MarcatoMarc

Building a Vocal Synthesizer in Rust

@patchlore Awesome!

9 June at 19:11 | Open on merveilles.town

patchlore

Building a Vocal Synthesizer in Rust

I made an oopsie and now the particular shapes I sculpted for this demo don't work anymore. Had to sculpt some new ones. They are okay enough though not as loud and pronounced as this one.

9 June at 20:53 | Open on merveilles.town

patchlore

Building a Vocal Synthesizer in Rust

Anyways, nasal/velum control seems to work I think. It's now a slider on the demo page. I've also turned on 2x oversampling, which will hopefully smooth things out a bit:

https://pbat.ch/recurse/demos/singer_test/

9 June at 20:56 | Open on merveilles.town

Reilly Spitzfaden (they/them) replied to patchlore

@patchlore this project is super cool! My only experience with DSP/synths is in C++/JUCE, but I've been wanting to learn how to do it in Rust. I'll have to give this project a look

9 June at 21:04 | Open on hachyderm.io

patchlore replied to Reilly Spitzfaden (they/them)

@reillypascal still learning things. the actual DSP programming feels pretty similar to how I do it in C, at least with my current coding style. I'm sure I'll get rustier as I go. Rust is more strict about number types (float vs int), and has some funny notation for math functions (x.sin() instead of sin(x)). There is a huge performance difference between debug and release builds.

9 June at 21:09 | Open on merveilles.town