Spectral Vocoder

Tim Kleinert

Hi,

I've been reading up on DFT and FFT and pondered how much of it could be applied to the G2. It was obvious pretty quickly that a complete frequency-domain representation of a given audio signal and consecutive processing was totally out of the question. However, by severely limiting the amount of frequency bins, simple stuff could be achieved.

My idea was that a monophonic, harmonic and highly periodic signal can be represented by the sum of it's partials and their magnitudes. So, let's say, if I want to graft the spectral information of a source signal onto this monophonic pitch, I only need to extract from the source signal the magnitudes of the sinusoids that directly correspond to the frequencies of these partials. This limits the required frequency bins down to a very small amount -which the G2 can handle. Smile

The result of that idea is this patch, which I call a spectral vocoder. In some ways it is similar to the "classic" vocoder, in other ways not at all. The classic vocoder is based on analysis and resynthesis bandpass filter banks, which arbitrarily filter and process the source and carrier signals without knowing much about them. But in this case I know that the result shall be a pitched monophonic signal, so therefore I know a) exactly which information I want from the source signal (the magnitudes of the sinusoids that correspond to the partials of my monophonic pitched signal), and b)how to best represent this signal: by the sum and magnitudes of it's partials. In other words: by additive synthesis.

Since the standard computer DSP programming approach to measuring the magnitudes of sinusoids in a source signal is impossible to realise on the G2, I opted for an old "analog" technique, which is to heterodyne the source signal with the sine and cosine of the frequency to be measured, and taking the square root of the quadratures of the results. (This actually is nothing else but transforming rectangular coordinates into polar coordinates, with the polar radius being the magnitude of the sinusoid). Quadratures are easy on the G2, and thanks to a building block by Mike Estlick, square roots are possible too.

Patched in the most economical way possible, the process described above uses close to 25% of a DSP. And since I always wanted to build a patch that completely maxes out an expanded G2 Laughing

, here finally was the opportunity. Smile

But instead of tediously filling up all VA and FX slots with these analysis stages one by one, I used the G2s polyphony feature, each voice conveniently acting as a macro for a single analysis/resynthesis band. These bands are controlled by the FX area via interslot busses. So I get 30 bands in total from an expanded G2, which is as good as it gets. Of course you can use less bands by turning down the voice count (or using an unexpanded G2 Laughing

), but the resulting sound will be increasingly dull.

So, at last, a single monophonic voice that maxes out your expanded G2! Laughing

The controls:
INPUT: Select channel for the source audio
INPUT COMP: Compress the source audio to taste
Resp1 and Resp2: Response characteristics of the analysis stages. Controls the sensitivity, speed and immunity to inharmonic frequency content.
FORMANT: Shift formants up and down (simply by offsetting the analysis and resynthesis frequency bands)
HI THRU Freq/Level: Just like classic vocoders, let's you add some high frequency content of the source signal, especially good for sibilants and "air".
KEY PLAY: Attack and Release times of the keyboard envelope.
PRE-ANALYSIS EQ: Tweak the source audio before it goes to the analysis stage.
POST-RESYNTHESIS EQ: Tweak the resynthesized result.

...and there was DSP power left for a delay module (yay! Laughing

).

There's a short demo mp3 of this patch too, with me talking into the microphone and playing some keys. G2 goes Antares. Laughing

This is probably as close as you can get the G2 into frequency-domain territory. (But never say never Wink

)

cheers,
tim

SpectraVocoderTK.pch2

Description:

Monophonic vocoding by realtime spectral analysis and additive resynthesis. Monophonic monster patch that completely maxes out an expanded G2.

Download (listen)

Filename:

SpectraVocoderTK.pch2

Filesize:

3.08 KB

Downloaded:

2680 Time(s)

SpectraVocoder_demo.mp3

Description:

Demo of the spectal vocoder patch. G2 goes Antares. :o)

Download (listen)

Filename:

SpectraVocoder_demo.mp3

Filesize:

657.18 KB

Downloaded:

2109 Time(s)

jamos · Posted: Wed Feb 03, 2010 9:07 pm Post subject:

Cool, cool. Very Happy

I use on of my G2's primarily for vocal processing, so I'll throw this at my singer and see what he thinks.

Roland Kuit · Posted: Wed Feb 03, 2010 10:05 pm Post subject:

wow....nice 1 tim. thanks for charing

dorremifasol · Posted: Thu Feb 04, 2010 9:34 am Post subject:

Impressive!!!

Wan · Posted: Fri Feb 05, 2010 12:53 am Post subject:

Amazing patch again! Shocked

Tim Kleinert · Posted: Fri Feb 05, 2010 5:35 am Post subject:

Glad you like it. Smile

It was quite an adventure. In the beginning I had a pitch-tracker as well to track the pitch of the incoming signal to extract the partials thereof and port those to the pitch of choice in the additive resynthesis stage. This resulted in the formants being shifted as well, producing the "mickey mouse" effect. However, we all know that the pitch-tracker is so-so, so it caused occasional warbles in the spectral distribution, and anyway -who want's mickey mouse? Wink

.

It was so obvious to extract the sinusoid magnitudes from the input signal directly (without pitch-detection) that I didn't think of it doh

-hence porting to the desired pitch only those magnitudes that correspond to it's partials, and therefore retaining formants. Easier this way, no pitch-track warbles, and no mickey-mouse.

You can hear the high-frequency partials ring a little bit, which makes it sound more artificial than it should. This is due to the square root function being based on feedback, which is noticeable especially in the higher partials which have high frequency and low amplitude. There's no way around this unfortunately. It would sound better otherwise.

cheers,t

EDIT: If you want some fun, try twiddling the partial quantizer module, which defines the partial spacing. You can tweak it eg. to only odd harmonics, sounds pretty weird. Laughing

fairplay · Posted: Fri Feb 05, 2010 5:47 am Post subject:

...that's embarrasing! - how do you make things like this? - all the time?? - again and again???...i feel pretty depressed now {being unable to do things like this myself}...

Wink

...thank you for sharing!...

Tim Kleinert · Posted: Fri Feb 05, 2010 8:05 am Post subject:

Too much spare time I guess Rolling Eyes

...

klangumsetzer · Posted: Fri Feb 05, 2010 12:34 pm Post subject:

3phase · Posted: Tue Feb 09, 2010 4:12 pm Post subject:

blue hell · Posted: Tue Feb 09, 2010 5:15 pm Post subject:

Inventor · Posted: Tue Feb 09, 2010 6:20 pm Post subject:

The main part that I don't understand is how does the patch determine what the fundamental frequency of the voice is? I don't know the G2, so looking at the patch won't help me there. Anyone understand that part of it?

Les

seraph · Posted: Wed Feb 10, 2010 12:53 am Post subject:

Tim Kleinert · Posted: Wed Feb 10, 2010 5:46 am Post subject:

Oh, all these "shocked" emoticons... what did I do. Embarassed

blue hell · Posted: Wed Feb 10, 2010 1:09 pm Post subject:

drapdap · Posted: Sat Feb 13, 2010 10:00 am Post subject:

this one is so much fun, thanks Tim!

i made my engine boot with this one for now... Very Happy

it was just what the doctor ordered, as i added an xlr input recently with a knob, so i can vocode with just a mic and the engine. and a keyboard, erh, you lucky G2 key owners.

thanks again, Tim...

seraph · Posted: Sun Feb 14, 2010 6:05 am Post subject:

Tim Kleinert · Posted: Mon Feb 15, 2010 4:50 pm Post subject:

Thank you Carlo. Seeing/hearing ones patches used for artistic expression is the ultimate compliment.

cheers,
t

seraph · Posted: Tue Feb 16, 2010 1:07 am Post subject:

Inventor · Posted: Wed Feb 17, 2010 2:44 am Post subject:

I played Carlo's song on my radio show and mentioned the g2 patch of tim's after the song. You guys are collaborating so well it makes for great radio!

Les

iPassenger · Posted: Wed Feb 17, 2010 6:35 am Post subject:

seraph · Posted: Wed Feb 17, 2010 6:44 am Post subject:

G2egory · Posted: Wed Feb 17, 2010 5:13 pm Post subject:

The Spectral Vocoder is dynamite. Thanks for sharing the patch.

peterkadar · Posted: Thu Feb 18, 2010 2:30 am Post subject:

wow, this is incredible! Thanks so much for all your hard work, and for sharing it with the rest of us.

Your Kung Fu is strong!!

seraph · Posted: Sun Feb 28, 2010 6:19 am Post subject:

one more example of microtonal spectral vocoding using Tim's patch Very Happy