Category Archives: Development

Adding in Phase

I’ve already got data on the phase of harmonics in the synSinger database, so I figured that I’d apply it to the new sine() synthesis code and see how much difference it made. Here’s a screenshot from Praat showing the … Continue reading

Posted in Development | Tagged , , | Leave a comment

More Vocoding

I haven’t had as much time to work on synSinger as I’d like, but I’ve continued trying to get the bugs out of the vocoder. To help debugging, I created a version of the vocoder code that constructed a spectral … Continue reading

Posted in Development | 5 Comments

Demo

Here’s a demo of synSinger with the old standard Twinkle, Twinkle, Little Star: This version uses sampled consonants. The are a number of problems with this, but it does show off the clarity of the new spectral synthesis rendering engine.

Posted in Development | Tagged , | Leave a comment

Synthesizing Frication

synSinger has been using sampled phonemes for voiced and unvoiced fricatives such as /V/ and /SH/. This works well, but I’ve been looking into synthesizing the consonants so they are better integrated into the rendering framework. Frication noise is made … Continue reading

Posted in Development | Tagged , , , | Leave a comment

Identifying Formants

Praat generally does an excellent job of identifying formants, in contrast to my own feeble attempts. Nevertheless, I continue to get sidetracked on the task of correctly identifying formants. My current method is fairly simple: I identify all the peaks … Continue reading

Posted in Development | Tagged , | Leave a comment

Interpolating Peaks

I’ve added some simple logic that checks to see if a peak can be interpolated by looking at the intersection of the slopes between points. Here’s an example of it in action: The results look very good – peaks move … Continue reading

Posted in Development | Tagged , , | Leave a comment

New Rendering Logic

The core of the new rendering engine is an inverse DFT (Discrete Fourier Transform). Each voiced phoneme target is described as a piecemeal spectral envelope, which when given a frequency, returns the amplitude and phase of that frequency. Rendering is … Continue reading

Posted in Development | Tagged , , , , , | Leave a comment