Title Clam for speech
Student Ebrahim "Abe" Kazemzadeh
Mentor Xavier Amatriain
Abstract
This application will detail my interests in the CLAM project and
explain what I can offer the project as a participant in the summer of
code. My main interest is in speech processing and recognition. This
gives me a partial overlap in interests in the audio processing
capabilities of CLAM, as well as other interests that could add to
CLAM's capabilities and user base. Specifically, I am familiar with
many of the DSP techniques used in CLAM, but with more of an emphasis
on speech than music. This slight difference in focus applies both at
the DSP level, as well as the user interface level, since linguists
and speech scientists have different needs from a user interface as
well as the DSP behind it: at the DSP level, there are warpings to the
spectrum to simulate the auditory system, and the LPC model is based
on an articulatory model, for example, and at the user interface
level, there are certain displays that linguistics prefer, like black
and white spectrograms and multi-tiered transcriptions, to name a few.
Given my background in linguistics (BA), computer science (MS, working
on PhD), and EE/signal processing (2 speech processing courses), the
project idea that I am most interested in is the vowel synthesizer .
Below in the detailed description I will explain my past experience
building a LPC/source-filter based vowel synthesizer in Matlab for my
speech processing class. I'll also explain other ideas I have, both
to further develop this project, and other ideas I have for
incorporating speech recognition abilities into CLAM, such as singing
synthesis, . I will also describe facilities available to me at my
university that may be helpful.