|
BALTHASAR:
|
An integrated multiple-level statistical model for speech pattern processing |
|
|
People
Martin Russell Philip Jackson Boon-Hooi Lo Nick Wilkinson Michael Wong
Related research
|
EPSRC grant reference: GR/M87146The name BALTHASAR stands for the Birmingham Articulatory-layered Linear-Trajectory HMM-based Automatic Speech Recognizer. It is a three year, EPSRC-funded research project which was started in February 2000, and has produced a number of conference and journal publications that are available below. OverviewThe goal is to develop a rigorous theory of multiple-level statistical modelling of acoustic speech patterns. Within this framework, the relationship between the symbolic (state-level) and surface-acoustic representations of speech is regulated by an intermediate representation which is able to capture the inherent constraints of the speech production process. The intermediate representation is referred to as the articulatory, or pseudo-articulatory, layer. Ideally, this intermediate representation will also provide a low-dimensional characterisation of speaker properties, which offers the potential for rapid speaker adaptation. This characterisation may be explicit, as in the case of an actual articulatory model, or implicit. The research will involve the derivation of a suitable mathematical framework, extension of the relevant training and recognition algorithms, and evaluation of the resulting models through off-line experimentation on an international-standard speech corpus. Related publicationsWhile the core technology remains under development, recognition results from tests with our new system are still preliminary. However, we have been publishing our findings as we go:
|
|
|
[
Group |
Research |
School |
Univ
]
© 2002-5, maintained by Philip Jackson, last updated on 10 Feb 2005. |
School |