While the core technology remains under development, recognition results
from tests with our new system are still preliminary.
However, we have been publishing our findings as we go:
MJ Russell,
X Zheng,
PJB Jackson
(2007).
"Modelling speech signals using formant frequencies as an intermediate representation".
IET Signal Processing,
1 (1):
43-50.
[ bib |
doi |
abstract |
preprint ]
MJ Russell, PJB Jackson
(2005).
"A multiple-level linear/linear segmental HMM with a
formant-based intermediate layer".
Computer Speech and Language,
19 (2):
205-225.
[ bib |
doi |
abstract |
preprint ]
MJ Russell, PJB Jackson
(2004).
"Regularized re-estimation of stochastic duration models".
In J. Acoust. Soc. Am.,
115 (5, Pt. 2): 2429 A,
New York, New York, USA.
[ abstract ]
MJ Russell, PJB Jackson
(2003).
"The effect of an intermediate articulatory layer on the performance
of a segmental HMM".
In Proc. Eurospeech 2003,
2737-2740,
Geneva.
[ abstract
| pdf ]
PJB Jackson
(2003).
"Improvements in phone-classification accuracy from modelling
duration".
In Proc. Int. Cong. of Phon. Sci., ICPhS 2003,
1349-1352,
Barcelona.
[ abstract
| pdf ]
MJ Russell, PJB Jackson,
MLP Wong
(2003).
"Development of articulatory-based multi-level segmental HMMs for phonetic
classification in ASR".
In Proc. EURASIP Conf. on Video/Image Proc. & Multimedia Comm.,
EC-VIP-MC 2003,
2: 655-660,
Zagreb, Croatia.
[ bib |
doi |
abstract |
preprint ]
PJB Jackson, MJ Russell
(2002).
Models of speech dynamics in a segmental-HMM recognizer
using intermediate linear representations.
In Proc. Int. Conf. on Spoken Lang. Proc., ICSLP
2002,
1253-1256,
Denver, Colorado, USA.
[ abstract
| pdf
| ps ]
N Wilkinson, MJ Russell
(2002).
Improved phone recognition on TIMIT using formant frequency data
and confidence measures.
In Proc. Int. Conf. on Spoken Lang. Proc., ICSLP
2002,
2121-2124,
Denver, Colorado, USA.
[ abstract
| doc
| pdf
| ps ]
PJB Jackson, B-H Lo,
MJ Russell
(2002).
"Models of speech dynamics for ASR, using intermediate linear
representations".
Presented at NATO Advanced Study Institute on the
Dynamics of Speech Production and Perception,
Il Ciocco, Italy.
[ abstract
| ppt ]
PJB Jackson, B-H Lo,
MJ Russell
(2002).
Data-driven, non-linear, formant-to-acoustic mapping for
ASR.
Electronics Letters,
38 (13): 667-669.
[ bib |
doi |
abstract |
preprint ]
N Wilkinson, MJ Russell
(2001).
Progress towards improved speech modelling using asynchronous
sub-bands and formant frequencies.
In Proc. Inst. Acoust., WISP 2001,
23 (3): 27-36,
Stratford-upon-Avon, UK.
[ pdf ]
PJB Jackson
(2001).
Acoustic cues of voiced and voiceless plosives
for determining place of articulation.
In Proc. Workshop on
Consistent and Reliable Acoustic Cues
for sound analysis, CRAC 2001,
19-22,
Aalborg, Denmark.
[ abstract
| pdf
| ps ]