|
Computational models of how an
infant learns to speak
There has been the general assumption that
infants learn to speak by "imitation", without considering how the ability to imitate could come about. In most
accounts this is assumed to be achieved by a process of acoustic matching, with the infant comparing his output to that
of his caregiver. There are problems with both of these
assumptions. Firstly it assumes that the correspondence
problem between infant and adult speech can be solved by the
infant. Secondly, several developmental observations
cannot be explained by these mechanisms.
I have been working on a computational model – Elija -
incorporating an articulatory synthesiser and a simple speech
recognizer based on DTW, which does not learn to pronounce
speech sounds this way. Instead,
Elija starts by exploring the sound making capabilities of his
vocal apparatus. This process is formulated as an optimization
problem. The objective function receives a positive contribution
from salience of the sensory consequences of action and of its
diversity, and is penalized by the effort involved. This leads
to the unsupervised discovery of speech like sounds. Clustering
is used to extract a small number of distinct sub-actions
(corresponding to Cs and Vs) and their recombination expands
Elija’s sound repertoire. A second stage of
development makes use of interactions with a caregiver, who
listens to the sounds can respond to them. The natural responses
from the caregiver are often “reformulations” – i.e. imitations
made by the caregiver. These responses are used to reinforce
sounds and bias overall production to those that occur in the
ambient language. In addition, the reformulations are used to
learn equivalence relations between his vocal actions and the
corresponding caregiver’s speech, forming the basis for leaning
by imitation. During a final object labelling task, using this
newly established mechanism of imitation, Elija is able to learn
and reproduce some object names spoken by the caregiver.
A Computational Model of Infant Speech
Development
We have shown that Elija
progresses from a babbling stage to learning the names of
objects. The initial publication was from
Specom2007 . The
first published results in a Motor Control paper are available
here. This
research is rapidly progressing and further publications will
appear soon. Some newer results were be presented at
ESSV
2011, 28-30 September in Aachen, Germany.
Modelling motor pattern generation in the
development of infant speech production
Because embodiment plays an important role in
the development of speech, we have also begun preliminary work
to incorporate the role of speech breathing in our computational
model. See the publication
ISSP2008 for
further details. |