Compensatory coarticulation, /u/-fronting, and sound change in Standard Southern British: an acoustic and perceptual study.*
Jonathan Harrington, Felicitas Kleber, Ulrich Reubold
*
submitted Journal of the Acoustical Society of AmericaGeneral aim of this paper
To establish to what extent a sound-change in
progress, /u/-fronting in Standard Southern British (SSB) – can be linked to diminished perceptual
compensation for coarticulation in Ohala's (1993) model of sound change.
Background
A. Perceptual compensation for coarticulation B. Ohala's model of sound change
C. /u/-fronting in SSB
A. What is perceptual compensation for coarticulation?
si
su
Frequency of fricative noise
ACOUSTICS
1. Anticipatory coarticulatory lip-rounding causes spectral centre of gravity lowering in /s/
PERCEPTION
Listeners reverse the effects of
coarticulation
si su
2. Listeners know this and reverse its effect (=
compensation for coarticulation*)
*e.g. Fujisaki & Kunisaki, 1977; Mann & Repp, 1980
;
If you synthesise a continuum from /s/ to // that can be
done by spectral COG lowering and prepend it to a vowel, then listeners are more likely to perceive the SAME
synthetic token as /s/ before /u/ than before /a/
Perceptual compensation for coarticulation
s
F re q ue nc y o f no is e
+ /u/ + /a/
Listener compensates for
coarticulation (= factors out COG lowering assumed to be
attributable to /u/):
s s s s
s s
s s
s
B. Cental ideas in Ohala's model of sound change
Ohala: "Today's variability is tomorrow's sound change"
The origin of many sound changes is not always in the mouth of the speaker, but in the ear of the listener
Contra sociolinguists: sound change is not teleological = it's not done on purpose or for any reason, it happens by
accident because of an unintended error on the part of the listener (and also for this reason, the origin of sound change is not cognitive nor phonological)
Hypoarticulation-induced sound change: one that
arises out of the natural processes of coarticulation
and in which the listener fails to compensate for
coarticulation…
(the listener thinks: "the speaker meant to say /ci/")
Hypoarticulation-induced sound change in Ohala
Listener as speaker Speaker Listener
/c/ has been phonologised because it is planned,
produced and perceived,
even in contexts that can't be explained by coarticulation plans /ki/
[ci]
produces
acoustic [ci]
signal
compensates for coarticulation
reconstructs /ki/
Sound change:
/k/ -> /c/
plans /ci, cu/
[ci, cu]
reconstructs /ci/
C. /u/-fronting in Standard Southern British
Extensive auditory and some acoustic evidence that SSB /u/ has fronted in the last 50 years e.g., Gimson, 1966, Wells 1982,
Henton 1983, Deterding 1997, Hawkins & Midgley 2005, Roach, 1997).
([i] ‘heed’, [] ‘hoard’, [u], ‘who’d’)
/u/-fronting and possible chain-shifting in the Queen's Christmas broadcasts (Harrington, 2008)*
Harrington (2008), Laboratory Phonology IX, in press
This could be just such an example of a hypoarticulation- induced sound change. Why?
1. Taking into account word-frequency, /u/ frequently (p
≈ 0.7) occurs in a coarticulatory fronting context (e.g.
'you', 'too', 'lose', 'do', 'new').
j
u
F2-locus
target distance plans
/ju/ [j ]
produces
F2
Time
Speaker
2. In an analysis of the Christmas broadcasts at 20 year intervals, the Queen, Harrington (2008)* shows just such a reduction of F2 locus-target distance:
Time F2
locus-target distance
locus-target distance
Decade
* Harrington (2008), Laboratory Phonology 9 in press
Our extension of Ohala's model to these data and age-differences in SSB speakers is as follows:
/u/-fronting and speech perception
Acoustic input: [sn]
compensate for coarticulation
Perceived as: /sun/ /sn/
OLD listeners YOUNG listeners
/u/-fronting, speech perception and production
PRODUCTION PERCEPTION
/fj d/
/fd/
[f d]
sound change Front
Back
Young
Old
For the Old, the allophones diverge in production, but not in perception (this is the trigger for sound change)
For the Young, the allophones are aligned in perception and production and NO compensation for coarticulation
[fjd]
[fud]
/fjud/
/fud/
compensate for coarticulation
(food)
(feud)
[fjd]
[fud]
PRODUCTION PERCEPTION
/fjud/
/fud/
/fjd/
/fd/
[fd]
sound change compensate for coarticulation Front
Back
Young
Old
Predictions about age differences
1. (trivially) /u/ vowels are fronted for the Young.
2. C-on-/u/ coarticulation is greater in the Old (their /u/ allophones show greater divergence).
Production
3. Young and Old differ primarily on the back allophones (if sound change involves a shift of these to the front).
Perception
4. The Old but not the Young compensate perceptually
for coarticulation.
Experimental analysis I:
Production
(Predictions 1 -3)
30 Standard Southern British speakers recruited through University of Cambridge and University College London.
YOUNG: 14 subjects aged 18-20 (11 F, 3 M) OLD: 13 subjects aged 53-88 (7 F, 9 M)
Method: Speakers
Subjects were carefully checked to ensure that they were SSB speakers.
only 1 subject took part in the production study only
Materials
Isolated word production of words each produced 10 times
/u/
C(C) Word
j used 269
fj feud 266
hj hewed 266 kj queued 270
f food 269
s soup 270
k cooed 264
h who'd 270
sw swoop 268
2412
/i/
C(C) Word j yeast 269 f feed 267 h heed 270 k keyed 270 s seep 269 sw sweep 269
1614
//
C(C) Word h hard 270
Recordings made in U.K. with SpeechRecorder
(Draxler & Jänsch, 2004)
Acoustic parameters
Formants calculated and F2 was checked and manually corrected.
Each F2-trajectory was reduced to a single point in a three- dimensional space formed from the first three coefficients of the discrete-cosine-transformation (Watson & Harrington, 1999; Harrington, 2006; Zahorian, and Jagharghi, 1993) We did this because we wanted to assess vowel fronting in the entire F2-trajectory (from onset to offset) rather than just at the vowel's temporal midpoint (which encodes no dynamic information).
Also with the DCT, we avoid having to make an often arbitrary
decision about the location of the vowel target.
Discrete-cosine-transformation
decomposes any signal into a set of ½ cycle cosine waves which, if summed, reconstruct entirely the original signal.
The amplitudes of these cosine waves are the DCT coefficients. Moreover, the cosine waves at the
lowest frequencies encode important properties of the trajectory's shape…
…at frequency (rad/sample)…
…is proportional to the trajectory's:
DCT-coeff
DCT-0 average
mean linear slope curvature
0
DCT-1 1
DCT-2 2
so you can use this technique to smooth formants…
620 660 700 740
140016001800
Raw
Time (ms)
F2 (Hz)
620 660 700 740
140016001800
DCT-smoothed
Time (ms)
A n a l y s i s
DCT coeffs
But the important point for this paper is that each F2
trajectory is reduced to a single point in a 3D-space which encodes a smoothed trajectory, like the one on the right .
F2 (Hz)
620 660 700 740
140016001800
Raw
Time (ms)
F2 (Hz)
620 660 700 740
140016001800
DCT-smoothed
Time (ms)
S y n t h e s i s
F2 (Hz)
For each speaker separately, we quantified the extent of /u/-fronting in this DCT space by
calculating each [u] token's relative distance to the front and back /i/ and // vowel centroids
Quantification of /u/-fronting
d
u= log(E
1/E
2) = log(E
1) – log(E
2) Log Euclidean distance ratio
d
u= 0, [u] equidistant bet. /i/ and //
d
u< 0, [u] nearer // (back) d
u> 0, [u] nearer /i/ (front)
So if following hypothesis 1 /u/ is phonetically
more back in the Old, then d
ushould be lower for
the Old compared with the Young
2. Quantification of C-on-/u/ perseverative coarticulation
We measured separately for each speaker the Inter- Euclidean distance in the DCT space between 'swoop' and 'used'
swoop ju
ju
ju juju
ju ju
ju ju
ju used
wu
wu wu wu wu wuwuwuwu
wu
DCT-0
DCT-1
swoop tokens to used centroid used tokens to swoop centroid