• Keine Ergebnisse gefunden

Articulatory question: Is production similar?

Im Dokument Human Language: (Seite 68-77)

Exploring Baboon Vocalizations with Speech Science Techniques

5. Articulatory question: Is production similar?

The spectral and temporal similarities that we established in the lower range of the spectrum (below 1.5 kHz), did not take into account the P2 peak of {a} around 2.3 kHz (Table 1). To explain the correspondence of this peak with the second formant in [a] we have to invoke an acoustic production model (Boë et al., 2013). This model lets us determine the maximal acoustic space within which the first two formants of all vowels occur for a vocal tract of the given length (Boë et al., 2017). We have estimated 13.4 cm as the length of the vocal tract of a male baboon. Figure 3 shows (in grey) the maximum acoustic space (MAS), which is the area enclosing all paired (F1, F2) values that could theoretically be produced by such a vocal tract, assuming total control of its shape and configurations (see Boë et al., 2017, for discussion). The spectral peaks P1 and P2 of {w}, {a}, and {hoo}, pro-vided in Table 1 and interpreted as formants, are shown as they would be located inside this maximal acoustic space.

Figure 3: F1 and F2 for [w], [a], and [u], as shown in the maximum acoustic space of a 13.4 cm vocal tract, corresponding to that of an adult male baboon.

We observed that, in agreement with our production hypothesis, {a} uttered by baboons would correspond to an open front vowel. The {hoo} is located where we would expect the mid- high back rounded [o]. The fact that it does not seem to correspond really to the high back rounded [u] might be due to the fact that lips are not sufficiently closed and protruded. For {w}, the first peak, P1, is in continuity with P1 of {a} whereas the continuity between P2 of {w} and P2 of {a} is absent. Though this means we do not have an actual trajectory for [wa.u] in F1-F2 formant space, we have indicated an approximate trajectory schematically by arrows (Figure 3).

Figure 4: Distances a and b measure the displacement of the thoracic cage;

distance c measures lip opening; distance d is from the center of the ear canal to the distal extremity of the upper lip and quantifies lip protrusion (images 6 and 15 of the video).

Figure 5: Articulatory analysis of the sample of “wahoo” extracted from the video. Synchronization of sound and image with spectrum (Praat, 50 ms analysis window), lip opening, rib cage variations, and lip protrusion (fitted with a 3rd degree polynomial curve).

We also directly studied the wahoo production on the video clip. We manu-ally estimated and reported three types of measurements from the video, image by image: (a) lip opening, (b) movement of the rib cage, and (c) lip protrusion (Figure 4). These measurements are synchronized with the sound, so as to allow analysis of the production sequence. From anatomical data (Boë et al., 2017), knowing that distance between the center of the ear canal and upper lip in the rest position is about 17 cm, we estimated the conversion factor at 5 pixels/cm. The lip opening gesture takes about 180 ms (5.5 images) and includes {w} and the beginning of {a}, with lips plateauing until its end (image 12), when the lips close to the beginning of the {hoo} (image 15). Ingressive airflow is shown by an increase of thoracic volume synchronized with {wa} and a plateau that ends at the beginning of {hoo} (image 12) together with the closing of the lips. The lip protru-sion measurement is highly variable or noisy, with a trajectory fit (using a 3rd degree polynomial curve) which reaches its maximum when the lips close (image 15).

For the 3 types of measurement we can therefore estimate, relative to the rest position, a range of variation through 1 cm for lip protrusion, 8 cm for lip opening, and 1 cm for the rib cage movement (Figure 5). The {w}

lip opening gesture is accompanied by a rise in F0 as well as a rise in the first formant. This effect is well- known for the [w] (Potter et al., 1947) and for stop consonants in general (Wang and Fillmore, 1961), and reflects the strong coupling between source and vocal tract. The fundamental frequency is correlated with the airflow at the glottis and the transition of the first formant corresponds to the opening of the vocal tract. The measurement of the protrusion supports the hypothesis of lip protrusion for {wa}. This protrusion also seems to involve a forward projection of all the tissue of the muzzle.

Overall, except for the difficulty of showing a clear F2 transition for {w}, our hypothesis of similar production mechanisms for baboon vocalizations and human speech is supported by our observations, with the caveat of course that pulmonic ingression is only paralinguistic for human speech.

6. Conclusions

There is an increasing interest from various research communities in animal vocalizations that are used for communication purposes. This is particularly true for researchers in primatology, due partly to increased interest in lan-guage evolution. We have here compared an understudied baboon vocaliza-tion, the “wahoo”, and its onomatopoetic name from human speech. We have used standard methods, commonplace in speech research, to analyze and compare certain aspects of baboon productions to similar processes in human speech. In particular, we used acoustic analysis of F0 and of spectral characteristics of the baboon wahoo to understand how it is likely perceived by humans, and we showed that those acoustic traits indeed provide support to its onomatopoetic name. We also used a video of a baboon producing a wahoo to extract quantitative articulatory data allow-ing us to understand several interestallow-ing aspects of the baboon’s production mechanism, and we showed that many of them are quite similar to human speech production mechanisms. We also verified that ingressive vocaliza-tion, which is found paralinguistically but is unusual in human speech, is common in the baboon wahoo. We thus transcribe baboon wahoo as [wa↓.

u↑] with the ↓ down arrow indicating an ingressive initial syllable, and ↑ for the egressive second syllable.

We believe we have demonstrated that standard phonetic and acoustic methods developed for speech can be profitably used for the analysis of vocalizations in non- human animals, and we recommend further explora-tory efforts in the same vein.

Acknowledgements

The authors are grateful to: Tom Larimer for the YouTube baboon video;

Thierry Legou, Arnaud Rey, Caralyn Kemp, Yannick Becker for recording and labeling the vocalizations of Papio papio baboons; Guillaume Captier for the anatomical study; and Pierre Badin, Pascal Perrier, and Jean- Luc Schwartz for helpful discussions. This research was funded partly by ANR SkullSpeech grant ANR-13-TECS-0011 “e- SwallHome- Swallowing & Res-piration: Modelling & e- Health at Home” in the “Technologies pour la Santé” program.

Research supported by grants 16-CONV-0002 (ILCB), ANR-11-LABX-0036 (BLRI) and ANR-11-IDEX-0001–02 (A*MIDEX).

Technical support from the staff of the Rousset- sur- Arc primate center is acknowledged.

7. References

Boë, L.J., Badin, P., Ménard, L., Captier, G., Davis, B., MacNeilage, P., Sawallis, T.R., and Schwartz, J.L. (2013). Anatomy and control of the developing human vocal tract: A response to Lieberman. Journal of Pho-netics, 41, 379–392.

Boë, L.J., Berthommier, F., Legou, T., Captier, G., Kemp, C., Sawallis, T.R., Becker, Y., Rey, A., and Fagot, J. (2017). Evidence of a vocalic proto- system in the baboon (Papio papio) suggests pre- hominin speech precur-sors. PLoS One, 12, e0169321.

Boersma, P., and Weenink, D. (2014). Praat: doing phonetics by computer [Computer program]. Version 5.3.63, retrieved 24 January 2014 from http://www.praat.org/

Byrne, R.W. (1981). Distance vocalisations of guinea baboons (Papio papio) in Senegal: an analysis of function. Behaviour, 78, 283–312.

Calliope (1989). La parole et son traitement. Paris: Masson.

Cheney, D.L., Seyfarth, R.M., and Palombit, R. (1996). The function and mechanisms underlying baboon ‘contact’ barks. Animal Behavior, 52, 507–518.

Cheney, D.L., Seyfarth, R.M., and Silk, J.B. (1995). Responses of female baboons (Papio cynocephalus ursinus) to anomalous social interactions:

evidence for causal reasoning? Journal of Comparative Psychology, 109, 134–141.

Delattre, P., Liberman, A.M., Cooper, F.S., and Gerstman, L.J. (1952). An experimental study of the acoustic determinants of vowel color; observa-tions on one- and two- formant vowels synthesized from spectrographic patterns. Word, 8, 195–210.

Delattre, P. (1968). From acoustic cues to distinctive features. Phonetica, 18, 198–230.

Eklund, R. (2004). Pulmonic ingressive speech: a neglected universal?

TMH- QPSR, KTH, 50, 21–24.

Eklund, R. (2008). Pulmonic ingressive phonation: diachronic and syn-chronic characteristics, distribution and function in animal and human sound production and in human speech. Journal of the International Phonetic Association, 38, 235–324.

Eklund, R., Peters, G., Weise, F., and Munro, S. (2010). A comparative acoustic analysis of purring in four cheetahs. FONETIK 2012, Depart-ment of Philosophy, Linguistics and Theory of Science, University of Gothenburg, The XXVth Swedish Phonetics Conference, 41–44.

Fagot, J., Gullstrand, J., Kemp, C., Defilles, C., and Mekaouche, M. (2014).

Effects of freely accessible computerized test systems on the spontaneous behaviors and stress level of guinea baboons (Papio papio). American Journal of Primatology, 76, 56–64.

Fedurek, P., and Slocombe, K.E. (2011). Primate vocal communication: a useful tool for understanding human speech and language evolution?

Human Biology, 83, 153–173.

Fischer, J., Hammerschmidt, K., Cheney, D.L., and Seyfarth, R.M. (2001).

Acoustic features of female chacma baboon barks. Ethology, 107, 33–54.

Fischer, J., Hammerschmidt, K., Cheney, D. L., and Seyfarth, R.M. (2002).

Acoustic features of baboon loud calls: influences of context, age, and individuality. Journal of the Acoustical Society of America, 111, 1465–1474.

Fischer, J., Kitchen, D., Seyfarth, R.M., and Cheney, D.L. (2004). Baboon loud calls advertise male quality: acoustic features and their relation to rank, age, and exhaustion. Behavioral Ecology and Sociobiology, 56, 140–148.

Fischer, J., Metz, M., Cheney, D.L., and Seyfarth, R.M. (2001). Baboon responses to graded bark variants. Animal Behaviour, 61, 925–931.

Fitch, W.T. (2002). Comparative vocal production and the evolution of speech: reinterpreting the descent of the larynx. In A. Wray (Ed.) The transition to language (pp. 21–45). Oxford: Oxford University Press.

Ghazanfar, A., and Rendall, D. (2008). Evolution of human communica-tion. Current Biology, 18, R457–R460.

Grau, S.M., Robb, M.P., and Cacace, A.T. (1995). Acoustic correlates of inspiratory phonation during infant cry. Journal of Speech and Hearing Research, 38, 373–381.

Gustison, M.L., Le Roux, A., and Bergman, T.J. (2012). Derived vocali-zations of geladas (Theropithecus gelada) and the evolution of vocal complexity in primates. Philosophical Transactions of the Royal Society, B367, 1847–1859.

Hall, K.R.L., and DeVore, I. (1965). Baboon social behavior. In I. DeVore (Ed.) Primate Behavior: Field Studies of Monkeys and Apes (pp. 53–110).

New York: Holt, Rinehart and Winston.

Kemp, C., Rey, A., Legou, T., Boë, L.J., Berthommier,F., Becker, Y., and Fagot,J. (2017) Vocal repertoire of captive guinea baboons (Papio pa-pio). In this volume.

Kitchen, D.M., Cheney, D.L., Engh, A.L., Fischer, J., Moscovice, L.R., and Seyfarth, R.M. (2013). Male baboon responses to experimental manipu-lations of loud ‘wahoo calls’: testing an honest signal of fighting ability.

Behavioral Ecology and Sociobiology, 67, 1825–1835.

Ladefoged, P. (2006). A course in phonetics. Belmont: Thomson, Wads-worth.

Larimer, T.S.  (2012). The wa- hu shout. https://www.youtube.com/

watch?v=za839cpwUh0

Maciej, P., Ndao, I., Patzelt, A., Hammerschmidt K., and Fischer, J. (2013).

Vocal communication in a complex multi- level society: constrained acoustic structure and flexible call usage in guinea baboons. Frontiers in Zoology, 10, 58–72.

Maddieson, I. (1986). Patterns of sounds. Cambridge: Cambridge Univer-sity Press.

Orlikoff, R.F., Baken, R.J., and Kraus, D.H (1997). Acoustic and physi-ologic characteristics of inspiratory phonation. Journal of the Acoustical Society of America, 102, 1838–1845.

Owren, M.J., Seyfarth, R.M., and Cheney, D.L. (1997). The acoustic fea-tures of vowel- like grunt calls in chacma baboons (Papio cyncephalus ursinus): implications for production processes and functions. Journal of the Acoustical Society of America, 101, 2951–2963.

Peters, G. (2002). Purring and similar vocalizations in mammals. Mammal Review, 32, 245–271.

Potter, H.C., Kopp, R.K., and Green, G.A. (1947). Visible speech. New York: D. Van Nostrand Company, Inc.

Price, T., Ndiaye, O., Hammerschmidt, K., and Fischer, J. (2014). Limited geographic variation in the acoustic structure of and responses to adult male alarm barks of African monkeys. Behavioral Ecology and Socio-biology, 68, 815–825.

Rendall, D., Cheney, D.L., and Seyfarth, R.M. (2000). Proximate factors mediating ‘contact’ calls in adult female baboons (papio cynocephalus ursinus) and their infants. Journal of Comparative Psychology, 114, 36–46.

Wang, W.S., and Fillmore, C.J. (1961). Intrinsic cues and consonant percep-tion. Journal of Speech and Hearing Research, 4, 130–136.

Zuberbühler, K. (2012). Primate communication. Nature Education Knowl-edge, 3, 83.

Department of Psychology and Neuroscience, St. Andrews University, St. Andrews, Scotland, UK

2Department of Anthropology, Durham University, Durham, UK

Im Dokument Human Language: (Seite 68-77)