• Keine Ergebnisse gefunden

Investigating articulatory differences between upright and supine posture using 3D EMA

N/A
N/A
Protected

Academic year: 2022

Aktie "Investigating articulatory differences between upright and supine posture using 3D EMA"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Investigating articulatory differences between upright and supine posture using 3D EMA

Ingmar Steiner 1,2 , Slim Ouni 1,3

1 LORIA Speech Group, 2 INRIA, 3 Université Nancy 2

Firstname.Lastname@loria.fr

Articulation: Positions of EMA coils in mid- sagittal plane for sustained vowels [a, e, u]

Anterior on right, coils shown (from left to right): tongue back, mid, tip center (tongue contour approximated by curve), lower

incisor, upper lip;

Bottom right: traces of mid-sagittal tongue

coils for three CV syllables [la, li, lu] in upright and supine position, under noise condition

Discussion: several phenomena are observible, including:

- formant shifting and tongue position

changes especially during noise conditions

- (over-)compensation for gravity and posture - changes in dynamics, depending on

articulatory relevance

Leightweight gurney made from non- metallic materials

Utterances include sustained vowels [a, e, i, o, u, y, ø, ə], repetitive CV syllables of these vowels and [p, t, k, m, n, ŋ, f, θ, s, ʃ, ç, x, l, ɫ], and 10 German and 10 English sentences.

Speaker in supine condition in AG500

Prompts presented via stethoscope to compensate for lack of display visibility, and to provide white noise in masking

condition

Audio prompts generated using TTS; CV syllable duration and pitch can have priming effect for production

Speaker in upright position in AG500

Measurement coils on tongue tip center, tongue blade left/right, tongue mid

center/ left/right, tongue back center,

lower incisor, upper lip (reference coils on bridge of nose and behind each ear)

Acoustics: discriminant analysis of first two formants of sustained vowels (middle 80% of frames, smoothed by 5-sample rectangular window) in both masking conditions

Note: automatic formant tracking (using Praat) resulted in wide scatter for some vowels

a e

i

u o y

ø ə

200 2500

Dimension2

100 850

Dimension 1 Clear

100 250 400 550 700 850

200 660 1120 1580 2040 2500

a i e

o u y

ø

ə

200 2500

Dimension2

100 850

Dimension 1 Noise

100 250 400 550 700 850

200 660 1120 1580 2040 2500

a i e

o u

y

ø ə

a i e

o u

y

ø ə

a e

i

o u y

ø ə

a

i e o

u y

əø

a i e

o u

y

ø

ə

a i e

o

u y

ø ə

Background: Previous work [e.g., 1-4] has shown that for some speakers, the effects of gravity and posture can have a pronounced effect on the articulators during

speech. With the growing importance of MRI for vocal

tract and speech production analysis, this effect must be accounted for.

Overview: In this pilot study, we investigate the posture effect using 3D EMA, systematically varying the three

parameters posture, masking noise, and presence of EMA coils.

Preliminary analysis of acoustic and articulatory results are presented, further work will focus on kinematics and relevance of individual articulators.

M. K. Tiede, S. Masaki, and E. Vatikiotis-Bateson. Contrasts in speech articulation observed in sitting and supine conditions. Proc. 5th Seminar on Speech Production, 2000.

O. Engwall. Assessing Magnetic Resonance Imaging measurements:

Effects of sustenation, gravitation, and coarticulation. In J. Harrington and M. Tabain, editors, Speech Production: Models, Phonetic Processes, and Techniques, 301–313, 2006.

T. Kitamura, H. Takemoto, K. Honda, Y. Shimada, I. Fujimoto, Y. Syakudo, S. Masaki, K. Kuroda, N. Oku-Uchi, and M. Senda. Difference in vocal tract shape between upright and supine postures: Observations by an open-type MRI scanner. Acoustical Science and Technology, 26(5):465–

468, 2005.

M. Stone, G. Stock, K. Bunin, K. Kumar, M. A. Epstein, C. Kambhamettu, M. Li, V. Parthasarathy, and J. L. Prince. Comparison of speech production in upright and supine position. Journal of the Acoustical Society of

America, 122(1):532–541, 2007.

[1]

[2]

[3]

[4]

Upright EMA Supine EMA Upright no EMA Supine no EMA

Referenzen

ÄHNLICHE DOKUMENTE

Table 9-5: Results of multilevel regression analyses in the falsified data, overview over significant independent variables on the interviewer level; dependent variables:

„Ich, Thomas Schlabs, versichere an Eides statt durch meine eigenhändige Unterschrift, dass ich die vorgelegte Dissertation mit dem Thema: „Comparison of cardiovascular

From these observations we can classify each of the sets of ecological and ecological- economic models into two classes: the first class, termed “simple models” contains the

Aggregate fitted tongue trajectories (including individual points) of the three tongue sensors (first row: T1, second row: T2, third row: T3) for the two groups of speakers in

Japanese vowels, CV sequences, and running speech using upright and supine X-ray mi- crobeam (Tiede et al., 2000); English word sequences using upright and supine ultrasound

However, unlike the facial motion-capture data, which provides 3D movement data for hundreds of points on the speaker’s face, enough to use these points directly as the vertices of

acoustic-visual (AV) speech synthesizer [1], our aim is to integrate a tongue model for improved realism and visual intelligibility. The AV text-to- speech (TTS) synthesizer uses

In particular, the automatic generation of gestural scores at this stage still uses the speaker definition supplied with VTL by its author, which constrains its deploy- ment to