• Keine Ergebnisse gefunden

Progress in animation of an EMA-controlled tongue model for acoustic-visual speech synthesis

N/A
N/A
Protected

Academic year: 2022

Aktie "Progress in animation of an EMA-controlled tongue model for acoustic-visual speech synthesis"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Neutral (bind) pose: [ɑː] Animated (deformed) pose: [t]

animation / deformation

Animation example: [ɑː] → [t] transition (from repetitive CV utterance) Abstract: As one component of the

talking head of an acoustic-visual speech synthesizer, we present a technique to

animate a 3D kinematic tongue model,

based on volumetric vocal tract MRI data, using skeletal animation with a flexible

rig, controlled by motion capture data

acquired with EMA, and implemented with off-the-shelf, open-source software.

Volumetric MRI scan of sustained [ɑː]

from mngu0 corpus (http://mngu0.org/) MRI data for tongue model mesh

The tongue model mesh is obtained from the isosurface of the segmented tongue.

TBackC TMidC

TMidL TBladeL TBladeR

TMidR

TTipC

(rendered as cones to visualize orientation)

EMA coil layout for pilot corpus recorded with Carstens AG500

EMA data in

The EMA coils serve as transformation tar- gets for the tongue model rig, which is con- trolled using inverse kinematics and volu-

metric constraints. Articulatory gestures

can be compiled into actions for non-linear animation and coarticulation modeling.

EMA data for tongue model kinematics

Progress in animation of an

EMA-controlled tongue model

for acoustic-visual speech synthesis

Ingmar Steiner1,2, Slim Ouni1,3

1LORIA Speech Group, 2INRIA, 3Université Nancy 2

Firstname.Lastname@loria.fr

Referenzen

ÄHNLICHE DOKUMENTE

Interessant an dieser Beschreibung ist zum einen die verwendete Metaphorik von Lücken und Geistern, die für TOTS verwendet wird. Es wird sich zeigen, daß die Aktivierungs-

Important principles of ethical conduct of clinical studies and the protection of subjects, including special populations, are stated in other ICH guidelines (ICH E6 Good

The nature of expressive and emotional speech has garnered a mounting body of research over the past decade (Scherer, 2003; Schröder, 2009; Schuller et al., 2011, among many others);

We announce the release of the PAVOQUE corpus, a single-speaker, multi-style database of German speech, designed for analysis and synthesis of expressive speech.. The corpus has

Using the articulatory animation framework, static meshes of dental cast scans and the tongue (extracted from the MRI subset of the mngu0 corpus) can be animated using motion

We have presented a technique to animate a kinematic tongue model, based on volumetric vo- cal tract MRI data, using skeletal animation with a flexible rig, controlled by motion

acoustic-visual (AV) speech synthesizer [1], our aim is to integrate a tongue model for improved realism and visual intelligibility. The AV text-to- speech (TTS) synthesizer uses

,.Iah, Leo/ ütleb praegust Walter, oma sigari laua pcale, mis tema ja sobra wahrl seisab, pannes, „ tahtnud ma hulk aoga mitte uskuda, et Gewa mind nii tõelikult saaks ära tonkama.