FOURER, Dominique; SHOCHI, Takaaki; ROUAS, Jean-Luc; RILLIARD, Albert

doi:10.21437/SpeechProsody.2016-203

hal.structure.identifier	Institut de Recherche en Energie Electrique de Nantes Atlantique EA4642 [IREENA]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	FOURER, Dominique
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Cognition, Langues, Langage, Ergonomie [CLLE-ERSS]
dc.contributor.author	SHOCHI, Takaaki
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	ROUAS, Jean-Luc
hal.structure.identifier	Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur [LIMSI]
dc.contributor.author	RILLIARD, Albert
dc.date.issued	2016
dc.date.conference	2016-05-31
dc.description.abstractEn	This paper is about the perception of 'genuine' social affects versus 'synthetic' ones. Our ultimate aim is to create a software for self-teaching language learning that includes a tool where learners will be able to hear their own voice producing the social affect correctly. Towards this goal, we study here how we can construct synthetic stimuli using neutral voices and prosodic parameters , and if such stimuli can be well enough recognized by native listeners. At first, we explain how our corpus is built around contextual scenarios and the recording protocol. Then, we explain how the synthetic stimuli are constructed. These stimuli must comply with several constraints: keeping the original speaker identity, preserving the linguistic content, and of course having the best possible quality. Results from a perception experiment with native speakers of Japanese show that the social affects for natural stimuli are quite well recognized although the results show more variation on the synthetic stimuli, depending on the considered social affect. Some social affects may indeed be expressed quite subtly so that they are difficult to synthesize. An investigation based on statistical analysis is proposed showing where the main difficulties lie.
dc.language.iso	en
dc.subject.en	Index Terms: speech processing
dc.subject.en	affective prosody
dc.subject.en	attitudes characterization
dc.title.en	Perception of prosodic transformation for Japanese social affects
dc.type	Communication dans un congrès
dc.identifier.doi	10.21437/SpeechProsody.2016-203
dc.subject.hal	Informatique [cs]/Traitement du signal et de l'image
bordeaux.page	989 - 993
bordeaux.volume	2016
bordeaux.country	US
bordeaux.conference.city	Boston
bordeaux.peerReviewed	oui
hal.identifier	hal-01392309
hal.version	1
hal.invited	non
hal.proceedings	oui
hal.conference.end	2016-06-04
hal.popular	non
hal.audience	Internationale
hal.origin.link	https://hal.archives-ouvertes.fr//hal-01392309v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2016&rft.volume=2016&rft.spage=989%20-%20993&rft.epage=989%20-%20993&rft.au=FOURER,%20Dominique&SHOCHI,%20Takaaki&ROUAS,%20Jean-Luc&RILLIARD,%20Albert&rft.genre=unknown

Fichier(s) constituant ce document

Fichiers	Taille	Format	Vue
Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

CLLE Montaigne : Cognition, langues, Langages, Ergonomie - UMR 5263

Afficher la notice abrégée

Perception of prosodic transformation for Japanese social affects

Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)