Afficher la notice abrégée

hal.structure.identifierInstitut de Recherche en Energie Electrique de Nantes Atlantique EA4642 [IREENA]
hal.structure.identifierLaboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.authorFOURER, Dominique
hal.structure.identifierLaboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifierCognition, Langues, Langage, Ergonomie [CLLE-ERSS]
dc.contributor.authorSHOCHI, Takaaki
hal.structure.identifierLaboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.authorROUAS, Jean-Luc
hal.structure.identifierLaboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur [LIMSI]
dc.contributor.authorRILLIARD, Albert
dc.date.issued2016
dc.date.conference2016-05-31
dc.description.abstractEnThis paper is about the perception of 'genuine' social affects versus 'synthetic' ones. Our ultimate aim is to create a software for self-teaching language learning that includes a tool where learners will be able to hear their own voice producing the social affect correctly. Towards this goal, we study here how we can construct synthetic stimuli using neutral voices and prosodic parameters , and if such stimuli can be well enough recognized by native listeners. At first, we explain how our corpus is built around contextual scenarios and the recording protocol. Then, we explain how the synthetic stimuli are constructed. These stimuli must comply with several constraints: keeping the original speaker identity, preserving the linguistic content, and of course having the best possible quality. Results from a perception experiment with native speakers of Japanese show that the social affects for natural stimuli are quite well recognized although the results show more variation on the synthetic stimuli, depending on the considered social affect. Some social affects may indeed be expressed quite subtly so that they are difficult to synthesize. An investigation based on statistical analysis is proposed showing where the main difficulties lie.
dc.language.isoen
dc.subject.enIndex Terms: speech processing
dc.subject.enaffective prosody
dc.subject.enattitudes characterization
dc.title.enPerception of prosodic transformation for Japanese social affects
dc.typeCommunication dans un congrès
dc.identifier.doi10.21437/SpeechProsody.2016-203
dc.subject.halInformatique [cs]/Traitement du signal et de l'image
bordeaux.page989 - 993
bordeaux.volume2016
bordeaux.countryUS
bordeaux.conference.cityBoston
bordeaux.peerReviewedoui
hal.identifierhal-01392309
hal.version1
hal.invitednon
hal.proceedingsoui
hal.conference.end2016-06-04
hal.popularnon
hal.audienceInternationale
hal.origin.linkhttps://hal.archives-ouvertes.fr//hal-01392309v1
bordeaux.COinSctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2016&rft.volume=2016&rft.spage=989%20-%20993&rft.epage=989%20-%20993&rft.au=FOURER,%20Dominique&SHOCHI,%20Takaaki&ROUAS,%20Jean-Luc&RILLIARD,%20Albert&rft.genre=unknown


Fichier(s) constituant ce document

FichiersTailleFormatVue

Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée