Ai is great, isn’t it? Tone direction and illocutionary force delivery of tag questions in amazon’s ai ntts polly

Rodríguez Fernández-Peña, Alfonso Carlos

doi:10.1344/efe-2023-32-227-242

Repositorio

Cómo publicar

Recursos

FAQs

Ai is great, isn’t it? Tone direction and illocutionary force delivery of tag questions in amazon’s ai ntts polly

Otros títulos:

La inteligencia artificial es genial, ¿verdad? Dirección del tono y vehiculaciónde la fuerza ilocucionaria en las preguntas ratificadas de Polly,el sistema de texto a voz AI de Amazon

Autor(es) y otros:

Rodríguez Fernández-Peña, Alfonso Carlos

Fecha de publicación:

2023

Versión del editor:

http://dx.doi.org/10.1344/efe-2023-32-227-242

Citación:

Estudios de Fonética Experimental, 32, p. 227-242 (2023); doi:10.1344/efe-2023-32-227-242

Descripción física:

p. 227-242

Resumen:

This work provides a descriptive analysis of the tone direction and its inherent illocutionary force in question tags delivered by Amazon’s neural text-to-speech system Polly. We included three types of tag questions (reverse-polarity tags — both positive and negative —, copy tags and command tags) for which 10 sentences were used as input in each case. The data included 600 utterances produced by British and American English voices currently available on Amazon’s NTTS. The audio files were examined with the speech analysis software Praat to identify the tone pattern for each utterance and confirm the intended illocutionary force. The results show that Amazon’s AI speech synthesis technology is not yet fully reliable and produces a high rate of utterances whose pragmatic load is undesired when using natural spontaneous speech traits as question tags.

Este trabajo ofrece un análisis descriptivo de la dirección tonal y la fuerza ilocucionaria en las pregun-tas ratificadas (o tag questions, en inglés) generadas por el sistema de texto a voz Polly de Amazon. Se examinan tres tipos de preguntas (polaridad revertida — positivas y negativas —, polaridad idéntica y orden) utilizando 10 frases como muestra para cada una. Se emplearon las voces sintéticas disponi-bles en inglés británico y americano y se generaron un total de 600 muestras de audio. Estas se anali-zaron con Praat para identificar el patrón tonal y confirmar la fuerza ilocucionaria presente en ellas. Los resultados revelan que la tecnología de síntesis de habla de Amazon aún no es completamente fiable, ya que produce un alto número de frases con una carga pragmática inadecuada para lograr una entonación natural en las preguntas ratificadas en inglés.

Ficheros en el ítem

Repositorio Institucional de la Universidad de Oviedo

Ai is great, isn’t it? Tone direction and illocutionary force delivery of tag questions in amazon’s ai ntts polly

Otros títulos:

Autor(es) y otros:

Fecha de publicación:

Versión del editor:

Citación:

Descripción física:

Resumen:

URI:

ISSN:

DOI:

Colecciones

Ficheros en el ítem

Métricas

Compartir

Estadísticas de uso

Metadatos