Approaches and Best Practices

- Data Collection of Audio Dialogues to Support the Training of Speech-To-Speech Translation Systems

Nist

Bog

Format
Bog, paperback
Engelsk
66 sider

Normalpris: kr. 169,95

Medlemspris: kr. 149,95 For at købe bogen til medlemspris skal du have et medlemskab med Shopping-fordele. Du kan prøve medlemskabet gratis i 7 dage. Medlemskabet fornyes automatisk og kan altid opsiges.

Leveringstid: 2-3 uger (Sendes fra fjernlager)
Forventet levering: 25-06-2024
Kan pakkes ind og sendes som gave
Split betalingen op med

Beskrivelse

The purpose of this document is to describe the best practices that personnel from the National Institute of Standards and Technology (NIST) have developed and implemented to efficiently and effectively capture two-way, free-form speech-to-speech audio dialogues within recording studios. These dialogues, produced to support the development and evaluation of machine translation technologies, are conducted by English and foreign language speakers conversing with one another in their native languages through the mediation of an interpreter. NIST personnel have collected over 500 hours of bilingual audio data sets encompassing more than 1100 dialogues across three unique language pairs (English/Iraqi-Arabic, English/Dari, and English/Pashto) since it became involved in this work in 2007. This document will present the methods the NIST team has designed and employed allowing the successful capture of audio data. In addition to the data collection protocols including personnel training and workflow, data collection scenario generation and speaker recruitment protocols will be discussed. Citation: NIST Interagency/Internal Report

Læs hele beskrivelsen

Detaljer

SprogEngelsk
Sidetal66
Udgivelsesdato12-11-2013
ISBN139781493756230
Forlag Createspace
FormatPaperback
Udgave0

Størrelse og vægt

Vægt176 g

Dybde0,3 cm

10 cm

21,5 cm

27,9 cm

Approaches and Best Practices

- Data Collection of Audio Dialogues to Support the Training of Speech-To-Speech Translation Systems

Findes i disse kategorier...

Se andre, der handler om...