|
![]() |
|||
|
||||
OverviewThe purpose of this document is to describe the best practices that personnel from the National Institute of Standards and Technology (NIST) have developed and implemented to efficiently and effectively capture two-way, free-form speech-to-speech audio dialogues within recording studios. These dialogues, produced to support the development and evaluation of machine translation technologies, are conducted by English and foreign language speakers conversing with one another in their native languages through the mediation of an interpreter. NIST personnel have collected over 500 hours of bilingual audio data sets encompassing more than 1100 dialogues across three unique language pairs (English/Iraqi-Arabic, English/Dari, and English/Pashto) since it became involved in this work in 2007. This document will present the methods the NIST team has designed and employed allowing the successful capture of audio data. In addition to the data collection protocols including personnel training and workflow, data collection scenario generation and speaker recruitment protocols will be discussed. Citation: NIST Interagency/Internal Report Full Product DetailsAuthor: NistPublisher: Createspace Independent Publishing Platform Imprint: Createspace Independent Publishing Platform Dimensions: Width: 21.60cm , Height: 0.40cm , Length: 28.00cm Weight: 0.177kg ISBN: 9781493756230ISBN 10: 1493756230 Pages: 66 Publication Date: 12 November 2013 Audience: General/trade , General Format: Paperback Publisher's Status: Active Availability: Available To Order ![]() We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately. Table of ContentsReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |