Authors
Matthew Marge, Satanjeev Banerjee, Alexander Rudnicky
Publication date
2010/6
Conference
Proceedings of the NAACL HLT 2010 workshop on creating speech and language data with Amazon’s Mechanical Turk
Pages
99-107
Description
Due to its complexity, meeting speech provides a challenge for both transcription and annotation. While Amazon’s Mechanical Turk (MTurk) has been shown to produce good results for some types of speech, its suitability for transcription and annotation of spontaneous speech has not been established. We find that MTurk can be used to produce highquality transcription and describe two techniques for doing so (voting and corrective). We also show that using a similar approach, high quality annotations useful for summarization systems can also be produced. In both cases, accuracy is comparable to that obtained using trained personnel.
Total citations
2010201120122013201420152016201720182019202020212022202316278643435212
Scholar articles