EEMCS

Home > Publications
Home University of Twente
Education
Research
Prospective Students
Jobs
Publications
Intranet (internal)
 
 Nederlands
 Contact
 Sitemap
 Search
 Organisation

EEMCS EPrints Service


18431 Automated Metadata Extraction for Semantic Access to Spoken Word Archives
Home Policy Brochure Browse Search User Area Contact Help

de Jong, F.M.G. and Heeren, W.F.L. and van Hessen, A.J. and Ordelman, R.J.F. and Nijholt, A. (2011) Automated Metadata Extraction for Semantic Access to Spoken Word Archives. (Invited) In: Proceedings 12th International Symposium on Social Communication, 17-21 September 2010, Santiago de Cuba. pp. 896-905. Centre for Applied Linguistics. ISBN 978-959-7174-19-6

Full text available as:

PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
476 Kb

Official URL: http://www.santiago.cu/hosting/linguistica/descargar.php?d=1713

Exported to Metis

Abstract

Archival practice is shifting from the analogue to the digital world. A specific subset of heritage collections that impose interesting challenges for the field of language and speech technology are spoken word archives. Given the enormous backlog at audiovisual archives of unannotated materials and the generally global level of item description, collection disclosure and item access are both at risk, and (semi-)automated methods for analysis and annotation may help to increase the use and reuse of these rich content collections. In several HMI projects the interplay has been investigated between evolving user scenarios and user requirements for spoken audio collections on the one hand, and the potential of automatic annotation and search technology for the improved accessibility and search paradigms on the other hand. In this paper we will present an overview of the state-of-the-art in metadata generation for audio content and explain the crucial importance of involving user groups in the design of research agendas and road maps for novel applications in this domain.

Item Type:Conference or Workshop Paper (Full Paper, Invited/Keynote Talk)
Research Group:EWI-HMI: Human Media Interaction
Research Program:CTIT-NICE: Natural Interaction in Computer-mediated Environments
Research Project:CHoral: access to oral history
Additional Information:cultural heritage, spoken audio collection, automatic annotation, speech technology, information retrieval
ID Code:18431
Status:Published
Deposited On:26 January 2011
Refereed:No
International:Yes
More Information:statisticsmetis

Export this item as:

To correct this item please ask your editor

Repository Staff Only: edit this item