Transcription and Linguistic Data Management

The Lab
Level 2 of the Digital Studio
West Wing of Arts West
(access via the rear lift)

This workshop aims to introduce concepts in language data management, in particular for language documentation projects. It will present two popular current tools, one for transcription of media (Elan) and the other for annotation of text and lexicon building (FLEx). We will also introduce our new tool, Digame, for collection management.

This workshop is supported by the Research Unit for Indigenous Language and the ARC Centre for Excellence for the Dynamics of Language.

This workshop is part of the Digital Studio's Digital Research in Action series.

Agenda

9am - 10am: Linguistic Data Management (Nick Thieberger)

Break

10.30am - 12.30pm: Elan intro and practicum (Nick Thieberger)

Break

1.30pm - 3.30pm: Flex intro and practicum (Rosey Billington)

Presenter

Nick Thieberger is an Associate Professor in Linguistics. He is particularly interested in developing methods for making better records of all of the world's many languages. This involves training new students in concepts of linguistic data management, the creation of new tools, and the use of existing records for new research.

He has worked with the Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC) since its inception in 2003, it is an archive that holds 12,000 hours of audio records in 1229 languages. He built the Digital Daisy Bates pages using TEI XML to display 23,000 pages of manuscript material in Australian Indigenous languages. He wrote a grammar of Nafsan (central Vanuatu) and continues to work on a dictionary of that language.