Goran Nenadić
(National Centre for Text Mining,
Manchester,
and Faculty of Mathematics, Belgrade)
CREATING DIGITAL LANGUAGE
RESOURCES
Abstract: We discuss building digital language resources (such as annotated corpora, lexicons, ontologies, terminologies, tools), which are the main prerequisite for successful communication and information management in the e-society of the 21st century. We give an overview of the main requirements and best practices, and point to necessary steps for creation and maintenance of standards-based and reusable language resources for written language. The notion of basic and extended language resource kits are discussed, along with other international initiatives, including the Declaration on open access to language resources. We also analyse challenges and responsibilities in creating digital language resources, and identify the need for wider national and international coordination and cooperation.