Show simple item record Wijesiri, I Gallage, M Gunathilaka, B Lakjeewa, M Wimalasuriya, DC Dias, G Paranavithana, R De Silva, N 2017-01-16T04:01:19Z 2017-01-16T04:01:19Z
dc.description.abstract Sinhala is one of the official languages of Sri Lanka and is used by over 19 million people. It belongs to the Indo-Aryan branch of the In-do-European languages and its origins date back to at least 2000 years. It has developed into its current form over a long period of time with influences from a wide variety of lan-guages including Tamil, Portuguese and Eng-lish. As for any other language, a WordNet is extremely important for Sinhala to take it into the digital era. This paper is based on the pro-ject to develop a WordNet for Sinhala based on the English (Princeton) WordNet. It de-scribes how we overcame the challenges in adding Sinhala specific characteristics which were deemed important by Sinhala language experts to the WordNet while keeping the structure of the original English WordNet. It also presents the details of the crowdsourcing system we developed as a part of the project - consisting of a NoSQL database in the backend and a web-based frontend. We con-clude by discussing the possibility of adapting this architecture for other languages and the road ahead for the Sinhala WordNet and Sin-hala NLP. en_US
dc.relation.uri en_US
dc.source.uri en_US
dc.title Building a WordNet for Sinhala en_US
dc.type Article-Abstract en_US
dc.identifier.year 2014 en_US
dc.identifier.journal Volume editors en_US
dc.identifier.pgnos 100 en_US en_US en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record