Language Technology Research Laboratory University of Colombo School of Computing 35, Reid Avenue Colombo07 Sri Lanka
+94 11 2581245 ext. 532
+94 11 2587239 ATTN: LTRL
ltrl(AT)ucsc.cmb.ac.lk
.
Downloads
Sinhala Enabling Pack (15.86 MB) Installation of this pack will enable the Unicode support for Sinhala, for Microsoft® Windows® XP(sp2) and Microsoft® Office 2003® (sp1).
Sinhala OCR (6.79 MB) Optical Character Recognizer software for Sinhala
akaradi - Trilingual Lexicon (Linux) (1.40 MB) This application expects Python (>=v2.5) and relevant wxGTK libraries (usually called libwxgtk) to be available in your system.
Word List (291 KB) This word list consists of 70142 distinct Sinhala words extracted from the UCSC/LTRL Sinhala Corpus Beta Version April 2005.
UCSC/LTRL Sinhala Corpus Beta Version - April 2005 (2.2 MB) UCSC/LTRL Sinhala Corpus is one of the main outputs of the PAN Localization project. Present beta release of the corpus contains around 650 000 words; 70 000 distinct words.