|
Researchspace >
General science, engineering & technology >
General science, engineering & technology >
General science, engineering & technology >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10204/5329
|
| Title: | Lucene stemmer for MXit lingo |
| Authors: | Butgereit, LL Botha, RA |
| Keywords: | MXit Dr Math Lucene stemmer Mxit conversations Mxit lingo Linguistic conventions World wide web applications |
| Issue Date: | Sep-2011 |
| Publisher: | Cape Peninsula University of Technology |
| Citation: | Butgereit, LL and Botha, RA. 2011. Lucene stemmer for MXit lingo. The 13th Annual Conference on World Wide Web Applications, Johannesburg, 14-16 September 2011 |
| Series/Report no.: | Workflow request;7545 |
| Abstract: | MXit lingo is an abbreviated form of written English used by children, teenagers and young adults when communicating using MXit as a medium over cell phones. A stemmer for MXit lingo would enable a search engine such as Lucene to index stored MXit conversations for later searching. A MXit stemmer would have to cater for the new grammatical and linguistic conventions which have developed in MXit lingo. For example, a word which contains a trailing -er may have the -er changed to an -a. Thus the word “ova” can be used in place of “over” and “unda” can be used in place of “under”. This paper describes the creation of a Lucene stemmer for MXit lingo. It also itemizes the conventions which have been noted in MXit lingo. |
| Description: | The 13th Annual Conference on World Wide Web Applications, Johannesburg, 14-16 September 2011 |
| URI: | http://www.zaw3.co.za http://hdl.handle.net/10204/5329 |
| ISBN: | 978-0-620-51918-2 |
| Appears in Collections: | ICT in education, youth, gender General science, engineering & technology
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|