ResearchSpace

Lucene stemmer for MXit lingo

Show simple item record

dc.contributor.author Butgereit, LL
dc.contributor.author Botha, RA
dc.date.accessioned 2011-11-24T08:01:07Z
dc.date.available 2011-11-24T08:01:07Z
dc.date.issued 2011-09
dc.identifier.citation Butgereit, LL and Botha, RA. 2011. Lucene stemmer for MXit lingo. The 13th Annual Conference on World Wide Web Applications, Johannesburg, 14-16 September 2011 en_US
dc.identifier.isbn 978-0-620-51918-2
dc.identifier.uri http://www.zaw3.co.za
dc.identifier.uri http://hdl.handle.net/10204/5329
dc.description The 13th Annual Conference on World Wide Web Applications, Johannesburg, 14-16 September 2011 en_US
dc.description.abstract MXit lingo is an abbreviated form of written English used by children, teenagers and young adults when communicating using MXit as a medium over cell phones. A stemmer for MXit lingo would enable a search engine such as Lucene to index stored MXit conversations for later searching. A MXit stemmer would have to cater for the new grammatical and linguistic conventions which have developed in MXit lingo. For example, a word which contains a trailing -er may have the -er changed to an -a. Thus the word “ova” can be used in place of “over” and “unda” can be used in place of “under”. This paper describes the creation of a Lucene stemmer for MXit lingo. It also itemizes the conventions which have been noted in MXit lingo. en_US
dc.language.iso en en_US
dc.publisher Cape Peninsula University of Technology en_US
dc.relation.ispartofseries Workflow request;7545
dc.subject MXit en_US
dc.subject Dr Math en_US
dc.subject Lucene stemmer en_US
dc.subject Mxit conversations en_US
dc.subject Mxit lingo en_US
dc.subject Linguistic conventions en_US
dc.subject World wide web applications en_US
dc.title Lucene stemmer for MXit lingo en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation Butgereit, L., & Botha, R. (2011). Lucene stemmer for MXit lingo. Cape Peninsula University of Technology. http://hdl.handle.net/10204/5329 en_ZA
dc.identifier.chicagocitation Butgereit, LL, and RA Botha. "Lucene stemmer for MXit lingo." (2011): http://hdl.handle.net/10204/5329 en_ZA
dc.identifier.vancouvercitation Butgereit L, Botha R, Lucene stemmer for MXit lingo; Cape Peninsula University of Technology; 2011. http://hdl.handle.net/10204/5329 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Butgereit, LL AU - Botha, RA AB - MXit lingo is an abbreviated form of written English used by children, teenagers and young adults when communicating using MXit as a medium over cell phones. A stemmer for MXit lingo would enable a search engine such as Lucene to index stored MXit conversations for later searching. A MXit stemmer would have to cater for the new grammatical and linguistic conventions which have developed in MXit lingo. For example, a word which contains a trailing -er may have the -er changed to an -a. Thus the word “ova” can be used in place of “over” and “unda” can be used in place of “under”. This paper describes the creation of a Lucene stemmer for MXit lingo. It also itemizes the conventions which have been noted in MXit lingo. DA - 2011-09 DB - ResearchSpace DP - CSIR KW - MXit KW - Dr Math KW - Lucene stemmer KW - Mxit conversations KW - Mxit lingo KW - Linguistic conventions KW - World wide web applications LK - https://researchspace.csir.co.za PY - 2011 SM - 978-0-620-51918-2 T1 - Lucene stemmer for MXit lingo TI - Lucene stemmer for MXit lingo UR - http://hdl.handle.net/10204/5329 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record