|
Researchspace >
General science, engineering & technology >
General science, engineering & technology >
General science, engineering & technology >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10204/5814
|
| Title: | Using N-grams to identify mathematical topics in MXit lingo |
| Authors: | Butgereit, LL Botha, RA |
| Keywords: | MXit lingo MXit spelling N-grams Dr Math C³TO |
| Issue Date: | Sep-2011 |
| Publisher: | ACM |
| Citation: | Butgereit, LL and Botha, RA. Using N-grams to identify mathematical topics in MXit lingo. Annual Conference of the South African Institute of Computer Scientists and Information Technologists (SAICSIT 2011), Cape Town, South Africa, 3-5 October 2011, pp 40-48 |
| Series/Report no.: | Workflow;7410 |
| Abstract: | N-grams are used to quantify the similarity between two documents or the similarity between two collections of words. This paper shows how N-grams of length 3 and 4 both coupled with text processing (including stop word removal and stemming according to MXit spelling conventions) can be used to categorize very short mathematical conversations conducted in MXit lingo into broad mathematical groups such as algebra, geometry, trigonometry, and calculus. MXit lingo is an abbreviated form of written English which children, teenagers and young adults utilise when communicating using the popular MXit chat mechanism over cell phones. Conversations from the "Dr Math" project were used for this analysis. "Dr Math" is a mathematical tutoring service which links primary and secondary school pupils to tutors from local universities. The tutors assist the pupils with their mathematics homework. |
| Description: | Copyright: 2011 ACM. This is an ABSTRACT ONLY. |
| URI: | http://dl.acm.org/citation.cfm?id=2072221.2072227&coll=DL&dl=ACM&CFID=76903344&CFTOKEN=61329174 http://hdl.handle.net/10204/5814 |
| ISBN: | 978-1-4503-0878-6 |
| Appears in Collections: | ICT in education, youth, gender General science, engineering & technology
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|