ResearchSpace

PAM-lite: Fast and accurate k-medoids clustering for massive datasets

Show simple item record

dc.contributor.author Olukanmi, PO
dc.contributor.author Nelwamondo, Fulufhelo V
dc.contributor.author Marwala, T
dc.date.accessioned 2019-09-25T06:43:36Z
dc.date.available 2019-09-25T06:43:36Z
dc.date.issued 2019-01
dc.identifier.citation Olukanmi, P.O., Nelwamondo, F.V. and Marwala, T. 2019. PAM-lite: Fast and accurate k-medoids clustering for massive datasets. SAUPEC/RobMech/PRASA Conference, Bloemfontein, South Africa, 28-30 January 2019, pp 200-204. en_US
dc.identifier.isbn 978-1-7281-0369-3
dc.identifier.uri https://ieeexplore.ieee.org/document/8704767
dc.identifier.uri DOI: 10.1109/RoboMech.2019.8704767
dc.identifier.uri http://hdl.handle.net/10204/11120
dc.description Copyright: 2019 IEEE. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text item, kindly consult the publisher's website. en_US
dc.description.abstract The Partitioning Around Medoids (PAM) clustering algorithm is well-known for its robustness and accuracy, but it is computationally expensive. This paper proposes a fast and accurate version, named PAM-lite. Like CLARA which also addresses PAM's inefficiency, PAM-lite applies PAM to random samples. However, unlike CLARA, it does not choose one of the obtained medoid sets (which would involve evaluating each set), but simply applies PAM again to the combination of all the obtained medoids. This simple change yields accuracy and speed improvement. We discuss the rationale behind PAM-lite's approach and evaluate the algorithm on benchmark datasets. In all cases tested, PAM-lite achieves better speed-up and clustering quality than CLARA; the speed-up margin increasing with problem size. PAM-lite competes so closely with the clustering quality produced by the full PAM algorithm, that in one high cluster variance case, it beats PAM's clustering quality slightly. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.relation.ispartofseries Worklist;22601
dc.subject Clustering LARge Applications en_US
dc.subject CLARA en_US
dc.subject K-medoids en_US
dc.subject Partitioning Around Medoids en_US
dc.subject PAM en_US
dc.title PAM-lite: Fast and accurate k-medoids clustering for massive datasets en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation Olukanmi, P., Nelwamondo, F. V., & Marwala, T. (2019). PAM-lite: Fast and accurate k-medoids clustering for massive datasets. IEEE. http://hdl.handle.net/10204/11120 en_ZA
dc.identifier.chicagocitation Olukanmi, PO, Fulufhelo V Nelwamondo, and T Marwala. "PAM-lite: Fast and accurate k-medoids clustering for massive datasets." (2019): http://hdl.handle.net/10204/11120 en_ZA
dc.identifier.vancouvercitation Olukanmi P, Nelwamondo FV, Marwala T, PAM-lite: Fast and accurate k-medoids clustering for massive datasets; IEEE; 2019. http://hdl.handle.net/10204/11120 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Olukanmi, PO AU - Nelwamondo, Fulufhelo V AU - Marwala, T AB - The Partitioning Around Medoids (PAM) clustering algorithm is well-known for its robustness and accuracy, but it is computationally expensive. This paper proposes a fast and accurate version, named PAM-lite. Like CLARA which also addresses PAM's inefficiency, PAM-lite applies PAM to random samples. However, unlike CLARA, it does not choose one of the obtained medoid sets (which would involve evaluating each set), but simply applies PAM again to the combination of all the obtained medoids. This simple change yields accuracy and speed improvement. We discuss the rationale behind PAM-lite's approach and evaluate the algorithm on benchmark datasets. In all cases tested, PAM-lite achieves better speed-up and clustering quality than CLARA; the speed-up margin increasing with problem size. PAM-lite competes so closely with the clustering quality produced by the full PAM algorithm, that in one high cluster variance case, it beats PAM's clustering quality slightly. DA - 2019-01 DB - ResearchSpace DP - CSIR KW - Clustering LARge Applications KW - CLARA KW - K-medoids KW - Partitioning Around Medoids KW - PAM LK - https://researchspace.csir.co.za PY - 2019 SM - 978-1-7281-0369-3 T1 - PAM-lite: Fast and accurate k-medoids clustering for massive datasets TI - PAM-lite: Fast and accurate k-medoids clustering for massive datasets UR - http://hdl.handle.net/10204/11120 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record