Digitální knihovna UHK

Examining the Impact of Distance-Based Similarity Metrics on the Performance of Projected Clustering Algorithm for Fingerprint Database Clustering

Zobrazit minimální záznam

dc.rights.license CC BY eng
dc.contributor.author Yaro, Abdulmalik Shehu cze
dc.contributor.author Malý, Filip cze
dc.contributor.author Pražák, Pavel cze
dc.contributor.author Malý, Karel cze
dc.date.accessioned 2025-12-05T16:08:29Z
dc.date.available 2025-12-05T16:08:29Z
dc.date.issued 2025 eng
dc.identifier.issn 2644-1268 eng
dc.identifier.uri http://hdl.handle.net/20.500.12603/2450
dc.description.abstract The projected clustering (PROCLUS) algorithm is a subspace clustering algorithm based on the k-medoids clustering approach. It is designed to address the challenges of irrelevant received signal strength (RSS) measurements in fingerprint vectors by focusing on meaningful subsets of RSS measurements from wireless APs, known as subspaces. Despite its robustness, its performance heavily depends on the chosen similarity metric, with Euclidean and Manhattan distances being the most common. While many researchers focus on modifying the algorithm to enhance performance, the impact of similarity metrics on clustering performance is often overlooked, despite its critical role in determining accuracy. As such, this article evaluates the clustering performance of the PROCLUS algorithm using five distance-based similarity metrics-Euclidean, Manhattan, Cosine similarity, Canberra, and Chebyshev-across six experimentally generated fingerprint databases. It aims to identify the best similarity metric for maximizing clustering performance for each of the six fingerprint databases, with silhouette scores used as the clustering performance metric. Simulation results show that Cosine similarity is the most effective metric with the PROCLUS algorithm. It consistently produces clusters with the highest silhouette scores, all well above the 0.25 threshold and were 24% to 136% higher than the scores achieved using the other distance-based metrics across the tested fingerprint databases. The Canberra distance performed variably, while Euclidean and Manhattan distances were less reliable. The Chebyshev distance consistently underperformed in all the databases considered. The findings in this article highlight the importance of choosing the appropriate similarity metric to perform clustering operations with the PROCLUS algorithm. eng
dc.format p. 1248-1259 eng
dc.language.iso eng eng
dc.publisher IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC eng
dc.relation.ispartof IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, volume 6, issue: July eng
dc.subject Examining eng
dc.subject Impact eng
dc.subject Distance-Based eng
dc.subject Similarity eng
dc.subject Metrics eng
dc.subject the eng
dc.subject Performance eng
dc.subject Projected eng
dc.subject Algorithm eng
dc.subject for eng
dc.subject Fingerprint eng
dc.subject Database eng
dc.subject Clustering eng
dc.title Examining the Impact of Distance-Based Similarity Metrics on the Performance of Projected Clustering Algorithm for Fingerprint Database Clustering eng
dc.type article eng
dc.identifier.obd 43882189 eng
dc.identifier.wos 001547282700002 eng
dc.identifier.doi 10.1109/OJCS.2025.3591192 eng
dc.publicationstatus postprint eng
dc.peerreviewed yes eng
dc.source.url https://ieeexplore.ieee.org/document/11087686 cze
dc.relation.publisherversion https://ieeexplore.ieee.org/document/11087686 eng
dc.rights.access Open Access eng


Soubory tohoto záznamu

Tento záznam se objevuje v následujících kolekcích

Zobrazit minimální záznam

Prohledat DSpace


Procházet

Můj účet