DSpace Repository

Examining the Impact of Fingerprint Vector Size on Similarity Determination and Clustering Performance of a Pattern-Based Similarity Metric

Show simple item record

dc.rights.license CC BY eng
dc.contributor.author Yaro, Abdulmalik Shehu cze
dc.contributor.author Malý, Filip cze
dc.contributor.author Pražák, Pavel cze
dc.date.accessioned 2025-12-05T15:39:56Z
dc.date.available 2025-12-05T15:39:56Z
dc.date.issued 2025 eng
dc.identifier.issn 2169-3536 eng
dc.identifier.uri http://hdl.handle.net/20.500.12603/2375
dc.description.abstract The correlation-based context similarity coefficient (CSC) metric is a pattern-based fingerprint similarity metric that has gained interest in fingerprint database clustering operations. The performance of the traditional distance-based metric in fingerprint vector similarity determination has been known to be influenced by the size of the fingerprint vector. However, as for the correlation-based CSC metric, there is no comprehensive research on how fingerprint vector size affects its performance in similarity determination and subsequently clustering performance. As such, this paper examines the impact of fingerprint vector size on the similarity determination performance of the correlation-based CSC metric with the k-medoids algorithm employed for clustering. The analysis is performed across four synthetic and two experimentally generated fingerprint databases with varying fingerprint vector sizes. The impact analysis is carried out with the clustering algorithm set to generate K is an element of [3, 5] clusters. Additionally, the results are compared against three distance-based metrics: squared Euclidean, Manhattan, and cosine. With silhouette score as the clustering performance metric, the simulation result shows that the size of the fingerprint vector influences the similarity determination performance of the correlation-based CSC metric. Additionally, the number of clusters in which the clustering algorithm is set to generate also contributes to how the correlation-based CSC metric performs in similarity determination. The similarity determination time complexity of the correlation-based CSC metric increases with fingerprint vector size, making efficient clustering more challenging as fingerprint vector sizes increase. For optimal performance, the correlation-based CSC metric is recommended for us as a similarity metric on a database with a fingerprint vector size of N <= 4 and a clustering algorithm configured to generate no more than 3 clusters. eng
dc.format p. 51660-51668 eng
dc.language.iso eng eng
dc.publisher IEEE eng
dc.relation.ispartof IEEE Access, volume 13, issue: March eng
dc.subject Examining eng
dc.subject the eng
dc.subject Impact eng
dc.subject Fingerprint eng
dc.subject Vector eng
dc.subject Size eng
dc.subject Determination eng
dc.subject and eng
dc.subject Clustering eng
dc.subject Performance eng
dc.subject Pattern-Based eng
dc.subject Similarity eng
dc.subject Metric eng
dc.title Examining the Impact of Fingerprint Vector Size on Similarity Determination and Clustering Performance of a Pattern-Based Similarity Metric eng
dc.type article eng
dc.identifier.obd 43881968 eng
dc.identifier.wos 001455525600016 eng
dc.identifier.doi 10.1109/ACCESS.2025.3553510 eng
dc.publicationstatus postprint eng
dc.peerreviewed yes eng
dc.source.url https://ieeexplore.ieee.org/document/10937090 cze
dc.relation.publisherversion https://ieeexplore.ieee.org/document/10937090 eng
dc.rights.access Open Access eng


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account