Documents

  • paperID636

    Accepted author manuscript, 367 KB, PDF-document

DOI

In computing, remote devices may be identified by means of device fingerprinting, which works by collecting a myriad of clientside attributes such as the device’s browser and operating system version, installed plugins, screen resolution, hardware artifacts, Wi-Fi settings, and anything else available to the server, and then merging these attributes into uniquely identifying fingerprints. This technique is used in practice to present personalized content to repeat website visitors, detect fraudulent users, and stop masquerading attacks on local networks. However, device fingerprints are seldom uniquely identifying. They are better viewed as partial device fingerprints, which do have some discriminatory power but not enough to uniquely identify users. How can we infer from partial fingerprints whether different observations belong to the same device?We present a mathematical formulation of this problem that enables probabilistic inference of the correspondence of observations. We set out to estimate a correspondence probability for every pair of observations that reflects the plausibility that they are made by the same user. By extending probabilistic data association techniques previously used in object tracking, traffic surveillance and citation matching, we develop a general-purpose probabilistic method for estimating correspondence probabilities with partial fingerprints. Our approach exploits the natural variation in fingerprints and allows for use of situation-specific knowledge through the specification of a generative probability model. Experiments with a real-world dataset show that our approach gives calibrated correspondence probabilities. Moreover, we demonstrate that improved results can be obtained by combining device fingerprints with behavioral models
Original languageEnglish
Title of host publicationproceedings of Joint European Conference on Machine Learning and Knowledge Discovery in Databases
PublisherSpringer
Pages222-237
Number of pages16
ISBN (Electronic)978-3-319-71246-8
ISBN (Print)978-3-319-71245-1
DOIs
Publication statusPublished - 2017
EventJoint European Conference on Machine Learning and Knowledge Discovery in Databases: Machine Learning and Knowledge Discovery in Databases - Skopje, Macedonia, The Former Yugoslav Republic of
Duration: 18 Sep 201722 Sep 2017

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume10535
ISSN (Print)0302-9743

Conference

ConferenceJoint European Conference on Machine Learning and Knowledge Discovery in Databases
Abbreviated titleECML PKDD 2017
CountryMacedonia, The Former Yugoslav Republic of
CitySkopje
Period18/09/1722/09/17

ID: 40692138