TRANCO: A Research-Oriented Top Sites Ranking Hardened Against Manipulation

Victor Le Pochat; Tom   Van Goethem; Samaneh Tajalizadehkhoob; Wouter Joosen

doi:10.14722/ndss.2019.23386

TRANCO: A Research-Oriented Top Sites Ranking Hardened Against Manipulation

Victor Le Pochat, Tom Van Goethem, Samaneh Tajalizadehkhoob, Wouter Joosen

Organisation & Governance

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

232 Citations (Scopus)

377 Downloads (Pure)

Abstract

In order to evaluate the prevalence of security and privacy practices on a representative sample of the Web, researchers rely on website popularity rankings such as the Alexa list. While the validity and representativeness of these rankings are rarely questioned, our findings show the contrary: we show for four main rankings how their inherent properties (similarity, stability, representativeness, responsiveness and benignness) affect their composition and therefore potentially skew the conclusions made in studies. Moreover, we find that it is trivial for an adversary to manipulate the composition of these lists. We are the first to empirically validate that the ranks of domains in each of the lists are easily altered, in the case of Alexa through as little as a single HTTP request. This allows adversaries to manipulate rankings on a large scale and insert malicious domains into whitelists or bend the outcome of research studies to their will. To overcome the limitations of such rankings, we propose improvements to reduce the fluctuations in list composition and guarantee better defenses against manipulation. To allow the research community to work with reliable and reproducible rankings, we provide TRANCO, an improved ranking that we offer through an online service available at https://tranco-list.eu.

Original language	English
Title of host publication	Network and Distributed Systems Security (NDSS) Symposium 2019
Number of pages	15
ISBN (Electronic)	189156255X, 9781891562556
DOIs	https://doi.org/10.14722/ndss.2019.23386
Publication status	Published - 2019
Event	Network and Distributed Systems Security Symposium 2019 - San Diego, United States Duration: 24 Feb 2019 → 27 Feb 2019

Publication series

Name	26th Annual Network and Distributed System Security Symposium, NDSS 2019

Conference

Conference	Network and Distributed Systems Security Symposium 2019
Abbreviated title	NDSS 2019
Country/Territory	United States
City	San Diego
Period	24/02/19 → 27/02/19

Access to Document

10.14722/ndss.2019.23386

ndss2019_01B-3Final published version, 646 KB

Cite this

@inproceedings{fa008ea342b04673838e88d830eb389c,

title = "TRANCO: A Research-Oriented Top Sites Ranking Hardened Against Manipulation",

abstract = "In order to evaluate the prevalence of security and privacy practices on a representative sample of the Web, researchers rely on website popularity rankings such as the Alexa list. While the validity and representativeness of these rankings are rarely questioned, our findings show the contrary: we show for four main rankings how their inherent properties (similarity, stability, representativeness, responsiveness and benignness) affect their composition and therefore potentially skew the conclusions made in studies. Moreover, we find that it is trivial for an adversary to manipulate the composition of these lists. We are the first to empirically validate that the ranks of domains in each of the lists are easily altered, in the case of Alexa through as little as a single HTTP request. This allows adversaries to manipulate rankings on a large scale and insert malicious domains into whitelists or bend the outcome of research studies to their will. To overcome the limitations of such rankings, we propose improvements to reduce the fluctuations in list composition and guarantee better defenses against manipulation. To allow the research community to work with reliable and reproducible rankings, we provide TRANCO, an improved ranking that we offer through an online service available at https://tranco-list.eu. ",

author = "{Le Pochat}, Victor and {Van Goethem}, Tom and Samaneh Tajalizadehkhoob and Wouter Joosen",

year = "2019",

doi = "10.14722/ndss.2019.23386",

language = "English",

isbn = "1-891562-55-X",

series = "26th Annual Network and Distributed System Security Symposium, NDSS 2019",

booktitle = "Network and Distributed Systems Security (NDSS) Symposium 2019",

note = "Network and Distributed Systems Security Symposium 2019, NDSS 2019 ; Conference date: 24-02-2019 Through 27-02-2019",

}

Le Pochat, V, Van Goethem, T, Tajalizadehkhoob, S & Joosen, W 2019, TRANCO: A Research-Oriented Top Sites Ranking Hardened Against Manipulation. in Network and Distributed Systems Security (NDSS) Symposium 2019. 26th Annual Network and Distributed System Security Symposium, NDSS 2019, Network and Distributed Systems Security Symposium 2019, San Diego, United States, 24/02/19. https://doi.org/10.14722/ndss.2019.23386

TRANCO: A Research-Oriented Top Sites Ranking Hardened Against Manipulation. / Le Pochat, Victor; Van Goethem, Tom ; Tajalizadehkhoob, Samaneh et al.
Network and Distributed Systems Security (NDSS) Symposium 2019. 2019. (26th Annual Network and Distributed System Security Symposium, NDSS 2019).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - TRANCO: A Research-Oriented Top Sites Ranking Hardened Against Manipulation

AU - Le Pochat, Victor

AU - Van Goethem, Tom

AU - Tajalizadehkhoob, Samaneh

AU - Joosen, Wouter

PY - 2019

Y1 - 2019

N2 - In order to evaluate the prevalence of security and privacy practices on a representative sample of the Web, researchers rely on website popularity rankings such as the Alexa list. While the validity and representativeness of these rankings are rarely questioned, our findings show the contrary: we show for four main rankings how their inherent properties (similarity, stability, representativeness, responsiveness and benignness) affect their composition and therefore potentially skew the conclusions made in studies. Moreover, we find that it is trivial for an adversary to manipulate the composition of these lists. We are the first to empirically validate that the ranks of domains in each of the lists are easily altered, in the case of Alexa through as little as a single HTTP request. This allows adversaries to manipulate rankings on a large scale and insert malicious domains into whitelists or bend the outcome of research studies to their will. To overcome the limitations of such rankings, we propose improvements to reduce the fluctuations in list composition and guarantee better defenses against manipulation. To allow the research community to work with reliable and reproducible rankings, we provide TRANCO, an improved ranking that we offer through an online service available at https://tranco-list.eu.

AB - In order to evaluate the prevalence of security and privacy practices on a representative sample of the Web, researchers rely on website popularity rankings such as the Alexa list. While the validity and representativeness of these rankings are rarely questioned, our findings show the contrary: we show for four main rankings how their inherent properties (similarity, stability, representativeness, responsiveness and benignness) affect their composition and therefore potentially skew the conclusions made in studies. Moreover, we find that it is trivial for an adversary to manipulate the composition of these lists. We are the first to empirically validate that the ranks of domains in each of the lists are easily altered, in the case of Alexa through as little as a single HTTP request. This allows adversaries to manipulate rankings on a large scale and insert malicious domains into whitelists or bend the outcome of research studies to their will. To overcome the limitations of such rankings, we propose improvements to reduce the fluctuations in list composition and guarantee better defenses against manipulation. To allow the research community to work with reliable and reproducible rankings, we provide TRANCO, an improved ranking that we offer through an online service available at https://tranco-list.eu.

UR - http://www.scopus.com/inward/record.url?scp=85170646912&partnerID=8YFLogxK

U2 - 10.14722/ndss.2019.23386

DO - 10.14722/ndss.2019.23386

M3 - Conference contribution

SN - 1-891562-55-X

T3 - 26th Annual Network and Distributed System Security Symposium, NDSS 2019

BT - Network and Distributed Systems Security (NDSS) Symposium 2019

T2 - Network and Distributed Systems Security Symposium 2019

Y2 - 24 February 2019 through 27 February 2019

ER -

TRANCO: A Research-Oriented Top Sites Ranking Hardened Against Manipulation

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cybersecurity (TPM)

Cite this

TRANCO: A Research-Oriented Top Sites Ranking Hardened Against Manipulation

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Projects

Cybersecurity (TPM)

Cite this