A Reliable Methodology to Collect Ground Truth Data of Image Aesthetic Appeal

Ernestasia Siahaan; Alan Hanjalic; Judith Redi

doi:10.1109/tmm.2016.2559942

A Reliable Methodology to Collect Ground Truth Data of Image Aesthetic Appeal

Ernestasia Siahaan, Alan Hanjalic, Judith Redi

Multimedia Computing

Research output: Contribution to journal › Article › Scientific › peer-review

27 Citations (Scopus)

Abstract

Recognizing what makes an image aesthetically pleasing is crucial to the effectiveness of many multimedia systems. Several works have attempted to build image aesthetic appeal predictors, and created their own set of ground truth data for the purpose, either by using rated images from photo sharing websites, or by asking a pool of users to rate images in lab or crowdsourcing experiments. Literature has shown that the way these experiments are conducted can influence their results: poor experimental setup can result in poorly reliable outcomes (i.e., highly imprecise aesthetic appeal measures). A question then arises whether the different choices made to collect ground truth of aesthetic appeal data are appropriate. In this paper, we propose a systematic study that looks into how different experimental environments and rating scales used to collect image aesthetic appeal ground truth data influence the reliability and repeatability of aesthetic appeal assessments. Our findings show that discrete and continuous scales with five-point absolute category rating labels yield more reliable results, with the continuous scale being more reliable for abstract images. We also show that image aesthetic appeal assessments could be repeatable across different experimental environments (i.e., lab and crowdsourcing). We finally formulate concrete recommendations to guide the collection of large sets of ground truth data for training models of aesthetic appeal appreciation.

Original language	English
Pages (from-to)	1338-1350
Number of pages	13
Journal	IEEE Transactions on Multimedia
Volume	18
Issue number	7
DOIs	https://doi.org/10.1109/tmm.2016.2559942
Publication status	Published - 27 Apr 2016

Keywords

computational aesthetics
image aesthetic appeal
Quality of Experience (QoE)
crowdsourcing

Access to Document

10.1109/tmm.2016.2559942

Cite this

@article{32074893bf8a4eb1a18d5ddbd11aa331,

title = "A Reliable Methodology to Collect Ground Truth Data of Image Aesthetic Appeal",

abstract = "Recognizing what makes an image aesthetically pleasing is crucial to the effectiveness of many multimedia systems. Several works have attempted to build image aesthetic appeal predictors, and created their own set of ground truth data for the purpose, either by using rated images from photo sharing websites, or by asking a pool of users to rate images in lab or crowdsourcing experiments. Literature has shown that the way these experiments are conducted can influence their results: poor experimental setup can result in poorly reliable outcomes (i.e., highly imprecise aesthetic appeal measures). A question then arises whether the different choices made to collect ground truth of aesthetic appeal data are appropriate. In this paper, we propose a systematic study that looks into how different experimental environments and rating scales used to collect image aesthetic appeal ground truth data influence the reliability and repeatability of aesthetic appeal assessments. Our findings show that discrete and continuous scales with five-point absolute category rating labels yield more reliable results, with the continuous scale being more reliable for abstract images. We also show that image aesthetic appeal assessments could be repeatable across different experimental environments (i.e., lab and crowdsourcing). We finally formulate concrete recommendations to guide the collection of large sets of ground truth data for training models of aesthetic appeal appreciation.",

keywords = "computational aesthetics, image aesthetic appeal, Quality of Experience (QoE), crowdsourcing",

author = "Ernestasia Siahaan and Alan Hanjalic and Judith Redi",

year = "2016",

month = apr,

day = "27",

doi = "10.1109/tmm.2016.2559942",

language = "English",

volume = "18",

pages = "1338--1350",

journal = "IEEE Transactions on Multimedia",

issn = "1520-9210",

publisher = "IEEE",

number = "7",

}

TY - JOUR

T1 - A Reliable Methodology to Collect Ground Truth Data of Image Aesthetic Appeal

AU - Siahaan, Ernestasia

AU - Hanjalic, Alan

AU - Redi, Judith

PY - 2016/4/27

Y1 - 2016/4/27

N2 - Recognizing what makes an image aesthetically pleasing is crucial to the effectiveness of many multimedia systems. Several works have attempted to build image aesthetic appeal predictors, and created their own set of ground truth data for the purpose, either by using rated images from photo sharing websites, or by asking a pool of users to rate images in lab or crowdsourcing experiments. Literature has shown that the way these experiments are conducted can influence their results: poor experimental setup can result in poorly reliable outcomes (i.e., highly imprecise aesthetic appeal measures). A question then arises whether the different choices made to collect ground truth of aesthetic appeal data are appropriate. In this paper, we propose a systematic study that looks into how different experimental environments and rating scales used to collect image aesthetic appeal ground truth data influence the reliability and repeatability of aesthetic appeal assessments. Our findings show that discrete and continuous scales with five-point absolute category rating labels yield more reliable results, with the continuous scale being more reliable for abstract images. We also show that image aesthetic appeal assessments could be repeatable across different experimental environments (i.e., lab and crowdsourcing). We finally formulate concrete recommendations to guide the collection of large sets of ground truth data for training models of aesthetic appeal appreciation.

AB - Recognizing what makes an image aesthetically pleasing is crucial to the effectiveness of many multimedia systems. Several works have attempted to build image aesthetic appeal predictors, and created their own set of ground truth data for the purpose, either by using rated images from photo sharing websites, or by asking a pool of users to rate images in lab or crowdsourcing experiments. Literature has shown that the way these experiments are conducted can influence their results: poor experimental setup can result in poorly reliable outcomes (i.e., highly imprecise aesthetic appeal measures). A question then arises whether the different choices made to collect ground truth of aesthetic appeal data are appropriate. In this paper, we propose a systematic study that looks into how different experimental environments and rating scales used to collect image aesthetic appeal ground truth data influence the reliability and repeatability of aesthetic appeal assessments. Our findings show that discrete and continuous scales with five-point absolute category rating labels yield more reliable results, with the continuous scale being more reliable for abstract images. We also show that image aesthetic appeal assessments could be repeatable across different experimental environments (i.e., lab and crowdsourcing). We finally formulate concrete recommendations to guide the collection of large sets of ground truth data for training models of aesthetic appeal appreciation.

KW - computational aesthetics

KW - image aesthetic appeal

KW - Quality of Experience (QoE)

KW - crowdsourcing

U2 - 10.1109/tmm.2016.2559942

DO - 10.1109/tmm.2016.2559942

M3 - Article

SN - 1520-9210

VL - 18

SP - 1338

EP - 1350

JO - IEEE Transactions on Multimedia

JF - IEEE Transactions on Multimedia

IS - 7

ER -

A Reliable Methodology to Collect Ground Truth Data of Image Aesthetic Appeal

Abstract

Keywords

Access to Document

Fingerprint

Cite this