Modelling the effects of inter-observer variation on colour rendition

Similar documents
Application of CIE with Associated CRI-based Colour Rendition Properties

Color Appearance in Image Displays. O Canada!

Assessing Colour Rendering Properties of Daylight Sources Part II: A New Colour Rendering Index: CRI-CAM02UCS

Module 3. Illumination Systems. Version 2 EE IIT, Kharagpur 1

Published online: 12 Jan 2015.

Reading. 2. Color. Emission spectra. The radiant energy spectrum. Watt, Chapter 15.

Spectral Adaptation. Chromatic Adaptation

Visual Evaluation and Evolution of the RLAB Color Space

COLOR FIDELITY OF CHROMATIC DISTRIBUTIONS BY TRIAD ILLUMINANT COMPARISON. Marcel P. Lucassen, Theo Gevers, Arjan Gijsenij

Color Vision. Spectral Distributions Various Light Sources

Design & Use of the Perceptual Rendering Intent for v4 Profiles

Colour rendering open questions and possible solutions

ams AG TAOS Inc. is now The technical content of this TAOS document is still valid. Contact information:

Colour appearance modelling between physical samples and their representation on large liquid crystal display

Consistent Colour Appearance: proposed work at the NTNU ColourLab, Gjøvik, Norway

Why does a visual system need color? Color. Why does a visual system need color? (an incomplete list ) Lecture outline. Reading: Optional reading:

Color. Reading: Optional reading: Chapter 6, Forsyth & Ponce. Chapter 4 of Wandell, Foundations of Vision, Sinauer, 1995 has a good treatment of this.

The ZLAB Color Appearance Model for Practical Image Reproduction Applications

Analysis of spectrophotometer specular performance using goniometric information

CS681 Computational Colorimetry

Colour Reading: Chapter 6. Black body radiators

Introduction to color science

Brightness, Lightness, and Specifying Color in High-Dynamic-Range Scenes and Images

Colour computer vision: fundamentals, applications and challenges. Dr. Ignacio Molina-Conde Depto. Tecnología Electrónica Univ.

Reprint (R30) Accurate Chromaticity Measurements of Lighting Components. Reprinted with permission from Craig J. Coley The Communications Repair depot

The Elements of Colour

Using surface markings to enhance accuracy and stability of object perception in graphic displays

UNEP-lites.asia Laboratory Training Workshop

Diagonal versus affine transformations for color correction

Automated Control of The Color Rendering Index for LED RGBW Modules in Industrial Lighting

A Data Flow Approach to Color Gamut Visualization

this is processed giving us: perceived color that we actually experience and base judgments upon.

CS635 Spring Department of Computer Science Purdue University

CSE 167: Lecture #6: Color. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2012

Digital Image Processing

3 Graphical Displays of Data

CSE 167: Lecture #7: Color and Shading. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2011

Estimating the wavelength composition of scene illumination from image data is an

An Algorithm to Determine the Chromaticity Under Non-uniform Illuminant

3 Graphical Displays of Data

CSE 167: Lecture #6: Color. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2011

Reading. 4. Color. Outline. The radiant energy spectrum. Suggested: w Watt (2 nd ed.), Chapter 14. Further reading:

Colour Gamut Mapping for Ultra-HD TV

Attention modulates spatial priority maps in human occipital, parietal, and frontal cortex

MODELING LED LIGHTING COLOR EFFECTS IN MODERN OPTICAL ANALYSIS SOFTWARE LED Professional Magazine Webinar 10/27/2015

Digital Image Processing COSC 6380/4393. Lecture 19 Mar 26 th, 2019 Pranav Mantini

A Geostatistical and Flow Simulation Study on a Real Training Image

Spectral Images and the Retinex Model

VISUAL COLOUR-RENDERING EXPERIMENTS N

Thank you for choosing NCS Colour Services, annually we help hundreds of companies to manage their colours. We hope this Colour Definition Report

Visualizing Metaclusters

Characterizing and Controlling the. Spectral Output of an HDR Display

When this experiment is performed, subjects find that they can always. test field. adjustable field

Lecture 1 Image Formation.

Meet icam: A Next-Generation Color Appearance Model

CSE 167: Introduction to Computer Graphics Lecture #6: Colors. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2013

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Preset Functions for Color Vision Deficient Viewers. The use of color is an important aspect of computer interfaces. Designers are keen to use

Refinement of the RLAB Color Space

Standard method of assessing the spectral quality of daylight simulators for visual appraisal and measurement of colour

Tips and Guidance for Analyzing Data. Executive Summary

VISUAL SIMULATING DICHROMATIC VISION IN CIE SPACE

Chapter 5. Track Geometry Data Analysis

The exam begins at 2:40pm and ends at 4:00pm. You must turn your exam in when time is announced or risk not having it accepted.

Black generation using lightness scaling

A Silicon Graphics CRT monitor was characterized so that multispectral images could be

Comparing the Usability of RoboFlag Interface Alternatives*

Color. Computational Photography MIT Feb. 14, 2006 Bill Freeman and Fredo Durand

What is Color and How is It Measured?

How Strong Metamerism Disturbs Color Spaces

Descriptive Statistics Descriptive statistics & pictorial representations of experimental data.

Chapter 2 CIECAM02 and Its Recent Developments

PEER Report Addendum.

Scientific imaging of Cultural Heritage: Minimizing Visual Editing and Relighting

ELEC Dr Reji Mathew Electrical Engineering UNSW

HEURISTIC OPTIMIZATION USING COMPUTER SIMULATION: A STUDY OF STAFFING LEVELS IN A PHARMACEUTICAL MANUFACTURING LABORATORY

(0, 1, 1) (0, 1, 1) (0, 1, 0) What is light? What is color? Terminology

Fluorescent Excitation from White LEDs

Big Mathematical Ideas and Understandings

WEB ANALYTICS A REPORTING APPROACH

Using Excel for Graphical Analysis of Data

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Case Study: Attempts at Parametric Reduction

Statistics: Interpreting Data and Making Predictions. Visual Displays of Data 1/31

Herding CATs: A Comparison of Linear Chromatic-Adaptation Transforms for CIECAM97s

Low Cost Colour Measurements with Improved Accuracy

Supplementary text S6 Comparison studies on simulated data

Local Linear Models for Improved von Kries Adaptation

Table of Contents (As covered from textbook)

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

Overview for Families

Adrián Álvarez, Miguel A. Pérez I. INTRODUCTION

2003 Steve Marschner 7 Light detection discrete approx. Cone Responses S,M,L cones have broadband spectral sensitivity This sum is very clearly a dot

Simultaneous surface texture classification and illumination tilt angle prediction

Graphical Analysis of Data using Microsoft Excel [2016 Version]

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

CIE Colour Chart-based Video Monitoring

Supplementary Figure 1. Decoding results broken down for different ROIs

Paper SAS Taming the Rule. Charlotte Crain, Chris Upton, SAS Institute Inc.

Measurement of display transfer characteristic (gamma, )

Transcription:

Lighting Res. Technol. 17; : 1 18 Modelling the effects of inter-observer variation on colour rendition MJ Murdoch PhD and MD Fairchild PhD Munsell Color Science Laboratory, Rochester Institute of Technology, Rochester, NY, USA Received 7 July 17; Revised 14 October 17; Accepted 5 October 17 The colour rendition characteristics of light sources are quantified with measures based on CIE standard observers, which are reasonable representations of population averages. However, even among people with normal colour vision, the natural range of variation in colour sensitivity means any individual may see something different than the standard observer. Modelling results quantify the effects of these inter-observer differences on colour rendition measures defined by IES TM-3-15. In general, inter-observer differences tend to be smaller for light sources with high colour fidelity values, and they are affected by spectral characteristics of different lighting technologies. The magnitude of variation in colour rendition measures, up to 5 1 units in IES TM-3-15 (R f, R g ), measures is compared with other sources of variability and ambiguity. 1. Introduction Colour rendition is one of the most important characteristics of a light source, while at the same time being one of the most difficult to describe simply. Throughout the history of modern electric lighting, researchers and practitioners have sought to provide objective and relevant descriptors of colour performance. In recent years, an ongoing conversation on colour fidelity, naturalness, gamut area and preference, along with the evolution of colour difference computations, has played out. So far, this conversation has left out another maturing topic in colour science: the role of inter-observer variability. This paper outlines how the human population distributions that affect colour sensitivity translate to differences in colour rendition, and puts them in context with other sources of variation. Address for correspondence: Michael J Murdoch, Munsell Color Science Laboratory, Rochester Institute of Technology, 54 Lomb Memorial Drive, Rochester, NY 1463, USA. E-mail: michael.murdoch@mail.rit.edu. Method.1. Colour rendition of light sources The spectral characteristics of an illuminant have a direct and readily apparent effect on the reflected spectral power distribution of objects being illuminated, and as such on the colour stimuli reaching the human eye. As lighting technologies have developed, and as our understanding of human perception has improved, many different measures and indices have been used to describe colour rendition. A good review of progress since 1965 is provided by Houser et al., 1 in which they compare measures that use different classes of colour rendition: fidelity, or the accuracy with which a light source renders object colours as compared to a familiar reference; preference, or the pleasantness or flattery with which a light source renders objects; and discrimination, or the ability of a light source to increase perceived differences between object colours. The legacy industry standard for colour rendition, which addresses colour fidelity, is the ß The Chartered Institution of Building Services Engineers 17 1.1177/1477153517744387

MJ Murdoch and MD Fairchild Commission Internationale de l Eclairage (CIE) General Colour Rendering Index R a, which describes the average computed accuracy in colour reproduction over a small set of standard colours, relative to a reference illuminant of the same correlated colour temperature (CCT). CIE R a was developed in part to quantify visual differences people noticed in the colour rendition of objects and human skin under gas discharge and fluorescent lighting compared to incandescent sources or natural daylight. 3 Rules-of-thumb and prescriptive norms have followed, for example ENERGY STAR Õ requires a CIE R a of 8 or higher. 4 Lighting manufacturers have responded by producing products carefully designed to meet the different levels of CIE R a values that the market demands. However, because CIE R a addresses only colour fidelity, and it does so with a small number of test colours using outdated colorimetry, it is being eclipsed by the Illuminating Engineering Society of North America s (IESNA) standard TM-3-15 (TM-3, herein), 5 presently under consideration internationally by the CIE. As described by David et al., 6 TM-3 uses a set of 99 colour evaluation samples (CES) chosen for spectral and colour space uniformity and computes colour differences for these colours between test and reference sources in CAM-UCS. 7 The results provided by TM-3 are two summary scores a colour fidelity index R f and a colour gamut index R g along with additional graphics and values to clarify huebinned colour distortions. The need for a twomeasure system follows an interesting line of research into discrimination, naturalness, and preference including the work of Rea and Freyssinnier, 8,9 Smet et al. 1 and Houser et al. 1 Importantly, TM-3 makes excellent use of plots, especially colour vector graphics, which grew out of the work of van der Burgt et al., 11,1 to further explain colour rendition differences that the two average indices may obscure. The IESNA provides an Excel spreadsheet that performs the computations described in the TM-3 document. The application of TM-3 to a test light source includes these steps: 1) Determine the CCT of the test source using a standard -degree observer. ) Define a reference illuminant of the same CCT that is Planckian if below 45 K, using the CIE daylight model if above 55 K, and a linear mix in between. 3) Compute the coordinates of the 99 spectral colour evaluation samples (CES) in CAM-UCS using a standard 1-degree observer for both test and reference light sources. 4) Compute colour differences between test and reference as an Euclidean distance in CAM UCS, and report a fidelity metric R f based on the mean colour difference. 5) Compute a polygon in the opponentcolour plane of CAM UCS based on 16 binned hues of the 99 CES for both test and reference sources, and report a gamut R g based on the ratio between the areas of the test and reference polygons. 6) Provide bar charts of hue-binned fidelity and relative saturation along with a huecircle distortion colour vector graphic. It is important to note that neither CIE R a nor IESNA TM-3 explicitly provides a quality or preference metric. Many of the references mentioned do discuss preference and in general the pleasing enhancement of colour saturation in certain parts of the hue circle, depending on context. 1,13 15 Recently, Royer et al. 16,17 have shown that the relative saturation increase of red hues, as quantified by the TM-3 colour saturation score for hue bin 16, R cs,h16, is a good predictor for perceived saturation and preference. However, the present study makes use of the TM-3 measures explicitly and thus does not comment on preference. To quantify the effects of normal variation in visual sensitivity in the population, we modelled realistic

Inter-observer variation of colour rendition 3 individual observer dependencies and how they cascade through computations to colour rendition measures. This paper presents a complete story based on preliminary findings published previously. 18.. Inter-observer variability Colorimetry has functioned successfully for nearly a century using sets of colour matching functions designed to represent the visual sensitivity of an average human observer. As with any average, the mean sensitivities do not necessarily represent any individual observer. Within any given population of people with normal colour vision (i.e. excluding colour vision deficiencies), there is a natural range in spectral sensitivities caused by variations in cone spectral absorptivity, ocular media density, and other anatomical and physiological parameters. Some of these variations are age dependent, others are genetic, and some have other causes such as diet or environment. It is not practical to measure many individuals visual sensitivities, but using known, measured distributions of the variables that affect sensitivity, population distributions can be estimated. CIE TC 1-36 standardised a representative set of physiological observers in 6 (CIE6) from which cone responsivities can be computed for observers of various ages (from to 8 years) and for matching stimuli of various field sizes (from 1 8 to 1 8). 19 However, each of these computed CIE6 function sets represent an average observer for that particular age and field size. They do not capture the added variability amongst all of the observers for the specified age and field size combination. Sarkar et al. provide both an excellent review of the development of the CIE6 observers and an analysis of the effect of physiological perturbations to the average CIE6 observers on colour reproduction. They showed that observer differences caused both differences in computed colorimetric values and changes in direction of error for pairs of stimuli that were metameric to the average observer. Recently, Asano et al. 1 created an individual colorimetric observer model that takes into account the natural distributions of anatomical and physiological features such as physical densities of ocular components and wavelength and density shifts of cone photopigments based on age, size of visual field and random population variability. Using Monte Carlo simulations, their model is able to create populations of individual sets of cones sensitive to long-, medium- and short-wavelength (LMS) light with realistic physiological distributions for observers of specified age and field size. These simulated populations of observers have been verified using population statistics and on an individual basis by predicting observed experimental colour matches. A simulated set of 1 individual observers 1-degree LMS sensitivities is shown as shaded areas in Figure 1(a), along with the mean LMS sensitivities as black lines. Each individual s sensitivities are a smooth set of curves plotted with transparency so that the shaded regions are denser where more observers happen to overlap. The relative sensitivities of human LMS cone cells are often transformed to XYZ-like colour matching functions (which will simply be referred to as CMFs in this paper) that can then be used to integrate a stimulus spectral power distribution to colorimetric values. There is not a standard conversion matrix between LMS and CMF curves, so in this work a best-fit 3 3 matrix is computed between the mean of a population of LMS curves and the CIE 1-degree standard observer CMFs. The population CMFs corresponding to the LMS curves in Figure 1(a) are shown as shaded in Figure 1(b) with the 1-degree standard observer overlaid as black lines. The specific matrix conversion from mean LMS to mean XYZ sensitivities follows

4 MJ Murdoch and MD Fairchild (a) 1 Rel. sensitivity 8 6 4 (b) Rel. sensitivity 1.5 1.5 3 x 6 7 4 y z 51 ffi 4 45 5 55 6 65 7 4 45 5 55 6 65 7 Wavelength (nm) Wavelength (nm) Figure 1 Population sensitivities. Plot (a) shows LMS long- (in red), medium- (green) and short-wavelength (blue) cone sensitivities, and plot (b) shows CMF x (red), y (green) and z (blue), of 1 simulated observers for 1-degree field of view. Solid black lines show average LMS sensitivity in plot (a) and CIE 1964 standard observer in plot (b). (Available in colour in online version). 3 :453 :631 :461 6 7 4 :1617 :731 :13 5 :3 :48 :35 3 l 6 7 4 m 5 s US:1 The CMFs in Figure 1(b) behave, on average, like the CIE 1-degree observer, but individual curves include variation typical of the US population. Each individual s set of three curves can be used in place of the standard CMFs for any colorimetric computation. This affects everything from XYZ to CIELAB values. In this work, two simulated populations of observers, comprised of realistically-distributed individual colour matching functions, are used to analyse how individual differences affect colour rendition measures with selected light sources..3. Modelling methodology Colour rendition computations essentially mimic a visual comparison between a set of coloured objects illuminated by a test light source and the same objects illuminated by a reference illuminant. One could physically set up this comparison and have a look, but usually, this is done computationally using standard colorimetric observers (e.g. CIE 1964 1-degree observer, in the case of TM-3), giving numbers that describe the colour rendition that the standard observer would see. The situation modelled herein is slightly different: the same comparison between test and reference is set up (implying that the choice of reference illuminant, based on CCT, was made using the standard colorimetric observer CIE 1931 -degree observer, in the case of TM-3). Then, a simulated but realistic population of observers with normal colour vision queued up to make the visual comparison, and they each reported their description of colour rendition based on what they saw. Because of interobserver variation, each has some probability of seeing something different than the standard observer, while on average the population behaves similarly to the standard observer. MATLAB implementations of the colour rendition measures discussed above were written based on the IES TM-3 document 5 and verified against its Excel spreadsheet (IES TM-3-15 Basic Calculation Tool v1.1.xlsm). The MATLAB re-implementation was necessary both for rapid scriptability and to allow the flexibility to vary the colour matching functions (and thus diverge from the TM-3 recommendation). In Step 3 of the TM-3 computation, the CMFs used in the

Inter-observer variation of colour rendition 5 computation of colorimetric values for both test and reference sources were made variable so that the individual colorimetric observers could be used. The CIE 1931 -degree standard colorimetric observer was used to calculate CCT and select the reference illuminant for all of the present modelling (Steps 1 and ). Thus, this is a simulation of realistic populations of observers assessing the comparison of test and reference illuminants and potentially seeing colour rendition characteristics that differ from standard measures thanks to their individual sensitivities..4. Observer populations In the present work, we simulated two different populations at two different visual field sizes. Because age has a strong effect on the physiological factors that affect visual sensitivity, the populations differ in age distribution. The first population, a.k.a. the US Age population, comprised 1 observers sampled from an age distribution taken from the most recent (1) U.S. census. The age distribution is fairly uniform up to about age 6, then it tails off steeply. A subset of this distribution was made, limited to ages between 1 and 7, inclusive, the same as that described by Asano et al. 1 Their paper gives further details on the distributions of physiological characteristics used in the simulation. Our supplementary data file USAgePopulation.xlsx includes all 1 observers physiological parameters and LMS sensitivities these are the sensitivities shown in Figure 1. The US Age sensitivities are repeated in Figure, with the sensitivities of two example observers, arbitrarily named Pat and Sam, highlighted. Much of the variation visible can be interpreted as a wavelength shift in LMS: peak sensitivity wavelength shift for each cone type are explicit parameters in Asano s model, and one effect of increased macular density is similar to a shift of the S curve to longer wavelengths. The highlighted observers were selected from the 1 because they are both of the mean and median age of 38 years, and because they are approximately equidistant, on average, in wavelength shift, from the average observer. Their LMS values are shifted about 3 nm batho (Pat) and hypso (Sam) from the average LMS curves, which puts them at the 1 th and 88 th percentiles of the population. Their names are intentionally gender-neutral to remind the reader that sex is not a physiological characteristic included in the model. (a) 1 Rel. Sensitivity 8 6 4 (b) Mean LMS Pat CIE 1964 Pat Sam Sam Rel. Sensitivity 1.5 1.5 4 45 5 55 6 65 7 4 45 5 55 6 65 7 Wavelength (nm) Wavelength (nm) Figure US Age population sensitivities. Plot (a) shows LMS long- (in red), medium- (green) and short wavelength (blue) cone sensitivities, and plot (b) shows CMF x (red), y (green) and z (blue), of 1 simulated observers drawn from an age distribution matching the US population, for 1-degree field of view. Solid black lines show average LMS sensitivity in plot (a) and CIE 1964 standard observer in plot (b). Dashed lines show example observers Pat and Sam, explained in the text. (Available in colour in online version).

6 MJ Murdoch and MD Fairchild Their results are included in subsequent figures to help show individual and population differences. The second population, the Age Group population, includes a bimodal distribution, chosen both because age is one of the few physiological factors that can be practically targeted in the market, and to emphasise that age is not the only factor in inter-observer variation. The Age Group population includes 5 observers of age 5 years and 5 observers of age 65 years, and their LMS and CMF sensitivities can be seen in Figure 3. With the younger observers plotted in transparent red and the older in transparent blue, biases are clearest at short wavelengths, while in regions where they are similar they blend together as purple. Note that while age is a strong contributor to visual differences, it is not the only one: there are differences in sensitivity visible both within and between the two ages present in the Age Group population. All 1 of the Age Group observers physiological parameters and LMS sensitivities are included in the supplementary data file AgeGroupPopulation.xlsx. As was done with the US Age population, the simulated Age Group LMS sensitivities were transformed to CMFs using a best-fit matrix. However, separate matrices were fit for the average of the younger and the average of the older groups of the population. Our intent behind this choice was to be conservative and not exaggerate the effect of age difference beyond their inherent CMF shape differences. The matrix equations are shown below, respectively 3 3 6 4 x 7 y z 3 x 6 7 4 y z 51 ffi 51 ffi :4641 :784 :47 6 7 4 :1746 :616 :3 5 :86 :119 :3 3 l 6 7 4 m 5 s 5yr:1 3 :4165 :63 :445 6 7 4 :136 :1 :15 5 :11 :134 :31 3 l 6 7 4 m 5 s 65yr:1.5. Light sources Nine contemporary light sources with differing spectral characteristics and CCTs were (a) 1 Rel. sensitivity 8 6 4 (b) Rel. sensitivity 1.5 1.5 4 45 5 55 6 65 7 4 45 5 55 6 65 7 Wavelength (nm) Wavelength (nm) Figure 3 Age group sensitivities: plot (a) shows LMS cone sensitivities, and plot (b) shows CMF xyz of 1 simulated observers, 5 of whom are 5 years old (plotted in red) and 5 of whom are 65 (blue). Black lines in (a) indicate the mean LMS of each group of 5 (solid for age 5, dashed for age 65), while solid black lines in (b) show the CIE 1964 1- degree x-bar, y-bar and z-bar standard CMFs. (Available in colour in online version).

Inter-observer variation of colour rendition 7 Table 1 Selected light sources: Code numbers and descriptions from the TM-3 spreadsheet, with relevant measures. CCT (in Kelvin), D uv, and R a are computed as standardised by CIE; R f, R g and R cs,h16 according to TM-3-15 Code Source type CCT D uv R a R f R g R cs,h16 8 Incandescent (6WA19) 81.1 1 1 1 % 114 RGB (455/547/63) 33.1 73 73 113 17% 13 RGB (465/535/59) 53.4 8 77 9 13% 146 RGBA (455/53/59/635) 3K-A 943.14 74 83 11 13% 16 RGBA (455/53/59/635) 4K-G 3959.11 9 88 14 4% 169 LED Phosphor Blue Pump (1) 88.8 9 87 91 4% 55 LED Phosphor Blue Pump (87) 63.6 97 96 11 1% 76 LED Phosphor Blue Pump (18) 381.5 84 8 11 % 79 LED Phosphor Blue Pump (111) 6811.39 8 74 85 13% selected, including tungsten, RGB and RGBA mixed-narrowband LEDs, and blue-pumped phosphor-converted white LEDs. The nine sources, with code numbers corresponding to the light source numbers in the TM-3 spreadsheet, 5 are listed in Table 1 and their relative spectral power distributions are shown in Figure 4. They were selected both because they include a variety of relevant LED light sources and because they cover a wide range of gamut versus fidelity values. These nine are not meant to be a representative sampling of all light sources, but rather illustrative examples; later in the paper a larger set of light sources is considered. 3. Results and discussion 3.1. Variation in colour rendition measures The modelling results show how the natural variations in an observer population manifest themselves in terms of colour rendition measures for the selected light sources. Looking first at the variation present in the US Age population, we can see what degree of variation can be expected among observers in the US. Figure 5 shows TM-3 gamut versus fidelity plots, with clouds of dots indicating the computed measures for each individual, and the labeled open circles showing the standard observer s measures. The trend with these nine light sources is for Rel. Power Rel. Power Rel. Power Rel. Power 114 33K 13 53K 146 943K 169 88K 76 381K 16 3959K 55 63K 79 6811K 4 5 6 7 4 5 6 7 Wavelength (nm) Wavelength (nm) Figure 4 Selected spectral power distributions: the eight LED light sources are shown. Tungsten (8) is not shown smaller clouds near the maximum-fidelity cusp: at high levels of fidelity, a light source is spectrally so similar to its reference that little inter-observer variation is possible: what this means is that even if different observers see a light source s colour rendition differently, the difference between its colour

8 MJ Murdoch and MD Fairchild Rg gamut 1 115 11 15 1 95 9 85 CIE 1964 Pat Sam 8 7 75 8 85 9 95 1 Rf fidelity Figure 5 R g vs. R f clouds: red clouds of points for each of the selected light sources indicate the modeled variability in observer colour sensitivity. The labeled open circles indicate the (R f, R g ) values computed using the CIE 1964 Standard Observer, according to TM-3; and symbols þ and show (R f, R g ) values for example observers Pat and Sam, respectively rendition and that of the reference is consistently small over observers. Results for the highest-fidelity light source, tungsten (8), are essentially pinpoint. For other sources throughout the (R f, R g ) space, the cloud shapes are not consistently patterned. A few clouds are essentially elliptical with various orientations, while others appear to have an underlying curl to their shapes. Further information about the differences in colour rendition seen by the population of observers is shown in the colour vector graphics in Figure 6. These plots show relative distortions of a hue circle on normalised CAM-UCS axes approximating bluishyellowish versus greenish-reddish. The gray polygon represents the reference hue circle, made up of the 16 hue bins of TM-3, and the black solid ring shows the location of the hue bins under the test light source computed using the CIE 1964 1-degree standard observer. The short line segments connecting the vertices help indicate the direction of both saturation (radial) and hue (circumferential) colour distortions. Also shown on these plots are the rings for each of the 1 observers (red) and example observers Pat (dash) and Sam (dot-dash). A few values are presented in the centre of each graphic, the R cs,h16 overall mean and standard deviation, as well as the values for Pat and Sam. Looking over the colour vector graphics, in some cases the inter-observer variation is greater in regions where the distortion is greater: for example, light sources 8 and 55 show very little distortion and almost no variation, while light sources 114 and 79 show lots of both. However, light source 13 shows lots of variation, even in regions with low distortion, while light sources 169 and 76 show very low variation even in regions of high distortion. The lack of pattern in these variations seems to reinforce the need for a multi-measure, graphical approach to communicating colour rendition and variability. 3.. Age-related variation Turning to the Age Group distribution, results in R g versus R f are shown in Figure 7, with the 5-year-old observers clouds plotted in red and the 65-year-old observers clouds plotted in blue. It is apparent that not only are there large differences between observers within these subgroups, there are even larger differences between them. For most plotted light sources, there is a clear difference between the age-based clouds. In extreme examples such as 146 and 169, there is little to no overlap between the subgroups. This means that observers of different ages are unlikely to perceive the colour rendition of these light sources in the same way. Colour vector graphics for the Age Group population are shown in Figure 8, in which

Inter-observer variation of colour rendition 9 bluish - yellowish Code: 8 Mean: % Std:.% Pat: % Sam: % Code: 114 Mean: 17% Std: 1.6% Pat: 18% Sam: 15% Code: 13 Mean: 13% Std: 3.% Pat: 8% Sam: 17% bluish - yellowish Code: 146 Mean: 13% Std: 1.3% Pat: 14% Sam: 11% Code: 16 Mean: 4% Std: 1.1% Pat: 3% Sam: 4% Code: 169 Mean: 4% Std:.8% Pat: % Sam: 5% bluish - yellowish Code: Mean: Std: Pat: Sam: 55 1%.7% % % Code: 76 Mean: % Std:.8% Pat: % Sam: 4% Code: 79 Mean: 13% Std:.8% Pat: 13% Sam: 13% Greenish - reddish Greenish - reddish Greenish - reddish Figure 6 US Age colour vector graphics: each of the nine subplots indicates the relative distortion of hues around a hue circle, plotted on normalised axes approximating bluish-yellowish vs. greenish-reddish. On each plot, the gray polygon is an undistorted circle, the red polygons show the distortions per hue for observers in the simulated population, and the solid black polygon indicates the distortion computed with the standard observer. Short vectors connecting the gray and black polygons show the direction of distortion, and dashed and dot-dashed polygons represent the relative distortion seen by observers Pat and Sam, respectively. Text within each graphic lists the relative colour saturation for hue bin 16 (R cs,h16 ): overall mean and standard deviation, and values for Pat and Sam. (Available in colour in online version). the red polygons indicate the younger group (age 5) and the blue indicate the older group (age 65). These vector graphics can be compared to those in Figure 6, which highlights the example observers Pat and Sam, both aged 38. While not totally consistent, it may be observed that in many hue regions, for most light sources shown, Pat s results look like those typical of the younger age group, and Sam s look like those of the older age group. This is because the LMS sensitivities of Sam and Pat are shifted batho and hypso with respect to the standard observer, which is the same trend seen on average in younger and older observers, respectively. Figure 5 may be compared with Figure 7 as

1 MJ Murdoch and MD Fairchild Rg gamut 1 115 11 15 1 95 9 85 CIE 1964 5 years 65 years 8 7 75 8 85 9 95 1 Rf fidelity Figure 7 R g vs. R f clouds by age: distributions of colour rendition measures for 5-year-old observers (red) and 65-year-old observers (blue). The labelled open circles indicate the (R f, R g ) values computed using the CIE 1964 Standard Observer, according to TM-3. (Available in colour in online version). well, and a similar relationship can be observed in TM-3 (R f, R g ). It should be emphasised, however, that averages do not tell the whole story, because while age is a strong factor, the distribution of wavelength shifts and other characteristics reaches nearly the same range for individuals of any given age, as do the distributions seen in the resulting colour rendition measures. 3.3. Differences between light sources Returning to the US Age population and looking closely at the results for Pat and Sam in Figure 5, it can be observed that they do not consistently appear on the same sides of the clouds. Specifically comparing light sources 16 and 76 near the middle of the diagram, Pat (þ) sees a much smaller difference in R f, on average, than does Sam (), because they appear to diverge in different directions from the standard observer. Differences in the R f and R g metrics for all observers including Pat and Sam are shown in Figure 9(a). It is worth pointing out the very different spectral characteristics of these two light sources (shown in Figure 4), which is likely the cause of the divergence between Pat and Sam. Is it significant if one observer sees an R f difference of 6 and another sees an R f difference of 1? An easier question to answer is, what if two observers disagree on the direction of a difference (meaning, one thinks A4B while the other thinks B4A)? We found several pairs of light sources in the TM- 3 spreadsheet that show this situation, with similar (R f, R g ) values and reasonably high R f. To illustrate, this, the light sources 141 (RGBA LED of 994 K, D uv.41, R f 86.9, R g 99.5) and 19 (Phosphor-converted LED of 3551 K, D uv.4, R f 87., R g 99.1) are shown in Figure 9(b). In this figure, it appears that about half the population will find the first source higher in R f and R g and the second source lower, and the other half of the population will see the opposite. Figure 1 clarifies the positions of these four illuminants in TM-3 (R f, R g ) space and shows how Pat and Sam relate to the standard observer. We do not mean to exaggerate the magnitude of these differences Pat and Sam are about four units of R f and.5 units of R g apart but our point is that it is quite possible for observers with normal colour vision to disagree on the direction of a difference in colour fidelity and/or colour saturation between two light sources. 3.4. Susceptibility of light sources to inter-observer variation One goal based on this work might be to design or specify light sources that are less susceptible to inter-observer variation in their colour rendition. It is interesting to observe that some light sources are indeed more

Inter-observer variation of colour rendition 11 bluish - yellowish Code: 8 Mean 5: % Std 5:.% Mean 65: % Std 65:.% Code: 114 Mean 5: Std 5: Mean 65: Std 65: 19%.% 15% 1.% Code: 13 Mean 5: 13% Std 5: 3.% Mean 65: 14% Std 65: 3.4% bluish - yellowish Code: 146 Mean 5: 14% Std 5: 1.4% Mean 65: 11% Std 65:.6% Code: 16 Mean 5: 4% Std 5: 1.% Mean 65: 3% Std 65:.8% Code: 169 Mean 5: 4% Std 5:.8% Mean 65: 4% Std 65:.5% bluish - yellowish Code: Mean 5: Std 5: Mean 65: Std 65: 55 %.6% 1%.5% Code: 76 Mean 5: 1% Std 5: 1.% Mean 65: 4% Std 65:.6% Code: 79 Mean 5: 13% Std 5:.6% Mean 65: 1% Std 65:.6% Greenish - reddish Greenish - reddish Greenish - reddish Figure 8 Age Group colour vector graphics: each of the nine subplots indicates the relative distortion of hues around a hue circle, plotted on normalised axes approximating bluish-yellowish vs. greenish-reddish. On each plot, the gray polygon is an undistorted circle and the solid black polygon indicates the distortion computed with the standard observer. Short vectors connecting the gray and black polygons show the direction of distortion. Red and blue polygons show the distortions per hue for observers in the simulated age groups of 5 years and 65 years, respectively. Text within each graphic lists the mean and standard deviation of relative colour saturation for hue bin 16 (R cs,h16 ) for each age group. (Available in colour in online version). susceptible than others, meaning the clouds of observer results for the US Age population are larger. On the TM-3 R g versus R f graphic, even though the inter-observer clouds are not strictly elliptical in shape, a first-order assessment of their two-dimensional variance and orientation may be made with fitted ellipses. Ellipses were fitted to the US Age distributions clouds of R g versus R f values for all light sources in the TM-3 data set and are shown in Figures 11 and 1. Both plots show the general trend of smaller ellipses closer to the maximum R f point, but also clear differences in the characteristics of different light source technologies, at least in this set of light source spectra: in Figure 11, the

1 MJ Murdoch and MD Fairchild (a) Difference in Rg 8 7 6 5 4 3 1 (b) Difference in Rg 4 3 1 1 3 4 4 6 8 1 1 6 4 4 6 Difference in Rf Difference in Rf Figure 9 Differences in R f and R g : two pairs of light sources are compared. Plot (a) shows the difference in R f and R g for light sources 16 and 76 for all 1 observers (red dots), the standard observer (o), Pat (þ), and Sam (). Plot (b) shows the same for light sources 141 and 19 1 11 CIE 1964 Pat Sam 115 Blue pump RGB/RGBA 11 15 15 Rg gamut 1 Rf gamut 1 95 9 95 85 8 85 Rf fidelity 9 8 7 8 Rf fidelity 9 1 Figure 1 R g vs. R f close up: the light source pairs 16 and 176 along with 141 and 19, shown for all 1 observers (red dots), the standard observer (o), Pat (þ), and Sam (). Black connecting lines are included to show which Pat and Sam points belong with which light sources Figure 11 Ellipse fits part I: ellipses fitted to the US Age distributions in R g vs. R f for two categories of LED light sources taken from the TM-3 data set. Ellipses are shown for blue-pump (dotted) and for RGB/RGBA (solid) LEDs. Higher eccentricity and more vertical orientation are characteristic of the RGB/RGBA LEDs

Inter-observer variation of colour rendition 13 1 (a) 115 11 15 Fluoro LEDHybrid Incand Ellipse area 15 1 5 Blue pump RGB/RGBA Fluoro Incand Rf gamut 1 95 9 85 (b) 1 Ellipse eccentricity.8.6.4. 7 75 8 85 9 95 1 8 7 8 9 1 Rf fidelity Figure 1 Ellipse fits part II: ellipses fitted to the US Age distributions in R g vs. R f for three categories of LED light sources taken from the TM-3 data set. Ellipses are shown for fluorescent (solid), LED Hybrid (dotted), and incandescent (dashed) LEDs ellipses of the blue pump LEDs (blue) appear more circular and more horizontally-oriented, and those of the RGB/RGBA LEDs (red) appear more eccentric and more verticallyoriented, meaning the average gamut, or relative colour saturation, of RGB/RGBA LEDs relative to a reference illuminant can be quite different for different observers. Figure 1 shows three additional light source categories: fluorescent (green solid), whose ellipses are generally moderately eccentric and horizontally oriented; LED hybrid (magenta dot), whose ellipses do not show a clear pattern; and incandescent (black dash), whose ellipses are mostly too small to see at very high R f values, with the exception of the neodymium-filtered tungsten sources visible at about (86, 19). The observed ellipse characteristics may be related directly to the R f fidelity metric, as Rf fidelity Figure 13 Ellipse characteristics: As a function of fidelity R f, the area (in plot (a)) and eccentricity (in plot (b)) are shown for light sources taken from the TM-3 data set, split into four categories based on technology. Ellipse area is computed in R f R g units, and eccentricity is defined such that a corresponds to a perfect circle and 1 to an infinite-aspect ratio ellipse shown in Figure 13. Figure 13(a) shows clear correlations between R f and ellipse area, and a distinct difference between the blue pump LED (þ), RGB/RGBA LED () and incandescent (o) types. All trend to zero as R f nears its maximum value of 1, and with few exceptions the RGB/RGBA LEDs () result in the largest ellipses. Incandescent sources cluster near the bottom-right of the plot, with very high fidelity and very small interobserver variation (again, the exceptions at R f 86 are neodymium-filtered tungsten). Figure 13(b) shows no clear relationship between ellipse eccentricity and R f, but indicates that RGB/RGBA ellipses () are overall more eccentric than those for blue pump LEDs (þ) and fluorescent light sources (). Note that eccentricity is not meaningful

14 MJ Murdoch and MD Fairchild for the incandescent sources with near-zero ellipse size. 4. Future work 4.1. An inter-observer variation index This paper is based on Monte Carlo simulations used to uncover how statistical variations in visual sensitivities correspond to variations in colour rendition measures. It would be much more convenient if the variations in measures could be predicted directly from characteristics of a light source spectral power distribution, or at least computed directly, rather than via thousands of simulations. It could be very valuable to directly compute an indicator of variability, such as an inter-observer variation index. The value of such an index would be to distill this complexity to a manageable number that could be used in communicating the colour quality of lighting products, and especially in the design of better light sources. As with most aspects of colour rendition, the effect of inter-observer variation varies considerably with the spectral characteristics of light sources. With the goal of directly relating spectral characteristics of the source directly to a measure of inter-observer variation, we computed various predictors based on light source and observer characteristics. For sources, we considered aspects including spectral slopes, near-zero spectral regions, and peak-to-valley power contrasts; and for sensitivity variation over the population, the standard deviations of both sensitivity and wavelength shift per wavelength. It seems that the amount of overlap between source spectral patterns and observer variation should predict the size of clouds in colour rendition measures. However, we could not find a solution with satisfactory correlation for all types of lighting technologies, which implies there is another underlying aspect of source spectral characteristics. Note that the current study highlights nine light sources for illustration, and includes ellipses for the 318 TM-3 sources, neither of which comprises a comprehensive nor statistically-defensible selection. We maintain hope that the relationship between source spectral characteristics and the computed ellipses and non-elliptical cloud shapes can be elucidated in future work. 4.. Variation in context With the observer-based distributions of colour rendition measures quantified for some example light sources, the broader question is, how important is inter-observer variation in context with other sources of variation or confusion? One important point to reinforce is that the indices R f and R g are averages over many colour evaluation samples, which means that it is quite possible for light sources that plot at the same (R f, R g ) point to render colours differently for the same observer, not even considering interobserver variation. This means that there can be no easy rule of thumb about what is a meaningful R f or R g index difference, because even no difference can be visible. In our simulation, we have shown that different observers may see the (R f, R g ) indices to vary by 5 1 units (judging by the size of clouds in Figures 5 and 7) between observers, not between light sources. Further, the relative difference between light sources isn t simply shifted for different observers, but can be inverted, meaning that some observers see one light source as of higher fidelity than another, and other observers see the opposite, easily reaching differences of five units (Figure 9). The magnitude of these variations is not huge, but we argue that it is large enough not to be easily dismissed. Inter-observer variation affects everything about colorimetry, so it is useful to compare the reported colour rendition variation to other performance measures such as illuminance and CCT. Another simulation was computed with the US Age distribution to create 1 -degree observers for the

Inter-observer variation of colour rendition 15 purpose of computing illuminance and CCT. Using these individual sensitivities in place of the 1931 Standard Observer, the computed illuminance and CCT of the 318 TM-3 light sources was found to vary slightly. The standard deviation over observers for these light sources was on average 1.9% of the illuminance, with a maximum standard deviation of 4.6%. In terms of CCT variation, the average standard deviation was 13 K and maximum was 454 K (for a 6811 K source). As with the colour rendition measures, these inter-observer differences would be most interesting if they affect direct comparisons, in which observers may disagree about which of a pair of light sources is higher in illuminance or in CCT. In all cases, these are rather small differences. A point of reference might be found in another source of variation that has been well studied and quantified, the effect of visual field size. 19, To test this, using the CIE 6 model, 19 we computed CMFs for field sizes 1 8 1 8 and age 3 years, and used them in the TM-3 computations in the same way we used the population CMFs described earlier. The results are plotted in Figure 14 as connected black dots coded by size, together with the Age Group population clouds (red), the CIE 1964 1-degree (open circle) and 1931 -degree (blue diamond) standard observers. For several light sources (8, 16, 55), the field size series dots are hardly visible, coincident with the 1-degree standard; however, for some (114, 13, 146, 79), they result in trails wandering from the standard outward with smaller field sizes, in unpredictable directions. In all cases, however, the length of the wandering path is much smaller than the size of the cloud resulting from interobserver differences. For all readers who have seen for themselves that colours matched in a 1-degree field appear to not match when changed to a -degree field, consider that the difference any two observers might be seeing could be much larger. Rg gamut 1 115 11 15 1 95 9 85 8 7 75 8 85 Rf fidelity CIE 1964 1 CIE 1931 CIE 6 1 CIE 6 5 CIE 6 1 Finally, for practical analysis of colour rendition, the scores for index measures are computed for a standardised set of object colours, which will undoubtedly vary from an observer s experience with real objects in a real lighted room. Royer and Wei noted that objects used in published visual colour rendition experiments range from natural skin and fruit to a variety of colour targets. In their computational comparison, they found that the R f and R g indices covered ranges of tens of units, 3 meaning that the strongest source of variation in observed colour rendition may well be the objects used in the evaluation. Overall, we find that the magnitude of variation due to interobserver differences, whether judged by the 9 95 1 Figure 14 Field size comparison: in R g vs. R f, the clouds of points in red are the Age Group population, on top of which are highlighted the 1964 1-degree observer (open circle), 1931 -degree observer (blue diamond), and a series of CIE 6 observers with field sizes from 1 to 1 degrees (connected black dots, sized accordingly)

16 MJ Murdoch and MD Fairchild typical US population range or age categories, are larger than the range caused by field size differences, but smaller than what may be observed with different selections of test colours. SPD 6 4 Ref Test Rg 1 11 1 4.3. Communicating variability It is important to consider how to appropriately communicate that inter-observer variability is likely, and that it affects the resolution of colour rendition measures for specific light sources. The graphical summary of TM-3 could easily be updated with ellipses and error bars, showing the size of a standard deviation over a relevant population of observers on each value shown. A proposal is shown in Figure 15, which adds a one standard deviation ellipse to the R g versus R f plot and one standard deviation error bars to the bar charts. Additionally, it shows the spread of colour vector graphic polygons that was shown in Figure 6, all based on the 1 observers in the US Age population (and implying that the TM-3 computations were run 1 times to produce this graphic). A similar alternative might be to show an additional two values for younger and older observers, based on the CIE 6 CMFs, 19 which would allow individuals a slightly more tailored look at colour rendition measures. The TM-3 graphics with these additions clearly expose the inter-observer variability that is to be expected from a light source. We realise that adding yet more information to the already complex TM-3 graphical presentation may either clarify or confuse, depending on the levels of understanding and interest of the reader, but we think it is worth considering how to convey such data. In general, the professional lighting community might like to know if different light sources are more or less susceptible to inter-observer differences in colour rendition. Rf/Hue Rel sat/hue 4 6 wl 1 8 6 4 1 1 5 1 Hue Bin 5. Conclusions 15 A main concluding message of this study is that inter-observer variation, even among people with normal colour vision, has a significant effect on colour rendition measures. Communicating this clearly adds to the already-difficult burden of conveying the many aspects of colour rendition, but it may aid in developing light sources that minimise inter-observer variation. We have focused on the measures and graphics of IES TM-3-15, but it is important to point out that all colour rendition measures, because they rely on standard observers, will show the impact of inter-observer variability when it is included by methods such as our Monte Carlo approach. We have shown that there is wide variability in TM-3 R f and R g values, when the 9 8 bluish-yellowish 7 8 9 1 Rf CCT = 33 Rf = 73.3 Rg = 11.8 greenish - reddish Figure 15 TM-3 graphics: for TM-3 RGB LED light source 114, upper left plot shows test (red) and reference (black) SPDs vs. wavelength, and upper right shows R g vs. R f with US Age 1-std ellipse. Lower left bar charts show fidelity and relative colour saturation per hue bin, with 1-std error bars indicated in black. Lower right shows the colour vector graphic with the inter-observer variability shown in red behind the main circle (black)

Inter-observer variation of colour rendition 17 standard colour matching functions are replaced by those of individuals simulating the variation in the US population, especially far from the fidelity peak of R f ¼ 1. In many cases, the clouds of values resulting from inter-observer differences reach 5 1 units in size. Differences are visible in the colour vector graphics and bar charts of TM-3, which show the extent to which specific hue bins are affected. Further, we identified situations in which the index value differences for pairs of light sources invert for different observers. Through simulation of a bimodal age distribution of observers 5 and 65 years old, we have quantified large agerelated differences in colour rendition, showing not only the categorical difference that could have been predicted from CIE 6 observers, but also the expected distributions in each population. For some light sources, the clouds of values observed for the two age groups hardly overlap, which means that observers of different ages are unlikely to see the same colour rendition characteristics. This may present an opportunity for agetargeted products and marketing, if visual experiments verify the differences. We have observed large differences in variability between light sources, between and within technology types. There are general trends including lower variability with higher colour fidelity, but the implication is that different spectral characteristics vary in susceptibility to inter-observer variation. It would be an interesting engineering goal to create light sources to minimise inter-observer variation. We foresee future work to uncover the relationships between spectral differences and variation, which would allow the development of an observer variability index and enable direct computation, bypassing the Monte Carlo simulation employed herein. The payoff would be easier computation of measures to both communicate product specifications and inform the engineering of better products. To put the magnitude of variation caused by observer differences into context, we have compared the observed clouds of variation in TM-3 R f and R g indices of 5 1 units to that observed with field size in the CIE6 physiological observers ( 5 units) and that which might be expected using different object colours for assessment (tens of units). Clearly, the amount of ambiguity between different observers is large enough to not be ignored, and should be communicated along with the already complex specifications of light source colour rendition. Declaration of conflicting interests The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. Funding The authors received no financial support for the research, authorship, and/or publication of this article. References 1 Houser KW, Wei M, David A, Krames MR, Shen XS. Review of measures for light-source color rendition and considerations for a twomeasure system for characterizing color rendition. Optics Express 13; 1: 1393 1411. Commission Internationale de l Eclairage. Method of Measuring and Specifying Colour Rendering Properties of Light Sources CIE 13.3-1995. Vienna: CIE, 1995. 3 Nickerson D. Light sources and color rendering. Journal of the Optical Society of America 196; 5: 57 69. 4 U.S. Environmental Protection Agency and U.S. Department of Energy. ENERGY STAR Õ program requirements for luminaires. Retrieved 7 June 17, from https://www. energystar.gov/ia/partners/product_specs/program_reqs/final_luminaires_program_ Requirements.pdf

18 MJ Murdoch and MD Fairchild 5 Illuminating Engineering Society of North America. IES Method for Evaluating Light Source Color Rendition IES TM-3-15. New York: IESNA, 15. 6 David A, Fini PT, Houser KW, Ohno Y, Royer MP, Smet KAG, Wei M, Whitehead L. Development of the IES method for evaluating the color rendition of light sources. Optics Express 15; 3: 15888 1596. 7 Luo MR, Cui G, Li C. Uniform color spaces based on CIECAM color appearance model. Color Research and Application 6; 31: 3 33. 8 Rea MS, Freyssinier-Nova JP. Color rendering: A tale of two metrics. Color Research and Application 8; 33: 19. 9 Rea MS, Freyssinier JP. Color rendering: Beyond pride and prejudice. Color Research and Application 1; 35: 41 49. 1 Smet K, Ryckaert WR, Pointer MR, Deconinck G, Hanselaer P. Correlation between color quality metric predictions and visual appreciation of light sources. Optics Express 11; 19: 8151 8166. 11 van der Burgt PJM, van Kemenade J. About color rendition of light sources: The balance between simplicity and accuracy. Color Research and Application 1; 35: 85 93. 1 de Beer E, van der Burgt P, van Kemenade J. Another color rendering metric: Do we really need it, can we live without it? Leukos 15; 1: 51 59. 13 Smet KA, Hanselaer P. Memory and preferred colours and the colour rendition of white light sources. Lighting Research and Technology 16; 48: 393 411. 14 Ohno Y, Miller CC, Fein M. Vision experiment on chroma saturation for colour quality preference: Proceedings of the 8th Session of the CIE, Manchester, UK, 8 June to 4 July 15. Vienna: CIE. 15 Lin Y, Wei M, Smet KAG, Tsukitani A, Bodrogi P, Khanh TQ. Colour preference varies with lighting application. Lighting Research and. Technology 17; 49: 316 38. 16 Royer MP, Wilkerson A, Wei M, Houser K, Davis R. Human perceptions of colour rendition vary with average fidelity, average gamut, and gamut shape. Lighting Research and Technology 17; 49: 963 988. 17 Royer MP, Wilkerson A, Wei M. Human perceptions of colour rendition at different chromaticities. Lighting Research and Technology 17. First published 4 August 17. DOI: 147715351775974. 18 Murdoch MJ, Fairchild MD. Effects of interobserver variation on color rendition measures: Proceedings of the IS&T 4th Color and Imaging Conference, San Diego, CA, USA, 7-11 November: 16: 187 191. 19 Commission Internationale de l Eclairage. Fundamental Chromaticity Diagram with Physiological Axes Part 1 CIE 17.1-6. Vienna: CIE, 6. Sarkar A, Autrusseau F, Viénot F, Le Callet P, Blonde L. From CIE 6 physiological model to improved age-dependent and average colorimetric observers. Journal of the Optical Society of America A 11; 8: 33 48. 1 Asano Y, Fairchild MD, Blonde L. Individual colorimetric observer model. PLoS ONE 16; 11: e145671. Hu X, Houser KW. Large field color matching function. Color Research and Application 5; 31: 18 9. 3 Royer M, Wei M. The role of presented objects in deriving color preference criteria from psychophysical studies. Leukos 17; 13: 143 157.