TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Auto-Annotation

Similar documents
TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation

Comparison of Feature Sets using Multimedia Translation

Tree structured CRF models for interactive image labeling

Large-scale Image Annotation by Efficient and Robust Kernel Metric Learning

EUROPEAN KANGOUROU LINGUISTICS ENGLISH-LEVELS 3-4. Linguistic ENGLISH. LEVEL: 3 4 (Γ - Δ Δημοτικού)

easy english readers The Little Mermaid By Hans Christian Andersen

American Pride #32018 / 50 Designs

To obtain an editable Word version of this resource: upload and share your own resources at

Isa Genzken 519 West 19th Street September 16 - October 31, 2015

Image Tag Clarity: In Search of Visual- Representative Tags for Social Images

Best Witches by Sue Hillis

Stealing Objects With Computer Vision

Welcome to the class of Web Information Retrieval. Min ZHANG

Commonly used keys to write text

Emerald Cubicle System. Spec Sheet. Emerald 39,52, 65 H. skutchi.com cubicles.cc. Vol DESIGNS. inc.

2 Nasreen bought 3 m 20 cm cloth for her shirt and 2 m 5 cm cloth for her trouser. Find the total Length of cloth bought by her.

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

24 hours of Photo Sharing. installation by Erik Kessels

For the authors, *****************************************

Grand Tour. The Travels in Rome, London and Paris. Size mm: 69.34w x 98.30h. Stitch Count: 20,019. Size Inches: 3.70w x 3.87h

In this lesson you will learn: To use various tools of Paint activity. (See the level I lesson on paint for a revision.)

The Fraunhofer IDMT at ImageCLEF 2011 Photo Annotation Task

Metric learning approaches! for image annotation! and face recognition!

5.5 Right Triangles. 1. For an acute angle A in right triangle ABC, the trigonometric functions are as follow:

Small particles scatter light

AQR UNIT 7. Circuits, Paths, and Graph Structures. Packet #

Image Annotation in Presence of Noisy Labels

Skills Practice Skills Practice for Lesson 7.1

Performance Assessment Spring 2001

4 All Seasons. by Linda Samuels

3D Modeling Course Outline

Packet Unit 5 Right Triangles Honors Common Core Math 2 1

Lab for Media Search, National University of Singapore 1

The cosine ratio is a ratio involving the hypotenuse and one leg (adjacent to angle) of the right triangle Find the cosine ratio for. below.

CS 1674: Intro to Computer Vision. Attributes. Prof. Adriana Kovashka University of Pittsburgh November 2, 2016

Prime2 Peter Hallahan Operations Manager, Data Strategy & Development. National Mapping Agency

Object Recognition I. Linda Shapiro EE/CSE 576

Math A Regents Exam 0807 Page 1

Region-based Image Annotation using Asymmetrical Support Vector Machine-based Multiple-Instance Learning

DRAFT CHAPTER. Surface Area GET READY. xxx. Math Link. 5.1 Warm Up xxx. 5.1 Views of Three-Dimensional Objects xxx. 5.

Answer Key: Let s Go Phonics Book 1

Hang a sign OMG. So many emotions! We feel a great party is in the future. Send reminder (sent by Corporate) and post event on Facebook.

The little mermaid lives in a beautiful castle in the deep blue sea. She lives with her five sisters and her father, the Merking.

Kamaole Sands 2695 South Kihei Road, Kihei, Hawaii, Association of Apartment Owners (808) facsimile: (808)

Name (s) Class Date ERROR ANALYSIS GEOMETRY WORD PROBLEMS

Projectors Product Code Price ( ) Ex.VAT

FLAMINGO CHEAT SHEET FOR ES 305 CLASS 5 Pages of Fresh Flamingo Goodness

Attributes and More Crowdsourcing

Studio Library 1. A-4.News Studio 4. A-8.News Studio 8 A-9.News Studio 9. A-7.News Studio 7. A-10.Information Program 1. A-11.Infomation Program 2

Welcome to the Turtle World of Logo

Installation. 9mm. Rear view showing cable header pins

Interaction with an autonomous agent

Coherent Image Annotation by Learning Semantic Distance

Camera Shots. Lesson Plan

Math 7 Accelerated Summer Review

7.1. Multiplying and Dividing Monomials. Explore Multiplying and Dividing Monomials. Focus on. Reflect and Check

Terrain Tile Texture Production: Overview of the Process

PRISM: Concept-preserving Social Image Search Results Summarization

Leveraging flickr images for object detection

Ch 8: Right Triangles and Trigonometry 8-1 The Pythagorean Theorem and Its Converse 8-2 Special Right Triangles 8-3 The Tangent Ratio

Computer vision is the analysis of digital images by a computer for such applications as:

Quilted in Honor #32030 / 30 Designs

St. Vincent s H. & T. School, Asansol SYLLABUS FOR CLASS III ENGLISH LANGUAGE

Welcome to Unit 5~Answer Key

Be sure to label all answers and leave answers in exact simplified form.

Modeling semantic aspects for cross-media image indexing

Product Catalogue. Provencale Range Madison Dark Range Classic Chic Range Mon Chic Range... Junior Archipelago Range...

Going 3D with Blender: A room with toys

COMPUTER VISION Introduction

Finally, at the bottom right of the screen the word Fyrsta fades in.

This is a sample of what you get after it is processed. The yellow colors in this view come from the Classify Vegetation Height.tif background map.

MADISON GREEN COLOR CHART (Approved by MGMA 4/25/2013)

Preparation: Get to know the Java coordinate system. 3. How is the background color of the window specified? Copy the exact code:

Madison Dark Range Junior Archipelago Range Sofas & Chairs

Engineering Geology. Engineering Geology is backbone of civil engineering. Topographic Maps. Eng. Iqbal Marie

The Alchemist Yes Yes Yes. All the Light We Cannot See Yes Yes Yes. Alone in the Classroom No Yes Yes. American Wife Yes No No.

Assignment 3. Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Number Sense Workbook 3, Part 2

Summer Math Learning Packet for Students Entering. Grade 6. SFTitle I

±» ²¼ ª² ª³ ±» ² ª±² ( 80 ±«) ± ³» ³ ª² ; ±² ²¼ ³² ³±² ± ¹ ³ ³ ³ ³ ² «³ ± ««¾±² - ± ³«³»µ «; ² ª±² ( ±², ±, ±±³ ), ± ±² µ ±² ³ª²³».» µ³ ±²» ² ª±² ( «¾

5th Grade Mathematics Essential Standards

Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task

Unit 5.A Properties of Light Essential Fundamentals of Light 1. Electromagnetic radiation has oscillating magnetic and electric components.

AWM 11 UNIT 4 TRIGONOMETRY OF RIGHT TRIANGLES

Archdiocese of Washington Catholic Schools Academic Standards Mathematics

Activity using Paint. Lesson6. In this lesson you will learn: To use various tools of Paint activity.

Step-by-Step Photorealistic Renderings in Autodesk Land Desktop By: Harry Ward, PE - Of: OutSource Inc.

MATRIXED

AW Math 10 UNIT 7 RIGHT ANGLE TRIANGLES

Analog Line-in.. 2. Canon Legendary Optics Digital Effects DIGIC DV Dual Image Processor. 5

Area is the amount of space a shape covers. It is a 2D measurement. We measure area in square units. For small areas we use square centimetres.

GRADE. Nevada DEPARTMENT OF EDUCATION. Mathematics HIGH SCHOOL PROFICIENCY EXAM. Nevada

Welcome to your step by step guide to organising a successful activity event. This pack has been designed to save you time, hassle and frustration.

REINFORCEMENT WORKSHEETS ANSWER KEY

NMC Sample Problems: Grade 8

Catalog Carrara Glass COMPLIANT

Neston High School Mathematics Faculty Homework Booklet

Solve the problem. 1) Given that AB DC & AD BC, find the measure of angle x. 2) Find the supplement of 38. 3) Find the complement of 45.

Inverses of Trigonometric. Who uses this? Hikers can use inverse trigonometric functions to navigate in the wilderness. (See Example 3.

Understanding Tracking and StroMotion of Soccer Ball

Transcription:

TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Auto-Annotation M. Guillaumin, T. Mensink, J. Verbeek and C. Schmid LEAR team, INRIA Rhône-Alpes, France Supplementary Material

Summary We first present qualitative results on all three data sets (Corel 5k, ESP-Game and IAPR TC12) for the task of keyword-based image retrieval in Chapter 1 and next for the task of image auto-annotation in Chapter 2. 1

Contents 1 Keyword-based Image Retrieval 3 1.1 Corel Database............................ 4 1.1.1 Keyword based retrieval good results........... 4 1.1.2 Keyword based retrieval mean results.......... 5 1.1.3 Keyword based retrieval bad results........... 6 1.1.4 Multiword queries retrieval good results......... 7 1.1.5 Multiword queries retrieval mean results......... 8 1.1.6 Multiword queries retrieval bad results.......... 9 1.2 ESP-Game Database......................... 10 1.2.1 Keyword based retrieval good results........... 10 1.2.2 Keyword based retrieval mean results.......... 11 1.2.3 Keyword based retrieval bad results........... 12 1.3 IAPR Database............................ 13 1.3.1 Keyword based retrieval good results........... 13 1.3.2 Keyword based retrieval mean results.......... 14 1.3.3 Keyword based retrieval bad results........... 15 2 Image Annotation 16 2.1 Corel Database............................ 17 2.1.1 Best Predictions....................... 17 2.1.2 Median Predictions...................... 18 2.1.3 Worst Predictions...................... 19 2.2 ESP-Game Database......................... 20 2.2.1 Best Predictions....................... 20 2.2.2 Median Predictions...................... 21 2.2.3 Worst Predictions...................... 22 2.3 IAPR Database............................ 23 2.3.1 Best Predictions....................... 23 2.3.2 Median Predictions...................... 24 2.3.3 Worst Predictions...................... 25 2

Chapter 1 Keyword-based Image Retrieval In our paper we compare performance of keyword based retrieval, i.e. given a keyword or several, rank the images by relevance. In these sections we show some example queries from all three databases. We have restricted ourselves to keywords where at least 8 images should have been returned. Because we also show 8 images of each query, in the perfect case all shown images are correct. Images whom are wrongly assigned to a keyword are marked with a red border. For each of the databases we show examples where our performance is good, median and bad. These scores are based on the Break Even Point (BEP) precision evaluation. For each query we show: query BEP (nr. of relevant images). For the Corel database we also show multi-word queries. 3

1.1 Corel Database 1.1.1 Keyword based retrieval good results tiger 100.00 (10) pool 90.91 (11) cat 90.91 (11) tracks 90.91 (11) pillar 88.89 (9) mare 88.89 (9) foals 88.89 (9) 4

1.1.2 Keyword based retrieval mean results street 61.54 (26) grass 60.78 (51) sky 60.00 (105) plants 60.00 (15) garden 60.00 (10) tree 59.14 (93) field 58.82 (17) 5

1.1.3 Keyword based retrieval bad results shore 37.50 (8) rocks 36.36 (22) clouds 34.62 (26) close-up 30.00 (10) window 25.00 (8) hills 22.22 (18) town 22.22 (9) 6

1.1.4 Multiword queries retrieval good results cars, tracks 100.00 (10) cat, tiger 100.00 (10) stone, pillar 100.00 (8) water, pool 90.00 (10) horses, foals 88.89 (9) horses, mare 88.89 (9) train, locomotive 88.89 (9) 7

1.1.5 Multiword queries retrieval mean results sky, jet, plane 66.67 (12) buildings, street 66.67 (15) bear, snow 63.64 (11) water, people 60.00 (20) mountain, sky 55.56 (18) tree, grass 54.55 (11) mountain, tree 54.55 (11) 8

1.1.6 Multiword queries retrieval bad results tree, house 37.50 (8) water, beach, people 37.50 (8) water, bridge 36.36 (11) sky, tree 34.62 (26) sky, clouds 33.33 (15) people, street 25.00 (8) beach, sand 25.00 (8) 9

1.2 ESP-Game Database 1.2.1 Keyword based retrieval good results coin 81.48 (27) money 72.00 (25) chart 68.42 (38) map 67.39 (46) graph 66.67 (33) necklace 66.67 (15) sky 63.30 (188) 10

1.2.2 Keyword based retrieval mean results castle 33.33 (12) comic 33.33 (12) globe 33.33 (12) plant 33.33 (48) road 33.33 (30) singer 33.33 (9) horse 32.43 (37) 11

1.2.3 Keyword based retrieval bad results statue 8.33 (12) letter 7.69 (13) ear 7.14 (28) arm 5.56 (18) album 0.00 (12) guy 0.00 (13) ice 0.00 (8) 12

1.3 IAPR Database 1.3.1 Keyword based retrieval good results court 100.00 (18) tennis 94.12 (17) racetrack 90.91 (11) player 87.50 (24) grandstand 84.62 (26) cyclist 80.00 (45) grave 80.00 (10) 13

1.3.2 Keyword based retrieval mean results wood 43.48 (46) sand 43.33 (30) column 42.86 (42) bay 41.94 (31) woman 41.94 (248) cliff 41.86 (43) car 41.77 (79) 14

1.3.3 Keyword based retrieval bad results shoe 9.09 (11) area 8.33 (12) body 8.33 (12) backpack 5.56 (18) balcony 0.00 (11) dog 0.00 (10) paving 0.00 (10) 15

Chapter 2 Image Annotation To illustrate the performance of our model on the image annotation task, we show word predictions for a selection of images of the three data sets. Each data set is presented in three sections: Best predictions, Median predictions, and Worst predictions. These sections also relate to the BEP scores of our predictions compared to the ground-truth annotations, but here these BEP scores are computed by ranking keywords for a given image. When ties between BEP scores are observed, the images with the largest annotation sizes are kept. For each image, we show its BEP score, its ground-truth annotations, and the predictions of our model. The ground-truth words are sorted with our model s output score (shown in brackets), and are bold when the word is correctly predicted by our model. Predictions are also sorted with respect to their scores, and are bold if they appear in the ground-truth. We show as many predicted words as there are words in the ground-truth annotation for an image. 16

2.1 Corel Database 2.1.1 Best Predictions Image BEP, Ground Truth, Prediction Ground Truth: coral (1.00), ocean (1.00), reefs (1.00), fan (1.00), sea (0.99) Predictions: coral (1.00), ocean (1.00), reefs (1.00), fan (1.00), sea (0.99) Ground Truth: field (1.00), foals (1.00), mare (1.00), horses (1.00) Predictions: field (1.00), foals (1.00), mare (1.00), horses (1.00) Ground Truth: sun (1.00), sky (1.00), tree (1.00), clouds (0.99) Predictions: sun (1.00), sky (1.00), tree (1.00), clouds (0.99) Ground Truth: mosque (1.00), temple (1.00), stone (1.00), pillar (1.00) Predictions: mosque (1.00), temple (1.00), stone (1.00), pillar (1.00) Ground Truth: plane (1.00), jet (1.00), sky (1.00), smoke (0.99) Predictions: plane (1.00), jet (1.00), sky (1.00), smoke (0.99) Ground Truth: canyon (1.00), valley (1.00), sand (1.00), rocks (1.00) Predictions: canyon (1.00), valley (1.00), sand (1.00), rocks (1.00) Ground Truth: bengal (1.00), tiger (1.00), cat (1.00), forest (1.00) Predictions: bengal (1.00), tiger (1.00), cat (1.00), forest (1.00) Ground Truth: black (1.00), bear (1.00), river (1.00), water (1.00) Predictions: black (1.00), bear (1.00), river (1.00), water (1.00) 17

2.1.2 Median Predictions Image BEP, Ground Truth, Prediction Ground Truth: buildings (0.99), tree (0.98), hills (0.46), town (0.35) Predictions: village (1.00), buildings (0.99), house (0.99), tree (0.98) Ground Truth: relief (1.00), ruins (1.00), sculpture (0.76), stone (0.73) Predictions: relief (1.00), ruins (1.00), wall (1.00), costume (0.92) Ground Truth: light (1.00), buildings (1.00), city (0.72), street (0.56) Predictions: harbor (1.00), light (1.00), skyline (1.00), buildings (1.00) Ground Truth: clouds (1.00), sky (0.99), desert (0.93), church (0.17) Predictions: clouds (1.00), valley (0.99), sky (0.99), sand (0.97) Ground Truth: mountain (1.00), tree (0.99), sky (0.98), clouds (0.94) Predictions: hillside (1.00), mountain (1.00), valley (0.99), tree (0.99) Ground Truth: grass (0.98), tree (0.98), bush (0.54), truck (0.05) Predictions: flowers (1.00), grass (0.98), tree (0.98), moose (0.95) Ground Truth: herd (0.99), grass (0.98), tundra (0.96), caribou (0.13) Predictions: sky (0.99), herd (0.99), grass (0.98), hills (0.97) Ground Truth: mountain (1.00), grass (0.99), landscape (0.79), road (0.32) Predictions: hillside (1.00), mountain (1.00), bush (1.00), grass (0.99) 18

2.1.3 Worst Predictions Image BEP, Ground Truth, Prediction Ground Truth: tree (0.89), street (0.89), house (0.82), tower (0.79) Predictions: bridge (0.98), water (0.97), sky (0.95), people (0.94) Ground Truth: water (1.00), sky (1.00), house (0.54), bridge (0.43) Predictions: buildings (1.00), boats (1.00), courtyard (1.00), canal (1.00) Ground Truth: snow (0.86), field (0.52), elk (0.14), bulls (0.13) Predictions: ice (1.00), frost (1.00), water (0.99), prototype (0.96) Ground Truth: flowers (0.87), wall (0.52), gate (0.43), lawn (0.01) Predictions: window (0.97), tree (0.96), interior (0.92), leaf (0.92) Ground Truth: water (0.92), coast (0.81), bridge (0.80), hills (0.68) Predictions: valley (0.98), tree (0.98), mountain (0.96), sky (0.95) Ground Truth: beach (0.98), water (0.98), people (0.96), sand (0.86) Predictions: snow (0.99), polar (0.99), tracks (0.99), bear (0.99) Ground Truth: horizon (0.99), water (0.98), mountain (0.76), sunrise (0.08) Predictions: sun (1.00), clouds (1.00), city (1.00), sky (1.00) Ground Truth: buildings (0.77), house (0.53), village (0.22), hillside (0.12) Predictions: tree (0.99), kauai (0.91), rocks (0.89), grass (0.89) 19

2.2 ESP-Game Database 2.2.1 Best Predictions Image BEP, Ground Truth, Prediction Ground Truth: coin (1.00), old (1.00), metal (1.00), money (1.00), silver (1.00), round (1.00), circle (1.00), head (1.00), face (0.99) Predictions: coin (1.00), old (1.00), metal (1.00), money (1.00), silver (1.00), round (1.00), circle (1.00), head (1.00), face (0.99) Ground Truth: book (1.00), glasses (1.00), shirt (1.00), hand (1.00), old (1.00), green (1.00), man (1.00) Predictions: book (1.00), glasses (1.00), shirt (1.00), hand (1.00), old (1.00), green (1.00), man (1.00) Ground Truth: snow (1.00), fence (1.00), sky (1.00), mountain (1.00), white (0.98), tree (0.96) Predictions: snow (1.00), fence (1.00), sky (1.00), mountain (1.00), white (0.98), tree (0.96) Ground Truth: ocean (1.00), cloud (1.00), sea (1.00), sky (1.00), water (1.00), blue (1.00) Predictions: ocean (1.00), cloud (1.00), sea (1.00), sky (1.00), water (1.00), blue (1.00) Ground Truth: grass (1.00), tree (0.99), house (0.99), green (0.99), sky (0.98) Predictions: grass (1.00), tree (0.99), house (0.99), green (0.99), sky (0.98) Ground Truth: silver (1.00), coin (1.00), old (1.00), money (1.00), boat (1.00) Predictions: silver (1.00), coin (1.00), old (1.00), money (1.00), boat (1.00) Ground Truth: pot (1.00), tree (1.00), plant (1.00), green (1.00), leaf (1.00) Predictions: pot (1.00), tree (1.00), plant (1.00), green (1.00), leaf (1.00) Ground Truth: window (1.00), computer (1.00), gray (1.00), screen (1.00), blue (0.96) Predictions: window (1.00), computer (1.00), gray (1.00), screen (1.00), blue (0.96) 20

2.2.2 Median Predictions Image BEP, Ground Truth, Prediction BEP: 40% Ground Truth: map (0.99), red (0.89), yellow (0.79), black (0.79), numbers (0.23) Predictions: map (0.99), white (0.97), red (0.89), chart (0.85), blue (0.85) BEP: 40% Ground Truth: tv (0.98), comic (0.94), show (0.93), blue (0.90), yellow (0.69) Predictions: tv (0.98), movie (0.98), man (0.96), red (0.95), comic (0.94) BEP: 40% Ground Truth: man (0.99), woman (0.98), white (0.80), dog (0.26), show (0.14) Predictions: man (0.99), people (0.99), woman (0.98), red (0.96), painting (0.93) BEP: 40% Ground Truth: wave (0.99), girl (0.99), flower (0.97), black (0.93), america (0.11) Predictions: people (1.00), woman (1.00), wave (0.99), group (0.99), girl (0.99) BEP: 40% Ground Truth: black (0.99), picture (0.97), people (0.97), painting (0.90), group (0.59) Predictions: old (1.00), black (0.99), gray (0.99), man (0.99), picture (0.97) BEP: 40% Ground Truth: drawing (1.00), cartoon (1.00), kid (0.75), dog (0.72), brown (0.54) Predictions: drawing (1.00), cartoon (1.00), child (0.96), red (0.94), white (0.89) BEP: 40% Ground Truth: hair (0.98), woman (0.98), blonde (0.73), yellow (0.51), legs (0.34) Predictions: hair (0.98), woman (0.98), man (0.98), music (0.97), black (0.95) BEP: 40% Ground Truth: girl (0.99), woman (0.98), black (0.86), dance (0.73), ball (0.30) Predictions: man (0.99), girl (0.99), woman (0.98), people (0.97), pink (0.90) 21

2.2.3 Worst Predictions Image BEP, Ground Truth, Prediction Ground Truth: woman (0.83), cover (0.60), movie (0.53), magazine (0.36), fly (0.17), film (0.13) Predictions: sky (0.97), red (0.97), man (0.88), white (0.87), water (0.85), green (0.84) Ground Truth: water (0.85), blue (0.84), sea (0.47), plane (0.31), boat (0.28), army (0.18), war (0.15) Predictions: man (0.97), green (0.92), wall (0.90), white (0.90), grass (0.89), sky (0.89), black (0.88) Ground Truth: white (0.89), water (0.46), room (0.26), fire (0.25), wall (0.24), house (0.22), smoke (0.11) Predictions: suit (1.00), man (1.00), tie (0.99), door (0.99), people (0.99), woman (0.98), group (0.98) Ground Truth: people (0.75), red (0.70), city (0.66), building (0.56), car (0.51), mountain (0.33), street (0.17) Predictions: green (0.99), forest (0.98), leaf (0.98), tree (0.97), flower (0.97), yellow (0.95), grass (0.95) Ground Truth: snow (0.86), people (0.80), red (0.72), tower (0.68), yellow (0.53), mountain (0.37), ice (0.30) Predictions: sky (0.99), blue (0.99), building (0.95), house (0.94), man (0.91), water (0.90), tree (0.89) Ground Truth: water (0.87), boat (0.79), house (0.75), woman (0.71), game (0.66), lady (0.29), logo (0.28) Predictions: man (0.95), sky (0.94), white (0.91), building (0.90), ship (0.90), people (0.89), black (0.89) Ground Truth: blue (0.79), car (0.61), light (0.58), store (0.47), building (0.37), sign (0.36), moon (0.12), web (0.11) Predictions: man (0.99), table (0.97), people (0.96), tree (0.91), white (0.90), woman (0.90), black (0.89), red (0.85) Ground Truth: wood (0.62), face (0.59), eye (0.58), statue (0.36), nose (0.29), gray (0.27), person (0.25), mouth (0.14) Predictions: man (0.98), red (0.96), people (0.95), woman (0.95), brown (0.92), white (0.88), hat (0.84), picture (0.83) 22

2.3 IAPR Database 2.3.1 Best Predictions Image BEP, Ground Truth, Prediction Ground Truth: bike (1.00), country (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), short (1.00), hand (1.00), racing (1.00), road (1.00), side (1.00), desert (1.00), hill (1.00), landscape (1.00), sky (1.00) Predictions: bike (1.00), country (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), short (1.00), hand (1.00), racing (1.00), road (1.00), side (1.00), desert (1.00), hill (1.00), landscape (1.00), sky (1.00) Ground Truth: bike (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), line (1.00), power (1.00), short (1.00), hand (1.00), road (1.00), bottle (1.00), car (1.00), sky (1.00) Predictions: bike (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), line (1.00), power (1.00), short (1.00), hand (1.00), road (1.00), bottle (1.00), car (1.00), sky (1.00) Ground Truth: bike (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), short (1.00), shrub (1.00), road (1.00), desert (1.00), cloud (1.00), mountain (1.00), sky (1.00) Predictions: bike (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), short (1.00), shrub (1.00), road (1.00), desert (1.00), cloud (1.00), mountain (1.00), sky (1.00) Ground Truth: bike (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), short (1.00), cliff (1.00), meadow (1.00), road (1.00), landscape (1.00), cloud (1.00), sky (1.00) Predictions: bike (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), short (1.00), cliff (1.00), meadow (1.00), road (1.00), landscape (1.00), cloud (1.00), sky (1.00) Ground Truth: bike (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), short (1.00), shrub (1.00), road (1.00), desert (1.00), cloud (1.00), mountain (1.00), sky (1.00) Predictions: bike (1.00), cycling (1.00), cyclist (1.00), helmet (1.00), jersey (1.00), short (1.00), shrub (1.00), road (1.00), desert (1.00), cloud (1.00), mountain (1.00), sky (1.00) Ground Truth: cycling (1.00), cyclist (1.00), jersey (1.00), helmet (1.00), bike (1.00), highway (1.00), country (1.00), short (1.00), hand (1.00), sand (1.00), side (1.00), bush (1.00) Predictions: cycling (1.00), cyclist (1.00), jersey (1.00), helmet (1.00), bike (1.00), highway (1.00), country (1.00), short (1.00), hand (1.00), sand (1.00), side (1.00), bush (1.00) Ground Truth: cycling (1.00), power (1.00), cyclist (1.00), pole (1.00), bike (1.00), helmet (1.00), jersey (1.00), short (1.00), sun (1.00), road (1.00), shrub (1.00), sky (0.99) Predictions: cycling (1.00), power (1.00), cyclist (1.00), pole (1.00), bike (1.00), helmet (1.00), jersey (1.00), short (1.00), sun (1.00), road (1.00), shrub (1.00), sky (0.99) Ground Truth: cycling (1.00), cyclist (1.00), sky (1.00), road (1.00), cloud (1.00), bike (1.00), helmet (1.00), jersey (1.00), shrub (1.00), desert (1.00), short (1.00), mountain (1.00) Predictions: cycling (1.00), cyclist (1.00), sky (1.00), road (1.00), cloud (1.00), bike (1.00), helmet (1.00), jersey (1.00), shrub (1.00), desert (1.00), short (1.00), mountain (1.00) 23

2.3.2 Median Predictions Image BEP, Ground Truth, Prediction Ground Truth: pullover (1.00), boy (1.00) Predictions: cap (1.00), pullover (1.00) Ground Truth: sky (0.99), terrain (0.03) Predictions: cloud (1.00), sky (0.99) Ground Truth: cloud (0.99), meadow (0.99) Predictions: horizon (0.99), cloud (0.99) Ground Truth: sky (1.00), forest (0.90) Predictions: sky (1.00), slope (1.00) Ground Truth: salt (1.00), lake (0.42) Predictions: salt (1.00), desert (1.00) Ground Truth: woman (0.97), desert (0.70) Predictions: rock (0.99), woman (0.97) Ground Truth: woman (1.00), stand (0.99) Predictions: carpet (1.00), woman (1.00) Ground Truth: front (0.98), building (0.58) Predictions: front (0.98), child (0.98) 24

2.3.3 Worst Predictions Image BEP, Ground Truth, Prediction Ground Truth: woman (0.81), man (0.69), penguin (0.64), people (0.64), hill (0.43), jacket (0.36), trouser (0.22) Predictions: llama (0.99), rock (0.95), meadow (0.95), cliff (0.92), mountain (0.89), slope (0.89), waterfall (0.88) Ground Truth: road (0.57), bike (0.46), cyclist (0.40), short (0.31), cycling (0.20), shoe (0.09), sock (0.02) Predictions: canyon (0.99), man (0.95), branch (0.94), rock (0.91), side (0.89), slope (0.88), woman (0.86) Ground Truth: hand (0.68), roof (0.58), dress (0.27), head (0.23), girl (0.23), hair (0.19), hut (0.16) Predictions: man (0.98), woman (0.97), wall (0.97), tourist (0.95), front (0.93), people (0.89), hall (0.86) Ground Truth: bed (0.80), window (0.69), lamp (0.68), curtain (0.61), bedside (0.52), room (0.44), side (0.41) Predictions: man (0.94), woman (0.92), front (0.90), reed (0.90), column (0.89), sky (0.88), roof (0.88) Ground Truth: racetrack (1.00), meadow (0.99), people (0.64), hand (0.23), tent (0.21), flag (0.17), lane (0.05) Predictions: grandstand (1.00), uniform (1.00), jersey (1.00), team (1.00), short (1.00), field (1.00), stadium (1.00) Ground Truth: tourist (0.94), front (0.94), brick (0.93), door (0.84), girl (0.68), boy (0.50), building (0.49) Predictions: wall (1.00), room (1.00), window (0.99), man (0.96), table (0.96), woman (0.96), chair (0.95) Ground Truth: bed (0.80), wood (0.42), room (0.41), lamp (0.38), table (0.33), night (0.17), clock (0.06), bedcover (0.06) Predictions: field (0.99), landscape (0.98), house (0.96), terrace (0.94), tree (0.94), mountain (0.94), slope (0.92), meadow (0.88) Ground Truth: lawn (0.76), chair (0.67), table (0.60), shade (0.42), sun (0.41), shore (0.33), lake (0.22), sign (0.18), cliff (0.17) Predictions: tree (1.00), garden (0.94), man (0.92), house (0.91), sky (0.89), woman (0.88), people (0.88), front (0.86), plant (0.82) 25