Coreference Resolution with Knowledge. Haoruo Peng March 20, 2015

Size: px

Start display at page:

Download "Coreference Resolution with Knowledge. Haoruo Peng March 20, 2015"

Rodney Young
6 years ago
Views:

1 Coreference Resolution with Knowledge Haoruo Peng March 20,

2 Coreference Problem 2

3 Coreference Problem 3

4 Online Demo Please check outhttp://cogcomp.cs.illinois.edu/page/demo_view/coref 4

5 General Framework Jack threw the bags of Mary into the water since he is angry with her. 5

6 General Framework Jack threw the bags of Mary into the water since he is angry with her. Mention Detection 5

7 General Framework Jack threw the bags of Mary into the water since he is angry with her. Mention Detection 5

8 General Framework Jack threw the bags of Mary into the water since he is angry with her. Mention Detection Pairwise Mention Scoring 5

9 General Framework Jack threw the bags of Mary into the water since he is angry with her. Mention Detection Pairwise Mention Scoring Goal: Coreferent Mentions have higher scores 5

10 General Framework Jack threw the bags of Mary into the water since he is angry with her. Mention Detection Pairwise Mention Scoring Goal: Coreferent Mentions have higher scores 5

11 General Framework Jack threw the bags of Mary into the water since he is angry with her. Mention Detection Pairwise Mention Scoring Goal: Coreferent Mentions have higher scores Inference (ILP) Best-Link All-Link 5

12 ILP formulation of CR Best-Link All-Link 6

13 General Framework Learning (for Pairwise Mention Scores) Structural SVM Features: Mention Types, String Relations, Semantic, Relative Location, Anaphoricity(Learned), Aligned Modifiers, Memorization, etc. 7

14 General Framework Learning (for Pairwise Mention Scores) Structural SVM Features: Mention Types, String Relations, Semantic, Relative Location, Anaphoricity(Learned), Aligned Modifiers, Memorization, etc. Evaluation MUC BCUB CEAF_e 7

15 General Framework Learning (for Pairwise Mention Scores) Structural SVM Features: Mention Types, String Relations, Semantic, Relative Location, Anaphoricity(Learned), Aligned Modifiers, Memorization, etc. Evaluation MUC BCUB CEAF_e IllinoisCoref VS. Stanford Muti-pass Sieve System VS. Berkeley CR System 7

16 Difficulties in CR Hard Coreference Problems [A bird] perched on the [limb] and [it] bent. [Robert] is robbed by [Kevin], and [he] is arrested by police. Gender / Plurality information cannot help -> Requires Knowledge 8

17 Part 1 Solving Hard Coreference Problems 9

18 Hard Coreference Problems Motivating Examples 10

19 Hard Coreference Problems Motivating Examples Category 1 [A bird] perched on the [limb] and [it] bent. [The bee] landed on [the flower] because [it] had pollen. 10

20 Hard Coreference Problems Motivating Examples Category 1 [A bird] perched on the [limb] and [it] bent. [The bee] landed on [the flower] because [it] had pollen. Category 2 [Jack] is robbed by [Kevin], and [he] is arrested by police. [Jim] was afraid of [Robert] because [he] gets scared around new people. 10

21 Hard Coreference Problems Motivating Examples Category 1 [A bird] perched on the [limb] and [it] bent. [The bee] landed on [the flower] because [it] had pollen. Category 2 [Jack] is robbed by [Kevin], and [he] is arrested by police. [Jim] was afraid of [Robert] because [he] gets scared around new people. C [Lakshman] asked [Vivan] to get him some ice cream because [he] was hot. 10

22 Predicate Schemas Type 1 Type 2 11

23 Predicate Schemas Type 1 Type 2 11

24 Predicate Schemas Sub Obj Type 1 Type 2 11

25 Predicate Schemas Sub Obj Type 1 (Cat1) [The bee] landed on [the flower] because [it] had pollen. Type 2 11

26 Predicate Schemas Type 1 Sub Obj (Cat1) [The bee] landed on [the flower] because [it] had pollen. S(have(m=[the flower], a=[pollen])) > S(have(m=[the bee], a=[pollen])) Type 2 11

27 Predicate Schemas Type 1 Sub Obj (Cat1) [The bee] landed on [the flower] because [it] had pollen. S(have(m=[the flower], a=[pollen])) > S(have(m=[the bee], a=[pollen])) Type 2 11

28 Predicate Schemas Type 1 Sub Obj (Cat1) [The bee] landed on [the flower] because [it] had pollen. S(have(m=[the flower], a=[pollen])) > S(have(m=[the bee], a=[pollen])) Type 2 11

29 Predicate Schemas Type 1 Sub Obj (Cat1) [The bee] landed on [the flower] because [it] had pollen. S(have(m=[the flower], a=[pollen])) > S(have(m=[the bee], a=[pollen])) Type 2 Shared Mention 11

30 Predicate Schemas Type 1 Sub Obj (Cat1) [The bee] landed on [the flower] because [it] had pollen. S(have(m=[the flower], a=[pollen])) > S(have(m=[the bee], a=[pollen])) Type 2 Shared Mention (Cat2) [Jim] was afraid of [Robert] because [he] gets scared around new people. 11

31 Predicate Schemas Sub Obj Type 1 (Cat1) [The bee] landed on [the flower] because [it] had pollen. S(have(m=[the flower], a=[pollen])) > S(have(m=[the bee], a=[pollen])) Shared Mention Type 2 (Cat2) [Jim] was afraid of [Robert] because [he] gets scared around new people. S S(be afraid of(a=*, m=*) get scared around(m=*, a=*), because) 11 ^ ^

32 Predicate Schemas Possible variations for scoring function statistics. 12

33 Predicate Schemas in Coreference 13

34 Predicate Schemas in Coreference Pairwise Mention Scoring Function 13

35 Predicate Schemas in Coreference Pairwise Mention Scoring Function Scoring Function for Predicate Schemas 13

36 Predicate Schemas in Coreference Pairwise Mention Scoring Function Scoring Function for Predicate Schemas We can add scores of Predicate Schemas as Features 13

37 Ways of Using Knowledge Major Disadvantages of Using Knowledge as Features Noise in Knowledge Inexplicit Textual Inference Alternative way Using Knowledge as Constraints 14

38 Using Knowledge as Constraints 15

39 Using Knowledge as Constraints Generating Constraints 15

40 Using Knowledge as Constraints Generating Constraints ILP inference (Best-Link) 15

41 Scores for Predicate Schemas Multiple Sources Gigaword Wikipedia Web Search Polarity Information 16

42 Scores for Predicate Schemas Gigaword Chunking + Dependency Parsing => predicate(subject, object) => Type 1 Predicate Schema Heuristic Coreference => Type 2 Predicate Schema 17

43 Scores for Predicate Schemas Gigaword Chunking + Dependency Parsing => predicate(subject, object) => Type 1 Predicate Schema Heuristic Coreference => Type 2 Predicate Schema Wikipedia Entity Linking to ground on Wikipedia Entries (Disambiguation) Gather Simple Statistics for 1) immediately after 2) immediately before 3) after 4) before => Type 1 Predicate Schema ) (approximation) 17

44 Scores for Predicate Schemas Web Search Google Query with Quote (counts) m predicate, m a, a m, m predicate a => Type 1 Predicate Schema 18

45 Scores for Predicate Schemas Web Search Google Query with Quote (counts) m predicate, m a, a m, m predicate a => Type 1 Predicate Schema Polarity Information Polarity on predicates => Polarity on mentions Negate polarity if mention is object Negate polarity for polarity-reversing connective +1 if polarities for mentions are the same -1 if polarities for mentions are different => Type 2 Predicate Schema 18

46 Recap Things to consider for using knowledge in NLP Knowledge Representation Predicate Schema Knowledge Inference Features VS. Inference Knowledge Acquisition Multiple Sources 19

47 Datasets Winograd dataset 1 [The bee] landed on [the flower] because [it] had pollen. [The bee] landed on [the flower] because [it] wanted pollen

48 Datasets Winograd dataset 1 [The bee] landed on [the flower] because [it] had pollen. [The bee] landed on [the flower] because [it] wanted pollen

49 Datasets Winograd dataset 1 [The bee] landed on [the flower] because [it] had pollen. [The bee] landed on [the flower] because [it] wanted pollen. [Jack] threw the bags of [John] into the water since [he] mistakenly asked him to carry his bags

50 Datasets Winograd dataset 1 [The bee] landed on [the flower] because [it] had pollen. [The bee] landed on [the flower] because [it] wanted pollen. [Jack] threw the bags of [John] into the water since [he] mistakenly asked him to carry his bags. W inocoref dataset

51 Evaluation on Hard Coreference Dataset Winograd WinoCoref Metric Precision AntePre Illinois IlliCons Rahman and Ng (2012) KnowFeat KnowCons KnowComb

52 Evaluation on Standard Coreference 22

53 Analysis on Effects of Schemas 23

54 Part 2 Profiler: Knowledge Schemas at Scale 24

55 Goal How to enlarge the Knowledge acquired from text Data Volume Schema Richness Profiler Demo: 25

56 Motivation 26

57 Enriched Schemas 27

58 Enriched Schemas 28

59 Enriched Schemas 29

60 Implementation 30

61 Effect of Wikification (Entity-Linking) 31

62 Effect of Wikification (Entity-Linking) 32

63 Knowledge Visualization 33

64 Knowledge Visualization 34

65 Publications [1] Solving Hard Coreference Problems. Haoruo Peng*, Daniel Khashabi* and Dan Roth. NAACL [2] A Joint Framework for Mention Head Detection and Coreference Resolution. Submitted to ACL [3] Profiler: Knowledge Schemas at Scale. Submitted to Transactions of ACL

66 Future Directions The use of world knowledge in NLP tasks Knowledge Representation (schemas) Is co-occurrence information enough? Knowledge Inference Sparsity Issues Knowledge Acquisition Which sources to choose? Interpolation / Tasks beyond CR (CR can be seen as a subset of AI-complete problems) Outlier Detection for Singleton Mentions 36

67 Collaborators Daniel Khashabi Zhiye Fei Kaiwei Chang Prof. Dan Roth 37

68 Thank You! 38

69 Difficulties in CR Hard Coreference Problems Ex.1 [A bird] perched on the [limb] and [it] bent. Ex.2 [Robert] is robbed by [Kevin], and [he] is arrested by police. Gender / Plurality information cannot help -> Requires Knowledge 39

Difficulties in CR Hard Coreference Problems Ex.1 [A bird] perched on the [limb] and [it] bent. Ex.2 [Robert] is robbed by [Kevin], and [he] is arrested by police.

70 Difficulties in CR Hard Coreference Problems Ex.1 [A bird] perched on the [limb] and [it] bent. Ex.2 [Robert] is robbed by [Kevin], and [he] is arrested by police. Gender / Plurality information cannot help -> Requires Knowledge 39

71 Difficulties in CR Hard Coreference Problems Ex.1 [A bird] perched on the [limb] and [it] bent. Ex.2 [Robert] is robbed by [Kevin], and [he] is arrested by police. Gender / Plurality information cannot help -> Requires Knowledge 39

72 Difficulties in CR Hard Coreference Problems Ex.1 [A bird] perched on the [limb] and [it] bent. Ex.2 [Robert] is robbed by [Kevin], and [he] is arrested by police. Gender / Plurality information cannot help -> Requires Knowledge Performance Gaps System Dataset Gold Predicted Gap Illinois CoNLL Berkeley CoNLL Stanford ACE > Requires Better Mention Detection 39

73 A Joint Framework for Mention Head Detection and Coreference Resolution Goal: Improve CR on predicted mentions (End-to-End) Solution: [Multinational companies investing in [China]] had become so angry that [they] recently set up an anti-piracy league to pressure [the [Chinese] government] to take action. [Domestic manufacturers, [who] are also suffering], launched a similar body this month. Traditional: MD -> Coref Our paper: Mention Head -> Joint Coref -> Head to Mention Joint Learning / Inference Step Results Add decision variables to decide whether to choose a head or not Joint Coref is able to reject some mention head candidates Dataset Illinois Baseline Our Paper ACE CoNLL

74 ILP formulation of CR Best-Link with Knowledge Constraints Best-Link with Joint Mention Detection 41

Narrative Schema as World Knowledge for Coreference Resolution

Narrative Schema as World Knowledge for Coreference Resolution Joseph Irwin Nara Institute of Science and Technology Nara Prefecture, Japan joseph-i@is.naist.jp Mamoru Komachi Nara Institute of Science