Title: Fall 2005
 1EECS 595 / LING 541 / SI 661
Natural Language Processing
- Fall 2005 
- Lecture Notes 7
2Natural Language Generation 
 3What is NLG?
- Mapping meaning to text 
- Stages 
- Content selection 
- Lexical selection 
- Sentence structure aggregation, referring 
 expressions
- Discourse structure
4Systemic grammars
- Language is viewed as a resource for expressing 
 meaning in context (Halliday, 1985)
- Layers mood, transitivity, theme
The system will save the document
Mood subject finite predicator object
Transitivity actor process process goal
Theme theme rheme rheme rheme 
 5Example
- (process save-1actor system-1goal 
 document-1speechact assertiontense future
- ) 
- ? Input is underspecified
6The Functional Unification Formalism (FUF)
- Based on Kays (83) formalism 
- partial information, declarative, uniform, 
 compact
- same framework used for all stages syntactic 
 realization, lexicalization, and text planning
7Functional analysis
- Functional vs. structured analysis 
- John eats an apple 
- actor (John), affected (apple), process (eat) 
- NP VP NP 
- suitable for generation
8Partial vs. complete specification
action  eat
actor  John
object  apple
- Voice An apple is eaten by John 
- Tense John ate an apple 
- Mode Did John ear an apple? 
- Modality John must eat an apple 
- prolog p(X,b,c)
9Unification
- Target sentence 
- input FD 
- grammar 
- unification process 
- linearization process
10Sample input
((cat s) (prot ((n ((lex john))))) (verb ((v 
((lex like))))) (goal ((n ((lex mary)))))) 
 11Sample grammar
((alt top (((cat s) (prot ((cat 
np))) (goal ((cat np))) 
(verb ((cat vp) (number prot 
number))) (pattern (prot verb 
goal))) ((cat np) (n ((cat 
noun) (number   number))) 
 (alt (((proper yes) 
(pattern (n))) ((proper no) 
 (pattern (det n)) 
 (det ((cat article) 
(lex the))))))) ((cat vp) 
 (pattern (v)) (v ((cat verb)))) 
 ((cat noun)) ((cat verb)) 
 ((cat article))))) 
 12Sample output
((cat s) (goal ((cat np) (n ((cat noun) 
 (lex mary) (number goal 
number))) (pattern (n)) (proper 
yes))) (pattern (prot verb goal)) (prot ((cat 
np) (n ((cat noun) (lex 
john) (number verb number))) 
 (number verb number) (pattern (n)) 
 (proper yes))) (verb ((cat vp) 
(pattern (v)) (v ((cat verb) 
(lex like)))))) 
 13Comparison with Prolog
- Similarities 
- both have unification at the core 
- Prolog program  FUF grammar 
- Prolog query  FUF input 
- Differences 
- Prolog first order term unification 
- FUF arbitrarily rooted directed graphs are 
 unified
14The SURGE grammar
- Syntactic realization front-end 
- variable level of abstraction 
- 5600 branches and 1600 alts 
Lexicalized FD
Syntactic FD
LinearizerMorphology
Lexicalchooser
SURGE
Text 
 15Systems developed using FUF/SURGE
- COMET 
- MAGIC 
- ZEDDOC 
- PLANDOC 
- FLOWDOC 
- SUMMONS
16CFUF
- Fast implementation by Mark Kharitonov (C) 
- Up to 100 times faster than Lisp/FUF 
- Speedup higher for larger inputs 
17References
- Cole, Mariani, Uszkoreit, Zaenen, Zue (eds.) 
 Survey of the State of the Art in Human Language
 Technology, 1995
- Elhadad, Using Argumentation to Control Lexical 
 Choice A Functional Unification Implementation,
 1993
- Elhadad, FUF the Universal Unifier, User Manual, 
 1993
- Elhadad and Robin, SURGE a Comprehensive Plug-in 
 Syntactic Realization Component for Text
 Generation, 1999
- Kharitonov, CFUF A Fast Interpreter for the 
 Functional Unification Formalism, 1999
- Radev, Language Reuse and Regeneration 
 Generating Natural Language Summaries from
 Multiple On-Line Sources, Department of Computer
 Science, Columbia University, October 1998
18Path notation
- You can view a FD as a tree 
- To specify features, you can use a path 
- feature feature  feature value 
- e.g. prot number 
- You can also use relative paths 
-  number value gt the feature number for the 
 current node
-   number value gt the feature number for the 
 node above the current node
19Sample grammar
((alt top (((cat s) (prot ((cat 
np))) (goal ((cat np))) 
(verb ((cat vp) (number prot 
number))) (pattern (prot verb 
goal))) ((cat np) (n ((cat 
noun) (number   number))) 
 (alt (((proper yes) 
(pattern (n))) ((proper no) 
 (pattern (det n)) 
 (det ((cat article) 
(lex the))))))) ((cat vp) 
 (pattern (v)) (v ((cat verb)))) 
 ((cat noun)) ((cat verb)) 
 ((cat article))))) 
 20Unification Example 
 21Unify Prot 
 22Unify Goal 
 23Unify vp 
 24Unify verb 
 25Finish 
 26Discourse Analysis 
 27The problem
- Discourse 
- Monologue and Dialogue (dialog) 
- Human-computer interaction 
- Example John went to Bills car dealership to 
 check out an Acura Integra. He looked at it for
 about half an hour.
- Example Id like to get from Boston to San 
 Francisco, on either December 5th or December
 6th. Its okay if it stops in another city along
 the way.
28Information extraction and discourse analysis
- Example First Union Corp. is continuing to 
 wrestle with severe problems unleashed by a
 botched merger and a troubled business strategy.
 According to industry insiders at Paine Webber,
 their president, John R. Georgius, is planning to
 retire by the end of the year.
- Problems with summarization and generation 
29Reference resolution
- The process of reference (associating John with 
 he).
- Referring expressions and referents. 
- Needed discourse models 
- Problem many types of reference!
30Example (from Webber 91)
- According to John, Bob bought Sue an Integra, and 
 Sue bough Fred a legend.
- But that turned out to be a lie. - referent is a 
 speech act.
- But that was false. - proposition 
- That struck me as a funny way to describe the 
 situation. - manner of description
- That caused Sue to become rather poor. - event 
- That caused them both to become rather poor. - 
 combination of several events.
31Reference phenomena
- Indefinite noun phrases I saw an Acura Integra 
 today.
- Definite noun phrases The Integra was white. 
- Pronouns It was white. 
- Demonstratives this Acura. 
- Inferrables I almost bought an Acura Integra 
 today, but a door had a dent and the engine
 seemed noisy.
- Mix the flour, butter, and water. Kneed the dough 
 until smooth and shiny.
32Constraints on coreference
- Number agreement John has an Acura. It is red. 
- Person and case agreement () John and Mary have 
 Acuras. We love them (where WeJohn and Mary)
- Gender agreement John has an Acura. He/it/she is 
 attractive.
- Syntactic constraints 
- John bought himself a new Acura. 
- John bought him a new Acura. 
- John told Bill to buy him a new Acura. 
- John told Bill to buy himself a new Acura 
- He told Bill to buy John a new Acura.
33Preferences in pronoun interpretation
- Recency John has an Integra. Bill has a Legend. 
 Mary likes to drive it.
- Grammatical role John went to the Acura 
 dealership with Bill. He bought an Integra.
- (?) John and Bill went to the Acura dealership. 
 He bought an Integra.
- Repeated mention John needed a car to go to his 
 new job. He decided that he wanted something
 sporty. Bill went to the Acura dealership with
 him. He bought an Integra.
34Preferences in pronoun interpretation
- Parallelism Mary went with Sue to the Acura 
 dealership. Sally went with her to the Mazda
 dealership.
- ??? Mary went with Sue to the Acura dealership. 
 Sally told her not to buy anything.
- Verb semantics John telephoned Bill. He lost his 
 pamphlet on Acuras. John criticized Bill. He lost
 his pamphlet on Acuras.
35An algorithm for pronoun resolution
- Two steps discourse model update and pronoun 
 resolution.
- Salience values are introduced when a noun phrase 
 that evokes a new entity is encountered.
- Salience factors set empirically. 
36Salience weights in Lappin and Leass
Sentence recency 100
Subject emphasis 80
Existential emphasis 70
Accusative emphasis 50
Indirect object and oblique complement emphasis 40
Non-adverbial emphasis 50
Head noun emphasis 80 
 37Lappin and Leass (contd)
- Recency weights are cut in half after each 
 sentence is processed.
- Examples 
- An Acura Integra is parked in the lot. (subject) 
- There is an Acura Integra parked in the lot. 
 (existential predicate nominal)
- John parked an Acura Integra in the lot. (object) 
- John gave Susan an Acura Integra. (indirect 
 object)
- In his Acura Integra, John showed Susan his new 
 CD player. (demarcated adverbial PP)
38Algorithm
- Collect the potential referents (up to four 
 sentences back).
- Remove potential referents that do not agree in 
 number or gender with the pronoun.
- Remove potential referents that do not pass 
 intrasentential syntactic coreference
 constraints.
- Compute the total salience value of the referent 
 by adding any applicable values for role
 parallelism (35) or cataphora (-175).
- Select the referent with the highest salience 
 value. In case of a tie, select the closest
 referent in terms of string position.
39Example
- John saw a beautiful Acura Integra at the 
 dealership last week. He showed it to Bill. He
 bought it.
Rec Subj Exist Obj IndObj NonAdv HeadN Total
John 100 80 50 80 310
Integra 100 50 50 80 280
dealership 100 50 80 230 
 40Example (contd)
Referent Phrases Value
John John 155
Integra a beautiful Acura Integra 140
dealership the dealership 115 
 41Example (contd)
Referent Phrases Value
John John, he1 465
Integra a beautiful Acura Integra 140
dealership the dealership 115 
 42Example (contd)
Referent Phrases Value
John John, he1 465
Integra a beautiful Acura Integra, it 420
dealership the dealership 115 
 43Example (contd)
Referent Phrases Value
John John, he1 465
Integra a beautiful Acura Integra, it 420
Bill Bill 270
dealership the dealership 115 
 44Example (contd)
Referent Phrases Value
John John, he1 232.5
Integra a beautiful Acura Integra, it1 210
Bill Bill 135
dealership the dealership 57.5 
 45Observations
- Lappin  Leass - tested on computer manuals - 86 
 accuracy on unseen data.
- Centering (Grosz, Josh, Weinstein) additional 
 concept of a center  at any time in discourse,
 an entity is centered.
- Backwards looking center forward looking centers 
 (a set).
- Centering has not been automatically tested on 
 actual data.
46Discourse structure
- () Bill went to see his mother. The trunk is 
 what makes the bonsai, it gives it both its grace
 and power.
- Coherence principle 
- John hid Bills car keys. He was drunk 
- ?? John hid Bills car keys. He likes spinach 
- Rhetorical Structure Theory (Mann, Matthiessen, 
 and Thompson)
47Sample rhetorical relations
Relation Nucleus Satellite
Antithesis ideas favored by the author ideas disfavored by the author
Background text whose understanding is being facilitated text for facilitating understanding
Concession situation affirmed by author situation which is apparently inconsistent but also affirmed by author
Elaboration basic information additional information
Purpose an intended situation the intent behind the situation
Restatement a situation a reexpression of the situation
Summary Text a short summary of that text 
 48Example (from MMT)
1) Title Bouquets in a basket - with living 
flowers 2) There is a gardening revolution going 
on. 3) People are planting flower baskets with 
living plants, 4) mixing many types in one 
container for a full summer of floral beauty. 5) 
To create your own "Victorian" bouquet of 
flowers, 6) choose varying shapes, sizes and 
forms, besides a variety of complementary colors. 
 7) Plants that grow tall should be surrounded by 
smaller ones and filled with others that tumble 
over the side of a hanging basket. 8) Leaf 
textures and colors will also be important. 9) 
There is the silver-white foliage of dusty 
miller, the feathery threads of lotus vine 
floating down from above, the deep greens, or 
chartreuse, even the widely varied foliage colors 
of the coleus. Christian Science Monitor, 
April, 1983 
 49Example (contd) 
 50Cross-document structure 
 51(No Transcript) 
 52(No Transcript)