236x Filetype PDF File size 0.42 MB Source: stp.lingfil.uu.se
AComputational Grammar for Swedish
Noun Phrases in a Natural Language
Interface to Databases
Master’s thesis
Department of Linguistics
University of Uppsala
Sweden
Karl Lundstedt
Academic supervisor: Anna S˚agvall Hein
Industrial supervisor: Gregor Jonsson,
Dialogue Technologies AB
Uppsala, 10th December 2002.
Abstract
This thesis describes a computational grammar that analyzes Swedish noun phrases, and maps
them to semantic representations. The noun phrase grammar is a component of the Swedish
version of the Phoenix natural language interface to databases system which maps natural language
questions to corresponding SQL queries. The coverage of the grammar is determined by the noun
phrase part of a grammar specification for Swedish questions. Beside grammatical literature, a
Wizard of Oz dialogue corpus has been used to investigate the grammatical phenomena supported
by the grammar specification. An evaluation corpus has been compiled by randomly selecting
questions from Internet FAQ:s about cellular phones, Internet banking and broadband connection.
From these questions, noun phrases were extracted and supplied to the grammar. After manually
examining the resulting semantic representations, it was found that 83% of a total of 254 noun
phrases were given all the intended semantic representations, and only the intended semantic
representations.
Keywords: Natural language interfaces to databases, Computational grammars, Noun phrases,
Question–Answering systems, Swedish grammar
i
Acknowledgment
I would like to thank Gregor Jonsson, Joakim Ingers, Christopher Dahr´en and Bjorn Als´en at
¨
Dialogue Technologies for making this thesis possible; professor Anna S˚agvall–Hein for guidance
and fruitful discussions; Hubert Lehmann and Thomas Haag at linguatec Entwicklung & Services
GmbHfor providing professional computational linguistic knowledge and skills; Peter Cedermark
whohasdevelopedtheSwedishnounandadjectivemorphologyusedinthenounphrasegrammar1;
Jorg Tiedemann for giving me valuable comments on the evaluation chapter; and Arne Jonsson
¨ ¨
at NLPLAB, Linkoping University for letting me use their dialogue corpus. I would also like to
¨
thank my wonderful girlfriend Elin for putting up with me in the not always so smooth course of
writing.
1Cedermark, P. Swedish noun and adjective morphology in a natural language interface to databases. Depart-
ment of Linguistics, Uppsala. (in progress)
ii
Contents
1 Introduction 1
2 Background 3
2.1 Question Answering systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.1.1 Natural Language Interfaces to highly organized data . . . . . . . . . . . . 3
2.1.2 Q/Asystems finding answers in large collections of free text . . . . . . . . . 5
2.1.3 Advantages and disadvantages with NLIDBs . . . . . . . . . . . . . . . . . 6
2.2 The Phoenix NLIDB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2.1 System overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.2.2 The Phoenix parser and the XLDG grammar formalism . . . . . . . . . . . 8
3 Method 10
3.1 The Swedish grammar specification . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.2 Empirical material – Wizard of Oz corpus . . . . . . . . . . . . . . . . . . . . . . . 10
3.3 Theoretical material – grammatical literature . . . . . . . . . . . . . . . . . . . . . 11
3.3.1 Traditional Swedish grammars . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.3.2 Swedish computational grammars . . . . . . . . . . . . . . . . . . . . . . . . 11
4 The NP grammar 12
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.2 Pre-posed attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
4.2.1 Adjectival phrases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
4.2.2 Measure phrases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
4.3 Post-posed attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.3.1 Prepositional phrases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.3.2 Coordination of prepositional phrases . . . . . . . . . . . . . . . . . . . . . 17
4.4 NPlevel rules – common nouns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
4.4.1 Syntactic categories of the NP . . . . . . . . . . . . . . . . . . . . . . . . . 19
4.4.2 Selecting correct specifier . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
4.4.3 Prepositional attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
4.4.4 Adjectival attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
4.4.5 Quantity attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
4.4.6 Definite attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
4.4.7 Naked nouns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.5 Proper nouns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
4.6 Coordination of NP:s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
4.6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
4.6.2 Restrictions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
4.6.3 Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
iii
no reviews yet
Please Login to review.