English grammar

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(Common features)
Line 2: Line 2:
  
  
== Common features==
+
== Requisites ==
The following features
+
The grammars here presented depend heavily on the structure of the dictionary presented at [[English dictionary]]. You have to be acquainted with the formalism described at the [[UNL Dictionary Specs]] and the [[Tagset]] in order to fully understand the use of the dictionary entry structure used by the grammars.
  
;SHEAD = beggining of the sentence (system-defined feature assigned automatically by the machine)
+
== Features ==
;CHEAD = beginning of a scope (system-defined feature assigned automatically by the machine)
+
The grammars play with a set of features that come from three different origins:
;STAIL = end of the sentence (system-defined feature assigned automatically by the machine)
+
*'''Dictionary features''' are the features ascribed to the entries in the dictionary, and appear either as simple attributes (LEX,GEN,NUM), as simple values (N,MCL,SNG) or attribute-value pairs (LEX=N,GEN=MCL,NUM=SNG).
;CTAIL = end of a scope (system-defined feature assigned automatically by the machine)
+
*'''System-defined features''' are features automatically assigned by EUGENE and IAN during the processing. They are the following:
;BLK = blank space (feature assigned by the dictionary - the entry [ ]{}""(BLK)<;;>; must be included in the dictionary)
+
**SHEAD = beggining of the sentence (system-defined feature assigned automatically by the machine)
;DIGIT = UW is digit (feature assigned by the dictionary - this feature should be associated to the entries whose UW is a digit)
+
**CHEAD = beginning of a scope (system-defined feature assigned automatically by the machine)
;PUT = punctuation sign (feature assigned by the dictionary - this feature should
+
**STAIL = end of the sentence (system-defined feature assigned automatically by the machine)
 +
**CTAIL = end of a scope (system-defined feature assigned automatically by the machine)
 +
**TEMP = temporary entry (system-defined feature assigned to the strings that are not present in the dictionary)
 +
*'''Grammar features''' are features created inside the grammar in any of its intermediate states between the input and the output. They are the following:
  
 
== EN-UNL (Analysis) Grammar ==
 
== EN-UNL (Analysis) Grammar ==

Revision as of 00:59, 27 July 2012

The English grammars here presented are used for representing English sentences into UNL (UNLization) and for generating English sentences from UNL graphs (NLization). They follow the syntax defined at the UNL Grammar Specs and have been used for IAN and EUGENE, i.e., they work on the sentence level. As a still ongoing work, they cover yet a rather small set of constructions, but may be used as strategy to address the constructions of the Corpus500.


Contents

Requisites

The grammars here presented depend heavily on the structure of the dictionary presented at English dictionary. You have to be acquainted with the formalism described at the UNL Dictionary Specs and the Tagset in order to fully understand the use of the dictionary entry structure used by the grammars.

Features

The grammars play with a set of features that come from three different origins:

  • Dictionary features are the features ascribed to the entries in the dictionary, and appear either as simple attributes (LEX,GEN,NUM), as simple values (N,MCL,SNG) or attribute-value pairs (LEX=N,GEN=MCL,NUM=SNG).
  • System-defined features are features automatically assigned by EUGENE and IAN during the processing. They are the following:
    • SHEAD = beggining of the sentence (system-defined feature assigned automatically by the machine)
    • CHEAD = beginning of a scope (system-defined feature assigned automatically by the machine)
    • STAIL = end of the sentence (system-defined feature assigned automatically by the machine)
    • CTAIL = end of a scope (system-defined feature assigned automatically by the machine)
    • TEMP = temporary entry (system-defined feature assigned to the strings that are not present in the dictionary)
  • Grammar features are features created inside the grammar in any of its intermediate states between the input and the output. They are the following:

EN-UNL (Analysis) Grammar

EN-UNL (Analysis) Transformation Grammar

EN-UNL (Analysis) Disambiguation Grammar

UNL-EN (Generation) Grammar

UNL-EN (Generation) Transformation Grammar

UNL-EN (Generation) Disambiguation Grammar

Software