Distribution
(→Examples) |
(d) |
||
Line 3: | Line 3: | ||
== Natural Language == | == Natural Language == | ||
In the UNLarium framework, the distribution must be informed through [[s-rule]]s, except in case of exceptional (non-default) cases, to be registered in the dictionary by the following values: | In the UNLarium framework, the distribution must be informed through [[s-rule]]s, except in case of exceptional (non-default) cases, to be registered in the dictionary by the following values: | ||
+ | |||
+ | === Representing distribution in the dictionary === | ||
{{#tree:id=DIS|openlevels=0|root=Distribution (DIS)| | {{#tree:id=DIS|openlevels=0|root=Distribution (DIS)| | ||
Line 13: | Line 15: | ||
**free (FRE): coming in any position | **free (FRE): coming in any position | ||
*adjacency (PXM) | *adjacency (PXM) | ||
− | **immediate (IMM): right after or right before | + | **immediate (IMM): right after or right before (priority = 0) |
− | **near (NEA): precedence over other constituents (except IMM) | + | **near (NEA): precedence over other constituents (except IMM) (priority = 1) |
− | **distant (FAR): no precedence over other constituents | + | **distant (FAR): no precedence over other constituents (priority > 1) |
}} | }} | ||
− | === Examples === | + | ==== Examples ==== |
*Order | *Order | ||
**very = BEF - In English, the intensifier "very" is a premodifier: ''He is very rich'' (<strike>''He is rich very''</strike>) | **very = BEF - In English, the intensifier "very" is a premodifier: ''He is very rich'' (<strike>''He is rich very''</strike>) | ||
Line 28: | Line 30: | ||
**down (in "put down") = NEA (In English, the adverb "down" must come right after the base form "put" in order to form the compound "put down", except for the complement: ''Put down that'' (<strike>''Put down it''</strike>) | **down (in "put down") = NEA (In English, the adverb "down" must come right after the base form "put" in order to form the compound "put down", except for the complement: ''Put down that'' (<strike>''Put down it''</strike>) | ||
− | === Observations === | + | ==== Observations ==== |
;The distribution of words must be informed in the dictionary only if variable. | ;The distribution of words must be informed in the dictionary only if variable. | ||
:The field "distribution" must be filled in only if different words of the same category may occur in different positions | :The field "distribution" must be filled in only if different words of the same category may occur in different positions | ||
Line 46: | Line 48: | ||
::English intensifiers must come right before the intensified ("very well"), therefore BEF&IMM; | ::English intensifiers must come right before the intensified ("very well"), therefore BEF&IMM; | ||
::Adverbs of manner normally comes after the complements ("She read the letter slowly"), therefore "AFT&FAR"; | ::Adverbs of manner normally comes after the complements ("She read the letter slowly"), therefore "AFT&FAR"; | ||
+ | |||
+ | === Representing distribution in the grammar === | ||
+ | In the grammar, distribution is defined through [[S-rule]]s in the following format: | ||
+ | |||
+ | <SYNTACTIC ROLE>(<ORDER>,<ADJACENCY>); | ||
+ | |||
+ | Where:<br /> | ||
+ | *<SYNTACTIC ROLE> is the [[Syntax#Syntactic_Roles|syntactic role]] (VA, VC, VS, VH, etc) of the constituent in relation to the head; and | ||
+ | *<ORDER> is the position of the constituent in relation to the head. It may assume one of the following values: | ||
+ | **'''FNT''' in the beginning of the sentence | ||
+ | **'''END''' in the end of the sentence | ||
+ | **'''BEF''' or '''<<''' to the left before a blank space | ||
+ | **'''AFT''' or '''>>''' to the right after a blank space | ||
+ | **'''>''' immediately to the right (i.e., without any blank space) | ||
+ | **'''<''' immediately to the left (i.e., without any blank space) | ||
+ | *<ADJACENCY> is the precedence of the constituent in relation to other constituents of the same phrase. It may assume one of the following values: | ||
+ | **'''IMM''' immediately: right after or right before | ||
+ | **'''NEA''' precedence over other constituents (except IMM) | ||
+ | **'''FAR''' distant: no precedence over other constituents | ||
+ | |||
+ | ==== Examples ==== | ||
+ | ;VS(<<,IMM); | ||
+ | :the specifier must be generated to the left of the verb before a blank space with precedence over any other constituent | ||
+ | ;VC(>>,FAR); | ||
+ | :the complement must be generated to the right of the verb after a blank space without any precedence over over other constituents | ||
+ | |||
+ | ==== Observations ==== | ||
+ | ;Order and adjacency may be represented in different rules: | ||
+ | :VS(<<); (the specifier must be generated to the left of the verb before a blank space) | ||
+ | :VS(IMM); (the specifier must be generated with precedence over any other constituent) | ||
+ | ;Complex distribution | ||
+ | :A single distribution rule may contain several distribution operations: | ||
+ | :VS(<<)VS(IMM); (the same as "VS(<<,IMM);") | ||
+ | *VS(<<)VC(>>); (the specifier will be generated to the left and the complement to the right of the head) | ||
+ | *VS(FAR)VC(FAR); (both the specifier and the complement of the verb have no precedence over other constituents) | ||
+ | ;Conditional distribution | ||
+ | :Conditional case-marking may be stated by defining the left side of the s-rule and coindexing it to the right side: | ||
+ | *VC(>>); (unconditional distribution: the complement will be always generated to the right of the verb); | ||
+ | *VC(PPR):=VC(<<); (conditional distribution: the complement will be generated to the left of the verb if a personal pronoun (PPR); | ||
+ | ;Adjacency | ||
+ | :Adjacency must be informed when two constituents are to be generated in the same direction (otherwise, the system will simply follow the order of application of rules defined in the grammar) | ||
+ | *VC(>>)VA(>>,FAR); (or "VC(>>)VA(>>)VA(FAR);", i.e., the complement comes nearer the head than the adjunct) | ||
+ | Adjacency states a gradient of proximity and should be assigned only to differentiate the priority of generation | ||
+ | *VC(>>,NEA)VA(>>); or VC(>>)VA(>>,FAR); but there's no need for <strike>VC(>>,IMM)VA(>>,FAR);</strike> | ||
+ | *VS(>>,IMM)VC(>>,NEA)VA(>>); or VS(>>,IMM)VC(>>)VA(>>,FAR); but there's no need for <strike>VS(>>,IMM)VC(>>,NEA)VA(>>,FAR);</strike> | ||
+ | Adjacency is limited to three values (IMM, NEA, FAR) because of the binary nature of branching in the [[Syntax|X-bar approach]]. More complex structures should be reorganized as intermediary projections and only then related one another. See [[projection]] for further information. | ||
+ | ;Reordering | ||
+ | :Reordering can be done in two different ways: | ||
+ | *By [[Ph-rule]]s, if the process involves neighbour items and affects only the surface structure of the phrase; | ||
+ | *By attribute change (i.e., deleting and adding distribution features), such as in "VC(->>,<<);" (i.e.,delete the "after" attribute and add the "before" attribute) | ||
+ | ;The symbol '''^''' is used for negation and to control infinite recursion | ||
+ | *VC(^>>):=VC(>>); (assign the "after" attribute to the complement of the verb if it does not have it yet) | ||
== UNL == | == UNL == | ||
Word order is not informed in UNL. | Word order is not informed in UNL. |
Revision as of 11:02, 24 March 2010
Distribution (or word order) refers to the study of the order of the syntactic constituents of a language.
Contents |
Natural Language
In the UNLarium framework, the distribution must be informed through s-rules, except in case of exceptional (non-default) cases, to be registered in the dictionary by the following values:
Representing distribution in the dictionary
Examples
- Order
- very = BEF - In English, the intensifier "very" is a premodifier: He is very rich (
He is rich very) - well = AFT - In English, the adverb of manner "well" is a postmodifier: He speaks well (
He well speaks) - yesterday = FRE - In English, the adverb of time "yesterday" may come either before or after the modified: Now I go or I go now.
- very = BEF - In English, the intensifier "very" is a premodifier: He is very rich (
- Adjacency
- the = FAR (In English, the article "the" has no precedence over other modifiers: the small round black leather handbag (
small the round black leather handbag). - after (in "look after") = IMM (In English, the preposition "after" must come right after the base form "come" in order to form the compound "look after": We look after them (
We look them after) - down (in "put down") = NEA (In English, the adverb "down" must come right after the base form "put" in order to form the compound "put down", except for the complement: Put down that (
Put down it)
- the = FAR (In English, the article "the" has no precedence over other modifiers: the small round black leather handbag (
Observations
- The distribution of words must be informed in the dictionary only if variable.
- The field "distribution" must be filled in only if different words of the same category may occur in different positions
- Adverbs, in English, may be premodifiers or postmodifiers. Therefore, distribution of adverbs must be informed in the dictionary.
- Articles, in English, are always premodifiers. Therefore, distribution of articles must not be informed in the dictionary.
- The distribution of words must be informed in the dictionary only if not the default one.
- The grammar brings the normal (default) distribution of the words in a language. Only exceptions to the rule must be informed in the dictionary.
- Adjectives, in English, are normally premodifiers. Therefore, distribution of premodifier adjectives (such as "beautiful") must not be informed in the dictionary. On the other hand, free order adjectives (such as "possible": "it is the only solution possible" or "it is the only possible solution") must be tagged, in the dictionary, with the corresponding feature (FRE).
- Middle position should be used only for words to be inserted inside others (i.e., between the prefix and the root, or the root and the suffix).
- Adverbs coming between auxiliaries and verbs must be defined as premodifiers.
- Distribution values are not exclusive
- BEF&AFT means that the word may occur both as a premodifier or as postmodifier;
- BEF&MID means that the word may occur both as a premodifier or as a middle modifier.
- Order and adjacency may be combined to express specific distributions
- BEF&IMM means that the word occurs right before the modified (as with English intensifiers)
- Order must be informed only when required
-
- English intensifiers must come right before the intensified ("very well"), therefore BEF&IMM;
- Adverbs of manner normally comes after the complements ("She read the letter slowly"), therefore "AFT&FAR";
Representing distribution in the grammar
In the grammar, distribution is defined through S-rules in the following format:
<SYNTACTIC ROLE>(<ORDER>,<ADJACENCY>);
Where:
- <SYNTACTIC ROLE> is the syntactic role (VA, VC, VS, VH, etc) of the constituent in relation to the head; and
- <ORDER> is the position of the constituent in relation to the head. It may assume one of the following values:
- FNT in the beginning of the sentence
- END in the end of the sentence
- BEF or << to the left before a blank space
- AFT or >> to the right after a blank space
- > immediately to the right (i.e., without any blank space)
- < immediately to the left (i.e., without any blank space)
- <ADJACENCY> is the precedence of the constituent in relation to other constituents of the same phrase. It may assume one of the following values:
- IMM immediately: right after or right before
- NEA precedence over other constituents (except IMM)
- FAR distant: no precedence over other constituents
Examples
- VS(<<,IMM);
- the specifier must be generated to the left of the verb before a blank space with precedence over any other constituent
- VC(>>,FAR);
- the complement must be generated to the right of the verb after a blank space without any precedence over over other constituents
Observations
- Order and adjacency may be represented in different rules
- VS(<<); (the specifier must be generated to the left of the verb before a blank space)
- VS(IMM); (the specifier must be generated with precedence over any other constituent)
- Complex distribution
- A single distribution rule may contain several distribution operations:
- VS(<<)VS(IMM); (the same as "VS(<<,IMM);")
- VS(<<)VC(>>); (the specifier will be generated to the left and the complement to the right of the head)
- VS(FAR)VC(FAR); (both the specifier and the complement of the verb have no precedence over other constituents)
- Conditional distribution
- Conditional case-marking may be stated by defining the left side of the s-rule and coindexing it to the right side:
- VC(>>); (unconditional distribution: the complement will be always generated to the right of the verb);
- VC(PPR):=VC(<<); (conditional distribution: the complement will be generated to the left of the verb if a personal pronoun (PPR);
- Adjacency
- Adjacency must be informed when two constituents are to be generated in the same direction (otherwise, the system will simply follow the order of application of rules defined in the grammar)
- VC(>>)VA(>>,FAR); (or "VC(>>)VA(>>)VA(FAR);", i.e., the complement comes nearer the head than the adjunct)
Adjacency states a gradient of proximity and should be assigned only to differentiate the priority of generation
- VC(>>,NEA)VA(>>); or VC(>>)VA(>>,FAR); but there's no need for
VC(>>,IMM)VA(>>,FAR); - VS(>>,IMM)VC(>>,NEA)VA(>>); or VS(>>,IMM)VC(>>)VA(>>,FAR); but there's no need for
VS(>>,IMM)VC(>>,NEA)VA(>>,FAR);
Adjacency is limited to three values (IMM, NEA, FAR) because of the binary nature of branching in the X-bar approach. More complex structures should be reorganized as intermediary projections and only then related one another. See projection for further information.
- Reordering
- Reordering can be done in two different ways:
- By Ph-rules, if the process involves neighbour items and affects only the surface structure of the phrase;
- By attribute change (i.e., deleting and adding distribution features), such as in "VC(->>,<<);" (i.e.,delete the "after" attribute and add the "before" attribute)
- The symbol ^ is used for negation and to control infinite recursion
- VC(^>>):=VC(>>); (assign the "after" attribute to the complement of the verb if it does not have it yet)
UNL
Word order is not informed in UNL.