C-rule
From UNL Wiki
(Difference between revisions)
(→Syntax: wl multiword expression) |
|||
Line 4: | Line 4: | ||
In the UNL<sup>arium</sup> framework, compounds are treated as ordinary simple words except in case of discontinuous [[multiword expression|multi-word expressions]] or with infixation (such as "give in" or "take into account"). In these cases, the [[lemma]] is different from the [[base form]], and the compound-formation process is expected to be defined through [[S-rule]]s such as the following: | In the UNL<sup>arium</sup> framework, compounds are treated as ordinary simple words except in case of discontinuous [[multiword expression|multi-word expressions]] or with infixation (such as "give in" or "take into account"). In these cases, the [[lemma]] is different from the [[base form]], and the compound-formation process is expected to be defined through [[S-rule]]s such as the following: | ||
− | + | <SYNTACTIC ROLE>(<ADDED>); | |
Where:<br/> | Where:<br/> | ||
Line 19: | Line 19: | ||
|give in | |give in | ||
|give | |give | ||
− | | | + | |VH([in]) |
|the string "in" is to be added to the base form as part of the head of the verb (VH) | |the string "in" is to be added to the base form as part of the head of the verb (VH) | ||
|- | |- | ||
|take into account | |take into account | ||
|take | |take | ||
− | | | + | |VA("into account") |
|the string "into account" is to be added to the base form as an adjunct to the verb (VA) | |the string "into account" is to be added to the base form as an adjunct to the verb (VA) | ||
|- | |- | ||
|throw <person> to the lions | |throw <person> to the lions | ||
|throw | |throw | ||
− | | | + | |VA("to the lions") |
|the string "to the lions" is to be added to the base form as an adjunct to the verb (VA) | |the string "to the lions" is to be added to the base form as an adjunct to the verb (VA) | ||
|} | |} | ||
Line 36: | Line 36: | ||
;Phrasal verbs | ;Phrasal verbs | ||
:Particles of phrasal verbs must be represented as part of the head, if non separable, or as adjuncts, if separable: | :Particles of phrasal verbs must be represented as part of the head, if non separable, or as adjuncts, if separable: | ||
− | :*give in = | + | :*give in = VH([in]); ("give in something" but <strike>"give something in"</strike>) |
− | :*give back = | + | :*give back = VA([back]); ("give back something" or "give something back") |
;General syntactic roles (NP, PP, XP) must not be defined in composition rules but inside the [[subcategorization frame]]: | ;General syntactic roles (NP, PP, XP) must not be defined in composition rules but inside the [[subcategorization frame]]: | ||
− | :*throw <person> to the lions = +VA("to the lions"); (and not " | + | :*throw <person> to the lions =+VA("to the lions"); (and not "VA("to the lions")VC(NP);". The lemma should be associated to the transitive frame instead) |
;"Quotes" or [brackets]? | ;"Quotes" or [brackets]? | ||
:In the compound-formation process, the UNL<sup>arium</sup> distinguishes between strings (to be represented between "") and lemmas (to be represented between [ ]). The difference between strings and lemmas has to do with the dictionary status: lemmas (but not strings) are expected to be dictionary entries. | :In the compound-formation process, the UNL<sup>arium</sup> distinguishes between strings (to be represented between "") and lemmas (to be represented between [ ]). The difference between strings and lemmas has to do with the dictionary status: lemmas (but not strings) are expected to be dictionary entries. | ||
− | :* | + | :*VA("into account"); (the string "into account" is not expected to be a dictionary entry) |
− | :* | + | :*VC([sense]); (the term "sense" is expected to be a dictionary entry). |
;Complex compounds | ;Complex compounds | ||
:Compounds must include as many terms as different syntactic roles. One single "+" must be provided at the beginning of the rule: | :Compounds must include as many terms as different syntactic roles. One single "+" must be provided at the beginning of the rule: | ||
− | :*give up the ghost = | + | :*give up the ghost = VH([up])VC("the ghost"); (<strike>+VH("up the ghost")</strike> or <strike>+VC("up the ghost")</strike>) |
;Order is to be represented by the [[Distribution|distribution features]] (">", ">>", "<", "<<", ...), if not default: | ;Order is to be represented by the [[Distribution|distribution features]] (">", ">>", "<", "<<", ...), if not default: | ||
− | :* | + | :*VC([love]); (order must not be informed, because in English complements come at the right side by default: ''make'' > ''make love'') |
− | :* | + | :*NS([the]); (order must not be informed, because in English specifiers come at the left side, by default: ''Netherlands'' > ''the Netherlands'') |
:*NA('''>>''',[available]); (order must be informed, because in English nominal adjuncts come at the left side, by default: ''table'' > ''new table'') | :*NA('''>>''',[available]); (order must be informed, because in English nominal adjuncts come at the left side, by default: ''table'' > ''new table'') | ||
;Adjacency is to be represented by the [[Adjacency|adjacency features]] (AJ0,AJ1,AJ2,...), if not default: | ;Adjacency is to be represented by the [[Adjacency|adjacency features]] (AJ0,AJ1,AJ2,...), if not default: | ||
− | :* | + | :*VC([love]); (adjacency must not be informed, because in English complements come after the head, by default: ''make'' > ''make love'') |
− | :* | + | :*VH([up])VC("the ghost"); (adjacency must not be informed, because in English head particles come before complements, by default: ''give'' > ''give up the ghost'') |
− | :* | + | :*VA([home],AJ1)VC("the bacon",AJ2); (adjacency must be informed because in English the complement is normally generated before the adjunct: ''bring the bacon home'') |
Revision as of 18:05, 8 December 2011
Compounding or composition is the word-formation process of creating compounds by combining or putting together lexemes.
Syntax
In the UNLarium framework, compounds are treated as ordinary simple words except in case of discontinuous multi-word expressions or with infixation (such as "give in" or "take into account"). In these cases, the lemma is different from the base form, and the compound-formation process is expected to be defined through S-rules such as the following:
<SYNTACTIC ROLE>(<ADDED>);
Where:
<SYNTACTIC ROLE> is the syntactic role (VA, VC, VS, VH, etc) of the term to be added to the base form; and
<ADDED> is the term to be added to the base form to form the compound. It can be a string between "quotes" or a lemma between [brackets].
Examples
Lemma | Base Form | Compound | Description |
---|---|---|---|
give in | give | VH([in]) | the string "in" is to be added to the base form as part of the head of the verb (VH) |
take into account | take | VA("into account") | the string "into account" is to be added to the base form as an adjunct to the verb (VA) |
throw <person> to the lions | throw | VA("to the lions") | the string "to the lions" is to be added to the base form as an adjunct to the verb (VA) |
Observations
- Phrasal verbs
- Particles of phrasal verbs must be represented as part of the head, if non separable, or as adjuncts, if separable:
- give in = VH([in]); ("give in something" but
"give something in") - give back = VA([back]); ("give back something" or "give something back")
- give in = VH([in]); ("give in something" but
- General syntactic roles (NP, PP, XP) must not be defined in composition rules but inside the subcategorization frame
-
- throw <person> to the lions =+VA("to the lions"); (and not "VA("to the lions")VC(NP);". The lemma should be associated to the transitive frame instead)
- "Quotes" or [brackets]?
- In the compound-formation process, the UNLarium distinguishes between strings (to be represented between "") and lemmas (to be represented between [ ]). The difference between strings and lemmas has to do with the dictionary status: lemmas (but not strings) are expected to be dictionary entries.
- VA("into account"); (the string "into account" is not expected to be a dictionary entry)
- VC([sense]); (the term "sense" is expected to be a dictionary entry).
- Complex compounds
- Compounds must include as many terms as different syntactic roles. One single "+" must be provided at the beginning of the rule:
- give up the ghost = VH([up])VC("the ghost"); (
+VH("up the ghost")or+VC("up the ghost"))
- give up the ghost = VH([up])VC("the ghost"); (
- Order is to be represented by the distribution features (">", ">>", "<", "<<", ...), if not default
-
- VC([love]); (order must not be informed, because in English complements come at the right side by default: make > make love)
- NS([the]); (order must not be informed, because in English specifiers come at the left side, by default: Netherlands > the Netherlands)
- NA(>>,[available]); (order must be informed, because in English nominal adjuncts come at the left side, by default: table > new table)
- Adjacency is to be represented by the adjacency features (AJ0,AJ1,AJ2,...), if not default
-
- VC([love]); (adjacency must not be informed, because in English complements come after the head, by default: make > make love)
- VH([up])VC("the ghost"); (adjacency must not be informed, because in English head particles come before complements, by default: give > give up the ghost)
- VA([home],AJ1)VC("the bacon",AJ2); (adjacency must be informed because in English the complement is normally generated before the adjunct: bring the bacon home)