Anchor

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
Line 3: Line 3:
 
For instance, given the document  
 
For instance, given the document  
  
{|
+
{| class="wikitable"
 
|-
 
|-
 
!Original
 
!Original

Revision as of 18:54, 11 July 2013

In the scope of the project LACE, an anchor is an element that may facilitate word alignment at the document level.

For instance, given the document

Original Anchors
'''Geneva''' ({{IPAc-en|dʒ|ɨ|ˈ|n|iː|v|ə}}; {{lang-fr|Genève}}, {{IPA-fr|ʒə.nɛv|IPA}}; {{lang-frp|Genèva}}, {{IPA-frp|dzəˈnɛva|IPA}} and {{lang-de|Genf}}; {{IPA-de|gɛnf|IPA}}<ref>In the [[Languages of Switzerland|national languages]] of Switzerland the city is known as ''Genf'' (German), ''Ginevra'' (Italian) and ''Genevra'' ([[Romansh language|Romansh]]).</ref>) is the second most populous city in [[Switzerland]] (after [[Zurich]]) and is the most populous city of [[Romandy]], the French-speaking part of Switzerland. Situated where the [[Rhone]] exits [[Lake Geneva]], it is the capital of the [[Canton of Geneva|Republic and Canton of Geneva]]. The municipality ''(ville de Genève)'' has a population ({{as of|{{Swiss populations YM|CH-GE}}|lc=on}}) of {{Swiss populations|CH-GE|6621}}, and the canton (''République et Canton de Genève'', which includes the city) has {{Swiss populations|CH-GE|CH-GE}} residents.{{Swiss populations ref|CH-GE}} In 2007, the urban area, or ''agglomération franco-valdo-genevoise'' (''Great Geneva or Grand Genève in French'') had 1,240,000<ref>[http://www.are.admin.ch/themen/agglomeration/00641/03333/index.html?lang=fr Office fédéral du développement territorial ARE, DETEC, Étude thématique A1: l'évolution des villes et des agglomérations suisses, 19 décembre 2006]</ref> inhabitants in 189 municipalities in both Switzerland and France.<ref name=international>{{Fr icon}} {{cite web| url=http://www.are.admin.ch/themen/agglomeration/00694/index.html?lang=de |publisher=Office fédéral du développement territorial ARE |title=Agglomérations transfrontalières |date=22 February 2012}}</ref>


HTML elements

The following HTML elements are used to define the set of anchors in the project LACEhpc. They are said to involve smaller texts and, therefore, are more likely to provide lexical mappings.

Tag Description
<a> Defines a hyperlink
<b> Defines bold text
<caption> Defines a table caption
<dt> Defines a term (an item) in a definition list
<em> Defines emphasized text 
<figcaption> Defines a caption for a <figure> element
<h1> to <h6> Defines HTML headings
<i> Defines a part of text in an alternate voice or mood
<legend> Defines a caption for a <fieldset>, < figure>, or <details> element
<li> Defines a list item
<mark> Defines marked/highlighted text
<nav> Defines navigation links
<q> Defines a short quotation
<small> Defines smaller text
<strike> Not supported in HTML5. Deprecated in HTML 4.01. Defines strike-through text
<strong> Defines important text
<sub> Defines subscripted text
<sup> Defines superscripted text
<td> Defines a cell in a table
<th> Defines a header cell in a table

Observations

Nesting
Anchors must ignore nesting. For instance, given <b><i>ABC</i>DEF</b>, there will be two anchors: ABC and ABCDEF.
Software