TUT

From UNL Wiki
Revision as of 19:18, 13 December 2010 by Martins (Talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

TUT (Text-to-Text through UNL) is a digital library of texts represented in the Universal Networking Language (UNL). It comprises links to the integral version of more than 30,000 titles and, whenever available, the UNL version of the text, along with three possible realizations (summarized, simplified and rephrased), in any of the languages available in the UNL System.

Goals

The main goal of TUT is to increase and to extend the semantic accessibility to texts. In that sense, TUT contributes to education and to the diffusion of knowledge as:

  • it renders the text understandable for those who do not speak the language of the original (by generating a version of it in any of the languages available in the UNL system);
  • it renders the text comprehensible for those who have reading difficulties (by generating a simplified version of the text in any of the languages available in the UNL system); and
  • it identifies the most important information in a text (by generating a summarized version of the text in any of the languages available in the UNL system).

Corpus

TUT comprises links to more than 30,000 titles, written by more than 10,000 different authors, in more than 30 different languages, most of which hosted by the Project Gutenberg. As we are targeting reading material under public domain, the repository comprises mainly books published before 1923. For the time being, the collection of UNL documents is still very small.

Results

Partial results of TUT are available at http://www.unlweb.net/tut.

Software