Powered by <TEI:TOK>
TEITOK Help Pages
A TEITOK corpus consists of a collection of XML files. You can search through the corpus, or look at each XML file individually, either just viewing it, or editing it as administrator of the corpus. In TEITOK version 2, there are various different ways to view an XML file, depending on what information is available in the file. You can switch between these views (those that are available for the XML field), and you can define which of those you want the system to use by default.
The most basic view is the text view, which in TEITOK 1 was the only XML view. In the text view, the XML file is display as-is, stylized by CSS, accompanied by any additional information in the XML it can display: any facsimile images in the XML will be displayed next to the text, and any audio/video file on top. All stored information for each token is display upon moving the mouse over it, and as an administrator, you can modify each token by clicking on it. Every project always includes the text view, and it is the default view used for everything (unless otherwise specified).
Manuscript line view
The manuscript line view demonstrates each line in the transcription of a manuscript directly below a cut-out of the manuscript showing the line it corresponds to. For administrators, it is possible to edit directly from this view, which makes it very easy to correct mistakes since the visual evidence is directly above the transcription (although for transcription, the page-by-page transcription module is a more dedicated tool). A small demonstration project demonstrating this view can be found here.
The facsimile view is a view comparable to what is common for searchable PDF files: a visible version of the facsimile image, with a hidden layer behind it showing the words in the text. Moving your mouse over a word in the image will access the underlying token and show all the information related to it. The facsimile image depends on each token having information about the bounding box in the image, which can only feasibly be obtained when using OCR or HWR systems. There is a small demonstration project here that demonstrates the facsimile view in action.
For time-aligned audio transcriptions, TEITOK provides a waveform view, which shows the waveform analysis of the audio file on type, with the transcription below it. Clicking on the transcription will play the corresponding sound, and playing the audio will show the corresponding transcription. There is even a full editing mode, allowing you to create a transcription directly within TEITOK. More info about the module can be found here, and an example of a project using the waveform view is can be seen here (in the MADISON project)
The Pageflow view presents a transcribed facsimile document in a fashion similar to TEI visualization tools like EVT, with a left column showing the image, and the right column showing the text, showing one page at a time. More info about the module can be found here, and an example of a project using the pageflow view is can be seen here.
Interlinear Glossed Text
Typically, the lowest level in TEITOK is the token. But it is possible to add morpheme-based information below the token. XML files having that information can be displayed as interlinear glossed text (IGT), as in this example with a file converted from the Typecraft tool. The IGT view mimicks the view of for instance the Linguist Toolbox, with information vertically aligned below the word.
Back to index