The .wrd format

Files of type .wrd (that can be found in /data/annot/text/wrd/ of the annotation DVD) comprise a manually verified word segmentation in which the words occurring in the orthographic transcription have been linked to the audio signal. The files are in ShortTextGrid format and can be produced, changed or viewed by means of the PRAAT software. For a description of the ShortTextGrid format, see the description of the .ort-formaat. For every speaker two tiers are envisaged. The tier name of the first tier is the speaker ID. It is identical to the same tier in the .ort file. The next tier receives the same name with the suffix _FON (N98765 and N98765_FON respectively) and comprises the phonetic transcription that can also be found in the .fon file. The time markers are the same in both tiers.

An interval in the tier with the orthographic transcription is filled with exactly one word (with or without underscores), a single underscore ("_"), or a pause (empty interval).

In the tier with the phonetic transcription the following phenomena can occur:

For an overview of the ponetic symbols that were used we refer to the description of the .fon format. Analogous to the .fon format, the .wrd file does not comprise a BACKGOUND and/or COMMENT tier.