@@ -4,15 +4,17 @@ The Multi Column Files (mcf) format is the text format used to represent text an
The list of labels is the following:
***FORM**
***CPOS**
***POS**
***LEMMA**
***FEATS**
***GOV**
***LABEL**
***SENT_SEG**
***A** to **Z**
***FORM** form of the word
***CPOS** coarse part of speech
***POS** part of speech
***LEMMA** lemma
***FEATS** other linguistic features (usually morphological)
***GOV** relative position of the governor (-n indicates that the governor is n words to the left, n indicates that it is n words to the right)
***LABEL** label of the syntactic dependency
***SENT_SEG** indicates that the word is the last word in the sentence
***A** to **Z** other labels used to represent other useful information (word duration, speaker, ...)
Here is an example of two sentences represented as an mcf. The first column corresponds to **FORM**, the second to **POS**, the third to **LEMMA** the fourth to **GOV** the fifth to **LABEL** and the last to *SENT_SEG**