Update Corpus format description authored by Agata Savary's avatar Agata Savary
...@@ -17,7 +17,7 @@ Similarly to _PARSEME:MWE_, the information in the 11th column called _PARSEME-F ...@@ -17,7 +17,7 @@ Similarly to _PARSEME:MWE_, the information in the 11th column called _PARSEME-F
* **pos-category-criteria labels** are strings corresponding to information about the MWE/NE. These labels are composed of three fields separated by a pipe '|' character (i.e. POS|CATEGORY|CRITERION1,CRITERION2...): * **pos-category-criteria labels** are strings corresponding to information about the MWE/NE. These labels are composed of three fields separated by a pipe '|' character (i.e. POS|CATEGORY|CRITERION1,CRITERION2...):
1. **POS** is a tag representing the part of speech of the whole MWE/NE. The tags were inferred automatically using heuristics, or defined manually for irregular constructions. **[MARIE add link to POS details here if relevant](XXX)**. 1. **POS** is a tag representing the part of speech of the whole MWE/NE. The tags were inferred automatically using heuristics, or defined manually for irregular constructions. **[MARIE add link to POS details here if relevant](XXX)**.
2. **CATEGORY** is a tag corresponding to a category that depends on the type of entity being annotated. It contains a prefix and a suffix, separated by a dash. 2. **CATEGORY** is a tag corresponding to a category that depends on the type of entity being annotated. It contains a prefix and a suffix, separated by a dash.
* The prefix of the tag indicates whether this is a MWE (EP for _expression polylexicale_) or NE (EN for _entité nommée_). * The prefix of the tag indicates whether this is a MWE (EP for _expression polylexicale_) or a NE (EN for _entité nommée_).
* The suffix depends on prefix as follows: * The suffix depends on prefix as follows:
1. For verbal MWEs (POS is VERB, prefix is EP), the suffix corresponds to the [PARSEME 1.1 verbal MWE categories](http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1/?page=categ). 1. For verbal MWEs (POS is VERB, prefix is EP), the suffix corresponds to the [PARSEME 1.1 verbal MWE categories](http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1/?page=categ).
2. For non-verbal MWEs (POS is not VERB, prefix is EP), the suffix is unspecified (_). 2. For non-verbal MWEs (POS is not VERB, prefix is EP), the suffix is unspecified (_).
... ...
......