... | ... | @@ -14,7 +14,7 @@ Similarly to _PARSEME:MWE_, the information in the 11th column called _PARSEME-F |
|
|
2. an underscore '_' if the MWE/NE annotation is unspecified or missing
|
|
|
|
|
|
3. a list of semicolon-separated **CODES** if the current word is part of one or more MWEs/NEs. Codes are only assigned to the lexicalized components of a MWE/NE (see [Lexicalized components and open slots](http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1/?page=lexicalized) in the PARSEME annotation guidelines).
|
|
|
- for all the components of a MWE/NE except the (linearly) first one), the CODE is simply an **IDENTIFIER**:
|
|
|
- for all the components of a MWE/NE except the (linearly) first one, the CODE is simply an **IDENTIFIER**:
|
|
|
* the **IDENTIFIER** of a MWE/NE is an integer, greater or equal to 1, and is unique within the sentence: the only requirement for IDENTIFIERS is that all the components of a MWE/NE must have CODES starting by the same IDENTIFIER and no other MWE/NE in the sentence use it.
|
|
|
- for the (linearly) first component of a MWE/NE, the code consists of an identifier followed by a colon ':' and a **LABEL**:
|
|
|
* **LABELS** provide information about the MWE/NE and are composed of a **POS** field, a **CATEGORY** field and **CRITERIA** field separated by a pipe '|' character for (i.e. POS|CATEGORY|CRITERION1,CRITERION2..., for instance ADP|MWE|IRREG describes a MWE (not a NE), whose part of speech is ADP, and for which the criterion IRREG has been used):
|
... | ... | |