... | ... | @@ -45,7 +45,6 @@ Similarly to _PARSEME:MWE_, the information in the 11th column called _PARSEME-F |
|
|
|
|
|
### Example
|
|
|
|
|
|
**(marie: je trouve qu'un exemple fabriqué permet de couvrir plus de phénomènes, ?)**
|
|
|
|
|
|
Here is an example of sentence using the PARSEME-FR _cupt_ format described above,
|
|
|
showing only columns 1 (ID), 2 (FORM) and 11 (MWE / NE annotation).
|
... | ... | @@ -54,37 +53,40 @@ E.g. "Peugeot" is annotated as a final ORG named entity (NE-ORG.final), with ide |
|
|
|
|
|
"tout au plus" is annotated as a MWE, more precisely tokens "tout", "à", "le" and "plus" are annotated with identifier 3 ("au" is a multi-word token which is not annotated). It has "ADV" as part-of-speech, meaning it behaves as an adverb, but it is considered as irregular from the syntactic point of view. The criterion that was used to annotate it is "IRREG".
|
|
|
|
|
|
The sentence contains an example of embedded MWEs: tokens 18 to 22 form a named entity of type ORG, itself containing a MWE ("conseil général") and a LOC (Essonne).
|
|
|
The sentence contains an example of a word (support verb "effectuait") belonging to two LVCs (tokens 21+23 and tokens 21+26 each form a LVC).
|
|
|
|
|
|
```
|
|
|
# text = Chez Peugeot tout au plus on savait que Jean Gapé ne faisait plus partie du conseil général de l'Essonne.
|
|
|
# text = Chez Peugeot tout au plus on savait que Jean Gapé ne faisait plus partie du conseil général mais effectuait divers travaux et divers tâches.
|
|
|
# sent_id = 123
|
|
|
# source_sent_id = http://deep-sequoia.inria.fr/download/sequoia-8.2.tgz sequoia-8.2/sequoia.surf.conll 123
|
|
|
1 Chez ... *
|
|
|
2 Peugeot ... 1:PROPN|NE-PERS.prim|_;2:PROPN|NE-ORG.final|_
|
|
|
3 tout ... 3:ADV|MWE|IRREG
|
|
|
4-5 à ... *
|
|
|
4 à ... 3
|
|
|
5 le ... 3
|
|
|
6 plus ... 3
|
|
|
7 on ... *
|
|
|
8 savait ... *
|
|
|
9 que ... *
|
|
|
10 Jean ... 4:PROPN|EN-PERS.final|_
|
|
|
11 Gapé ... 4
|
|
|
12 ne ... *
|
|
|
13 faisait ... 8:_|MWE-VID|_
|
|
|
14 plus ... *
|
|
|
15 partie ... 8
|
|
|
16-17 du ... *
|
|
|
16 de ... *
|
|
|
17 le ... *
|
|
|
18 conseil ... 5:PROPN|EN-ORG.final|_;6:_|MWE|LEX
|
|
|
19 général ... 5;6
|
|
|
20 de ... 5
|
|
|
21 l' ... 5
|
|
|
22 Essonne ... 5;7:PROPN|EN-LOC.final|_
|
|
|
23 . ... *
|
|
|
1 Chez ... *
|
|
|
2 Peugeot ... 1:PROPN|NE-PERS.prim|_;2:PROPN|NE-ORG.final|_
|
|
|
3 tout ... 3:ADV|MWE|IRREG
|
|
|
4-5 à ... *
|
|
|
4 à ... 3
|
|
|
5 le ... 3
|
|
|
6 plus ... 3
|
|
|
7 on ... *
|
|
|
8 savait ... *
|
|
|
9 que ... *
|
|
|
10 Jean ... 4:PROPN|EN-PERS.final|_
|
|
|
11 Gapé ... 4
|
|
|
12 ne ... *
|
|
|
13 faisait ... 8:_|MWE-VID|_
|
|
|
14 plus ... *
|
|
|
15 partie ... 8
|
|
|
16-17 du ... *
|
|
|
16 de ... *
|
|
|
17 le ... *
|
|
|
18 conseil ... 5:_|MWE|LEX
|
|
|
19 général ... 5
|
|
|
20 mais ... *
|
|
|
21 effectuait ... 6:_|LVC|_;7:_|LVC|_
|
|
|
22 divers ... *
|
|
|
23 travaux ... 6
|
|
|
24 et ... *
|
|
|
25 diverses ... *
|
|
|
26 tâches ... 6
|
|
|
27 . ... *
|
|
|
```
|
|
|
|
|
|
### Morpho-syntactic annotation schemes, and link to the MWE / NE annotations
|
... | ... | |