... | ... | @@ -164,32 +164,27 @@ What is the **final type** of the candidate sequence _c_? |
|
|
<!--------------------------------------------------------------------------------------------->
|
|
|
<!--------------------------------------------------------------------------------------------->
|
|
|
<!---------------------------------------------------------------------------------------------
|
|
|
<!--
|
|
|
|
|
|
## Color code
|
|
|
In the following, different colors are used to display examples:
|
|
|
- <font color="red">Red</font> is used for counter-examples, that is, expressions which look like VMWEs but are not one, whatever the language.
|
|
|
-->
|
|
|
|
|
|
## Step 1 - choosing a candidate sequence
|
|
|
choose a candidate sequence _c_ such that _c_ is a nominal group and _c_ names an entity of one of the relevant [types](ep_et_en#-2-les-types-den) ([PERS](ep_et_en#-21-noms-de-personne-pers), [LOC](ep_et_en#-22-noms-de-lieu-loc), [ORG](ep_et_en#-23-noms-dorganisation-org-incluant-les-humains-collectifs), [PROD](ep_et_en#-24-produits-humains-prod) or [EVE](ep_et_en#-25-ev%C3%A9nements-nomm%C3%A9s-eve))
|
|
|
apply [UniqueRef](#test-1-uniqueref-unique-referent)(_c_)
|
|
|
**NO** => _c_ is **not a NE**
|
|
|
**YES** => go to [Step 2](#step-2-identifying-a-named-entity).
|
|
|
--------------------------------------------------------------------------------------------->
|
|
|
|
|
|
|
|
|
<!--
|
|
|
**NO** => apply [DefDesc](#DefDesc)(_c_,_t_)
|
|
|
**NO** => _c_ is not a NE
|
|
|
-->
|
|
|
|
|
|
|
|
|
|
|
|
<!------------------------------------------------
|
|
|
### Test 1 [UniqueRef] - unique referent
|
|
|
Does the sequence name a unique object in the discourse world?
|
|
|
<!--- in an autonomous manner, i.e. without having to take the linguistic context (other than the place and date of the utterance) into account? -->
|
|
|
<!-- but indentifying the referent of the latter expression requires the linguistic context. Test passed for _François Ruffin_ but not for _Le désormais célèbre réalisateur de Merci patron!_.-->
|
|
|
<!-- but indentifying the referent of the latter expression requires the linguistic context. Test passed for _François Ruffin_ but not for _Le désormais célèbre réalisateur de Merci patron!_.
|
|
|
|
|
|
Examples:
|
|
|
* "C’est un véritable petit exploit qu’a accompli _François Ruffin_. _Le désormais célèbre réalisateur du documentaire Merci patron !_ a réussi à rattraper un retard de presque dix points" - both _François Ruffin_ (PERS) and _Le désormais célèbre réalisateur de Merci patron!_ (no NE) refer to a unique referent in the discourse world. Test passed.
|
... | ... | @@ -200,9 +195,8 @@ Examples: |
|
|
* _Angiox est un médicament suédois_ - Angiox is considered here as the name of an invention (molecule) or of a trade mark, it refers to a unique instance. Test passed.
|
|
|
* _J'ai pris 2 Angiox avant de dormir_ - Angiox refers to several products of this mark. Test not passed.
|
|
|
* _A Maisons-Alfort il y a plusieurs Pierres Martins_ - _Pierres Martins_ (no NE) refers to several persons. Test not passed.
|
|
|
------------------------------------------------>
|
|
|
|
|
|
<!-- obsolete: il s'agit bien de l'interpretation en contexte
|
|
|
obsolete: il s'agit bien de l'interpretation en contexte
|
|
|
**AJOUT PENDANT ADJUDICATION MATHIEU/MARIE**:
|
|
|
Does the reference to a unique referent need to hold without context?
|
|
|
It seems we need to add this precision.
|
... | ... | @@ -217,12 +211,9 @@ The test is complicated by the fact that a given name may well be ambiguous ("Pi |
|
|
* Pour "Commission", on a un peu plus l'impression qu'hors contexte il réfère bien à la commission européenne, mais en toute rigueur on peut construire un texte ou "Commission" tout seul est le diminutif d'une autre commission...
|
|
|
|
|
|
**Solution 2**: on prend aussi des diminutifs d'EN, même s'il faut un peu de contexte pour avoir la référence précise. Terrain glissant !!!!!
|
|
|
-->
|
|
|
|
|
|
|
|
|
|
|
|
<!--
|
|
|
------------------------------------------------
|
|
|
### Test 3 [DefDesc] - definite description
|
|
|
|
|
|
Is the candidate sequence a definite description? A sequence is a definite description if its referent cannot be identified on the basis of the sole sequence, but requires empirical (i.e. extra-liguistic) knowledge instead.
|
... | ... | @@ -231,11 +222,8 @@ Examples: |
|
|
* _Le désormais célèbre réalisateur du documentaire Merci patron ! a réussi à rattraper un retard de presque dix points_ - empirical knowledge is understand that a film has a director and to know the director of this precise film (Merci patron !). Test passed.
|
|
|
* _Le président de la République exerce la plus haute fonction du pouvoir exécutif de la République française._ - _le président de la République_ (no NE) has a unique referent but the sentence defines it, so no previous knowledge is needed to identify it. Test not passed.
|
|
|
* _Le Conseil de l'Union européenne a été informé des économies réalisées par le FMI._ - empirical knowledge is needed to understand that a, institution like EU is led by a counsel - _Conseil de l'Union européenne_ (ORG) - consisting of all prime ministers. Test passed.
|
|
|
-->
|
|
|
|
|
|
|
|
|
<!--
|
|
|
------------------------------------------------
|
|
|
### Test 5 [InitUpper] - initial uppercase
|
|
|
|
|
|
Is the candidate sequence spelled with an initial uppercase letter?
|
... | ... | @@ -270,6 +258,5 @@ Does the candidate sequence occur at the middle of the sentence? |
|
|
Examples:
|
|
|
* _Affaire des disparus du Beach: 100 morts_ - test passed for _Affaire des disparus du Beach_ (EVE/no NE)
|
|
|
* _Il a évoqué l'Affaire des disparus du Beach_ - test not passed for _Affaire des disparus du Beach_ (NE)
|
|
|
-->
|
|
|
|
|
|
|