README
JSON Structure
The JSON object contains the following fields:
-
lu
: The lexical unit (LU)/trigger in the sentence. -
pos_lu
: The part of speech tag of the LU. (corresponding to ftrigger). -
lemma_lu
: The lemma or root form of the LU. -
frame
: The semantic frame associated with the LU. -
question
: The text of the question. -
id
: A unique identifier for the question-answer pair. -
answers
: A list of dictionaries containing the reference answers with the following fields:-
text
: The text of the reference answer. -
role
: The semantic role of the answer. -
answer_start
: The starting character offset of the answer in the context. -
answer_end
: The ending character offset of the answer in the context. -
coref
: A dictionary for coreference information, withanchor
andmentions
fields. -
wrong_answer
: The incorrect reference answer if there was a correction made.
-
-
predictions
: A dictionary containing model predictions and corresponding ROUGE-L scores. Each model has an entry with:-
answer_pred
: The predicted answer by the model. -
rougeL
: The ROUGE-L score of the prediction. -
HScore
: The HScore of the prediction as computed in the paper : "Correct" as human annotation → 1, "Partiellement correct" → 0.5 and 0 otherwise. In the case of mutliple annotation for the same question, the average HScore of all the annotation is taken.
-
-
human_annot
: A dictionary containing human annotations for each model's output. Each model has an entry which is a list of annotations:-
annot
: The annotation identifier. -
rating
: The rating given by the human annotator (e.g., "Correct").
-
-
lu_in_question
: Boolean corresponding to whether the trigger is found in the question for this example (corresponding to fLU in q). -
nb_fe_frame
: Number of Frame Elements in the frame that triggered the question. (corresponding to fnb FEs). -
list_dep_lu_ans
: Detail of dependencies crossed between response and frame trigger. -
nb_arc_lu_ans
: Number of dependency arcs between the answer and the trigger of the question's frame. (corresponding to fdist). -
entropy_frame
: Entropy of the question's frame, common to all the examples of this frame. (corresponding to fentropy). -
complexity_vector
: Each element corresponds to a complexity factor, 1 if it's "active" and the example therefore corresponds to the difficult group, 0 otherwise. Indexes correspond to the following complexity factors:-
0
: fLU in q -
1
: ftrigger -
2
: fdist -
3
: fentropy -
4
: fnb FEs
-
Example
Here is an example of the JSON structure:
{
"lu": "devient",
"pos_lu": "AUXE",
"lemma_lu": "devenir",
"frame": "Becoming",
"question": "Quel type d'État devient l'Irlande ?",
"id": "8abee7c1-e632-4168-8a9c-225eb7e15f43",
"answers": [
{
"text": "un état souverain et indépendant",
"role": "Final_category",
"answer_start": 279,
"answer_end": 311,
"coref": {
"anchor": {},
"mentions": []
},
"wrong_answer": ""
}
],
"predictions": {
"MT5-large_260_AP0": {
"answer_pred": "état souverain et indépendant",
"rougeL": 1.0
}
},
"human_annot": {
"MT5-large": [
{
"annot": "annot_1",
"rating": "Correct"
}
]
},
"lu_in_question": true,
"nb_fe_frame": 2,
"list_dep_lu_ans": [
"conj",
"obj"
],
"nb_arc_lu_ans": 1,
"complexity_vector": [
0,
1,
0,
0,
1
]
}