Skip to content
Snippets Groups Projects
Commit 07795c9d authored by Franck Dary's avatar Franck Dary
Browse files

Added more SPLIT transitions because arabic needs it

parent ac4b5d0a
No related branches found
No related tags found
No related merge requests found
......@@ -27,6 +27,11 @@ tokenizer.ts: all_no_test.conllu $(MCD)
echo "SPLIT 0" >> $@
echo "SPLIT 1" >> $@
echo "SPLIT 2" >> $@
echo "SPLIT 3" >> $@
echo "SPLIT 4" >> $@
echo "SPLIT 5" >> $@
echo "SPLIT 6" >> $@
echo "SPLIT 7" >> $@
echo "ADDCHARTOWORD" >> $@
echo "IGNORECHAR" >> $@
sed -i -e 's/^/<tokenizer> /' $@
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment