... | @@ -9,3 +9,9 @@ The Multi Column Description Format (mcd) associates [labels](column_labels) to |
... | @@ -9,3 +9,9 @@ The Multi Column Description Format (mcd) associates [labels](column_labels) to |
|
* **INT** indicates that the values are integers
|
|
* **INT** indicates that the values are integers
|
|
* **EMB** indicates that the values are real valued vectors (embeddings)
|
|
* **EMB** indicates that the values are real valued vectors (embeddings)
|
|
* The fourth column is used for embeddings, it is the name of a file containing the embeddings
|
|
* The fourth column is used for embeddings, it is the name of a file containing the embeddings
|
|
|
|
|
|
|
|
Here is an example of an mcd file
|
|
|
|
|
|
|
|
1 | FORM | VOCAB | _
|
|
|
|
---- | ---- | ---- | ---- | ----
|
|
|
|
2 | POS | VOCAB | _ |