Description
|
Values of linguistic features in individual text chunks. Each row of the table corresponds to a text chunk in the Koditex corpus. Columns represent linguistic features, and additionally text chunk ID, classification metadata (MODE, DIVISION, SUPERCLASS, CLASS) and length (_LEN).
The abbreviations for the classification categories (= values in the MODE, DIVISION, SUPERCLASS and CLASS columns) and linguistic feature names (= remaining column names) are explained in 00_README.pdf.
|