Text-Fabric dataset of the Greek New Testament, based on the Nestle 1904 (7th printing) edition.
About this datasetFeature group | Feature type | Data type | Available for node types | Used by viewtypes |
---|---|---|---|---|
Orthograpic |
Node |
String |
word subphrase phrase |
syntax-view wg-view |
This feature contains all material after the end of the word (excluding critical signs). This includes punctuations and trailing space.
For word
nodes (used in syntax-view
and wg-view
):
Value | Description | Unicode codepoint(s) | Frequency |
---|---|---|---|
|
space |   |
119270 |
, |
Comma & space | , &   |
9462 |
. |
Full Stop & space | . &   |
5717 |
· |
Midle Dot & space | · &   |
2359 |
; |
Semicolon & space | ; &   |
971 |
For phrase
nodes (used in syntax-view
):
Value | Description | Unicode codepoint(s) | Frequency |
---|---|---|---|
|
space |   |
37665 |
, |
Comma & space | , &   |
3903 |
. |
Full Stop & space | . &   |
2731 |
· |
Midle Dot & space | · &   |
1189 |
; |
Semicolon & space | ; &   |
589 |
For subphrase
nodes (used in syntax-view
):
Value | Description | Unicode codepoint(s) | Frequency |
---|---|---|---|
|
space |   |
119270 |
, |
Comma | ,   |
9462 |
. |
Full Stop & space | . &   |
5717 |
· |
Midle Dot & space | · &   |
2359 |
; |
Semicolon & space | ; &   |
971 |
The following image shows the features describing the material found after a word.
The following set of features describe the full surface text:
The following image shows the relation between these features.
The following text-formating options are defined in this dataset using this feature:
A.showFormats() format level template lex-orig-plain word {lemma}{trailer} lex-translit-plain word {lextranslit}{trailer} text-orig-full word {before}{text}{after} text-orig-plain word {text}{trailer} text-translit-plain word {translit}{trailer} text-unaccent-plain word {unaccent}{trailer}
Calculated from the from XML attribute after
of tag w
(word).