N1904-TF

Text-Fabric dataset of the Greek New Testament, based on the Nestle 1904 (7th printing) edition.

About this dataset
Transcription
Featureset
Optional features
Viewtypes
Textformats
Syntaxtrees
Tutorial
Latest release

Nestle 1904 GNT - Feature: after

Feature group Feature type Data type Available for node types Used by viewtypes
Orthograpic Node String word subphrase phrase syntax-view wg-view

Feature description

The after feature includes all material found after a word, such as regular space characters, punctuation marks followed by a space, and text-critical markers. This feature is essential for preserving the context and formatting of the text. This feature is coded in Unicode using polytonic accents over the vowels (oxia, varia, and perispomeni).

This feature is also populated for phrase or subphrase, but only if they consist of just one word node.

Feature values

All material found after a word. The frequency is provided by the table below.

For word nodes (used in syntax-view and wg-view):

Value Description Unicode codepoint Frequency
Space &#32 119261
, Comma & space &#44 & &#32 9439
. Full stop & space &#46 & &#32 5704
· Midle dot & space &#183 & &#32 2355
; Semicolon & space &#59 & &#32 969
,— Comma, em dash & space &#44 & &#8212 & &#32 18
Em dash & space &#8212 & &#32 7
). Closing round bracket, full stop & space &#41 & &#46 & &#32 6
.]] Full stop & 2 Right Square Bracket &#46 & 2x &#93 4
etc.. Various various  

For phrase nodes (used in syntax-view):

Value Description Unicode codepoint Frequency
Space &#32 37661
, Comma & space &#44 & &#32 3892
. Full stop & space &#46 & &#32 2724
· Midle dot & space &#183 & &#32 1187
; Semicolon & space &#59 & &#32 588
,— Comma, em dash & space &#44 & &#8212 & &#32 8
). Closing round bracket, full stop & space &#41 & &#46 & &#32 4
etc.. Various various  

For subphrase nodes (used in syntax-view):

Value Description Unicode codepoint Frequency
Space &#32 119261
, Comma & space &#44 & &#32 9439
. Full stop & space &#46 & &#32 5704
· Midle dot & space &#183 & &#32 2355
; Semicolon & space &#59 & &#32 969
,— Comma, em dash & space &#44 & &#8212 & &#32 18
Em dash & space &#8212 & &#32 7
). Closing round bracket, full stop & space &#41 & &#46 & &#32 6
.]] Full stop & 2 Right Square Bracket &#46 & 2x &#93 4
etc.. Various various  

Notes

The following image shows the features describing the material found after a word.

The following features describe the full surface text:

The following image shows the relation between these features.

The following text-formating options are defined in this dataset using this feature:

  A.showFormats()
     format              level    template
     lex-orig-plain      word     {lemma}{trailer}
     lex-translit-plain  word     {lextranslit}{trailer}
     text-orig-full      word     {before}{text}{after}
     text-orig-plain     word     {text}{trailer}
     text-translit-plain word     {translit}{trailer}
     text-unaccent-plain word     {unaccent}{trailer}

Source description

The after feature is based on the XML attribute after of the w (word) tag.


Browse all features by name, node type, data type, feature group or feature type.