Nestle 1904 GNT - Feature: after

Feature group	Feature type	Data type	Available for node types	Used by viewtypes
`Orthograpic`	`Node`	`String`	`word` `subphrase` `phrase`	`syntax-view` `wg-view`

Feature description

The after feature includes all material found after a word, such as regular space characters, punctuation marks followed by a space, and text-critical markers. This feature is essential for preserving the context and formatting of the text. This feature is coded in Unicode using polytonic accents over the vowels (oxia, varia, and perispomeni).

This feature is also populated for phrase or subphrase, but only if they consist of just one word node.

Feature values

All material found after a word. The frequency is provided by the table below.

For word nodes (used in syntax-view and wg-view):

Value	Description	Unicode codepoint	Frequency
	Space	`&#32`	119261
`,`	Comma & space	`&#44` & `&#32`	9439
`.`	Full stop & space	`&#46` & `&#32`	5704
`·`	Midle dot & space	`&#183` & `&#32`	2355
`;`	Semicolon & space	`&#59` & `&#32`	969
`,—`	Comma, em dash & space	`&#44` & `&#8212` & `&#32`	18
`—`	Em dash & space	`&#8212` & `&#32`	7
`).`	Closing round bracket, full stop & space	`&#41` & `&#46` & `&#32`	6
`.]]`	Full stop & 2 Right Square Bracket	`&#46` & 2x `&#93`	4
etc..	Various	various

For phrase nodes (used in syntax-view):

Value	Description	Unicode codepoint	Frequency
	Space	`&#32`	37661
`,`	Comma & space	`&#44` & `&#32`	3892
`.`	Full stop & space	`&#46` & `&#32`	2724
`·`	Midle dot & space	`&#183` & `&#32`	1187
`;`	Semicolon & space	`&#59` & `&#32`	588
`,—`	Comma, em dash & space	`&#44` & `&#8212` & `&#32`	8
`).`	Closing round bracket, full stop & space	`&#41` & `&#46` & `&#32`	4
etc..	Various	various

For subphrase nodes (used in syntax-view):

Value	Description	Unicode codepoint	Frequency
	Space	`&#32`	119261
`,`	Comma & space	`&#44` & `&#32`	9439
`.`	Full stop & space	`&#46` & `&#32`	5704
`·`	Midle dot & space	`&#183` & `&#32`	2355
`;`	Semicolon & space	`&#59` & `&#32`	969
`,—`	Comma, em dash & space	`&#44` & `&#8212` & `&#32`	18
`—`	Em dash & space	`&#8212` & `&#32`	7
`).`	Closing round bracket, full stop & space	`&#41` & `&#46` & `&#32`	6
`.]]`	Full stop & 2 Right Square Bracket	`&#46` & 2x `&#93`	4
etc..	Various	various

Notes

The following image shows the features describing the material found after a word.

The following features describe the full surface text:

after (this feature): All material found after a word (including critical signs).
before: All material found before a word.
criticalsign: Text-critical signs.
punctuation: Punctuations found after a word.
normalized: Normalized Greek text.
text: Word without punctuations and text-critical signs.
trailer: All material found after a word (excluding text-critical signs).
translit: Transliteration of the word surface texts.
unaccent: Word without accents and diacritical markers.
unicode: Unicode presentation including all material before and after word.

The following image shows the relation between these features.

The following text-formating options are defined in this dataset using this feature:

  A.showFormats()
     format              level    template
     lex-orig-plain      word     {lemma}{trailer}
     lex-translit-plain  word     {lextranslit}{trailer}
     text-orig-full      word     {before}{text}{after}
     text-orig-plain     word     {text}{trailer}
     text-translit-plain word     {translit}{trailer}
     text-unaccent-plain word     {unaccent}{trailer}

Source description

The after feature is based on the XML attribute after of the w (word) tag.

N1904-TF

Nestle 1904 GNT - Feature: after

Feature description

Feature values

Notes

Source description

Browse all features by name, node type, data type, feature group or feature type.