Text-Fabric dataset of the Greek New Testament, based on the Nestle 1904 (7th printing) edition.
About this datasetFeature group | Feature type | Data type | Available for node types | Used by viewtypes |
---|---|---|---|---|
Syntactic |
Node |
String |
book |
syntax-view wg-view |
The lang
feature indicates the language of the text based on ISO 639.2 language codes.
Value | Description | Frequency |
---|---|---|
el |
Greek | 27 |
The current data source contains the value el
(from Ελληνικά), which corresponds to Modern Greek (after 1453) according to ISO 639.2. However, the correct language indicator should be grc
for Ancient Greek (before 1453).
Since this feature is only available for book
nodes, it can not be used to identify the language of individual words. By querying feature morph a limited set of (transliterated) Hebrew words can be identified in the Greek text, see the following snippet:
Query = '''
phrase
morph~HEBR*
'''
Results = A.search(Query)
To select Aramaic words, replace ‘HEBR’ with ‘ARAM’ in the above query.
This feature is derived from the XML attribute lang
of the tag book
.