N1904-TF

Text-Fabric dataset of the Greek New Testament, based on the Nestle 1904 (7th printing) edition.

About this dataset
Transcription
Featureset
Optional features
Viewtypes
Textformats
Syntaxtrees
Tutorial
Latest release

Nestle 1904 GNT - About Text-Fabric

Text-Fabric is a powerful Python library and framework designed to facilitate the analysis and manipulation of large-scale textual data, particularly in the context of ancient languages and biblical texts. It provides a comprehensive set of tools for processing and querying structured text data efficiently. Text-Fabric was developed by Dirk Roorda. The software package is accessible at https://github.com/annotation/text-fabric.

Data model

Text-Fabric, as its name suggests, employs a ‘warp and weft’ concept inspired by textile weaving. The ‘warp’ represents foundational structured data, like words, phrases, and clauses, depicted as nodes, while the ‘weft’ adds layers of information—features like lexical, morphological, or syntactic data. By weaving these features with the nodes, TF maintains a clear separation between structure and content.

TF’s data model is structured as a graph of nodes, each identified by a node type (a string) and a corpus wide unique sequence number (an integer). Nodes support two feature types: ‘node features’ providing information about individual nodes, while ‘edge features’ define relationships or links between nodes. Each feature’s data is stored in separate plain text files, resulting in a flat data structure. This design enables multiple formats for displaying corpus text and allows unlimited annotation depth.

The TF Python package provides an Application Programming Interface (API) for accessing and manipulating corpus-specific data, making it easy to integrate with other analytical tools. Given Python’s prominence in Biblical and Digital Humanities research, TF is a fitting choice. Users can load additional feature sets, provided they align at the node level, allowing for queries beyond existing annotations. For example, this TF dataset was enhanced with features like Bible Online Learner (BibleOL) data and Aland’s synoptic parallel data, enabling more comprehensive analysis.

Functions

The main functionalities of Text-Fabric include:

Detailed information regarding Text-Fabric can be found at: