Introduction to Formats and Standards

One important factor in providing a tagged Bible translation for free use in Bible apps and programs in this modern computer age, is that the data must be readily accessible in convenient forms. Hence, Freely-Given.org has invested in designing Bible databases and file formats that themselves are offered to the Bible software world as a gift to use in whatever ways meets your needs.

Putting that a different way, the OET attempts to lead and set new directions not just in the ways the Bible text is translated and how it’s presented to the user, but also how it’s saved on computers and linked to helpful databases. And we are 'splitters' rather than 'lumpers' because we try to provide detailed information, even if it’s only discarded later.

Most existing Bible translations in the world are developed using the USFM file format. USFM includes features for marking titles and introductions, chapters and verses, section headings and paragraphs, footnotes and cross-references, and even for marking words that the translators have added, and for marking the words of Jesus (for things like red-letter Bibles). However, we want to go further to do things like:

easily annotating footnotes into classes, e.g., text-critical comments, historical comments, comments on figurative language, etc.
not just marking added words, but putting the reason they were added or changed, e.g., required by English grammar, implied by the previous sentence or context, implied by knowledge of historical context, etc.
not just marking Jesus' words, but annotating every speech segment, including the narrator or editor.

ESFM Bible files

Enhanced Standard Format Marker (ESFM) files are based on that USFM 3 format used by most Bible translators but add the ability to include metadata (such as names of translators for that particular file) and plentiful datasets linked to each Bible word by means of associated tables.

Note that we also provide a script to convert our ESFM back to USFM (by stripping out most of the additional information).

Note also (while we're discussing file formats for Bible software), most current Bible programs and apps weren't designed to handle a Readers’ Version and a very Literal Version that are intended to always be used together. We have a dedicated cross-platform Bible app in early development that includes a JSON version of our ESFM files.

OET extensions

Add character fields

ESFM and USFM both have a \add character field. To that, we insert (after the space) a single character (that would not normally be expected to be part of the start of an English word) inside the added field, to indicate the reason why a word or words were added or changed in the English translation.

+ We had to add an article (like 'a' or 'the' or 'some') for the English to make grammatical sense
= We had to add a copula (like 'is') for the English to make grammatical sense
< We had to add a direct object (like 'it' after 'he hit') for the English to make grammatical sense
> We had to add an implied person or object (like 'thing' after 'he took one', or 'person' after 'one' in 'one took the bread') for the English to make sense
≡ We chose to repeat something that was elided (like replacing 'pursued King A and King B' with the longer 'pursued King A and pursued King B' where doing so makes the English easier to read and understand
& We had to add an owner (like replacing 'the' with the possessive 'his' in 'raised the hands') for the English to make sense or to be more natural (like in 'his only son' in John 3:16)
@ We changed a pronoun to a name (like replacing pronoun 'he' with the name 'Adam') for English fluency, or following a section heading in order to help readers who jump into the text at that point (OET-RV only)
* We changed a name to a pronoun (like replacing 'Adam' with the pronoun 'he') for English fluency
# We changed the number, mostly changing singular sayings to plural (like replacing 'The poor man, he cries…' with 'Poor people, they cry…') in order to generalise the saying for modern readers, and often to remove any wrong suggestion that it might only apply to males (OET-RV only)
% We changed the person, mostly changing direct speech to indirect (like replacing 'God said, I will…' with 'God said that he will…'), often in order to reduce the number of nested levels of direct speech for modern readers (OET-RV only)
^ We changed a saying to its opposite (like replacing 'the doorway is not closed' with 'the door is always open') in order to make something more natural for modern readers (OET-RV only)
≈ We reworded (single word, or phrase, or clause) (like replacing 'said' with 'answered', or replacing 'let them go' with 'release them') to make the English style more modern and easy-to-understand (OET-RV only)
Note: This is different from when this symbol is used for Hebrew parallelism (see separate section below).
? We were unable to determine what the author intended to communicate,
PLUS, if any of the above are preceded by a question-mark '?', then it means that a considerable amount of doubt is involved, i.e., the intent of the original author is unclear and our best attempt might actually be wrong. (Often in those cases, we'll simply leave the word-for-word literal text in the Readers’ Version where it’s unclear, but in some cases, even that isn't at all understandable.)

Note: There are four special characters (≈, ^, →, and ⇔) that can be used at the very beginning of a line, i.e., these are OUTSIDE of \add markers so are quite different from the above (see below).

Footnotes and cross-references

We are planning to annotate the type of all of our notes to enable some notes to be added or removed for specific online or printed OET editions. Plus we plan/hope to get specialists to write additional footnote and cross-reference sets, e.g., for a Muslim or a Jew reading this English translation, or for a specialist archeological edition, etc.

For annotating footnote types, we are currently testing the addition of a code at the beginning of the initial \ft field, e.g., 'TC: ' to indicate a note on textual criticism. More to come…

Hebrew parallelism

Hebrew poetry doesn't use rhyming like a lot of English poetry does, but (like English), it does often use shortish lines and often these come in pairs or doublets.

The OET-RV marks the second line of doublets with synonymous parallelism with the ≈ character (commonly used in mathematics for 'approximately equal') to alert the reader when we consider that the author seems to be repeating a similar thought in the second line of a doublet. (This character will usually follow the space after a \q1 or \q2 field.)

Similarly, we use the ^ character for antithetic parallelism (saying the opposite in the second line of the doublet), and the → character for synthetic parallelism (coming to the result or conclusion in the second line of the doublet.)

For example, we might code something like:
\q1 God is my hope.
\q1 ≈Yahweh gives me comfort.
\q1 Judah will prosper,
\q1 ^but Israel will be taken into exile.
\q1 I called to Yahweh for help
\q1 →and he answered my prayer.

Note: This is different from when the ≈ and ^ characters are used with the \add character field (see above).

Other

Automated software is used to 'connect' words in the OET-RV with words from the OET-LV that it’s translated from, e.g., we might change 'Speak to' to 'Tell', or the verb phrase 'stored up' to the noun 'savings'. Usually that software assumes that we roughly translated the verse from beginning to end as it tries to match up words.

However, sometimes it’s more natural in English to completely reverse the ordering, e.g., a simplified example from Num 36:10: "Just as YHWH commanded, so the girls did." becomes "The girls did just as Yahweh had commanded." In order to inform our software that we swapped the first and second halves of the verse, we place a ⇔ character at the beginning of the OET-RV verse line. (Of course, this character should be removed/ignored by most presentation software and will be removed in our exported USFM files.)

Bible Books Codes

When writing any Bible software, one of the first requirements is for a list of Bible books codes . It's common to use GEN or Gen or Gn or gen for Genesis, but what about Exodus? EXO or Exo or exo or EX or Ex or ex or EXOD or Exod or exod? The BibleOrgSys Bible books codes:

are always exactly three characters and always UPPERCASE, e.g., GEN, EXO
always start with a letter, e.g., KI1, PE2
include codes for 'books' beyond the sixty-six in the modern Western protestant tradition, e.g., TOB, LAO
include codes for unusual cases like SAM (which would be SA1 and SA2 since we're no longer restricted by how big or heavy a single scroll can be—the revolutionary, new OET always considers traditions and whether or not they still need to apply)

Note that the OET Explorers’ Edition changes the traditional protestant Christian 'book' order and number of 'books' (unlike the Conventional Edition). We don't feel at all bad about that, because Christians already changed both the number of 'books' and the traditional Jewish order from the Hebrew scriptures.