DTA

German text archive

Info

Applicant: Professor Dr. Wolfgang Klein
Subject Area: Allgemeine und Vergleichende Sprachwissenschaft, Experimentelle Linguistik, Typologie, Außereuropäische Sprachen
Term: 2007 to 2017
Project identifier: Deutsche Forschungsgemeinschaft (DFG) - Projektnummer 37149321
Institution: Berlin-Brandenburgische Akademie der Wissenschaften (BBAW)

Description

The German Text Archive indexes, stores and provides a cross-disciplinary and cross-genre collection of German-language texts. At the center is the core corpus with around 1500 titles, which forms the basis for a reference corpus of Modern High German.

The special feature of the core corpus is:

  • the balanced selection of texts,
  • the publication period spans from the 17th century to the early 20th century,
  • the full-text digitization of first editions,
  • the transcription is carried out while preserving the language status,
  • text structuring on the basis of DTABf (TEI-XML).

Examples

Examples from the holdings of the German Text Archive