Closing the Gap

in Non-Latin-Script Data

187 items

A Digital Synopsis of the Mishnah and Tosefta

2015
2018
eng

The aim of this project is to provide scholars and students of early rabbinic literature with digital tools to study and determine relationships between two central texts of rabbinic Judaism: the Mishnah (200 CE) and the Tosefta (250 CE) which share the same overall structure, the parallel organization, similar language and terminology and yet their relationship is tantalizingly problematic. The Synopsis Project will build upon and test computational methods for identifying “text reuse”; it will develop strategies and tools for presentation and visualization of overlapping texts which will provide access to the parallel and interrelated traditions of both texts.

computational_text_analysisjewish_studiesmachine_learningnlpreligion_studiestext_reuse

Last edit: 2024-10-31

A Digital Typology of Arabic Documents

2003
eng

The latest phase of the project Arabic Papyrology Database (APD). Its aim is to create digital diplomatic charts for structuring formulaic and layout features of Arabic documents.

arabic_studiescorpus_outputdigital_preservationlexiconmanuscripttranslation

Last edit: 2024-10-31

A Literary History of Medicine Online

2013
2017
eng

A Literary History of Medicine Online offers a digital edition of The ‘Uyūn al-anbā’ fī ṭabaqāt al-aṭibbā’ written by famous Syrian physician Ibn Abī Uṣaybi‘ah (d. 1270) along with its annotated English translation.

annotationarabic_studiesclassical_textdigital_editiondigital_preservationhistory_studiesliterary_studiesmanuscriptmedicine_historyscience_historytranslationvre

Last edit: 2024-10-31

Ada Lovelace Center for Digital Humanities

2021
deu ,
eng

The main objective of Ada Lovelace Center for Digital Humanities is to connect and promote all activities in the Digital Humanities taking place not only at the FU but also at the national and international level. By providing a social and communicative infrastructure for collaboration (co-working hours), developing interdisciplinary lab culture within the humanities (ADA.Labs) and bringing together all projects, teaching and training programs in the Digital Humanities at the FU it enhances and coordinates the transfer of project-based, personal expertise into a sustainable, interdisciplinary knowledge management.

collaboration_platform

Last edit: 2024-10-31

Afrikanische Stimmen in islamischen Manuskripten aus Mali: Erschließung und Erforschung afrikanischer Sprachen in arabischer Schrift (Ajami)

2017
2029
deu ,
eng ,
fra

This project is dedicated to research on manuscripts produced in African languages written in Arabic script (Ajami). The main focus of the study is the collection of rescued manuscripts form Timbuktu currently hosted in Bamako. The project aims at their historical and linguistic analyses, in-depth cataloguing, and establishment of a foundation for the interdisciplinary field of Ajami Studies.

african_studiescatalogdigital_preservationislamic_studieslinguisticsmanuscript

Last edit: 2024-10-31

Al-Furqan E-Database

2013
eng ,
ara

The Al-Furqan Digital Library offers extensive access to a rich collection of Islamic manuscripts and scholarly works. It serves as a vital resource for researchers in the field of digital humanities by providing online access to rare texts, fostering the preservation of cultural heritage, and supporting academic study through searchable digital content.

catalogdigital_preservationgeographyislamic_studieslanguage_documentationmanuscript

Last edit: 2024-10-31

al-Ṯurayyā Project

2013
eng

This project aims to reconstruct the geospatial characteristics of the early Islamic world by creating a digital gazetteer with includes over 2,000 toponyms, their descriptions from Arabic sources and technical information which is used for placing them on the map. The visualization is enriched by a geospatial model which plots routes and itineraries as well as networks of reachable places from selected centers, allowing to find paths between two or more locations.

geodatageographyislamic_studiesvisualization

Last edit: 2024-10-31

Altägyptisches Wörterbuch

1992
2012
deu

Altägyptisches Wörterbuch is a digital corpus of religious, juristic and administrative texts form ancient Egypt which were originally found in temples, tombs or on papyrus documents and which cover over 3000 years of Egyptian language history. The corpus together with a vast documentation of vocabulary and the ways it was used has been made available by open access and is updated regularly.

annotationcorpus_resourceegyptologylexicontranslation

Last edit: 2024-10-31

APSearch

2021
ara ,
eng ,
deu ,
fra ,
heb ,
spa

APSearch is a cataloge of musical recordings from the MENA region dating from the beginning of the 20th century until present. It provides a sophisticated search engine which enables to filter the recordings by a variety of categories.

arabic_studiescatalogdigital_preservationislamic_studiesmusic_studies

Last edit: 2024-10-31

Arabic and Latin Corpus

2016
eng

The project is a digital collection of medieval Arabic-Latin translations to offer a deeper insight into the Arabic influence on Europe in the 10th to 14th centuries AD.

arabic_studiescorpus_outputdigital_preservationmanuscriptscience_historytranslationtranslation_studies

Last edit: 2024-10-31

Arabic and Latin Glossary

2005
2013
2013
eng

The Arabic and Latin Glossary is a dictionary of medieval Arabic–Latin translations and corresponds to the project “Arabic and Latin Corpus” in terms of examining the Arabic influence on medieval Europe.

arabic_studiescorpus_resourcelexiconscience_historytranslationtranslation_studies

Last edit: 2024-10-31

Arabic Literature Cosmopolitan

2020
2027
deu ,
eng

Arabic Literature Cosmopolitan and AnonymClassic jointly are the first ever comprehensive study of Kalila and Dimna, one of the most important text of premodern world literature which passed from Hinduism and Buddhism via Islam to Christianity. Its Arabic versions constitute a source of all further translations into European languages and are also the main focus of the project, which analyzes the transmission processes of the work from India to Europe with Arabic as cultural bridge.

arabic_studiesclassical_textdigital_editiondigital_preservationliterary_studiesmanuscripttranslationvre

Last edit: 2024-10-31

Arabic Papyrology Database

2003
eng

The Arabic Papyrology database is a digital compilation of pre-modern Arabic manuscripts written between 7th and 16th century CE. It comprises a total of 13286 documents which can be search by their metadata and content.

arabic_studiescorpus_outputdigital_preservationlexiconmanuscripttranslation

Last edit: 2024-10-31

Arbeitsstelle Kleine Fächer

2005
deu

This project focuses on “small subjects” or less known fields of study in German universities. Its main objective it to collect data about the current situation of small subjects in Germany as well as its documentation.

research_infrastructure

Last edit: 2024-10-31

Archaeological Atlas of Coptic Literature by PATHs

2015
eng

The project aims to map and contextualize the distribution of Coptic literary texts across Egypt, focusing on their archaeological and manuscript traditions.

digital_preservationvisualizationgeodatageographycatalogmanuscript

Last edit: 2024-10-31

ARCHE

2017

ARCHE is a digital archive designed specifically to preserve digital research data from the humanities created in or related to Austria. It also promotes the use of open access and open data policies.

research_data_managementresearch_infrastructure

Last edit: 2024-11-15

ARCHES

2019
2024
eng

The main objective of this project is the identification and cataloguing of Arabic and Judaeo-Arabic manuscripts related to Bible exegesis and translation, known as the Firkovitch Collection. The series contains approximately 2700 manuscripts dating back to the Golden Age of Arabic lands (between 9th and 13th centuries), which provide an extraordinary window to the intellectual and religious life of the Jewish communities living in those areas in that time. The metadata of the manuscripts will be made accessible for further research.

arabic_studiescatalogdigital_preservationjewish_studiesmanuscriptreligion_studiesreligious_text

Last edit: 2024-10-31

Atlas of the Languages of Iran

2015
2023
eng ,
fas

The overall goal of the ALI research programme is to enable work toward a systematic understanding of the language situation in Iran. This initiative is guided by a set of interrelated themes which are: language distribution, linguistic and areal typology, language classification and language documentation. The output of the project can be found online in form of interactive maps and visualizations.

geodatairanian_studieslanguage_documentationlinguisticsmapothervisualization

Last edit: 2024-11-15

Audition Certificates Platform

2019
2023
eng ,
ara

The ACP transcribes audition certificates in Arabic manuscripts and links them to the digital reproductions as far as they are available. These certificates, found in manuscript books, provide rich historical data on authorized text transmissions, and the ACP's digital efforts aim to systematically explore libraries like Staatsbibliothek Berlin, Forschungsbibliothek Gotha, The Bibliothèque nationale de France, and the Syrian National Library to create a searchable corpus, offering unique insights into Middle Eastern social, economic, and urban history, as well as biographical studies.

annotationcorpus_outputdigital_preservationgeodatahistory_studiesislamic_studiesmanuscriptmap

Last edit: 2024-10-31

Automatic Collation for Diversifying Corpora

2021
2023
eng

This project aims to significantly improve the accuracy of handwritten text recognition (HTR) for Arabic-script manuscripts. It will develop a collation tool to automatically create large amounts of training data from existing digital texts and manuscript images without time-consuming human annotation of individual manuscripts.

htrislamic_studiesmachine_learningocrvre

Last edit: 2024-10-31

Averroes Edition

2016
2041
deu ,
eng

The project’s main goal is to generate a trilingual critical edition of all commentaries by Ibn Rušd (Averroes) on Aristotle’s Physics, which are the synthesis of the late antique and Arabic reception of Greek philosophy. Next to the Arabic commentaries the edition will also contain their Latin and Hebrew translations.

arabic_studiesdigital_editiondigital_preservationjewish_studiesmanuscriptphilosophyscience_history

Last edit: 2024-10-31

Avestan Digital Archive

2006
eng

The ADA project seeks, on the one side, to find, to collect and to digitalize all the extant Avestan manuscripts. On the other hand, the ADA Project is developing a tool to provide all these manuscripts with indexes of the passages and to make them thus available on the web for researchers and for the general public. The electronic tool will allow an easy checking of all the manuscripts containing a concrete passage.

digital_editiondigital_preservationhistory_studiesiranian_studiesmanuscriptreligious_textvre

Last edit: 2024-10-31

BABMED - Babylonische Medizin

2013
2018
deu ,
eng

BabMed represents the first comprehensive study of ancient Babylonian medical science since the decipherment of the cuneiform writing system. It aims to make Babylonian medical texts available not only for the specialist, but for the wider public as well by publishing them in form of thematic online corpora. Its second objective is to study medicine in the Babylonian Talmud, in order to explore comparisons with later medical traditions from the same region in Aramaic, another language of ancient Mesopotamia.

corpus_outputdigital_preservationepigraphyhistory_studiesinscriptionmedicine_historyscience_historysemitic_studies

Last edit: 2024-10-31

Bamberg Lexical Database for Contemporary Iranian Languages

2016
eng

The aim of this project is the sub-classification of Iranian languages which has proven to be a particularly recalcitrant problem in historical linguistics. BLDCIL will complement and extend existing scholarship by applying a novel phylogenetic methodology to this old problem. The research will be based on a compilation of standardized lexical data sets, together with sound files, from a representative set of Iranian languages, focusing initially on the West Iranian languages.

audiocorpus_resourceiranian_studieslanguage_documentationlinguistics

Last edit: 2024-11-15

Barlaam and Josaphat in Arabic Literature and Culture

2017
eng

The focus of this project lies on Arabic manuscripts of Barlaam and Josaphat, an outstanding work of world literature, and the rich history of its transmission in Arabic sources. In the course of the project the manuscripts will be published online in an open-access research platform ediarum developed at the Berlin Brandenburg Academy of Sciences and Humanities (Berlin, Germany). Along with the photocopies of the manuscripts, the platform will feature full transcriptions of the Arabic texts, an English translation of selected manuscripts as well as analytical metadata and bibliography. This will result in the creation of a comprehensive research tool on the history of Barlaam and Josaphat in Arabic literature.

arabic_studiesclassical_textdigital_editiondigital_preservationliterary_studiesmanuscripttranslation

Last edit: 2024-10-31

Belegwörterbuch der sabäischen Sprache

2012
2024
deu ,
eng

This long-term project is dedicated to Sabaic language and aims to create a comprehensive dictionary based on a complete database of all known Sabaic inscriptions covering the period from 14th century BC to 6th century AD. The results of the project are available in an open-access online database which is fully searchable for Sabaic roots, words and forms.

corpus_resourcelexiconsemitic_studiestranslation

Last edit: 2024-10-31

Beta maṣāḥǝft Manuscripts of Ethiopia and Eritrea

2016
2040
deu ,
eng

The objective of this long-term project is to create a digital XML-based platform on Ethiopian and Eritrean manuscripts. It will provide its users not only with the textual content of the documents but also with all the available information on the documents themselves: authors, owners, scribes, provenance, various physical elements, colophons, notes, etc. The interoperability of the platform will allow search across all existing repositories. The incorporated manuscripts come both from western libraries as well as form Ethiopia and Eritrea.

african_studiesdigital_editiondigital_preservationmanuscriptteivre

Last edit: 2024-10-31

Biblia Arabica: The Bible in Arabic among Jews, Christians and Muslims

2012
2018
eng

Biblia Arabica serves as a research consortium with focus on Arabic translations of Hebrew, Aramaic, Syriac, Greek and Coptic Bible versions. It examines their stylistic and didactic features, vocabulary, scripts and strategies, as well as the mutual religious and cultural influences between the different religious communities, which were reinforced by the translation movements. The project attempts to set methodological standards on this field of study, charts countless manuscripts from across the globe and analyzes different method of translation form the various source languages.

arabic_studiesreligion_studiesreligious_textteitranslationtranslation_studies

Last edit: 2024-11-15

Bibliotheca Arabica

2018
2035
deu ,
eng

This project is dedicated to research on Arabic literatures dating from 1150 to 1850, a period perceived as a steady cultural decline of the region. The project counters this narrative by focusing on manuscripts that were produced and read throughout this time. Next to publication of 16 volumes and numerous articles and working papers, the project will provide a bio-bibliographical research platform that integrates digitized catalogue data and collections of manuscript notes, as well as a curated compilation of bio-bibliographical sources and Linked Data collections.

arabic_studiescatalogdigital_preservationliterary_studiesmanuscript

Last edit: 2024-10-31

Borno and Old Kanembu Islamic Manuscripts

2005
2007
2009
2011
2012
2015
eng

This digital collection provides access to one of the earliest written sub-Saharan languages in manuscript form - Old Kanembu written in Arabic script was a language of Qur’anic interpretation in the ancient Borno Sultanate. The digitalized corpus contains more than 5000 folios of Old Kanembu Islamic manuscripts from Nigeria, Niger and Republic of Chad and is an outcome of three research projects conducted in SOAS University of London and University Hamburg.

african_studiescorpus_outputdigital_preservationmanuscript

Last edit: 2024-10-31

Change Projekt - Wandel@FU-Bib

2019
2021
deu

The Change Project’s objective was to reorganize and optimize library infrastructure at FU Berlin. Within the two years of its activity the project’s team worked at improving connection and communication between university libraries and CeDiS (Centre for Digital Systems), simplifying structures, enhancing user orientation and service quality as well as increasing general efficiency and ability to manage future challenges.

research_infrastructure

Last edit: 2024-10-31

CLARIAH-DE

2019
2021
deu ,
eng

CLARIAH-DE is the merger of the two research infrastructure networks CLARIN-D and DARIAH-DE. Over a period of two years, the project will merge the two established research infrastructures. In this way, researchers from the Humanities and cultural sciences will be able to work with complex digital tools and special data sets in a much easier manner.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

CLARIN - European Research Infrastructure for Language Resources and Technology

2012
eng

CLARIN (Common Language Resources and Technology Infrastructure) is a pan-European infrastructure which provides access to digital language data and advanced tools to discover, annotate, analyze or combine this data via a network of European repositories. The infrastructure mainly serves for purposes of social sciences and humanities.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

CLARIN-D - European Research Infrastructure for Language Resources and Technology

2012
deu ,
eng

CLARIN-D is a research-oriented infrastructure for the Humanities and Social Sciences which supports researchers with accessing, preparing and analyzing of research data based on all language-related resources. It offers information on a wide range of topics, including teaching material, help on data management plans and other, discipline-specific support and makes available tools for Humanities, Cultural and Social Sciences scholars to analyze their research data in different ways. It is a German center of a pan-European research infrastructure CLARIN.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Colophons in Sumerian and Akkadian Literary Manuscripts from 3rd and 2nd Millennium BCE Mesopotamia

2019
2025
eng

The focus of this project are colophons of Sumerian and Akkadian manuscripts dating back to 3rd and 2nd millennium BC, which until now have been neglected in manuscript studies. The objective is to create a database containing a catalogue of respective manuscripts and an edition of the extant colophons. It should serve as solid base for the study of scribal practices in an ancient Mesopotamia and the relevance of colophons in its cultural history.

corpus_outputdigital_preservationepigraphyinscriptionsemitic_studies

Last edit: 2024-10-31

Communities of Knowledge: Interreligious Networks of Scholars in Ibn Abi Usaybi‘a’s “History of the Physicians”

2018
2021
deu ,
eng

This project’s objective is to digitally examine the social encounters of Christian, Muslim, and Jewish scholars in the Abbasid Near East (750–1258) engaged in exchange of medical knowledge as described by Ibn Abi Usaybi‘a (d. 1270), a Muslim physician from Syria in his history of medicine. The project will digitally record these interactions and use network analysis in order to identify the persons most frequently involved in scholar exchange and compare the results with representations from other sources.

arabic_studiesbiographical_datageodatamapnetwork_analysisscience_historyteivisualization

Last edit: 2024-10-31

Compilation and critical edition of pre-19th century Kurmanji Kurdish

2014
2016
eng

The main purpose of this project is to lay a foundation for serious academic research on the history of Kurdish, which until now is severely hampered due to the lack of written attestation prior to the 15th century. The project will thus compile a selection of 10 Kurdish texts from prior to 1800, transliterated in a standardized format and supplied with English translations and an authoritative critical apparatus. The texts will also be made fully accessible as a digital corpus, accompanied by a concordance, and the resulting two volumes will be published on the open-access portal of the University of Bamberg.

corpus_outputdigital_preservationiranian_studieslinguisticsmanuscripttranslation

Last edit: 2024-10-31

Computational Literary Studies Infrastructure (CLS INFRA)

2021
2025
eng

This project is a partnership of various European research institutions to build a shared resource of high-quality data, tools and knowledge to aid new approaches to studying literature in the digital age. It aims to create an infrastructure which will align the diverse resources of Europe’s multilingual and interconnected literary heritage with each other, with the tools needed to interrogate them, and with a widened base of users.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Coptic OCR

2018
eng

The aim of this project is to improve the quality and accessibility of OCR software for Coptic texts and their different fonds using Machine Learning technologies.

egyptologymachine_learningocr

Last edit: 2024-10-31

Coranica

2011
2014
deu ,
eng ,
fra

The focus of Coranica Project is to understand the early development and canonization of the Qur’an by giving priority to the empirical approach (analyses of material evidence like manuscripts), rather than to the data of the Arabo-Islamic tradition. The project consists of two components: the first, “Manuscripta et testimonia coranica,” aims at producing catalogued inventories and studies of the earliest manuscripts of the Qur’an. The second part, Glossarium Coranicum, will provide focused analysis and documentation on the languages (spoken and written) and religions that were practiced in Arabia at the time of Muhammad, and on their impact on the Qur’anic lexicon.

collaboration_platformislamic_studieslexiconreligion_studiesreligious_text

Last edit: 2024-10-31

Corpus Avesticum Berolinense

2018
2030
eng

The CAB project represents the first attempt to edit the entire Avestan Zoroastrian textual corpus within its ritual context, and to produce and offer both printed and digital editions of the texts. It will also develop a set of tools for an assisted reading of these texts, including translations, grammatical and semantic analysis of words, etc., which will be accessible also for non-specialist readers. Amongst these tools, the project will offer a digital version of the standard dictionary of the Avestan language (Bartholomae’s Altiranisches Woerterbuch).

digital_editiondigital_preservationiranian_studiesmanuscriptreligious_textvre

Last edit: 2024-10-31

Corpus Coranicum

2007
2024
deu ,
eng ,
fra

Corpus Coranicum develops a catalogue of early Qur’anic manuscripts and texts of the Islamic scholarly tradition, which are all published online (open access) by using TEI standards. It offers not only transcriptions of the manuscripts in original languages and their German translations, but also images of originals, variant readings and commentaries explaining their literary, cultural and historical context. The project focuses on the systematic documentation of sources related to the Qur’an with the objective to reconstruct the development of the first Muslim community as the interaction between the Prophet and the first addressees in Mecca and Medina.

catalogdigital_preservationislamic_studiesmanuscriptreligion_studiesreligious_textteitranslation

Last edit: 2024-10-31

Corpus dei Manoscritti Copti Letterari

1968
1980
1980
1990
2011
eng

The CMCL project digitally reconstructs manuscripts, critically edits Coptic texts, and maintains a comprehensive electronic database encompassing authors, works, manuscripts, electronic editions, computational grammar, and a bibliography in Egyptology, Coptology, and Christian Orient.

annotationcorpus_outputdigital_preservationegyptologylinguisticsmanuscript

Last edit: 2024-10-31

Corpus der arabischen und syrischen Gnomologien

2009
2011
2023
deu ,
eng

The corpus represents an assemblage of proverbs, aphorisms, and sagacious declarations sourced from the literary traditions of Arabic and Syriac. Additionally, it encompasses Greek texts, facilitating the examination of knowledge transmission across these three distinct cultural spheres.

corpus_outputdigital_preservationliterary_studiessemitic_studiestei

Last edit: 2024-10-31

Corpus Masoreticum

2018
2021
2018
2030
deu ,
eng

The long-term project Corups Masoreticum is dedicated to the Masorah of the Hebrew Bible – a corpus of annotations regarding rules of orthography, syntax, vocalization, cantillation etc. of the Biblical Text. This project is the first ever scientific engagement with the Western European (Ashkenazic) Masorah tradition between the 11th and 13th centuries. The philological work is integrated into a digital work environment BIMA, which was develop for the need of the project, and will result in an openly accessible digital edition of Ashkenazic Masorah.

annotationdigital_editiondigital_preservationjewish_studiesmanuscriptreligion_studiesreligious_text

Last edit: 2024-11-08

Corpus Musicae Ottomanicae

2015
2027
deu ,
eng ,
tur

This long-term project is dedicated to the transcription and critical editing of nineteenth-century sources of Ottoman music written in Hampartsum and staff notations, as well as the accompanying song texts. The end results of the project will be made available through an online platform which will provide access to both music and text editions, as well as the source catalogue, which is a comprehensive database of printed, manuscript, and online sources. The materials of the edited corpus represent the multi-ethnic and multi-religious environment of the region and period.

arabic_studiescorpus_outputdigital_preservationmanuscriptmusic_studiesvre

Last edit: 2024-10-31

Corpus of Arabic Legal Documents

2022
2024
eng

CALD is a research tool developed for the study of Arabic legal documents from the 7th to 16th centuries C.E. It provides access to over 3,000 historical documents, facilitating the study of Islamic law. The platform offers a sample of these documents along with analytical tools, and users can create accounts to access images and resources.

arabic_studiescorpus_outputdigital_editiondigital_preservationhistory_studiesislamic_studiesmanuscriptvre

Last edit: 2024-10-31

Corpus of South Arabian Inscriptions

2011
2016
eng

The CSAI (Corpus of South Arabian Inscriptions) is a significant collection of over 15,000 epigraphic inscriptions documenting South Arabian culture from the late 2nd millennium BC to the 6th century AD.

archaeologycorpus_outputdigital_preservationepigraphygeodatainscriptionmappre_islamic_arabia

Last edit: 2024-11-15

CorpusBuilder

2017
2020

The objective of this project is to develop a robust and user-friendly OCR pipeline for Islamic texts.

islamic_studiesmachine_learningocr

Last edit: 2024-10-31

Country of Words: Reading and Reception of Palestinian Literature from 1948 to the Present

2018
2023

PalREAD is a project dedicated to Palestinian literature from 1948 to the present which aims to find new ways to account for the literary production of a displaced population. It elaborates a new methodology of literary analysis that challenges and lies outside conventional conceptions of the nation-state and adopts an interdisciplinary and transmedial approach that embeds literary production in a wider cultural web of intersecting media and genres. The output of the project will be made accessible in the form of an online platform.

arabic_studiesliterary_studies

Last edit: 2024-10-31

Critical Edition of Ibn Bāğğa’s Commentary on Aristotle’s De generatione et corruptione

2016
2021
eng

This project focuses on Ibn Bāğğa’s Commentary on Aristotle’s “De generatione et corruption” which, eclipsed by the subsequent commentaries of Averroes, received scant attention, even though the importance of Ibn Bāğğa’s influence on the thought of the latter is undeniable. The objective of the project is a critical edition of Ibn Bāğğa’s commentaries, who not only follows Aristotle but also presents different examples and develops ideas of his own.

arabic_studiesdigital_editiondigital_preservationjewish_studiesmanuscriptphilosophyscience_history

Last edit: 2024-10-31

Critical Edition of Ibn Bāǧǧa’s Commentary on Aristotle’s Physics

2016
eng

Ibn Bāǧǧa’s commentaries provide an exposition of the content of Aristotle’s “Physics,” which is partly incomplete. They also had a decisive impact on Averroes’s commentaries on the same work. The project’s objective is a critical edition of Ibn Bāǧǧa’s commentaries, the study of their genesis and structure as well as their first ever presentation in form of a complete and reliable text.

arabic_studiesdigital_editiondigital_preservationjewish_studiesmanuscriptphilosophyscience_history

Last edit: 2024-10-31

Critical Edition of the Fragmentary Latin Translation of Averroes’s Commentary on De Partibus Animalium and De Generatione Animalium

2016
eng

The project aims at proving inside in Averroes’s lost commentary on Aristotle’s De partibus animalium and De generatione animalium by editing its Latin translation together with a study of work’s transmission into Hebrew.

arabic_studiesdigital_editiondigital_preservationjewish_studiesmanuscriptphilosophyscience_history

Last edit: 2024-11-15

Cuneiform Digital Library Initiative

2000
2003
2004
2006
2009
2011
2011
2013
eng

The main goal of this project it to make available through the internet the form and content of cuneiform inscriptions dating from the beginning of writing, ca. 3350 BC, until the end of the pre-Christian era. It already catalogued 320,00 of approximately more than 500,000 artefacts kept in public and private collections.

corpus_outputdigital_preservationepigraphyinscriptionsemitic_studiesvre

Last edit: 2024-10-31

Datenbankgestützte Erfassung, Erforschung und digitale Präsentation der Familienbibliothek Refaiya

2008
2013
ara ,
deu ,
eng

This project is dedicated to digital presentation, cataloguing, and research on the private Arabic-Islamic library of the Damascene Rifā‘ī family. The Refaiya library contains 488 carefully preserved volumes, which can be seen as an archive of a traditional Islamic culture of scholarship, knowledge, and books handed down over several centuries—until its purchase by Prussian Consul and Arabist Johann Gottfried Wetzstein in the 19th century. A trilingual database (English, German, and Arabic) will make the manuscripts accessible internationally, including in the Islamic world.

arabic_studiescatalogdigital_preservationmanuscript

Last edit: 2024-10-31

Datenrepositorium Manuskriptkulturen

2015
2019
deu ,
eng

This project was dedicated to the systematic securing and long-term use of all research data relevant for SFB (Sonderforschungsbereich for Manuscript Cultures in Asia, Africa and Europe). As result a centralized data repository was created for all the sub-projects of SBF synchronizing them with the administration and freeing them of the tasks of securing data and of developing specific tools. Next to the collection of all the data of SFB the created infrastructure helps to optimize manuscript research by making all manuscripts available in one place, offering the possibility to annotate documents, to write glossaries and to make results available to the public as well as by supporting the graduate college on using e-learning.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Demarginalizing Medieval Africa: Images, Texts, and Identity in Early Solomonic Ethiopia

2021
2024
eng

The research object of the ITIESE project are manuscripts belonging to the literary tradition of the Ethiopian Orthodox Church, which until now unjustly remain at the periphery of academic interest. The project’s intention is to shed new light on the cultural heritage of Ethiopian Empire by examining manuscripts illuminations made by medieval Ethiopian artists with focus on the cultural, religious and socio-political significance of their activities. The project will furthermore set up a platform for exchange between scholars working on the history of manuscripts illumination.

african_studiesart_studiescollaboration_platformimage

Last edit: 2024-10-31

Der Bibeltext der Handschriften mit babylonisch-jemenitischer Punktation: Erfassung der Textvarianten

2015
2020
deu ,
eng

This project is dedicated to the complete collection and computerization of textual variants of the Biblical manuscripts of the Babylonian and Yemenite tradition. The variants will be compered with other ancient textual witnesses and classified in their historical context. The analyses of the hitherto unexplored material will open a new perspective for the reconstruction of the biblical text history.

digital_preservationjewish_studiesmanuscriptreligion_studiesreligious_textvre

Last edit: 2024-11-15

Dhimmis and Muslims - Analysing Multi-Religious Spaces in the Medieval Muslim World

2017
2021
deu ,
eng

This project is dedicated to the study of religious diversity in medieval Muslim cities. The main purpose is to create a comprehensive database containing geographical information about religious communities compiled from various medieval sources and to make the data accessible via interactive visualization as well as a pilot web-based geo-temporal multi-view system.

geodatahistory_studiesislamic_studiesmapreligion_studiesvisualization

Last edit: 2024-10-31

Die Großbotschaften Damian Hugo von Virmonts und Ibrahim Paschas (1719/20)

2020
deu ,
eng

The aim of this project is the edition of all Habsburg and Ottoman sources documenting the diplomatic mission of Damian Hugo of Virmont and Ibrahim Pasha 1719/20, which are letters reports, travelogues, invoices, pictures, maps, artifacts, leaflets, pamphlets and newspapers. Sources written in Ottoman Turkish are translated into English and the editing work is based on a dynamic indexing process, according to which sources and metadata are put online even before the historical-critical editing is fully completed.

annotationdigital_editiondigital_preservationmanuscripttranslationturkologyvre

Last edit: 2024-10-31

Die Schreiben Selims II. an Maximilian II. und Mustafa Paşa (1566–1574)

2020
deu ,
eng

The purpose if this project was to edit documents from the MA thesis “Die Schreiben Selims II. aus dem Haus-, Hof- und Staatsarchiv zu Wien und die habsburgisch-osmanischen Beziehungen zwischen 1566 und 1574,” Vienna 2020, written by Halit Temiz and supervised by Univ.-Prof. Dr. Claudia Römer. They were converted into TEI-XML within the framework of the project Digital Edition of Sources on Habsburg-Ottoman Diplomacy 1500-1918 (QhoD), including all apparatus, and are available here under free license as an XML corpus.

annotationdigital_editiondigital_preservationmanuscripttranslationturkologyvre

Last edit: 2024-10-31

Digital Archive for the Study of pre-Islamic Arabian Inscriptions

2011
2016
eng

DASI seeks to gather all known pre-Islamic Arabian epigraphic material into a comprehensive online database, with the aim to make available to specialists and to the broader public a wide array of documents often underestimated because of their difficulty of access.

archaeologycorpus_outputdigital_preservationepigraphygeodatainscriptionmappre_islamic_arabia

Last edit: 2024-11-15

Digital Corpus for Greco-Arabic Studies

2010
2013
eng

The Digital Corpus for Graeco-Arabic Studies was created in the course of a collaborative project at Harvard and Tufts University and contains ca. 230 texts, three fifths of which are Greek originals and the rest are Arabic translations. In addition, the platform offers a list of authors and works included in the corpus that can be filtered by language, subject and text type and a search interface to search for Arabic or Greek words and phrases.

arabic_studiescorpus_outputdigital_preservationscience_historytranslationtranslation_studiesvre

Last edit: 2024-10-31

Digital Research Infrastructure for the Arts and Humanities (DARIAH)

2006
eng

DARIAH is a pan-European research infrastructure with 20 Members, 1 Observer and several Cooperating Partners in six non-member countries. It aims to enhance and support digitally-enabled research and teaching across the arts and humanities. DARIAH is a network of people, expertise, information, knowledge, content, methods, tools and technologies from its member countries. It develops, maintains and operates an infrastructure in support of ICT-based research practices and sustains researchers in using them to build, analyze and interpret digital resources.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Digital Research Infrastructure for the Arts and Humanities (DARIAH-DE)

2011
deu ,
eng

DARIAH-DE is a member of the pan-European research infrastructure for the arts and humanities. It supports research in the humanities and cultural studies that work with digital methods and procedures with a research infrastructure consisting of four pillars: teaching, research, research data and technical components.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Digital Sirah Project

2019
2021
eng

This project combines traditional methods of textual discovery and machine learning technologies with the goal of developing algorithms that are capable to locate reuse of Sirah (Biography) of the Prophet Muhammad, written by Muhammad b. Ishaq (d. 150/767) in a 1.5 billion word corpus of classical Arabic texts.

classical_textcomputational_text_analysisislamic_studiesmachine_learningnlpreligion_studiestext_reuse

Last edit: 2024-10-31

Digitale Gesamtedition und Übersetzung des koptisch-sahidischen Alten Testaments

2015
deu ,
eng

In this long-term project, the objective is to digitally reconstruct the Coptic Old Testament composed in an extinct Coptic dialect, Sahidic. The vast majority of manuscripts of this text are scattered across institutions in Europe and North America, which until now made a complete edition impossible. Equipped with new technologies of digital scholarship, the project will aim at overcoming this challenge by recreating the Coptic Old Testament in the form of an online database, which will contain photos and descriptions of catalogued manuscripts or their fragments, as well as English, German, and Arabic translations. It will also develop a system of digital tools for research on Coptic literature.

digital_editiondigital_preservationegyptologymanuscriptreligious_textvre

Last edit: 2024-10-31

Digitalisierungsprojekt Translatio

2013
2014
eng ,
deu ,
fra

arabic_studiescatalogdigital_preservationhistory_studiesiranian_studiesturkology

Last edit: 2024-11-15

Documentation of Endangered Languages / Dokumentation Bedrohter Sprachen

2000
2011
eng ,
fra ,
ind ,
por ,
rus ,
spa

The DoBeS program was initiated in 2000 by the Volkswagen Foundation in order to document languages that are potentially in danger of becoming extinct within a few years' time. It consisted of one archiving team and several documentation teams recording language data through fieldwork. The teams carried out 67 documentation projects collecting audio and video data in a close cooperation with the speech communities. The data are described with a set of standardized metadata categories and digitally archived according to open standards and made accessible. In addition, the archive ensures the long-term persistency of the digital material.

audiolanguage_documentationlinguisticsvideovre

Last edit: 2024-10-31

Documentation of Gorani, an endangered language of West Iran

2007
2012
eng

This project aims at documenting the endangered language and cultural practices of the Gorani speech community in western Iran. It will provide a collection of film recordings, highlighting the importance of musical performances in all areas of Gorani folklore and culture, annotated collections of religious texts, traditional folklore, oral narratives (mainly from recent history), and everyday speech, descriptive grammatical sketch of the Gawraju’i variety of Gorani and an electronic lexicon with multimedia applications.

annotationaudiocorpus_resourcedialectologyiranian_studieslanguage_documentationlexiconlinguisticsvideovisualizationvre

Last edit: 2024-10-31

Doha Historical Dictionary (معجم الدوحة التأريخي للغة العربية)

2013
2028
ara

This project aims to create a historical dictionary of Arabic language. The entries will follow changes in usage and meaning of each word from its first documented record until approximately 200 years AH / 820 AD (first phase of the project).

arabic_studieslexicographylexicon

Last edit: 2024-10-31

Edition des Gesamtwerkes von Ibn Nubātah al-Miṣrī

2020
2032
deu

Against the concept of the decline of Islamic cultures after the 11th century it becomes apparent that this period witnessed vast literal and cultural developments. One of these is a major impact of the most important poet of Mamluk era, Ibn Nubātah, on the Arabic literature for the centuries after him. The aim of this project is a complete digital edition of his diwan and twelve other works, most of with are available only in bad prints. Due to many revisions and rewritings made by the author the editing of his works is a great challenge. In order to represent his poetry correctly the edition will contain may versions of the poems enabling their comparison.

arabic_studiesdigital_editiondigital_preservationlexiconliterary_studiesmanuscriptvre

Last edit: 2024-10-31

eScriptorium

2018
eng

This project provides digital recognition of handwritten documents using machine learning techniques. It offers an integrated set of tools to transcribe, annotate, translate and publish historical documents written in Arabic script.

annotationhtrislamic_studiesmachine_learningnlpocrtranslationvre

Last edit: 2024-10-31

Ethio-SPaRe: Cultural Heritage of Christian Ethiopia. Salvation, Preservation, Research

2009
2015
eng

The Ethio-SPaRe project focused on preserving and analyzing manuscripts in Ethiopian churches and monasteries, particularly in the Tigray region. With a mission to safeguard Ethiopia's rich Christian heritage, the project aimed to identify, protect, digitize, and catalog manuscripts, creating searchable databases and virtual libraries while conducting extensive research into Ethiopian literature and oral traditions. The urgency stemmed from the imminent extinction of a substantial part of this unique cultural heritage.

african_studiescorpus_outputdigital_preservationmanuscript

Last edit: 2024-10-31

European Lexicographic Infrastructure

2018
2022
eng

ELEXIS is a cooperation project carried out by 17 European partner institutions that aims to develop an infrastructure which will foster cooperation and knowledge exchange between different research communities in lexicography in order to bridge the gap between lesser-resourced languages and those for which advanced experience in e-lexicography exists. ELEXIS will introduce common standards of data management for lexicographic resources and dictionary databases and develop tools and infrastructure which will enhance and harmonize cooperation between European institutions, publishers, universities and communities have been developing dictionaries and/or dictionary data.

lexicographyresearch_data_managementresearch_infrastructure

Last edit: 2024-10-31

FDNext

2020
2023
deu ,
eng

In this project six universities in Berlin and Brandenburg cooperate and exchange their expertise to optimize and improve Research Data Management (RDM) systems in their institutions. The focus of the project lies on the following aspects: 1) establishment of a contact point for legal questions concerning RDM (First-Level-Support), 2) fostering the qualifications of researchers by extending the offer on RDM training, 3) development of subject-specific RDM strategies and project-based RDM policies, 4) analyses and establishment of suitable services and central facilities as well as 5) enhancing the interoperability with external infrastructures and international cooperation.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

From Translation to Creation: Changes in Ethiopic Style and Lexicon from Late Antiquity to the Middle Ages

2014
2019
eng

The TraCes meticulously analyzed lexical, morphological, and stylistic features in Ethiopic texts, creating a groundbreaking digital corpus. The project's highlight is a comprehensive digital lexicon of the Gəʿəz language, departing from traditional dictionaries, providing complete lemmas, full translations, and contextual insights, resulting in the first-ever thesaurus for Gəʿəz.

african_studiesannotationcorpus_outputdigital_preservationlexicographylexiconlinguisticsteivre

Last edit: 2024-10-31

GenderVarianten — Revisionen von Genderkonstruktionen in Textüberlieferungen

2023
2026
deu ,
grc ,
eng

The project combines New Testament textual criticism, analyses of the reception of gender constructions, information science research, and systematic-ethical investigations. The central aim of the project is to analyze how digitally supported research can provide interdisciplinary gender studies with new opportunities for analyzing the transmission of texts and uncovering associated understandings of gender.

computational_text_analysiscorpus_outputcultural_studiesliterary_studiesmachine_learningnlpreligious_texttei

Last edit: 2024-10-31

Genesis Rabbah Loanwords

2018
2022
deu ,
eng

This project investigates the predominantly Greek borrowed vocabulary in the Midrash Genesis Rabbah, the earliest rabbinic commentary on the Book of Genesis written in Rabbinic Hebrew and Aramaic. It aims to compile a dictionary in TEI format and the annotation of the loanwords in the corpus for the investigation of issues pertaining to sociolinguistics as well as corpus linguistics.

annotationcorpus_resourcejewish_studieslexiconlinguisticstei

Last edit: 2024-11-15

Glossarion

2020
2024
deu ,
eng

This project’s objective is to develop an integrative database for the results of the research project “Verses and Sayings” (Verse und Sprüche).

research_infrastructure

Last edit: 2024-10-31

Glossarium Graeco-Arabicum

1994
2006
2010
2015
eng

Glossarium Græco-Arabicum serves as a database which makes available the files of a lexical project, intended to open up the lexicon of the medieval Arabic translations from the Greek in all the major disciplines of philosophy and the sciences. It provides information on vocabulary and syntax of Classical and Middle Arabic and Greek, on the development of scientific and technical vocabulary and on lexical consequences of the translation movement form Greek to Arabic. The sources of the projects are mainly complied form earlier glossaries, printed editions and manuscripts.

arabic_studiescorpus_resourcelexicographylexiconscience_historyteitranslationtranslation_studiesvre

Last edit: 2024-10-31

Grasping the World in Ottoman Istanbul: Visual Negotiation and Reading in Practices in Geographic Literature from the Sixteenth Century

2019
2022
eng

This project examines how geographical literature in sixteenth-century Istanbul was impacted by the vast cultural and political expiation of the Ottoman Empire in that time. As the city became a new center for scholarship and art, the elites began circulating Arabic and Persian geographical manuscripts. The project asks how this exchange was received by artists and cartographers who in turn negotiated and transformed geographic knowledge through images. Besides that, the project will also build a comprehensive online database of geographic manuscripts and their illustrations.

corpus_outputdigital_preservationgeographyislamic_studiesmanuscript

Last edit: 2024-10-31

Gumar Corpus

2018
eng

The aim of the GUMAR project is to automatically annotate a 100M-word Gulf Arabic corpus and develop gold standard data annotations for 200,000 words from eight novels, improving tools for Gulf Arabic automatic analysis.

annotationarabic_studiescorpus_outputdialectologymachine_learningnlp

Last edit: 2024-10-31

Handschriftenportal

2018
2021
deu ,
eng

This collaborative project carried out by four German libraries (see parent projects) will develop an online platform to provide metadata, descriptive information and images of all manuscripts held at German institutions, as well as a user-friendly workspace for manuscript research. It will enable researchers to access and work on data presented in a standardized and coherent way from one central access point.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Hannah Arendt Digital

2020
deu ,
eng

The objective of this long-term project is a first ever complete hybrid (print and online) edition of all Arendt’s published and unpublished works in their original languages. The digital edition offers commentaries and additional information about each text in the corpus as well as diverse search options. The project enables a new approach to the development of stile and thought of the author and her practice of writing between and across languages and audiences.

digital_editiondigital_preservationliterary_studiesvre

Last edit: 2024-10-31

Hethitologie Portal Mainz

2001
2007
deu ,
eng

This concluded project is an infrastructure for digital publication of cuneiform documents. Its aim is to modernize and foster the development of hittitological research by building an openly accessible database for its founding and elaborating new standardized way of their documentation. Currently the HPM infrastructure hosts data of many external researchers and projects form the field of Hittitology.

corpus_outputdigital_preservationdigital_publishingepigraphyinscription

Last edit: 2024-10-31

Hill Museum & Manuscript Library

1965
eng

HMML is an enterprise of saving and cataloguing manuscripts from all across the globe. It’s history reaches back to the 1960s when first repository of manuscripts’ photographs in form of microfilms was initiated at the Saint John’s University. Nowadays HMML archives approximately 300,000 manuscripts containing more than 70 languages. In 2015 the library launched its online platform of digitalized manuscripts to make them available for public and further research.

catalogdigital_preservationmanuscript

Last edit: 2024-11-15

Historia de los Autores y Transmisores Andalusíes

1988
eng ,
spa

The project focus lies in the intellectual production of al-Andalus. It aims to compile a bio-bibliographical database of over 5,000 Andalusian scholars, detailing their works and contributions in chronological order.

arabic_studiesbiographical_datacatalogdigital_preservationislamic_studiesmanuscript

Last edit: 2024-10-31

Historical Middle East Data Alliance

2020

An informal alliance for exchanging research data related to the historical Middle East.

collaboration_platformresearch_data_management

Last edit: 2024-11-15

HUNAYNNET

2016
2021
deu ,
eng

This project is a first ever attempt to compile a digital trilingual and linguistically annotated parallel corpus of Greek classical scientific and philosophical literature and the Syriac and Arabic translations thereof. Besides an online lexicography the database is enhanced by a linguistic corpus management system providing advanced search tools which help to compare and analyze the texts in all three languages.

annotationarabic_studiescorpus_outputdigital_preservationmanuscriptscience_historysemitic_studiestranslationtranslation_studiesvre

Last edit: 2024-10-31

Irankoran

2017
2020
deu ,
eng ,
fra

This project is dedicated to the study of early Qur’anic manuscripts from Iran with the central question of which kind of deviations in the text of the Quran, as it was fixed in the 10th century, are historically verifiable. The project will develop an online catalogue of all Qur’anic manuscripts held in the National Museum of Iran and other collections, create transliterations of Arabic text from selected pages and date the parchment with radiocarbon measurement. All these steps will be followed by analysis of the selected texts in order to verify the deviation from the canonical readings.

digital_editiondigital_preservationislamic_studiesmanuscriptreligion_studiesreligious_texttranslation

Last edit: 2024-10-31

Islamic Painted Page

2013
eng

Islamic Painted Page is an online database of Islamicate arts of the book, which originated from a personal project of Stephen Serpell and now turned to major international recourse with over 42,000 references to paintings, illuminations and bindings from over 270 collections around the globe. Its functions are twofold: firstly, it enables users to locate and compare works worldwide using a single database, displaying images wherever possible; and secondly, it signposts users onward to more authoritative sources, with hotlinks direct to the specific image pages of collection websites where available, and page-specific references for printed publications. The database is currently hosted by the University Hamburg.

art_studiescatalogdigital_preservationimageislamic_studiesmanuscript

Last edit: 2024-10-31

Islamic Scientific Manuscripts Initiative

2006
2019
eng ,
ara

ISMI is a catalogue that includes an object-relational database storing information on the rational scienes of scientific, philosophical and theological traditions of Islam before 1900 CE. The database aims to provide access to Islamicate authors, their works, and extant manuscript witnesses of mathematical sciences

arabic_studiesbiographical_datacatalogdigital_preservationimageislamic_studiesmanuscriptscience_history

Last edit: 2024-10-31

Isnalyser

2019
eng

For scholars studying ḥadīth texts, drawing an isnād tree with more than 40 transmitters is a tedious task, and finding the right medium to display it fully can even prove to be impossible. isnalyser is a simple program for the automation of drawing isnād trees and for their customizable display in handy formats.

hadithislamic_studiesisnadreligion_studies

Last edit: 2024-10-31

Jewish Book Culture in the Islamicate World

2020
2023
eng

The aim of the project is to study Jewish books, their makers and their readers in the medieval eastern Mediterranean. JBC draws on a large corpus of manuscripts produced in Egypt and the Near East in the ninth to thirteenth centuries and enrols multidisciplinary methods of codicology, palaeography and general book history to study books as objects and as essential actors in the transmission of texts, practices and ideas.

corpus_outputdigital_preservationhistory_studiesjewish_studiesmanuscriptreligious_texttranslation

Last edit: 2024-11-15

Kalila and Dimna – AnonymClassic

2018
2022
deu ,
eng

AnonymClassic is the first ever comprehensive study of Kalila and Dimna, one of the most important text of premodern world literature which passed from Hinduism and Buddhism via Islam to Christianity. Its Arabic versions constitute a source of all further translations into European languages and are also the main focus of the project, which analyzes the transmission processes of the work from India to Europe with Arabic as cultural bridge.

arabic_studiesclassical_textdigital_editiondigital_preservationliterary_studiesmanuscripttranslationvre

Last edit: 2024-10-31

Kallimachos

2014
2017
2017
2019
deu

Kallimachos is a Digital Humanities center at the University of Würzburg. It was developed via a project by the same name funded between 2014 and 2019. The center supports humanities scholars in applying digital methods in their research by providing technical and social infrastructure, software, and prototypical workflows. The center also establishes an integral structure for research data management.

collaboration_platformresearch_data_managementresearch_infrastructure

Last edit: 2024-10-31

Keywords for the Study of the Arabic Literary and Manuscript Tradition

2024
2027
ara ,
eng

KALiMaT is the first bilingual, open-access Arabic-English online living handbook of short articles on historical terms from the pre-modern and early modern literary tradition. Each article addresses a lemma, providing analysis and contextualization that makes the historical term accessible. With its nuanced approach focused on the analysis of concepts from the Arabic tradition and the critical examination of Western and modern literary categories, as well as with its bilingual publication strategy in both English and Arabic, KALiMaT contributes to the decolonization of knowledge. The project will be hosted on the Open Encyclopedia System infrastructure offered by Freie Universität Berlin.

arabic_studiesdigital_publishingencyclopedialiterary_studies

Last edit: 2024-10-31

Knowledge, Information Technology, and the Arabic Book

2018
2023
eng

KITAB is a platform which enables exploration of Arabic texts in digital form. It also provides a digital toolbox and a forum for discussions about Arabic literature. The main research focus of the project lies in the question of how authors copied from previous works. The research is conducted on a corpus of texts created for the needs of the project and powered by technologies at the cutting edge of computer science. All documentation is openly accessible on GitHub.

annotationarabic_studiescollaboration_platformcomputational_text_analysiscorpus_resourceisnadliterary_studiesnetwork_analysisnlpocrtext_reusevisualizationvre

Last edit: 2024-10-31

KOHD Coptica

2016
2022
eng ,
deu

The KOHD Coptica project, part of the Cataloging of Oriental Manuscripts in Germany initiative, catalogs and digitizes Coptic manuscripts in the Berlin Papyrus Collection. The ongoing database development facilitates efficient searches and standardized presentations of metadata, with records gradually released as cataloging and digitization progress.

catalogdigital_preservationegyptologymanuscript

Last edit: 2024-10-31

KOHD Digital

2016
2022
deu ,
eng

KOHD Digital is a databank of KODH (Die Katalogisierung der Orientalischen Handschriften in Deutschland) – a research project that aims at systematizing not yet cataloged manuscripts from various German libraries. Previously all texts were published in printed catalogues but since 2016 they appear mostly online in KHOD Digital databank.

arabic_studiescatalogdigital_preservationiranian_studiesmanuscriptturkology

Last edit: 2024-10-31

KompetenzwerkD

2020
deu

KompetenzwerkD is a research center for the Digital Humanities for various extramural research institutes in Saxony (see the project’s website). The objectives is to enhance the cooperation and exchange between the local institutes, implementation of Open Data Policy promotion and further development of digital methods in humanistic research.

collaboration_platformresearch_data_managementresearch_infrastructure

Last edit: 2024-10-31

Linguistic dynamics in the Greater Tunis Area: a corpus-based approach (TUNICO)

2013
2016
eng

The focus of this project lies on the Arabic dialect spoken in the Tunisian capital. It investigates the linguistic dynamics caused by recent demographic changes in the metropolitan area of Tunis and its intention is to gather data through field recordings made with young speakers who were born in Tunis but descended from parents who arrived to the capital from other regions. Based on this data two digital language resources were created: a corpus of unmonitored speech that contains both conversations and narratives and a dictionary based on this corpus and on previously published resources.

annotationarabic_studiesaudiocorpus_resourcedialectologylanguage_documentationlexiconlinguisticsteitranslation

Last edit: 2024-11-15

Localizing 4000 Years of Cultural History. Texts and Scripts from Elephantine Island in Egypt

2015
2022
eng

The Elephantine Manuscripts Project employs cutting-edge digital methodologies, integrating disciplines like digital humanities, physics, and mathematics. A central element is the development of an open-access online research database, housing 80% previously unpublished manuscripts in ten different languages and scripts. The database provides unprecedented insights into the medical, religious, legal, administrative, and literary aspects of Elephantine's history spanning 4000 years.

annotationcorpus_outputdigital_preservationmanuscriptteitranslation

Last edit: 2024-10-31

Machine Translation and Automated Analysis of Cuneiform Language

2017
2021
eng

Combining the newest developments in machine learning with statistical and neural machine translation technologies, this project formulates, tests and evaluates methodologies to translate and analyze the contents of some 67,000 highly standardized administrative documents from southern Mesopotamia. All outcomes of the project, the methodologies, the translations and the data extracted from them, will be offered to the public in open access.

annotationcomputational_text_analysiscorpus_resourcelinguisticsmachine_learningnlpsemitic_studiestranslationtranslation_studies

Last edit: 2024-10-31

MAJLIS. The Transformation of Jewish Literature in Arabic in the Islamicate World

2021
2026
eng

This project is dedicated to the hitherto under-researched Judaeo-Arabic literature from the period between the 9th and the 11th century. The overall aim is to explore how the adoption of Arabic language and Arabic intellectual tradition influenced the Jewish literary creativity in this period. The project will proceed by applying state-of-the-art digital tools to analyze manuscripts, trace changes in the Jewish literature, identify the dominant scholars and to compare the Judaeo-Arabic literature with non-Jewish literary traditions of the time.

arabic_studiesliterary_studiesdigital_preservationjewish_studiescatalog

Last edit: 2024-11-15

Maparabia

2018
2023
eng

The objective of the project is to produce a set of tools with a common aim: to model territorial and cultural dynamics by cross-referencing the various available sources. These analyses yield an updated regional history from the beginning of the 1st millennium BCE to the 7th cent. CE.

arabic_studiesarchaeologyepigraphygeodatageographyinscription

Last edit: 2024-11-05

Memory Spaces: Mapping Oral History in Mosul

2023
2024
deu

The project aims to establish an online accessible digital platform where former and current residents of the Iraqi city of Mosul can contribute their personal and individual memories of their hometown. These memories can be inserted in the form of stories, images, and audio recordings, all placed on a digital and visually appealing map of Mosul and its surroundings (prior to 2014). After editorial processing and approval by a dedicated editorial team, these contributions will be accessible worldwide.

arabic_studiescultural_studiesdigital_preservationimagevideo

Last edit: 2024-11-15

Mesopotamian Royal Inscriptions

2000
eng

In this sub-project of the Cuneiform Digital Library Initiative (CDLI), the focus lies on the digitization of royal and monumental texts, which despite representing a small share of the overall CDLI catalogue (20,000 of 350 000 texts) assume a disproportionately large role in history, literature, and linguistics in the several sub-fields of cuneiform studies.

corpus_outputdigital_preservationepigraphyinscriptionsemitic_studiesvre

Last edit: 2024-10-31

Mesopotamian Seals

2000
eng

This sub-project of the Cuneiform Digital Library Initiative (CDLI) focuses explicitly on inscriptions on the Mesopotamian stamp and cylinder seals for which, despite their substantial role in the development of writing, the online resources are still difficult to be found. This project is offered to bring attention to the admittedly limited text annotation files of the CDLI as one of several avenues of research available in this field.

corpus_outputdigital_preservationepigraphyinscriptionsemitic_studiesvre

Last edit: 2024-10-31

Methoden der Bildverarbeitung zur Bestimmung von visuellen Manuskript- und Zeichenmerkmalen

2015
2019
deu ,
eng

This project is intended to support the research program of the SFB (Sonderforschungsbereich for Manuscript Cultures in Asia, Africa and Europe) by piloting and further development of the Advanced Manuscript Analysis Portal (AMAP), a web-based platform offering self-usable toolbox for humanists and facilitating communication between experts, as well as by development of innovative image processing methods.

imageresearch_infrastructure

Last edit: 2024-10-31

Methoden der Manuskriptanalyse zur Wiedergewinnung verlorener Schrift

2011
2019
deu ,
eng

This project supports the research program of the SFB (Sonderforschungsbereich for Manuscript Cultures in Asia, Africa and Europe) by making available and further development of modern technologies that help to recover scripts that are not longer readable for the human eye due to organic deterioration or conscious interventions.

research_infrastructure

Last edit: 2024-10-31

Middle East Librarians Association

1972
eng

MELA is an association of Middle East library specialists. Its mission is to facilitate communication and encourage cooperation among its members and Middle East libraries. It aims at improving area librarianship through the development of standards for the profession and education of Middle East librarians. It also compiles and disseminates information concerning Middle East libraries through various meeting and publications.

collaboration_platform

Last edit: 2024-10-31

Mouse & Manuscript

2020
eng ,
ara

The aim of the 'Mouse & Manuscript' project is to enhance teaching on codicology and Oriental manuscripts by utilizing digital tools to make Leiden University's manuscript collections more accessible. The project provides interactive lessons, leveraging digital humanities technology, such as an IIIF viewer, to offer detailed, zoomable access to manuscripts, while integrating questions and assignments to engage students and support further scholarly research.

arabic_studiese_learningislamic_studiesmanuscript

Last edit: 2024-10-31

Moving frontiers

2010
2010
eng

This project aims to create a dynamic reconstruction of historical events that revolved around the conflict between Tang China and the Arab Caliphate around 750, ultimately establishing a balance of power between the two empires in Central Asia. The methodology employed is multidisciplinary, combining historical and archaeological data related to frontier dynamics and mobility. These data will be integrated into theoretical network analysis models and Geographic Information Systems (GIS).

geodatahistory_studiesislamic_studiesmapnetwork_analysis

Last edit: 2024-10-31

Multi-Arabic Dialect Applications and Resources

2016
2019
eng

The project aimed at improving dialectal Arabic processing by developing resources for Arabic dialect modeling, including a 25-city multi-dialect lexicon and parallel corpus, as well as creating machine translation systems between dialects, English, and Standard Arabic, and advancing dialect identification systems capable of operating at different levels of granularity.

arabic_studiescorpus_outputdialectologylexiconmachine_learningnlptranslationtranslation_studies

Last edit: 2024-10-31

Multilingual Corpus of Annotated Spoken Texts

2000
2021
deu ,
eng

Multi-CAST is a collection of annotated texts from a typologically diverse selection of 15 languages including endangered ones. The corpus is freely accessible via an online platform which provides multiple levels of parallel annotations, time-aligned with audio recordings, chiefly monologic, natural narrative texts from 15 languages, encompassing roughly 25,000 clause units available in multiple file formats, including as EAF files for the linguistic annotation software ELAN, as XML and TSV files, and via the multicastR package for R.

audiocorpus_resourcelanguage_documentationlinguisticstranslation

Last edit: 2024-11-15

New Languages for NLP

2021
2022
eng

The practical aim of this project is to add so-called under-represented or low-resource languages to the library of spaCy. Ultimately, the more general goal consists in increasing the diversity of languages and trained models to help natural language processing reflect the linguistic diversity of our world.

literary_studiesmachine_learningnlp

Last edit: 2024-10-31

Nimrud NW Palace

2000
eng

This project is a collaboration between Cuneiform Digital Library Initiative (CDLI) and a German researcher Klaudia Englund and is dedicated to the digital preservation of the heritage of the Nimrud palace, which suffered severe damage at the hands of ISIS. The main goal of the project lies in preparing for display the image and text files that the researcher gathered in the course of her work on the Northwest Palace and the over 600 relief slabs mounted along the walls of the building’s main rooms.

archaeologycorpus_outputdigital_preservationepigraphyinscriptionsemitic_studiesvre

Last edit: 2024-10-31

NoMansLand: Nomads’ Manuscripts Landscape

2020
2026
deu ,
eng

The focus of this projects lies on Iranian and Central Asian societies in the period between the 13th and 15th century which witnessed nomadic conquest after the rise of Chinggis Khan. The process of transculturation and interaction between the sedentary Muslim conquered and nomadic conqueror is documented in largely unstudied manuscripts from that period, which will be explored and analyzed in the course of this project. The technical side of the project will be responsible for developing and implementing of an independent Analytical Database of Islamic Manuscripts which will serve both as a digital repository of manuscripts and an analytical tool that will allow users to do wider research on Islamic and Eurasian studies.

corpus_outputdigital_preservationislamic_studiesmanuscript

Last edit: 2024-11-15

Online Corpus of the Inscriptions of Ancient North Arabia

2013
2017
2024
eng

The Online Corpus of the Inscriptions of Ancient North Arabia aims to help transform our knowledge of the history, languages and cultures of ancient Arabia by creating a digital corpus of all known Ancient North Arabian inscriptions in North and Central Arabia, and elsewhere.

corpus_outputdigital_preservationepigraphygeodatainscriptionmappre_islamic_arabia

Last edit: 2024-11-15

Onomasticon Arabicum

1966
ara ,
eng

Onomasticon Arabicum (OA) is an Arabic biographical database created in the course of a long-living project reaching back to the 1960s. It contains more than 27000 entries about scholars and celebrities from the first Muslim millenary. Its searching tools are designed to allow the user separate interrogation on any of the different elements of the Arabo-Muslim names, dates and places, reconstructing the identity of a person, trace ways of knowledge transmission and frame historical contexts.

arabic_studiesbiographical_dataencyclopediageodatahistory_studiesmapnetwork_analysisvre

Last edit: 2024-10-31

Open Arabic Periodical Editions: A framework for bootstrapped digital scholarly editions outside the global north

2015
eng

Corpus of TEI/XML editions of Arabic periodicals, a static webview, and a framework of workflows for digital scholarly editions following the principles of minimal computing to address the affordances of the digital for cultural artefacts from the Global South

arabic_studiescorpus_outputdigital_preservationislamic_studies

Last edit: 2024-11-15

Open Encyclopedia System

2016
2020
deu ,
eng

OES is a standardized publishing platform for humanities and social sciences. Contrary to existing academic encyclopedias the OES taps into the full potential of e-publishing by adding new solutions to the already available options: it offers flexible workflows, configurable participation models, various options for customizing the peer review process and features that facilitate the engagement and management of a community.

digital_publishingresearch_infrastructure

Last edit: 2024-10-31

Open Islamicate Texts Initiative

2019
2021
eng

Open ITI is a multi-institutional project which aims to construct the first machine-actionable scholarly corpus of premodern Islamicate texts in Arabic, Persian and other languages of the Islamic world. This new textual infrastructure will enhance the implementation of new forms of macro textual and digital scholarship in the inter-related fields of Islamic, Persian, and Arabic Studies

annotationcorpus_outputdigital_preservationislamic_studiesmachine_learningmanuscriptnlpocr

Last edit: 2024-10-31

Open Islamicate Texts Initiative Arabic-script OCR Catalyst Project

2019
2021
eng

This project is led by an interdisciplinary team of humanities, computer science, and digital humanities co-principal investigators with the scope to challenge the technical and organizational barriers that historically have stymied the development of Arabic-script OCR and digital text production for Islamicate Studies. Its primary goal is to achieve approximately 97% character accuracy rates (CARs) for OCR on the most used Persian and Arabic print typefaces.

corpus_resourceislamic_studiesmachine_learningnlpocr

Last edit: 2024-10-31

OpenITI_NgramReader

2023
eng

The NgramReader analyzes and visualizes the chronological occurrence of words and phrases based on the OpenITI corpus data. It enables the combination of various morphological forms and the exploration of object classes.

arabic_studiescomputational_text_analysiscorpus_resourcehistory_studieslinguisticsvisualization

Last edit: 2024-10-31

Oracc: The Open Richly Annotated Cuneiform Corpus

2012
eng

Oracc emerged from the CDLI project. It is a is a collaborative effort to develop a complete multilingual corpus of cuneiform whose rich annotation and open licensing support the next generation of scholarly research.

corpus_outputdigital_preservationepigraphyhistory_studiesinscriptionsemitic_studiesvre

Last edit: 2024-11-15

Orient-Digital

2020
2023
deu

Orient-Digital is a union catalogue and a portal for oriental manuscripts kept in Germany. It will contain the metadata of Arabic, Persian, and Ottoman-Turkish manuscripts from more than 25 cooperating institutions and will provide links to the digitized manuscripts that are available online.

catalogdigital_preservationmanuscript

Last edit: 2024-10-31

Oxford Cambridge Islamic Manuscript Catalogue Online

2009
2011
eng

This project aims to improve electronic access to the UK's significant Islamic manuscript collections at Oxford and Cambridge universities. It plans to create 10,000 summary records using a TEI P5 schema for future detailed descriptions, making these resources accessible through Oxford University Library Services and potentially contributing to the European Manuscriptorum. Collaboration with the Islamic Manuscript Association and other initiatives is a key focus.

african_studiesarabic_studiescatalogdigital_preservationiranian_studiesislamic_studiesjewish_studiesmanuscriptsemitic_studiesteiturkology

Last edit: 2024-10-31

Paleocoran

2015
2018
deu ,
eng ,
fra

PALEOCORAN examines Qur’anic manuscripts that once belonged to the ‘Amr ibn al-‘Āṣ Mosque in al-Fusṭāṭ (Old Cairo) – one of the most significant Qur’anic Libraries – and are now scattered across the globe. The main objective of the project is to reconstruct the library and to make the manuscripts available digitally.

digital_editiondigital_preservationislamic_studiesmanuscriptreligious_texttranslationvre

Last edit: 2024-10-31

Patterns of Knowledge Circulation: The Transmission and Reception of Jewish Esoteric Knowledge in Manuscript and Print in Early Modern East-Central Europe (1500-1750)

2019
2024
deu ,
eng

This project aims to explore how Jewish esoteric knowledge circulated throughout history in multiple moves of transmission and reception that took place in various social, cultural, and material contexts in early modern East-Central Europe. To this end it will provide a systematic digital catalogue and quantitative analysis of esoteric Jewish texts produced and in circulation in East-Central Europe from 1500 to 1750. The catalogue will provide grounds for further analysis of transmission, reception, and movement of these texts and is designed to facilitate further qualitative studies of patterns of knowledge transmission based on quantitative analysis of relevant sources.

catalogdigital_preservationjewish_studiesmanuscriptreligion_studiesreligious_textvre

Last edit: 2024-10-31

Pilotprojekt zur datenbankgestützten Erschließung und digitalen Bereitstellung der neu erworbenen arabischen, persischen und türkischen Handschriften der Universitätsbibliothek Leipzig

2006
2008
ara ,
deu ,
eng

In this project 55 Arabic, Persian and Ottoman-Turkish manuscripts, that has been purchased by Leipzig University Library in 1995 and 1996, will be catalogued and digitalized. It is a pioneer project, as it will for the first time integrate Arabic script into a database that also features German and American transliteration systems. The manuscripts contain texts that cover an amazingly broad spectrum, which comprises almost all traditional Islamic fields of knowledge and the main scope of this project to provide scholars of Oriental Studies with access to this hitherto unknown pool of information.

catalogdigital_preservationislamic_studiesmanuscriptvre

Last edit: 2024-10-31

Post-predicate Elements in Iranian: Inheritance, Contact, and Information Structure

2019
2022
eng

This project is dedicated to the study of the syntax of Iranian languages. It will investigate selected Middle Iranian corpora, and unrelated contact languages, in order to reconstruct the genesis and historical development of Iranian syntax with focus on largely ignored OVX (object-verb-oblique) word order type. The project’s results will be made available on two platforms: The Word Order in Western Asia Corpus and The HamBam Corpus (see project websites).

annotationcorpus_resourceiranian_studieslanguage_documentationlinguisticstranslation

Last edit: 2024-11-15

Project for the digitisation and online presentation of Arabic, Persian, and Turkish manuscripts at Leipzig University Library

2012
2015
ara ,
deu ,
eng

In this project all the remaining Arabic, Persian and Turkish manuscripts at Leipzig University Library that have not been yet catalogued and digitized in previous projects (Refaiya and Pilotprojekt zur datenbankgestützten Erschließung und digitalen Bereitstellung der neu erworbenen arabischen, persischen und türkischen Handschriften der Universitätsbibliothek Leipzig) will be made accessible online until 2015.

catalogdigital_preservationislamic_studiesmanuscriptvre

Last edit: 2024-10-31

Prosopografía de los ulemas de al-Andalus

2006
2008
2010
2014
eng ,
spa

This project is dedicated to the prosopography and biography of Andalusi scholars. It sourced information about approximately 11,600 scholars from Arabic biographical dictionaries and organized into a database, in a way that enables taking advantage of possibilities offered by electronic data processing. It also offers a setup of the necessary tools to organize and filter information, handle the data with quantitative criteria and present the results of the analysis with a variety of visualizations.

arabic_studiesbiographical_dataislamic_studiesgeodatamapvisualization

Last edit: 2024-10-31

Ptolemaeus Arabus et Latinus

2013
2037
eng

Ptolemaeus Arabus et Latinus (PAL) is a long-term project dedicated to the edition and study of the Arabic and Latin versions of Ptolemy’s astronomical and astrological texts and related material. The texts are made available in an online database containing catalogues of Arabic and Latin works, images of manuscripts, searchable transcriptions, critically edited texts and a Greek-Arabic-Latin-English glossary of astronomical and astrological terms.

arabic_studiesdigital_editiondigital_preservationlexiconmanuscriptphilosophyscience_history

Last edit: 2024-10-31

Qalamos

2023
2026
deu ,
eng

Qalamos is the portal for manuscripts and block prints from Asian and African script traditions. It provides access to metadata and digitized copies of all of all collections kept in German memory institutions. In addition, it will create a research infrastructure that provides the international community with direct access to all relevant Oriental manuscript collections in Germany.

catalogdigital_preservationmanuscript

Last edit: 2024-10-31

QhoD – Digitale Edition von Quellen zur habsburgisch-osmanischen Diplomatie 1500–1918

2020
deu ,
eng

This project aims to provide a digital infrastructure to edit sources on Habsburg-Ottoman diplomacy from the beginning of diplomatic relations between the two empires at the turn of the 15th and 16th centuries to their demise at the end of World War I. The sources will be digitally processed on a historical-critical basis and made available in open access (CC-BY license). All forms of diplomatic relations and a large variety of source types are included.

annotationdigital_editiondigital_preservationmanuscripttranslationturkologyvre

Last edit: 2024-10-31

Qumran Digital: Text und Lexikon

2021
2033
deu

The main objective of this project it to elaborate a philological dictionary of a complete vocabulary the Dead Sea Scrolls, famous manuscripts contacting over 1000 biblical and non-biblical text in Hebrew and Aramaic languages dating back to period between 3 c. BC and 2 AD. The materials will be edited morphologically, etymologically and semantically and made openly accessible for public.

annotationcorpus_resourcejewish_studieslexicographylexiconsemitic_studies

Last edit: 2024-10-31

Reallexikon der Assyriologie und Vorderasiatischen Archäologie

1928
2018
deu ,
eng ,
fra

This project represents the most extensive publication in Ancient Near Eastern Studies, with its first fascicle published in 1928 and the last one in 2018. It is an encyclopedia whose entries cover names of locations, deities and kings, history, religion, literature, sciences, flora and fauna, material culture as well as art and architecture of the Ancient Near East. It is available as printed and digital edition, accessible either as PDF files or via an online List of Lemmata.

encyclopediahistory_studiessemitic_studies

Last edit: 2024-10-31

Romantic Period Poetry Archive

2020
2025
eng

The Romantic Period Poetry Archive, or RPPA for short, is an open access digital platform of global Romantic-period poetry. It comprises a full-text poetry database and an open scholarship platform. Over 600 poets from 70 countries will be included in the database by the end of the alpha phase of the project (2020-2025).

corpus_outputdigital_preservationliterary_studiestranslation

Last edit: 2024-10-31

Safeguarding the Manuscripts of Timbuktu

2013
2021
deu ,
eng ,
fra

The project aims to save the Timbuktu manuscripts by perserving, restoring, and researching the manuscripts. The manuscripts are considered to be the most important collections of the written heritage of the West African literary tradition.

african_studiescatalogdigital_preservationislamic_studiesmanuscript

Last edit: 2024-10-31

SaxFDM-Fokusprojekt: Publikationsdienst für wissenschaftliche Datenmodelle und Vokabulare

2022
2023
deu

The purpose of this project is to standardize and optimize Research Data Management (RDM) in Humanities. The expected output is a new RDM service, which will allow to publish and document different data models in a uniform way. The is a part of SaxFDM initiative dedicated to RDM optimization on a level of Saxony state.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Scripta Qumranica Electronica: Eine Verbunddatenbank und virtuelle Forschungsumgebung für die Texte vom Toten Meer

2016
2021
eng

This project is dedicated to enhancing and standardizing the research on the Dead Sea Scrolls, famous manuscripts contacting over 1000 biblical and non-biblical text in Hebrew and Aramaic languages dating back to period between 3 c. BC and 2 AD. Next to the preparation of the pioneer Digital Scholarly Edition of Qumran Texts the main focus of the project lies on the creation of a standardized environment for collaborative production of Scrolls editions, advanced research tools and a robust dynamic interlinked database of texts, images and annotations.

annotationcatalogdigital_preservationjewish_studiesmanuscriptsemitic_studiesvre

Last edit: 2024-11-15

Scripta-PSL

2016
2019

Scripta-PSL aims to integrate the fundamental sciences of the written word (paleography, codicology, epigraphy, history of the book, etc.) and other social and human sciences (linguistics, history, anthropology, etc.) into the digital and computational humanities. It will develop machine vision tools (for handwriting recognition and document analysis, for example) and tools for (semi)-automatic classification of symbols and texts, with a focus on pre-typographical and/or non-alphabetic cultures. It will also provide additional inquiry into written traditions of rare languages and writings, which are currently underrepresented at the University.

annotationcomputational_text_analysiscorpus_resourcehtrmachine_learningocrvre

Last edit: 2024-10-31

Sefaria

2011
eng ,
heb

Sefaria in a non-profit organization which assembles a free-living library of Jewish texts and their interconnections, in Hebrew and in translation. The texts are available in a digital form with interactive interfaces allowing more people to engage with textual treasures and tradition.

corpus_outputdigital_preservationjewish_studiesreligious_textvre

Last edit: 2024-10-31

SfarData

1965
eng ,
heb

SfarData is a bilingual (Hebrew/English) codicological database of the dated medieval manuscripts written in Hebrew script documented in 250 libraries. It has a 50 years long history of digital preservation of bibliographical, textual, numerical, and paleographical data and was recently incorporated to the National Library of Israel.

catalogdigital_preservationjewish_studiesmanuscript

Last edit: 2024-10-31

SHARIAsource

2015
eng

SHARIAsource is a project dedicated to digitalization of Islamic legal studies. To this end, the platform offers a growing library of primary sources for Islamic law, which were collected from all around the globe through digitization, partnerships, and individual contributions. It also uses data science and AI to curate and build digital humanities tools that make the Portal’s Islamic law sources more accessible and useful, and to facilitate new research and analysis.

corpus_outputdigital_preservationhistory_studiesislamic_studies

Last edit: 2024-10-31

Shawi

2021
2025
deu ,
eng

This project focuses on the Arabic dialects of the so-called Shawi-type spoken by small cattle breeding Bedouin tribes in various regions of the Fertile Crescent. The main objective is to create a text corpus of this dialects based on audio recordings, to provide a comprehensive grammar and a digital dictionary. The basis of these contributions is new linguistic data gathered from the field from speakers living in the Harran-Urfa region of Southeastern Turkey, Syria and Iran.

arabic_studiesaudiocorpus_resourcedialectologylanguage_documentationlexiconlinguistics

Last edit: 2024-10-31

SIHAFA: The Ideosphere of the Late Ottoman Arabic Press: Mapping the Discursive Field of Authors and Texts through Computational Approaches

2023
2025
eng

SIHAFA explores the late Ottoman Arabic ideosphere of, employing a systematic and computational approach to study the periodical press as a discursive field. The project's methodology involves analyzing a digital corpus of seven Arabic journals through stylometric authorship attribution, social network analysis, and close reading. The aim is to fill a research gap, challenge established narratives of the Arabic Renaissance, and contribute to Arab Periodical Studies, emphasizing FAIR data principles and open access. Project outputs will include groundbreaking research published in English and Arabic, improved digital scholarly editions, authority files, an OCR model for Arabic periodicals, and a plain text corpus of authorship candidates.

arabic_studiescomputational_text_analysiscorpus_resourcehistory_studiesnetwork_analysisocrstylometry

Last edit: 2024-11-15

Simplification of Arabic Masterpieces for Extensive Reading

2016
2018

The aim of the SAMER project is to create standards and tools for simplifying modern Arabic fiction for school-age learners, including a readability scale, a curated readability list, a 36k-word thesaurus, and a simplification platform for Google Docs.

arabic_studiescorpus_outpute_learninglexiconmachine_learningnlp

Last edit: 2024-10-31

Streamlining Galen

2019
2024
eng

This project is dedicated to study of influence that Galen’s works had on Islamic medicine after their translation into Arabic between 8th and 10th century. The focus of the research lies on the Arabic summaries of Galen which served as primary learning tools for medical students and doctors. The main output of the project will be a comprehensive survey of Arabic Galenic summaries and a digital corpus of selected texts. Project members will analyze and compare them with each other and with the Galen treatises in Arabic translation that formed the basis of the summaries.

arabic_studiesdigital_editiondigital_preservationhistory_studiesmanuscriptmedicine_historyscience_history

Last edit: 2024-10-31

Strukturen und Transformationen des Wortschatzes der ägyptischen Sprache: Text- und Wissenskultur im Alten Ägypten

2013
2035
deu

The aim of the project is preparation of a diachronic corpus of Egyptian texts which, linked to a digital dictionary, enables online research of Coptic vocabulary throughout all epochs and stages of Egyptian language history. The results are accessible on Thesaurus Linguae Aegyptiae (TLA), a research and publishing platform developed by a previous BBAW project, “Altägyptisches Wörterbuch,” which expired 2012.

annotationcorpus_resourceegyptologylexicographylexicontranslationvre

Last edit: 2024-10-31

Study of Averroes’s Long Commentary on the Physics in Hebrew and Latin Transmission

2016
eng

This project is a preparation for the critical edition of Hebrew and Latin translations of Averroes’s Long Commentary on Aristotle’s Physics. While the Arabic original is lost, the aim of collation of its extant Latin and Hebrew translations is to reconstruct it.

arabic_studiesdigital_editiondigital_preservationjewish_studiesmanuscriptphilosophyscience_history

Last edit: 2024-10-31

Surayt-Aramaic Online Project

2014
2017
2017
2020
ara ,
deu ,
eng ,
fra ,
nld ,
swe ,
tur

This project is a second phase of Aramaic Online Project, whose team developed online courses for Surayt language in levels A1, A2 and B1. The aim of this phase is to develop courses for intermediary and advanced levels (B1-2, C1 and C2) and producing additional linguistic material for the teaching of Surayt, such as a “Reader” (digital text corpus) which will address the needs of more advanced learners.

corpus_resourcee_learningsemitic_studiestei

Last edit: 2024-10-31

Synoptische Edition des kabbalistischen Traktats Keter Shem Ṭov

2019
2022
deu ,
eng

The research object of this project is the approximately 700 years old Hebrew treatise Keter Shem Tov, which constitutes one of the most important introductory texts into the Kabbalah. The purpose of the project to prepare a critical printed edition of the text as well as its online, interactive and openly accessible version along with an English translation. To reach this objective the project contributes to the further development of an interactive, digital tool LERA.

digital_editiondigital_preservationjewish_studiesmanuscriptreligious_texttranslationvre

Last edit: 2024-10-31

Syriaca.org: The Syriac Reference Portal

2012

Syriaca.org is a portal intended to document and preserve Syriac cultural heritage. It serves as an online hub (cyber-infrastructure) to assist research in the field of Syriac studies and to compile and organize core data related to the study of Syriac sources. The publications of Syriaca.org combine emerging methods of the Digital Humanities with traditional scholarship in history and philology, new digital tools are created for widely disseminating this data and facilitating further research.

biographical_datacorpus_resourcegeodatamapsemitic_studiestranslationvisualization

Last edit: 2024-10-31

Text+

2021
2026
deu ,
eng

Text+ is a consortium with the objective to establish a national research data infrastructure. It will preserve text- and language-based research data in the long term and enable their broad use in science.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Textdatenbank und Wörterbuch des Klassischen Maya

2014
eng ,
deu ,
spa

This project is centered on the analysis of all known hieroglyphic Mayan texts which will serve as the basis for the compilation and editing of a Classic Mayan language dictionary. It will employ machine-readable versions of the source materials in order to integrate all occurrences of the Maya hieroglyphs into a corpus-based database, together with information concerning the original hieroglyphic spelling, transcription, and translation, as well as supplementary information. It thus lays the foundation for a systematic understanding of the structure of the writing system and of Classic Maya, the standard language underlying the hieroglyphic script.

digital_preservationepigraphyinscriptionlexicontranslation

Last edit: 2024-10-31

TextGrid

2006
2009
2009
2012
2012
2015
eng ,
deu

Answering an increasing demand for digital and collective research features in the humanities, TextGrid has established the infrastructure for a respective virtual research environment. In continuous exchange with the scientific community, TextGrid has developed a variety of tools and services available for free download in a stable version. Together with the TextGrid Repository, the virtual research environment TextGrid offers scholars in the humanities sustainable editing, storing and publishing of their data in a thoroughly tested and safe environment.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

The Digital Corpus of the Nabataean and Developing Arabic Inscriptions

2022
eng

DiCoNab aims at recording in a database easily accessible online the Nabataean and Developing Arabic inscriptions from the various countries and regions where they have been discovered since the mid-nineteenth century, primarily Syria, Jordan, Saudi Arabia, Yemen, and Egypt.

archaeologycorpus_outputdigital_preservationepigraphyhistory_studiesinscriptionpre_islamic_arabia

Last edit: 2024-11-15

The Evolution of Islamic Societies (c. 600–1600 CE): Algorithmic Analysis into Social History

2021
2027
eng

This project examines the social history of the Islamic society from its birth until the 16th century using highly advanced computational methods of text analysis and data modeling. The core of the study is a corpus of Islamic text from that period containing approximately 300 titles, 100 million words, and c. 400,000 biographical records, which—for the first time—will be treated holistically as a unified corpus of historical information. One of the project’s outputs (next to two PhD theses, multiple articles and a monograph) will be an open and expandable online research ecosystem, MasterChronicle, which will allow scholars in the field to engage in various modes of close and distant reading of the Arabic historical corpus.

annotationcomputational_text_analysiscorpus_resourcegeodataislamic_studiesmapreligion_studiesvisualizationvre

Last edit: 2024-10-31

The formation of Islam

2009
2015
eng

The purpose of this project is the examination of vastly important but largely neglected papyri from Egypt which preserve a minutely detailed transcription of daily life, as well as the only contemporary records of Islam’s rise and first wave of conquests. Based on this testimony the project aims to write the history of the formation of Islam through the striking newness of Islamic society and its debt to the diverse cultures it superseded. Beside stocktaking of the unedited papyri, the project’s team will crate a corpus of most significant texts and present the historical analysis through scholarly publications and a dedicated website.

arabic_studiescorpus_outputdigital_preservationislamic_studiesmanuscriptreligion_studies

Last edit: 2024-10-31

The Hebrew Palaeography Album

2022
eng

The aim of this resource is to offer historians, textual scholars, manuscript students, and the general learned public accessible tools for the study of Hebrew scripts and handwritings from the Middle Ages and Early Modern period. It analyses and makes freely available samples of manuscripts, both books and documents, as well as epigraphic inscriptions in Hebrew and Judeo-languages from across Jewish communities.

corpus_outputdigital_preservationepigraphyhistory_studiesinscriptionjewish_studiesmanuscriptreligious_texttranslation

Last edit: 2024-10-31

The intertwined world of the oral and written transmission of sacred traditions in the Middle East

2020
2023
eng

The scope of this project is to study the convergence of the three major religions of the Middle East – Islam, Christianity, and Judaism – at their formative periods of development during the early Islamic centuries. Two examples of this convergence are the development of notation systems in Arabic and Syriac representing oral reading traditions, and the assimilation of the mode of transmission of the Hebrew Bible in the 10th and 11th centuries to that of the Qur’an. The project will conduct digital analysis of manuscripts of the Qur’an and the Syriac and Hebrew Bible, in order to contribute to our understanding of the individual sacred traditions and to identify parallels among them.

annotationarabic_studiescorpus_outputdigital_preservationjewish_studiesmanuscriptreligion_studiesreligious_textsemitic_studiesvre

Last edit: 2024-10-31

The Islamic Manuscript Association

2008
eng

The Islamic Manuscript Association is an international non-profit organization dedicated to protecting Islamic manuscript collections and supporting those who work with them. Its main goal is to enhance the preservation and accessibility of many Islamic manuscript collections around the world. To this end, it articulates standards and guidelines for best practice in cataloguing, conservation, digitization, offers courses for professionals and awards grants to advance scholarship on Islamic manuscripts.

collaboration_platformislamic_studies

Last edit: 2024-10-31

The Middle East Virtual Library

2016
eng ,
deu

The project acquires difficult to access literature from countries of the MENA region in their original languages, allowing them to be permanently available. The subjects of literature are numerous, access to the materials is either traditional interlibrary loan or through digitized materials with Open Access.

arabic_studiescatalogdigital_preservationiranian_studiesislamic_studiessemitic_studiesturkology

Last edit: 2024-10-31

The Monumental Inscriptions of Historic Cairo

2010
eng

The Project covers Islamic inscriptions that are found in a range of the most important pre-1800 historical monuments in Cairo. The project database includes 3300 inscriptions related to 509 monuments, written on different materials such as stone, plaster, wood, and metal, and in many types of script including Kufic, Thuluth, and Naskh.

arabic_studiescorpus_outputdigital_preservationepigraphyhistory_studiesinscriptionislamic_studies

Last edit: 2024-10-31

The Multimedia Yasna

2016
2022
eng

The project MUYA employs a range of methodologies from digital humanities to delve into the oral and written traditions of the Yasna, a core ritual in Zoroastrian religion. To enhance research on oral traditions, the project has created a subtitled and interactive film of the Yasna ritual. For written traditions, the project has developed an online platform featuring transcribed manuscripts, editorial tools, digital transcriptions, and digital editions complete with a text-critical apparatus. These digital materials are supplemented by print editions, translations, and commentaries on the Yasna.

digital_editiondigital_preservationiranian_studiesreligion_studiesreligious_textteivideovre

Last edit: 2024-10-31

The Yemeni Manuscript Digitization Initiative

2010
2013
eng

The Yemen Manuscript Digitization Initiative (YMDI) is a collaborative project between Princeton University Library and the Freie Universität, Berlin. YMDI’s mission is the preservation and dissemination of the Arabic manuscripts in the private libraries of Yemen. The scope of the project is threefold: 1. preservation of three private libraries in Sana‘a, a total of 236 manuscripts. 2. digitization of 12 additional manuscripts in the rare book collections of the State Library Berlin and the Princeton University Library. 3 creation of a freely accessible repository of Islamic manuscripts based on the digital-library infrastructure developed at Princeton.

arabic_studiescatalogdigital_preservationmanuscript

Last edit: 2024-10-31

Thesaurus Indogermanischer Text- und Sprachmaterialien

1986

The project seeks to organize the collaboration of numerous scholars in developing an electronic text collection. This collection will serve as a foundation for computer-aided investigations into diverse Indo-European languages. Additionally, the project aims to expand the initiative by generating a growing repository of files containing supplementary materials, including dictionaries, word lists, linguistic analysis tools, and similar resources.

annotationcorpus_outputdigital_preservationepigraphyhistory_studiesinscriptionlexiconmanuscriptvre

Last edit: 2024-10-31

Tierrecht und Strafen im 'Gesetz zur Abwehr der Dämonen' (Vidçvdâd, Kapitel 13–15)

2010
2013
eng

The main objective of this research project is to conclude a critical edition of the Avesta and Pahlavi versions of the Vīdēvdād chapters 13–15 which was started in the pre-project Vidēvdād chapters 5–8. The edition should be enriched by a complete translation of Avesta and Pahlavi texts, the completion of two glossaries as well as preparation of a commentary on linguistic and textual peculiarities and difficulties. The translated and edited chapters will be made available to the broader audience in a digital form.

digital_editiondigital_preservationhistory_studiesiranian_studiesmanuscriptreligious_texttranslation

Last edit: 2024-10-31

Tikkoun Sofrim

2018
eng ,
fra ,
heb

The purpose of this project is to make Medieval Hebrew manuscripts openly and freely available as texts. To this end it combines automatic Handwritten Text-Recognition (HTR) and Crowdsourcing. The HTR is performed with a tool called Kraken with an error rate of less than 10%. In the next step, the resulting texts are made publicly available for correction. The amendments are than used both to improve Kraken’s capabilities and to place digital editions and scripts in libraries that will allow a literal search of the manuscripts.

htrjewish_studiesmachine_learningnlpocr

Last edit: 2024-10-31

Time, Space and Relations in Hadith Literature: A Machine Learning Approach

2020
2025
eng

The aim of this project is to achieve a broader perspective on the dynamics of narrative adaptation in ḥadīth literature by applying advanced data analysis. We now have digitized databases as well as sophisticated and efficient algorithms, which were the missing factors for large-scale studies of ḥadīth literature. Using comprehensive datasets and recent algorithmic advances in text processing and machine learning models, we will map the use, reuse, and adaptations of ḥadīth narratives to better apprehend regional and general patterns within the whole ḥadīth literature.

computational_text_analysishadithislamic_studiesmachine_learningnlpreligion_studiestext_reuse

Last edit: 2024-10-31

Topoi. The Formation and Transformation of Space and Knowledge in Ancient Civilizations

2007
2012
2012
2019

The excellence cluster TOPOI is a research network with a focus on study of the ancient world. It enhances the interdisciplinary cooperation not only in different fields of humanities, but also between humanities and natural sciences. Its various research projects cover a wide range of topics, periods and cultures from Central and Eastern Europe to the Mediterranean and the Middle East.

cultural_studieshistory_studies

Last edit: 2024-10-31

TUNOCENT

2019
2023
eng

This project studies the hitherto almost unknown Arabic varieties spoken in parts of Tunisia. Firstly, the team members will record and collect the linguistic data through fieldwork. In the second phase of the project selected dialects will be sketched and two varieties representing the main dialectal areas described in detail. Finlay a digital and fully searchable corpus of transcribed and translated narrative and ethnographic texts and conversations will be developed. The project will build on an existing research infrastructure for Arabic variations, which has been established at the Austrian Academy of Sciences over the recent years such as VICAV, TUNICO and ARCHE.

arabic_studiesaudiocorpus_resourcedialectologygeodatalanguage_documentationlinguisticsmaptei

Last edit: 2024-11-15

Understanding Written Artefacts

2019
2025
eng

Understanding Written Artefacts is a Cluster of Excellence comprising over 50 research projects from 40 academic disciplines with an overall aim to develop a global framework for the study of all written artefacts from the beginning of writing to the present day and from all regions that have produced such artefacts. It combines approaches from the humanities and the natural sciences and its research focuses on different dimensions of written artefacts, such as their materiality, spatiality, content, and format.

history_studies

Last edit: 2024-10-31

Union Catalogue of Manuscripts from the Islamicate World

2011
eng

FIHRIST is an online catalog featuring diverse manuscripts from the Islamicate world held by UK libraries. It offers detailed descriptions of manuscripts spanning from the 7th to the 19th century CE, providing valuable resources for scholars and enthusiasts while also facilitating contributions from institutions and private collectors to enrich its content.

african_studiesarabic_studiescatalogdigital_preservationiranian_studiesislamic_studiesjewish_studiesmanuscriptsemitic_studiesteiturkology

Last edit: 2024-10-31

Verse und Sprueche: Impulse und Reichweiten wissenschaftlicher und popularisierender Diskurse im arabischen Sprachraum

2020
2024
deu ,
eng

This projects studies micro-texts in poetry and gnomic texts in Arabic and related traditions. Single verses and maxims and the changes they underwent in their literary and scientific contexts bear important information about the intercultural transfer of knowledge. The project team will build a database of these minuscule texts and analyze them in order to trace the interaction of the multiple factors and impulses of epistemic change and embed it in a broader context.

arabic_studiescorpus_outputdigital_preservation

Last edit: 2024-10-31

Vienna Corpus of Arabic Varieties

2013
2028
eng

This project has two main purposes: firstly, it serves as a virtual research platform for Arabic dialectology, secondly, as a test bed for newly developed text technological methodologies and tools. In doing so, it gathers varying digital language resources for a number of different localities and offers information on the research history, available literature, salient grammatical features etc. of the particular variety. VICAV also makes accessible bibliographies, dictionaries, glossaries, and different types of transcribed texts. The project aims at providing a platform of exchange for a scientific community which is increasingly producing digital data but still lacks the infrastructure to make it widely available.

annotationarabic_studiesdialectologygeodatalanguage_documentationlexiconlinguisticsmaptranslation

Last edit: 2024-11-15

Virtuelle Archive für die geisteswissenschaftliche Forschung

2017
2020
deu

In this project is a cooperation of seven research institutions in Saxony (see project’s website) coordinated by Sächsische Akademie der Wisschenschaften (SAW). The institutions will exchange their expertise in the Digital Humanities in order to optimize research data management in human sciences. The focus of this projects lies on technical, legal and formal standardization of research data, long term archiving and accessibility.

research_data_managementresearch_infrastructure

Last edit: 2024-10-31

Weaving the Humanities into the Web and the Web into the Humanities. Preserving the Cultural Heritage of Yemen by Creating an Universally Virtual Library of Manuscripts (HumanWeb)

2013
2014
eng

This POC project will demonstrate that humanistic sciences can be productively coupled with sound business strategies to advance societal benefits and cultural sustainability. The grant will be used to get into contact and contract with potential joint venture partners for a cooperation which will result in a ground-braking universal online digital library for Yemeni manuscripts.

arabic_studiesdigital_preservationmanuscript

Last edit: 2024-10-31

WIBARAB: What is Bedouin-Type Arabic?

2021
2026
deu ,
eng

This project is dedicated to the study of Bedouin- Arabic dialects from cross the whole Arabic-speaking region. The project will collect new data through fieldwork in Saudi Arabia, Jordan, Sudan and Morocco, among other countries and publish the outputs in a database that will allow for efficient cross-dialectal comparisons, particularly with regard to phonological, morphological, syntactical, phraseological and lexical features.

arabic_studiescorpus_resourcedialectologylanguage_documentationlinguistics

Last edit: 2024-10-31

World Scripts Explorer: one stop to explore, learn and experience any graphical information system

2021
2022

The project enables digital writing in all scripts and graphical information systems, allowing users to explore and create content using standard or custom keyboard layouts. It facilitates learning about scripts, their categorization, and preservation. The modular design of the tool enables seamless integration of new languages or scripts.

vre

Last edit: 2024-11-15

Zodiac - Ancient Astral Science in Transformation

2021
2026
deu ,
eng

ZODIAC's main objective is to develop a convincing account of how zodiacal astral science emerged, was cross-culturally transmitted and able to take root in different ancient cultures. It also offers an online cross-cultural glossary of ancient astral science.

egyptologyhistory_studieslexiconotherscience_historysemitic_studies

Last edit: 2024-10-31

Zoroastrian Middle Persian Corpus and Dictionary

2021
2030
eng ,
deu

This project aims at the creation of a digital corpus of Zoroastrian Middle Persian texts in Pahlavi script as well as at the development of a comprehensive dictionary covering that corpus. The corpus consists of roughly 50 texts which will be supplied with several layers of orthographical, grammatical, semantic and intertextual annotation.

annotationdigital_editiondigital_preservationiranian_studieslexicographylexiconlinguisticsmanuscriptvre

Last edit: 2024-10-31