Posts - Page 2 of 2

Shajara-i Turk: A 17th-century Turko-Chinggisid Genealogy from Central Asia

  • 8 min read

We continue our series of guest posts by our summer paleography and codicology course students this week with an exploration by Nurlan Kabdylkhak of the manuscript tradition of an important component text of Central Asian Islamicate literature.

Shajara-i Turk stands out among historical chronicles originating in Central Asia for several compelling…

Read More

Exploring Manuscript Textual Variability With Digital Tools: Introduction

  • 3 min read

As mentioned in last week’s post, one of the goals of OpenITI over the coming months and years is to develop new ways of exploring textual variability in the manuscript tradition using the affordances of digital tools and platforms. One of the distinguishing features of texts in Islamicate manuscript cultures…

Read More

Digital Islamicate Paleography and Codicology Summer School

  • 3 min read

Roshan Institute for Persian Studies at the University of Maryland, College Park (Roshan Institute-UMD) is offering a free, stipend-supported twelve-week online summer course on digital Islamicate paleography and codicology. This course is part of the “Unsupervised Islamicate Manuscript Transcription via Lacunae Reconstruction” project, funded by a National Science Foundation grant to Roshan-UMD and the University of California, San Diego…

Read More

Using eScriptorium for Manuscript Transcription

  • 7 min read

There are any number of reasons a researcher might want to transcribe a digitized manuscript, and there are now a number of tools available to help one in doing so. Transcribing Arabic-script manuscripts poses certain challenges right from the get-go, regardless of whether one is operating in a digital environment or not. Deciphering difficult handwriting, reconstructing lacunae caused by loss of material, navigating manuscripts with text running…

Read More

Challenges of Layout Analysis across Arabic-Script Training Data

  • 3 min read

Layout Analysis is the process of identifying regions (e.g., title, body text, footnotes, etc.) on a page of text before sending it through the OCR engine. Preparing documents to train our OCR models involves several distinct steps, including semantic annotation, fixing segmentation errors, and editing faulty transcriptions. eScriptorium allows users to associate specific labels with regions…

Read More