made using Leaflet
DH explorations
DRAFT

Published

View
β€”
β€”

Made for: AI Meets Humanities and Social Sciences conference, 23rd-25th June 2025, Vienna, Austria

Author of the overview: Jan Odstrčilík

Last update: 16th December 2025

Link to this page (QR Code)

https://shorturl.at/vbvkW

https://leaflet.pub/8a9f8f63-8c33-4f6b-926c-c314417a0337

Link to this page (QR Code)

https://shorturl.at/vbvkW

https://leaflet.pub/8a9f8f63-8c33-4f6b-926c-c314417a0337


Drag and drop systems

No configuration, no custom models, intended for processing of single pages.

Transkribus.ai

🌐 URL: https://transkribus.ai/

πŸ’Ά Costs: πŸ†“

Transkribus.ai

🌐 URL: https://transkribus.ai/

πŸ’Ά Costs: πŸ†“

πŸ—’οΈ Note: Use of large AI models by Transkribus. Suitable only for single pages.



Transkribus public models

🌐 URL: https://app.transkribus.org/models/public

πŸ’Ά Costs: πŸ†“

Transkribus public models

🌐 URL: https://app.transkribus.org/models/public

πŸ’Ά Costs: πŸ†“

πŸ—’οΈ Note: Possibility to test any public model for free for single pages. Very limited export possibilities.


Example - Carolingian Minuscule Model

🌐 URL: https://app.transkribus.org/models/public/text/51210




Simple systems

Simple installation,no manual creation of ground truth, no training of custom models.

Rescribe.xyz

🌐 URL: https://rescribe.xyz/

πŸ’Ά Costs: πŸ†“

Rescribe.xyz

🌐 URL: https://rescribe.xyz/

πŸ’Ά Costs: πŸ†“


Full systems but without custom model training

Manual creation of groud truth and manual corrections of automatically recognized text but no custom models.

Projekt PERO

🌐 URL: https://pero.fit.vutbr.cz/

πŸ’Ά Costs: πŸ†“

Projekt PERO

🌐 URL: https://pero.fit.vutbr.cz/

πŸ’Ά Costs: πŸ†“

⚠️ No possibility of training custom models.⚠️



Full Integrated Transcription Environments

Possibility to train (and use) custom models in the graphical user interface (GUI).

Transkribus

🌐 URL: https://www.transkribus.org/

πŸ’Ά Costs: 50 credits/month for free, subscription model

Transkribus

🌐 URL: https://www.transkribus.org/

πŸ’Ά Costs: 50 credits/month for free, subscription model

Pros:

very large community

easy to sign-up and use

a lot of public models

part of an ecosystem

ScanTent, Transkribus, Transkribus Sites


Cons:

advanced features (field training, advanced export options. etc.) require subscription

some solutions could be cheaper

not open-source

not possible to import/export models

eScriptorium

🌐 URL: https://escriptorium.inria.fr/

πŸ’Ά Costs: πŸ†“ but you need your own infrastructure

eScriptorium

🌐 URL: https://escriptorium.inria.fr/

πŸ’Ά Costs: πŸ†“ but you need your own infrastructure


Pros:

large community

open-source

suitable for non-Latin scripts

possibility to import/export Kraken models (to be found on HTR-United and Zenodo)

privacy of the data

Cons:

need for own servers/infrastructure

less user friendly (a new version should appear in 2025)



OCR4All

🌐 URL: https://www.ocr4all.org/

πŸ’Ά Costs: πŸ†“ but you need your own infrastructure

OCR4All

🌐 URL: https://www.ocr4all.org/

πŸ’Ά Costs: πŸ†“ but you need your own infrastructure


Source of the image: https://www.ocr4all.org/about/ocr4all


Calfa Vision

🌐 URL: https://vision.calfa.fr/

πŸ’Ά Costs: 3500 Euro per 3500 pages, following pages cheaper

Calfa Vision

🌐 URL: https://vision.calfa.fr/

πŸ’Ά Costs: 3500 Euro per 3500 pages, following pages cheaper

🎯 Focus: non-Latin-alphabet based scripts


ScribbleSense

🌐 URL: https://scribblesense.cz/

πŸ’Ά Costs: Currently πŸ†“

ScribbleSense

🌐 URL: https://scribblesense.cz/

πŸ’Ά Costs: Currently πŸ†“


From the same developers as Projekt Pero.






Command-line tools

Programming skills required.

Kraken

🌐 URL: https://kraken.re/main/index.html

πŸ’Ά Costs: πŸ†“ but you need your own infrastructure.

Kraken

🌐 URL: https://kraken.re/main/index.html

πŸ’Ά Costs: πŸ†“ but you need your own infrastructure.

πŸ—’οΈ Note: Used by eScriptorium.


Other similar command-line tools: PyLaia, Tesseract, Calamari.


New Tools Created with Agentic Coding

Polyscriptor

🌐 URL: https://github.com/achimrabus/polyscriptor

πŸ’Ά Costs: Currently πŸ†“

Polyscriptor

🌐 URL: https://github.com/achimrabus/polyscriptor

πŸ’Ά Costs: Currently πŸ†“




made using Leaflet