
substack.com/@generativehistory/note/c-144981227
Preview meta tags from the substack.com website.
Linked Hostnames
3Thumbnail

Search Engine Appearance
Mark Humphries (@generativehistory)
Thanks for the comment. We did this in an article in Historical Methids using an earlier version of the software: https://www.tandfonline.com/doi/full/10.1080/01615440.2025.2500309. Transbrikus is a great tool but it is also expensive and our goal is not to create a competitor but an open-source alternative. For context, context, on a 10,000 word, 50 page English language 18th and 19th c test set using dozens of different hands, out of the box (ie without fine tuning or training), we found Gemini-2.5-pro achieved a WER of 4.89% and a CER of 2.63% (excluding punctuation and capitalization as both can be ambiguous). On the same test set, the latest Transkribus Titan model achieves 13.2% WER and 6.6% CER. Transkribus also costs around 24 cents per page versus 0.8 cents per page with Gemini-2.5-pro. Transkribus would probably approach and perhaps exceed Gemini’s performance if you fine tuned it on each ah d, but that requires around 50 pages of transcribed pages per hand. So on large datasets, Transkribus might be the best choice choice (and it might also be much better on non English sets, we don’t know). But for sets of mixed documents or small sets of documents (or where cost is an issue), Gemini-2.5-pro in the API via a program like Archive Studio offer an alternative.
Bing
Mark Humphries (@generativehistory)
Thanks for the comment. We did this in an article in Historical Methids using an earlier version of the software: https://www.tandfonline.com/doi/full/10.1080/01615440.2025.2500309. Transbrikus is a great tool but it is also expensive and our goal is not to create a competitor but an open-source alternative. For context, context, on a 10,000 word, 50 page English language 18th and 19th c test set using dozens of different hands, out of the box (ie without fine tuning or training), we found Gemini-2.5-pro achieved a WER of 4.89% and a CER of 2.63% (excluding punctuation and capitalization as both can be ambiguous). On the same test set, the latest Transkribus Titan model achieves 13.2% WER and 6.6% CER. Transkribus also costs around 24 cents per page versus 0.8 cents per page with Gemini-2.5-pro. Transkribus would probably approach and perhaps exceed Gemini’s performance if you fine tuned it on each ah d, but that requires around 50 pages of transcribed pages per hand. So on large datasets, Transkribus might be the best choice choice (and it might also be much better on non English sets, we don’t know). But for sets of mixed documents or small sets of documents (or where cost is an issue), Gemini-2.5-pro in the API via a program like Archive Studio offer an alternative.
DuckDuckGo

Mark Humphries (@generativehistory)
Thanks for the comment. We did this in an article in Historical Methids using an earlier version of the software: https://www.tandfonline.com/doi/full/10.1080/01615440.2025.2500309. Transbrikus is a great tool but it is also expensive and our goal is not to create a competitor but an open-source alternative. For context, context, on a 10,000 word, 50 page English language 18th and 19th c test set using dozens of different hands, out of the box (ie without fine tuning or training), we found Gemini-2.5-pro achieved a WER of 4.89% and a CER of 2.63% (excluding punctuation and capitalization as both can be ambiguous). On the same test set, the latest Transkribus Titan model achieves 13.2% WER and 6.6% CER. Transkribus also costs around 24 cents per page versus 0.8 cents per page with Gemini-2.5-pro. Transkribus would probably approach and perhaps exceed Gemini’s performance if you fine tuned it on each ah d, but that requires around 50 pages of transcribed pages per hand. So on large datasets, Transkribus might be the best choice choice (and it might also be much better on non English sets, we don’t know). But for sets of mixed documents or small sets of documents (or where cost is an issue), Gemini-2.5-pro in the API via a program like Archive Studio offer an alternative.
General Meta Tags
14- titleMark Humphries (@generativehistory): "Thanks for the comment. We did this in an article in Historical Methids using an earlier version of the software: https://www.tandfonline.com/doi/full/10.1080/01615440.2025.2500309. Transbrikus is a great tool but it is also expensive and our goal is not to create a competitor …"
- title
- title
- title
- title
Open Graph Meta Tags
9- og:urlhttps://substack.com/@generativehistory/note/c-144981227
- og:imagehttps://substackcdn.com/image/fetch/$s_!XCt4!,w_400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack.com%2Fimg%2Freader%2Fnotes-thumbnail.jpg
- og:image:width400
- og:image:height400
- og:typearticle
Twitter Meta Tags
8- twitter:imagehttps://substackcdn.com/image/fetch/$s_!XCt4!,w_400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack.com%2Fimg%2Freader%2Fnotes-thumbnail.jpg
- twitter:cardsummary
- twitter:label1Likes
- twitter:data10
- twitter:label2Replies
Link Tags
17- alternatehttps://substack.com/@generativehistory/note/c-144981227
- apple-touch-iconhttps://substackcdn.com/icons/substack/apple-touch-icon.png
- canonicalhttps://substack.com/@generativehistory/note/c-144981227
- iconhttps://substackcdn.com/icons/substack/icon.svg
- manifest/manifest.json
Links
6- https://generativehistory.substack.com
- https://substack.com/@generativehistory/note/c-144981227?
- https://substack.com/@generativehistory?
- https://substack.com/@generativehistory?utm_source=substack-feed-item
- https://substack.com/home?