Resources

Importing tables from websites into spreadsheets
EN
Sometimes it can be useful to take information from a website, such as document lists from archives, for future reference. This short resource will show the user how to download an extension to copy tables from websites and then import the table into a spreadsheet program.
Authors
Rachel Pistol
Read more →
Using OpenCV for Face Detection
EN
OpenCV is a very popular, free and open source software system used for a large variety of computer vision applications. This article is intended to help you get started in experimenting with OpenCV using an example of face detection in images as a case study.
Authors
Mike Bryant
Read more →
Building and Linking Humanities' Digital Spatial Infrastructures
EN
This workshop, focussing on "Spatial data medieval to modern", is the first of a series of workshops from the NOS-HS project "Linking, Building, and Sustaining Humanities Digital Spatial Infrastructures for Research in the Nordic Countries". The main aims of this workshop were to define key concepts (spatial infrastructures, Linked Open Data, metadata, ontology), outline major challenges in the field, and to provide an opportunity to share experiences of addressing the issues in individual and national projects across the Nordic countries.
Authors
Alexandra Petrulevich
Sara Ellis Nilsson
Peder Gammeltoft
Read more →
quod: A Tool for Querying and Organising Digitised Historical Documents
EN
This blog post from EHRI introduces 'quod' (querying OCRed documents), a prototype Python-based command line tool for OCRing and querying digitised historical documents, which can be used to organise large collections and improve information about provenance. To demonstrate its use in context, this blog takes the reader through a case study of the International Tracing Service, showing workflows and the steps taken from start to finish.
Authors
Reinier De Valk
Read more →
Exploratory Topic Modelling in Python
EN
Topic modelling is a technique by which documents within a corpus are clustered based on how certain groups of terms are used together within the text. The commonalities between such term groupings tend to form what we would normally call “topics”, providing a way to automatically categorise documents by their structural content, rather than a more metadata-based knowledge system. Using resources held with EHRI's collections, this notebook offers learners an introduction to 'LDA' topic modelling using Python in a step-by-step guide.
Authors
Mike Bryant
Maria Dermentzi
Read more →
Mapping Science in Immersive Architectures
EN
In this webinar from Friday Frontiers, Dario Rodighiero (University of Groningen) discusses visualisation and representation of scholarly knowledge. This presentation brings science mapping back to its original meaning by widening its context to arts and humanities with the help of design.
Authors
Dario Rodighiero
Read more →
How to Capture and Reference a Webpage in your Research Using Zotero
EN
The need to reference webpages in academic work is growing all the time, particularly in the digital humanities. There are many different reference management systems that exist to help researchers sort and find their sources and the most accessible of these is Zotero.
Authors
Rachel Pistol
Read more →
Tutorial for VOICE 3.0
EN
This tutorial explains how to navigate in and use the new VOICE 3.0 Online interface for the Vienna-Oxford International Corpus of English, developed by the VOICE CLARIAH project team and released in September 2021. The tutorial introduces the web interface, explains how to run search queries, apply filters for the creation of sub-corpora and set bookmarks. In addition, it provides short quizzes and links to short videos explaining the design and functions of the VOICE 3.0 interface.
Authors
Marie-Luise Pitzl
Stefanie Riegler
Ruth Osimk-Teasdale
Read more →
EHRI in TEITOK
EN
This blog examines TEITOK, which is a corpus framework used as an alternative to Omeka. TEITOK is centered around texts and is similar to the Omeka interface – both allow you to search through the documents, and display the transcription. The main difference is that Omeka treats the transcription as an object description, whereas TEITOK not only shows that a word appears in a document, but also where it appears and how it is used.
Authors
Maarten Janssen
Read more →
Has Anyone Cited A Woman?
EN
Women have long been under-represented in science, but their output appears to be often under-represented in citations. In this talk, presented as part of the DAIRAH Friday Frontiers webinar series, Sally Wyatt (Maastricht University) addresses how to achieve citational justice.
Authors
Sally Wyatt
Read more →
Copyright and Academia in the Digital Era
EN
This webinar introduces the foundations of copyright and offers snapshots on the most relevant topics for academic authors, intermediaries and users, such as copyright flexibilities, exceptions and limitations in the field of cultural heritage access and preservation (digitization, e-lending, orphan and out-of-commerce works), copyright authorship and ownership, law and praxis of academic publishing, commercial and non-commercial licensing, collective management of authors’ rights, with brief references to open access.
Authors
Caterina Sganga
Read more →
CLS-INFRA Training School on Data and Annotation
EN
This event, organised and provided by the CLS INFRA project, offers an introductory course to textual data annotation. The workshop introduces learners to how to edit, annotate, and query a text corpus without a single line of code, how to structure texts with the XML-TEI, and how to run an NLP tool to add linguistic information.
Authors
Lisanne van Rossum
Maarten Janssen
Silvie Cinková
Read more →

Resources

Filter by topic