Home
University of Bergen Library
The Digital Lab

Corpus Workshop: Text mining of Norwegian public records

Text mining is a collective term for techniques for text analysis using digital tools. This workshop covers a selection of such tools and some of the methodological issues you may encounter when applying them to public records.

Ryggen av serien Stortingsforhandlinger, 1992-1993
The back cover of the book "Stortingsforhandlinger"
Photo:
NBo-HS, Wikimedia Commons

Main content

The Workshop will be held in Norwegian.

The program consists of three lectures as well as hands-on sessions where you get to try tools such as UiB's Corpuscle and the National Library's dhlab. During the practical sessions, we will present how to define and analyze text collections in these tools.

The target group for the workshop is students and researchers within the fields of history, law, and the social sciences.

Schedule:

9:00-9:30:  Introductory lecture: "Do we need full text? Bureaucrats, researchers and disciplines on different perspectives on political text corpora" (Arne Solli, associate professor of history at UiB)

9:30-9:50: Retrieve the case: Storting proceedings and NB/Statsmaktene (Arne Solli)

10:00-10:50: Clarino and the National Library's web interfaces (Henrik Askjer)

11:00-11:30: Lecture on corpus analysis (Heidi Karlsen, researcher and librarian at BI and lecturer in digital humanities at UiO)

11:30-12:00: The National Library's dhlab in Jupyter Notebook (Henrik Askjer)

12:00-12:30: Lunch

12:30-13:00:  Lecture: Tove Bruland on how she used #lancsbox to analyze curricula 1974-2000 in her master’s thesis

13:10-14:30: Discussion

14:30-15:00: Further exploration of tools for anyone interested