- E-mailClaus.Huitfeldt@uib.no
- Phone+47 55 58 30 13913 48 030
- Visitor AddressSydnesplassen 12/13 / Harald Hårfagresgt 1BergenRoom507
- Postal AddressPostboks 78055020 Bergen
After having finished my "magister" degree at the University of Trondheim in 1984 I worked on the Norwegian Wittgenstein Project, until in 1990 I founded the Wittgenstein Archives at the University of Bergen. I directed this project until the publication of Wittgenstein's Nachlass: The Bergen Electronic Edition (Oxford University Press, 2000, ISBN-10: 0192686917). This was the first complete publication of Ludwig Wittgenstein's Nachlass, and one of the first digital text critical editions of its size and kind.
Since then I have worked, philosophically as well as text-technologically, with methods for the representation, manipulation and analysis of documents, as well as with questions about their ontological and epistemological status. Text encoding and transcription are central themes in this work, which focuses especially on two subjects:
- Identity conditions and similarity measures for texts in general, and for digital documents in particular
- Representation, manipulation and analysis of complex documents with overlapping, discontinuous, unordered or multiply ordered elements.
Some recent talks:
Huitfeldt, Claus, and C. M. Sperberg-McQueen. “Document similarity: Transcription, edit distances, vocabulary overlap, and the metaphysics of documents.” Presented at Balisage: The Markup Conference 2020, Washington, DC, July 27 - 31, 2020. In Proceedings of Balisage: The Markup Conference 2020. Balisage Series on Markup Technologies, vol. 25 (2020). https://doi.org/10.4242/BalisageVol25.Huitfeldt01.
Sperberg-McQueen, C.M., and Huitfeldt, Claus: "Bootstrapping Project-specific Spell-checkers". Paper given at Digital Humanities 2019, Utrecht, 8-.12-07.2019. Slides: http://mlcd.blackmesatech.com/mlcd/2019/Talks/Utrecht-201907/index.xml Abstract: https://dev.clariah.nl/files/dh2019/boa/0961.html
Huitfeldt, Claus, and Sperberg-McQueen, C.M.: "Interpreting Difference Among Transcripts". Paper given at Digital Humanities 2018, Mexico City, 26.-29.06.2018. Slides: http://mlcd.blackmesatech.com/mlcd/2018/Talks/MexicoDF-201806/index.en.xml Abstract: https://dh2018.adho.org/en/interpreting-difference-among-transcripts/
- (2020). Document similarity. Balisage Series on Markup Technologies.
- (2019). Bootstrapping Project-specific Spell-checkers.
- (2018). Interpreting Difference Among Transcripts.
- (2017). Transcriptional Implicature: Using a Transcript to Reason about an Exemplar.
- (2017). Systèmes de balisage de textes et editions numérique. 24 pages.
- (2015). XML Structured attributes, change tracking, and the metaphysics of documents.
- (2015). UnderDok: XML Structured attributes, change tracking, and the metaphysics of document. Balisage Series on Markup Technologies.
- (2014). Transcriptional Implicature: A Contribution To Markup Semantics.
- (2014). Markup technology and textual scholarship. 22 pages.
- (2014). Document lattices: Equivalence, compatibility, and contradiction in document markup. Balisage Series on Markup Technologies.
- (2013). Modeling overlapping structures: Graphs and serializability. Balisage Series on Markup Technologies.
- (2012). The MLCD Overlap Corpus (MOC): Project report. Balisage Series on Markup Technologies.
- (2012). Ten problems in the interpretation of XML documents. 18 pages.
- (2012). Modeling overlapping structures: Graphs and serializability.
- (2012). Documents as Timed Abstract Objects. Balisage Series on Markup Technologies.
- (2011). TagAl: A tag algebra for document markup. Balisage Series on Markup Technologies.
- (2011). TagAl: A tag algebra for document markup.
- (2011). Expressive power of markup languages and graph structures.
- (2010). Visualizing the semantics of TEI Lite.
- (2010). Two representations of the semantics of TEI Lite.
- (2010). The MLCD overlap corpus: A markup research infrastructure.
- (2010). The MLCD Overlap Corpus (MOC).
- (2010). Extension of the type/token distinction to document structure. Balisage Series on Markup Technologies.
- (2009). What is transcription? (part 2).
- (2009). Markup Meaning and Mereology. Balisage Series on Markup Technologies.
- (2009). Formal and informal meaning from documents through skeleton sentences: Complementing formal tag-set descriptions with intertextual semantics and vice-versa. Balisage Series on Markup Technologies.
- (2008). What is transcription? Literary & Linguistic Computing. 295-310.
- (2008). Markup Discontinued: Discontinuity in TexMecs, Goddag structures, and rabbit/duck grammars. Balisage Series on Markup Technologies.
- (2008). Goddag.
- (2008). Complex Document Structures.
- (2007). What is transcription?
- (2007). Thoughts about Texts and other Things.
- (2007). Text Technology and Philosophy.
- (2007). Preserving Information About Linearization in Document Graphs.
- (2007). Markup Technology and Textual Structures.
- (2006). Representation and processing of Goddag structures: implementation strategies and progress report.
- (2006). Philosophy Case Study. 16 pages.
- (2006). Markup Languages for Complex Documents - An Interim Project Report.
- (2004). Text Technology and Textual Criticism. Linguistica Computazionale. 259-275.
- (2004). Text Technology and Critical Editing.
- (2004). Text Technology and Critical Editing.
- (2004). Scholarly Text Processing and Future Markup Systems. 17 pages.
- (2004). Notation, data structure and grammar for non-hierarchical structures.
- (2004). Markup Languages for Complex Documents.
- (2004). GODDAG: A data structure for overlapping hierarchies. Lecture Notes in Computer Science (LNCS). 139-160.
- (2004). Editorial Principles of Wittgenstein's Nachlass: The Bergen Electronic Edition. 15 pages.
- (2004). Digitaliseringen av Holbergs samlede skrifter. 14 pages.
- (2003). XML Semantics and Digital Libraries.
- (2003). Towards a semantics for XML.
- (2003). Scholarly Text Processing and Future Markup Systems.
- (2003). Kommentering i lys av de muligheter datateknologi tilbyr.
- (2003). Hva er tekst.
- (2003). Editorial Principles of Wittgensteins Nachlass The Bergen Electronic Edition.
- (2003). Editorial Principles of Wittgensteins Nachlass The Bergen Electronic Edition.
- (2003). Digitalisering for gjenutgivelse av Holbergs skrifter.
- (2003). A logic programming environment for document semantics and inference. Literary & Linguistic Computing. 225-233.
- (2002). Drawing inferences on the basis of markup.
- (2001). Markup Discontinued. Balisage Series on Markup Technologies.
- (1998). Concurrent Document Hierarchies in MECS and SGML.
- (1998). Computer Aided Text Encoding and Textual Scholarship.
- (1997). Philosophy and Electronic Publishing. Theory and Metatheory in the Development of Text Encoding. The Monist. 348-367.
- (1995). Wittgenstein's Nachlass revisited.
- (1995). Multi-Dimensional Texts in a One-Dimensional Medium. Computers and the Humanities. 235-241.
- (1994). Toward a Machine-Readable Version of Wittgenstein's Nachlass: Some Editorial Problems. Editio : Internationales Jahrbuch für Editionswissenschaft. 37-43.
- (1994). Computerizing Wittgenstein - The Wittgenstein Archives at the University of Bergen. 20 pages.
- (1993). Manuscript Encoding: Alphatexts and Betatexts.
- (1993). MECS - A Multi-Element Code System.
- (1993). Encoding Wittgenstein.
- (1992). The Wittgenstein Archives at the University of Bergen - Annual Report 1991. (Norwegian-English parallel text.). .
- (1992). Multi-Dimensional Texts in a One-Dimensional Medium. 5. 5. .
- (1991). The Wittgenstein Archives at the University of Bergen - Background, Project Plan and Annual Report 1990. (Norwegian-English parallel text.). .
- (1991). Das Wittgenstein-Archiv der Universit@æt Bergen. Hintergrund und erster Arbeitsbericht mit Nachtrag: Wittgenstein-Nachla@s: Nothing is hidden. Mitteilungen aus dem Brenner-Archiv. 93-106.
More information in national current research information system (CRIStin)