Hjem
Gunn Inger Lyse Samdals bilde

Gunn Inger Lyse Samdal

Forsker, Leksikograf (ordbokredaktør)
  • E-postGunn.Lyse@uib.no
  • Telefon+47 55 58 32 24+47 476 32 349
  • Besøksadresse
    Universitetsbiblioteket
    Haakon Sheteligs plass 7
    5007 Bergen
    Rom 
    017
  • Postadresse
    Postboks 7805
    5020 Bergen

Nåværende stilling

Jeg jobber som forsker ved Universitetet i Bergen (1.okt. 2018 - ), med en prosjektstilling som leksikograf (ordbokredaktør). I løpet av et femårig prosjekt reviderer og oppdaterer vi de to offentlige ordbøkene Bokmålsordboka og Nynorskordbokasom er del av Språksamlingane ved Universitetet i Bergen. 

Forskergrupper

Forskningsinteresser

  • Leksikografi
  • Leksikalsk semantikk og semantiske ordnett
  • Flerordsuttrykk
  • Datalingvistikk og språkteknologi
  • Korpuslingvistikk
  • Statistiske metoder og maskinlæringsmetoder for språkanalyse og språkmodellering
  • Datahåndtering
  • Forskningsinfrastruktur (FAIR data, håndtering og deling av data, metadata)

Utdanning

Jeg forsvarte min ph.d.-avhandling ved Universitetet i Bergen i 2011 (tittel: Translation-based Word Sense Disambiguation). Jeg har hovedfag i Datalingvistikk og Språkteknologi ved Universitetet i Bergen (2003), med støttefagene allmennlingvistikk, humanistisk informatikk, tysk og engelsk.

Arbeidserfaring

Før min nåværende stilling har jeg jobbet med forskning og undervisning tilknyttet Datalingvistikk og Språkteknologi ved Universitetet i Bergen (2003 - 2007, 2012-2016 og 2018-). Jeg jobbet som forsker i språkteknologiske prosjekter ved Uni Research Computing i Bergen (2008-2009). Jeg har videre jobbet ved NSD - Norsk Senter for Forskningsdata AS som spesialrådgiver for å videreutvikle NSDs opplæringsportefølje knyttet til datahåndtering, dataarkivering, FAIR data og personvern (2016-2018).

Se fanene om prosjekter og publikasjoner for ytterligere detaljer.

Bøker
  • De Smedt, Koenraad; Samdal, Gunn Inger Lyse; Gjesdal, Anje Müller; Losnegaard, Gyri Smørdal. 2012. Norsk i den digitale tidsalderen (bokmålsversjon). Springer Science+Business Media B.V.. 81 sider. ISBN: 9783642313882.
  • De Smedt, Koenraad; Samdal, Gunn Inger Lyse; Gjesdal, Anje Müller; Losnegaard, Gyri Smørdal. 2012. Norsk i den digitale tidsalderen (nynorskversjon). Springer Science+Business Media B.V.. 81 sider. ISBN: 9783642314322.
  • Vold, Eva Thue; Lyse, Gunn Inger; Gjesdal, Anje Muller. 2007. New Voices in Linguistics. Cambridge Scholars Publishing. 271 sider. ISBN: 1-84718-014-0.
Tidsskriftartikler
  • Gjesdal, Anje Müller; Samdal, Gunn Inger Lyse. 2016. Exploring an environmental neologism in Norwegian across corpora. Neologica. 10: 39-57.
  • Rosén, Victoria; Thunes, Martha; Haugereid, Petter; Losnegaard, Gyri Smørdal; Dyvik, Helge J. Jakhelln; Meurer, Paul; Samdal, Gunn Inger Lyse; De Smedt, Koenraad. 2016. The enrichment of lexical resources through incremental parsebanking. Language Resources and Evaluation. 50: 291-319. doi: 10.1007/s10579-016-9356-5
  • Gjesdal, Anje Müller; Lyse, Gunn Inger. 2011. Dårlig språk er dårlig butikk. Dagens næringsliv. 1 sider. Publisert 2011-09-26.
  • Kvale, Knut; Nordgård, Torbjørn; Svendsen, Torbjørn; Lyse, Gunn Inger; Gjesdal, Anje Müller. 2011. Datamaskinen må skjønne norsk. Bergens Tidende. Publisert 2011-11-19.
Rapporter/avhandlinger
  • Lyse, Gunn Inger. 2011. Translation-based World Sense Disambiguation. University of Bergen.
Bokkapitler
  • De Smedt, Koenraad; Samdal, Gunn Inger Lyse; Kyrkjebø, Rune; Al Ruwehy, Hemed Ali Hemed; Gjesdal, Øyvind Liland; Rosén, Victoria; Meurer, Paul. 2016. The CLARINO Bergen Centre: Development and Deployment. 1, sider 1-12. I:
    • De Smedt, Koenraad. 2016. Selected Papers from the CLARIN Annual Conference 2015, October 14–16, 2015, Wroclaw, Poland. Linköping University Electronic Press. 117 sider. ISBN: 978-91-7685-765-6.
  • Dyvik, Helge J. Jakhelln; Meurer, Paul; Rosén, Victoria; De Smedt, Koenraad; Haugereid, Petter; Losnegaard, Gyri Smørdal; Samdal, Gunn Inger Lyse; Thunes, Martha. 2016. NorGramBank: A ‘Deep’ Treebank for Norwegian. Paper 943, sider 3555-3562. I:
    • Calzolari, Nicoletta; Choukri, Khalid; Declerck, Thierry; Grobelnik, Marko; Maegaard, Bente; Mariani, Joseph; Moreno, Asunción; Odijk, Jan; Piperidis, Stelios. 2016. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). European Language Resources Association. 6000 sider. ISBN: 978-2-9517408-9-1.
  • Samdal, Gunn Inger Lyse; Meurer, Paul; De Smedt, Koenraad. 2015. COMEDI: A component metadata editor. 116:008, sider 82-98. I:
    • Odijk, Jan. 2015. Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands. Linköping University Electronic Press. 110 sider. ISBN: 978-91-7685-954-4.
  • Losnegaard, Gyri Smørdal; Samdal, Gunn Inger Lyse; Gjesdal, Anje Müller; De Smedt, Koenraad; Meurer, Paul; Rosén, Victoria. 2013. Linking Northern European Infrastructures for Improving the Accessibility and Documentation of Complex Resources. artikkel, sider 44-59. I:
    • De Smedt, Koenraad; Borin, Lars; Lindén, Krister; Maegaard, Bente; Rögnvaldsson, Eiríkur; Vider, Kadri. 2013. Proceedings of the workshop on Nordic language research infrastructure at NODALIDA 2013. Linköping University Electronic Press. 76 sider. ISBN: 978-91-7519-585-8.
  • Meurer, Paul; Dyvik, Helge J. Jakhelln; Rosén, Victoria; De Smedt, Koenraad; Samdal, Gunn Inger Lyse; Losnegaard, Gyri Smørdal; Thunes, Martha. 2013. The INESS Treebanking Infrastructure. 85, sider 453-458. I:
    • Oepen, Stephan; Hagen, Kristin; Johannessen, Janne Bondi. 2013. Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013). Linköping University Electronic Press. 474 sider. ISBN: 978-91-7519-589-6.
  • Parra Escartín, Carla; Losnegaard, Gyri Smørdal; Samdal, Gunn Inger Lyse; Patiño, Pedro. 2013. Representing Multiword Expressions in Lexical and Terminological Resources: An Analysis for Natural Language Processing Purposes. Kapittel 23, sider 338-357. I:
    • Kosem, Iztok; Kallas, Jelena; Gantar, Polona; Krek, Simon; Langements, Margit; Tuulik, Maria. 2013. Electronic lexicography in the 21st century: thinking outside the paper. Proceedings of the eLex 2013 conference, 17-19 October 2013, Tallinn, Estonia. 516 sider. ISBN: 978-961-93594-0-2.
  • Losnegaard, Gyri Smørdal; Samdal, Gunn Inger Lyse. 2012. A data-driven approach to anglicism identification in Norwegian. Part 1:6, sider 131-154. I:
    • Andersen, Gisle. 2012. Exploring newspaper language : using the web to create and investigate a large corpus of modern Norwegian. John Benjamins Publishing Company. 356 sider. ISBN: 978-90-272-0354-0.
  • Losnegaard, Gyri Smørdal; Samdal, Gunn Inger Lyse; Thunes, Martha; Rosén, Victoria; De Smedt, Koenraad; Dyvik, Helge J. Jakhelln; Meurer, Paul. 2012. What we have learned from Sofie: Extending lexical and grammatical coverage in an LFG parsebank. chapter, sider 69-76. I:
    • Hajič, Jan; De Smedt, Koenraad; Tadič, Marko; Branco, António. 2012. META-RESEARCH Workshop on Advanced Treebanking at LREC2012. European Language Resources Association. 76 sider. ISBN: 0000000000000.
  • Rosén, Victoria; Meurer, Paul; Losnegaard, Gyri Smørdal; Samdal, Gunn Inger Lyse; De Smedt, Koenraad; Thunes, Martha; Dyvik, Helge J. Jakhelln. 2012. An integrated web-based treebank annotation system. kapittel, sider 157-167. I:
    • Hendrickx, Iris; Kübler, Sandra; Simov, Kiril. 2012. Proceedings of the Eleventh International Workshop on Treebanks and Linguistic Theories (TLT11). Edições Colibri. 205 sider. ISBN: 978‐989‐689‐274‐6.
  • Samdal, Gunn Inger Lyse; Andersen, Gisle. 2012. Collocations and statistical analysis of n-grams : multiword expressions in newspaper text. del 1, kap 4, sider 79-109. I:
    • Andersen, Gisle. 2012. Exploring newspaper language : using the web to create and investigate a large corpus of modern Norwegian. John Benjamins Publishing Company. 356 sider. ISBN: 978-90-272-0354-0.
  • Samdal, Gunn Inger Lyse; Parra Escartín, Carla; De Smedt, Koenraad. 2012. Applying current metadata initiatives: The META-NORD experience. chapter, sider 13-20. I:
    • Arranz, Victoria; Broeder, Daan; Gaiffe, Bertrand; Gavrilidou, Maria; Monachini, Monica; Trippel, Thorsten. 2012. Proceedings of the Workshop on Describing LRs with Metadata: Towards Flexibility and Interoperability in the Documentation of LRs. European Language Resources Association. 83 sider. ISBN: 0000000000000.
  • Vasiļjevs, Andrejs; Forsberg, Markus; Gornostay, Tatiana; Hansen, Dorte H.; Jóhannsdóttir, Kristín; Lindén, Krister; Lyse, Gunn I.; Offersgaard, Lene; Oksanen, Ville; Olsen, Sussi; Pedersen, Bolette S.; Rögnvaldsson, Eiríkur; Rozis, Roberts; Skadiņa, Inguna; De Smedt, Koenraad. 2012. Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries. paper 744, sider 1076-1083. I:
    • Calzolari, Nicoletta; Choukri, Khalid; Declerck, Thierry; Ugur Dogan, Mehmet; Maegaard, Bente; Mariani, Joseph; Odijk, Jan. 2012. Proceedings of the Eighth International Conference on Language Resources and Evaluation. European Language Resources Association. 1244 sider. ISBN: 978-2-9517408-7-7.
  • Lyse, Gunn Inger. 2007. "Making sense of translations": Translation-based lexical information for word sense disambiguation (WSD). Kapittel, sider 233-246. I:
    • Vold, Eva Thue; Lyse, Gunn Inger; Gjesdal, Anje Muller. 2007. New Voices in Linguistics. Cambridge Scholars Publishing. 271 sider. ISBN: 1-84718-014-0.

Se fullstendig oversikt over publikasjoner i CRIStin.

  • Jeg jobber som leksikograf (ordbokredaktør) i Revisjonsprosjektet for Bokmålsordboka og Nynorskordboka (www.ordbok.uib.no). Prosjektet (2018-2023) er finansiert av Språkrådet.

Tidligere prosjekter (kun på engelsk) 

Previous projects:

  • The NORDi project (2016-2021), supported by the Research Council of Norway, is owned by NSD - Norwegian Centre for Research Data. In 2016-2018, I worked at NSD as a special adviser on training and data management. In the NORDi project, I led the Work Package on training and Support. The main objective was to strengthen and expand guidance and training services at NSD, to increase awareness and practical skills within data management among researchers as well as research support staff in the Norwegian research sector. I worked both with digital training material and non-digital (face to face) training activities.
  • The national infrastructure project CLARINO (2012-2016), supported by the Research Council of Norway, was owned by UiB - University of Bergen. CLARINO is the Norwegian contribution to the international CLARIN infrastructure. I worked as a project manager in CLARINO. My main field of expertise in this context is data management, metadata and IPR/licensing issues. (2012-2015)
  • In the national infrastructure project INESS (2012-2016), supported by the Research Council of Norway, we developed a database of syntactically analysed sentences (a treebank). On the infrastructure side, I participated in developing the technical solutions of the INESS treebank (user interface, metadata, IPR issues). On the linguistic side, I added Norwegian sentence analyses to the treebank, mainly by manual annotation of sentences that have initially been parsed automatically by the NorGram grammar (LFG framework). In adding material to the Norwegian treebank, I also did text preprocessing and some grammar development.
  • The infrastructure project DASISH, supported by the EU, aimed to create synergies between infrastructure intitiatives in the Social Sciences and Humanities (SSH), among other things we developed a Virtual Competence Center (VCC) for future researchers to find information about the moral, ethical and legal aspect of compiling research material. Project ending: December 2014.
  • In the project META-NORD (2011-2013), supported by the EU, we worked to make language technology resources more easily available, offering standardized metadata and negotiating licences. We catalogued 67 Norwegian language resources relevant for language technology research and development. This work has been inherited by the projects CLARINO and (where relevant) INESS. My work in this project included: writing a national language white paper, creating metadata, encoding and reformatting research material, clearing resource ownership, rights and licenses.
  • In the national project Norsk aviskorpus / Norwegian Newspaper Corpus (Uni Research, Dep. of Computing), I worked on automatic methods to identify Multiword Expressions, terms and anglicisms in Norwegian text. I explored a variety of so-called association measures in order to rank 2- and 3-word sequences (bigrams and trigrams) in terms of whether they are likely to constitute multiword expressions. This work is closely related to automatic term extraction and the automated detection of Anglicisms in Norwegian text. We also explored the use of machine learning systems to detect Anglicisms automatically.
  • I was project leader for a development project at Uni Research Computing, where we provided language technology expertise to inprove a commercial writing aid system for dyslectics. In particular, I produced relations between orthography and pronunciation, and worked with n-grams for finding contexts to produce intelligent spelling suggestions.
  • In the national Machine Translation (MT) project LOGON (2003-2006), I worked with statistical methods for Word Sense Disambiguation. My resulting Ph.D. thesis was defended in 2011. 
  • Connected to the national project Fra parallellkorpus til ordnett (From parallel corpora to word nets) at University of Bergen, I wrote my master's thesis and also worked as a scientific assistant in 2002 - 2003.

Forskergrupper