5,059 Works

IceMorph morphological analysis data files

Timothy Tangherlini, Sean Crist, Peter M. Broadwell, David Gabriel, Kryztof Urban, Aurelijus Vijunas & Jackson Crawford
This dataset consists of four main resources: a concatenated dictionary of Old Icelandic parsed for word class and inflectional detail; a corpus of Old Icelandic sagas in plain text and chunked by chapter; a tagged version of the same text, output of the IceMorph system; a training corpus labeled "Expert" for training and testing a machine learning module; and a training corpus labeled "Gold" for training and testing a machine learning module.

Registration Year

  • 2011
    4
  • 2012
    2
  • 2014
    3
  • 2015
    6
  • 2016
    29
  • 2017
    4,403
  • 2018
    283
  • 2019
    329

Resource Types

  • Other
    5,059

Data Centers

  • UC San Diego
    5,037
  • California Digital Library
    10
  • UC Berkeley
    6
  • UC Santa Barbara
    4
  • UC Los Angeles
    1
  • UC San Francisco
    1