104 Works

Digitised Hebrew Manuscripts: Or 2518 to Or 5834

Ellie King
This dataset comprises 33 digitised Hebrew manuscripts (900 - 1899), with their shelfmarks in alphabetical order (Or 2518 to Or 5834). These manuscripts are out of copyright.

OCR text derived from digitised books published 1820 - 1829. ALTO XML.

British Library Labs
This set consists 2739 volumes, published between 1820-1829. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format.

AAS Card Catalogues: Tamil

British Library
This dataset contains digitised microfilms of Tamil card catalogues.

C M Taylor Keylogging Data: 17 April 2016 – 22 July 2016

C M Taylor
This dataset is comprised of keylogging data from the author C M Taylor captured April – May 2016; Keystroke files: 17/04/2016 – 22/07/2016.The dataset is comprised of screenshots and keystroke logs. The screenshots are saved individually as JPGs and BMPs as well as an AVI file, so the individual captures play as a film. Keystrokes are saved either as .rtf files or .txt.

Digitised Hebrew Manuscripts: Or 1103 - Or 2201

British Library
This dataset comprises of 46 digitised Hebrew manuscripts (1250 - 1699; unknown date), with their shelfmarks in alphabetical order (Or 1103 - Or 2201). These manuscripts are out of copyrights.

Digitised Quarterly Lists and Metadata

Tom Derrick
The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises full-text searchable PDFs of 215 volumes as well as the associated metadata for each volume and represents a rich source for researchers interested in the publishing industry and book history in India. The catalogues are predominantly in...

Digitised Hebrew Manuscripts: Harley 5772 to Or 14580

British Library
This dataset comprises 42 digitised Hebrew manuscripts (1200 - 1871), with their shelfmarks in alphabetical order (Harley 5772 - Or 14580). These manuscripts are out of copyright.

UK Doctoral Theses (EThOS) Abstracts and Metadata - 01/03/2015. XLS.

EThOS
The dataset comprises metadata descriptions for over 350,000 PhD theses awarded by UK Higher Education institutions, aggregated by the British Library's EThOS service. Abstracts are included for around one third of the records. Permission to reuse this data must be sought.

OCR text derived from digitised books published 1700 - 1799. ALTO XML.

British Library Labs
This set consists 2070 volumes, published between 1700-1799. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format.

AAS Card Catalogues: Nepali

British Library
This dataset contains digitised microfilms of Nepali card catalogues (1958-1977).

AAS Card Catalogues: Gujurati

British Library
This dataset contains digitised microfilms of Gujurati card catalogues (1909-1983)

Pelagios Project: Digitised Insularium Illustratum. Additional MS 15760

British Library
This dataset comprises of 123 images from the Insularium Illustratum, an account of the islands of the Mediterranean, and of some others produced by Henricus Martellus Germanus in 1495. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project.

AAS Card Catalogues: South Asian Minor Languages

British Library
This dataset contains digitised microfilms of South Asian Minor Language card catalogues.

AAS Card Catalogues: Chinese (Pinyin)

British Library
This dataset contains digitised cards from the Pinyin card catalogue.

Digitised Hebrew Manuscripts: Harley MS 5709 - Or 11016

British Library
This dataset comprises of 25 digitised Hebrew manuscripts (1100 - 1699; unknown date), with their shelfmarks in alphabetical order (Harley MS 5709 - Or 11016). These manuscripts are out of copyrights.

Digitised Hebrew Manuscripts: Add MS 18229 - Add MS 26897

British Library
This dataset comprises of 27 digitised Hebrew manuscripts (1100 - 1799; unknown date), with their shelfmarks in alphabetical order (Add MS 18229 - Add MS 26897). These manuscripts are out of copyrights.

OCR text derived from digitised books published 1860 - 1869. ALTO XML.

British Library Labs
This set consists 7498 volumes, published between 1860-1869. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format.

Judicial Committee of the Privy Council: Linked Appeals Data

from a spreadsheet created by Jonathan Sims and Sophie Flynn Piercy Linked Data produced by Sarah Middle
Linked Data about appeal cases heard by the Judicial Committee of the Privy Council between 1860 and 1998. Personal and organisation names have been reconciled to VIAF and Wikidata, and place names have been reconciled to Geonames (where possible).

OCR text derived from digitised books published 1800 - 1809. ALTO XML.

British Library Labs
This set consists 1502 volumes, published between 1800-1809. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format.

C M Taylor Keylogging Data: 07 Jan 2015 – 09 Feb 2015

C M Taylor
This dataset is comprised of keylogging data from the author C M Taylor captured October 2014 – 9 February 2015; Keystroke files: 07/01/2015 – 09/02/2015.The dataset is comprised of screenshots and keystroke logs. The screenshots are saved individually as JPGs and BMPs as well as an AVI file, so the individual captures play as a film. Keystrokes are saved either as .rtf files or .txt.

AAS Card Catalogues: Thai

British Library
This dataset contains digitised microfilms of Thai card catalogues.

Digitised Hebrew Manuscripts: Add MS 9399 - Harley MS 5708

British Library
This dataset comprises of 27 digitised Hebrew manuscripts (1200 - 1499; unknown date), with their shelfmarks in alphabetical order (Add MS 9399 - Harley MS 5708). These manuscripts are out of copyrights.

Digitised Hebrew Manuscripts: Add MS 5242 to Arundel Or 50

Ellie King
This dataset comprises 22 digitised Hebrew manuscripts (1100 - 1799), with their shelfmarks in alphabetical order (Add MS 5242 to Arundel Or 50). These manuscripts are out of copyright.

Volumes of Lysons Collectanea (Amusements), comprising broadsides, cuttings, advertisements on amusements.1660-1840.

British Library Labs
The dataset comprises nine digitised volumes of a collection of broadsides, cuttings and advertisements, relating to public exhibitions and places of amusement from 1660 - 1840 (with OCR-derived text.)

Registration Year

  • 2016
    44
  • 2017
    43
  • 2018
    17

Resource Types

  • Dataset
    9
  • Image
    5