105 Works

C M Taylor Keylogging Data: 27 Mar 2017 – 05 Mar 2018

C M Taylor
This dataset is comprised of keylogging data from the author C M Taylor captured March 2017 – March 2018; Keystroke files: 27/03/2017 – 05/03/2018.The dataset is comprised of screenshots and keystroke logs. The screenshots are saved individually as JPGs and BMPs as well as an AVI file, so the individual captures play as a film. Keystrokes are saved either as .rtf files or .txt. Please note the .avi file is only from 27 March to...

Portraits of actors, views of theatres and playbills (covering 1750 - 1821 in a single volume)

The dataset comprises one digitised volume (166 pages) of a collection of portraits of celebrated actors and actresses, views of theatres and playbills, dating 1750 - 1821. The dataset is in Portable Document Format (PDF).

Volume of Christmas ballads and broadsides. 1750 - 1840.

The dataset comprises one digitised volume (110 pages) of a collection of Christmas ballads and prose broadsides chiefly printed in London by J. Pitts between 1750 - 1840. The dataset is in Portable Document Format (PDF).

Volumes of Lysons Collectanea (Trades), comprising advertisements, cuttings, and illustrations relating to trades, professions, medical cures. 1660-1825.

The dataset comprises four digitised volumes of a collection of advertisements, cuttings and illustrations relating to trades, professions and medical cures from 1660 - 1825 (with OCR-derived text.)

AAS Card Catalogues: Marathai

This dataset contains digitised microfilms of Marathai card catalogues.

AAS Card Catalogues: Oriya

This dataset contains digitised microfilms of Oriya card catalogues (1904-1983).

AAS Card Catalogues: Sindhi

This dataset contains digitised microfilms of Sindhi card catalogues.

AAS Card Catalogues: South Asian Minor Languages

This dataset contains digitised microfilms of South Asian Minor Language card catalogues.

OCR text derived from digitised books published 1810 - 1819. ALTO XML.

This set consists 2338 volumes, published between 1810-1819. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format.

OCR text derived from digitised books published 1860 - 1869. ALTO XML.

This set consists 7498 volumes, published between 1860-1869. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format.

OCR text derived from digitised books published 1880 - 1889. ALTO XML.

This set consists 10856 volumes, published between 1880-1889. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format.

Digitised Hebrew Manuscripts: Add MS 18229 - Add MS 26897

This dataset comprises of 27 digitised Hebrew manuscripts (1100 - 1799; unknown date), with their shelfmarks in alphabetical order (Add MS 18229 - Add MS 26897). These manuscripts are out of copyrights.

Digitised Hebrew Manuscripts: Or 2406 - Or 2509

This dataset comprises of 41 digitised Hebrew manuscripts (1300 - 1799; unknown date), with their shelfmarks in alphabetical order (Or 2406 - Or 2509). These manuscripts are out of copyrights.

Digitised Hebrew Manuscripts: Or 2626 - Or 6425

This dataset comprises of 43 digitised Hebrew manuscripts (920 - 1845; unknown date), with their shelfmarks in alphabetical order (Or 2626 - Or 6425). These manuscripts are out of copyrights.

Theatrical playbills from Britain and Ireland. (OCR text only)

OCR-derived text for the playbills, encoded in UTF-8. The dataset comprises 264 volumes of digitised theatrical playbills published between 1660 – 1902 (mostly 19th century) from England, Scotland, Wales and Ireland. Digitised from the British Library's physical collection of over 500 volumes of playbills. The dataset containes text files (.TXT) in Optical Character Recognition (OCR) format. The playbills cover theatres in Bath (Royal), Bristol (Royal), Dublin (Royal), Edinburgh (miscellaneous), Hull (Royal), King's Lynn, Liverpool (Royal),...

Digitised Books - Images identified as Embellishments. c. 1510 - c. 1946. JPG.

The dataset comprises c. 41,6951 images identified as ‘Embellishments’ from the British Library's Flickr Commons collections, dating between c. 1510 - c. 1946. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1946. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The images are in .JPEG format.

Digitised Hebrew Manuscripts: Metadata

This dataset contains metadata (TEI XML catalogue records) of all British Library digitised Hebrew manuscripts.

AAS Card Catalogues: Armenian

This dataset contains digitised microfilms of Armenian card catalogues.

AAS Card Catalogues: Gujurati

This dataset contains digitised microfilms of Gujurati card catalogues (1909-1983)

AAS Card Catalogues: Tamil

This dataset contains digitised microfilms of Tamil card catalogues.

AAS Card Catalogues: Turkish

This dataset contains digitised microfilms of Turkish card catalogues.

Pelagios Project: Digitised Cornaro Atlas. Egerton MS 73

This dataset comprises of 37 images from a Portolano executed by different Venetian artists between 1489 and 1492, known as the 'Cornaro Atlas'. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project.

Pelagios Project: Digitised Liber insularum Arcipelagi Cotton MS Vespasian a.XIII.art.1

This dataset comprises of 82 images from the Liber insularum Arcipegelagi, an illustrated account of the islands and major ports of the Mediterranean produced by Christophori Bondelmonti around 1422. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project.

Pelagios Project: Maps after Ptolemy's Geographia. Burney MS 111

This dataset comprises of 68 images from a Greek manuscript edition of Ptolemy's Geography containing many diagrams and coloured maps, and produced between the 1375 and 1425. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project.

India Office Medical Archives samples

This dataset comprises 13 samples of digitised India Office Medical Archives on cholera and medical topography. All Open Government Licence.

Registration Year

  • 2019
    1
  • 2018
    17
  • 2017
    43
  • 2016
    44

Resource Types

  • Dataset
    99
  • Image
    4