23,923 Works

The Microsoft Academic Graph in RDF: A Linked Data Source with 8 Billion Triples of Scholarly Data

Michael Färber
We present the Microsoft Academic Knowledge Graph (MAKG), a large RDF data set with over eight billion triples with information about scientific publications and related entities, such as authors, institutions, journals, and fields of study. The data set is based on the Microsoft Academic Graph and licensed under the Open Data Attributions license. Furthermore, we provide entity embeddings for all 210M represented scientific papers. More information can be found at http://ma-graph.org/ and in the ISWC'19...

The Microsoft Academic Graph in RDF: A Linked Data Source with 8 Billion Triples of Scholarly Data

Michael Färber
We present the Microsoft Academic Knowledge Graph (MAKG), a large RDF data set with over eight billion triples with information about scientific publications and related entities, such as authors, institutions, journals, and fields of study. The data set is based on the Microsoft Academic Graph and licensed under the Open Data Attributions license. Furthermore, we provide entity embeddings for all 210M represented scientific papers. More information can be found at http://ma-graph.org/ and in the ISWC'19...

Dividing the Ontology Alignment Task

Ernesto Jimenez-Ruiz, Asan Agibetov, Jiaoyan Chen, Matthias Samwald & Valerie Cross
Large ontologies still pose serious challenges to state of the art ontology alignment systems. In the paper we present an approach that combines a lexical index, a neural embedding model and locality modules to effectively segment an input ontology matching task into smaller and more tractable (sub)matching tasks. We have conducted a comprehensive evaluation using the datasets of the Ontology Alignment Evaluation Initiative. The results are encouraging and suggest that the proposed methods are adequate...

weecology/PortalData 1.141.0

S. K. Morgan Ernest, Glenda M. Yenni, Ginger Allington, Ellen K. Bledsoe, Erica M. Christensen, Renata Diaz, Keith Geluso, Jacob R. Goheen, Qinfeng Guo, Edward Heske, Douglas Kelt, Joan M. Meiners, Jim Munger, Carla Restrepo, Douglas A. Samson, Michele R. Schutzenhofer, Marian Skupski, Sarah R. Supp, Katherine M. Thibault, Shawn D. Taylor, Ethan P. White, Diane W. Davidson, James H. Brown & Thomas J. Valone
Official Repo of the Portal Project Data

Dividing the Ontology Alignment Task

Ernesto Jimenez-Ruiz, Asan Agibetov, Jiaoyan Chen, Matthias Samwald & Valerie Cross
Large ontologies still pose serious challenges to state of the art ontology alignment systems. In the paper we present an approach that combines a lexical index, a neural embedding model and locality modules to effectively segment an input ontology matching task into smaller and more tractable (sub)matching tasks. We have conducted a comprehensive evaluation using the datasets of the Ontology Alignment Evaluation Initiative. The results are encouraging and suggest that the proposed methods are adequate...

MAS corpus

María Navas-Loro, Víctor Rodríguez-Doncel, Alba Fernández-Izquierdo, Idafen Santana-Pérez & Alberto Sánchez
Corpus of Spanish tweets for Marketing, including tags on Sentiment Analysis (emotions), Marketing Mix and Purchase Funnel by human taggers.

MAS corpus

María Navas-Loro, Víctor Rodríguez-Doncel, Alba Fernández-Izquierdo, Idafen Santana-Pérez & Alberto Sánchez
Corpus of Spanish tweets for Marketing, including tags on Sentiment Analysis (emotions), Marketing Mix and Purchase Funnel by human taggers.

Australian Federal Legislation - Principal acts in force

Víctor Rodríguez-Doncel
This dataset is a corpus with 884 documents in legal Australian English. These documents are part of the Australian Federal Legislation principal acts in force as of 29 December 2018. The documents are offered in two forms: (1) as pure txt documents (2) as RDF turlte documents following the ELI schema (European Legislation Identifier ontology).

Australian Federal Legislation - Principal acts in force

Víctor Rodríguez-Doncel
This dataset is a corpus with 884 documents in legal Australian English. These documents are part of the Australian Federal Legislation principal acts in force as of 29 December 2018. The documents are offered in two forms: (1) as pure txt documents (2) as RDF turlte documents following the ELI schema (European Legislation Identifier ontology).

CORAL: A corpus of ontological requirements annotated with Lexico-Syntactic Patterns

Alba Fernández-Izquierdo, María Poveda-Villalón & Raúl García-Castro
In this work we present CORAL (Corpus of Ontological Requirements Annotated with Lexico-syntactic patterns), an openly available corpus of 834 ontological requirements annotated and 29 lexico-syntactic patterns, from which 12 are proposed in this work. CORAL is openly available in three different open formats, namely, HTML, CSV and RDF.

CORAL: A corpus of ontological requirements annotated with Lexico-Syntactic Patterns

Alba Fernández-Izquierdo, María Poveda-Villalón & Raúl García-Castro
In this work we present CORAL (Corpus of Ontological Requirements Annotated with Lexico-syntactic patterns), an openly available corpus of 834 ontological requirements annotated and 29 lexico-syntactic patterns, from which 12 are proposed in this work. CORAL is openly available in three different open formats, namely, HTML, CSV and RDF.

The List Of Qps Status Recommended Biological Agents For Safety Risk Assessments Carried Out By Efsa

Antonia Ricci, Ana Allende, Declan Bolton, Marianne Chemaly, Robert Davies, Rosina Girones, Konstantinos Koutsoumanis, Roland Lindqvist, Birgit Nørrung, Lucy Robertson, Giuseppe Ru, Pablo Salvador Fernández Escámez, Moez Sanaa, Marion Simmons, Panagiotis Skandamis, Emma Snary, Niko Speybroek, Benno Ter Kuile, John Threlfall, Helene Wahlström, Pier Sandro Cocconcelli, Luisa Peixe, Günter Klein, Miguel Prieto Maradona, Amparo Querol … & Lieve Herman
The European Food Safety Authority (EFSA) asked the Panel on Biological Hazards (BIOHAZ) to deliver a scientific Opinion on the maintenance of the list of qualified presumption of safety (QPS) biological agents. The QPS approach was developed by the EFSA Scientific Committee to provide a harmonised generic pre-evaluation to support safety risk assessments of biological agents intentionally introduced into the food and feed chain, in support of the concerned scientific Panels and Units in the...

ConvLMD

Verónica Panadeiro, Antón García & Adrián Pallas
Dataset of MWIR coaxial images of Laser Metal Deposition process using different laser power and robot speed configurations.

Computational History of Philosophy of Science (Comp HOPOS) Dataset

Daniel J. Hicks, Rick Morris & Evelyn Brister
The Computational History of Philosophy of Science (Comp HOPOS) aims to be a comprehensive set of article and (when available) book chapter metadata for philosophy of science. The dataset covers the full run of over 40 journals and 3 major book series in the field. An automated author disambiguation script is used to construct canonical names for each author, and a combination of gender attribution methods is used to attribute the gender of each author....

Supplementary material for 'Revealing patterns of nocturnal migration using the European weather radar network'

Cecilia Nilsson, Adriaan Dokter, Liesbeth Verlinden, Judy Shamoun-Baranes, Baptiste Schmid, Peter Desmet, Silke Bauer, Jason Chapman, Jose A. Alves, Phillip M. Stepanian, Nir Sapir, Charlotte Wainwright, Mathieu Boos, Anna Górska, Myles H. M. Menz, Pedro Rodrigues, Hidde Leijnse, Pavel Zehtindjiev, Robin Brabant, Günther Haase, Nadja Weisshaupt, Michał Ciach & Felix Liechti
Introduction This package contains data, filters and visualizations from Nilsson and Dokter et al. (2019). Files radar_metadata.csv: Metadata for the 84 European radars considered for this study. Includes radar code (odim_code = country + odim_code_3char and alternative radar code vp_radar), radar site location (location, latitude, longitude), radar site elevation (site_altitude_asl in meters above sea level) and radar altitude range used in this study (min_height_cut_asl and max_height_cut_asl in meters above sea level). vp.zip: Vertical profiles of...

Supplementary material for 'Revealing patterns of nocturnal migration using the European weather radar network'

Cecilia Nilsson, Adriaan Dokter, Liesbeth Verlinden, Judy Shamoun-Baranes, Baptiste Schmid, Peter Desmet, Silke Bauer, Jason Chapman, Jose A. Alves, Phillip M. Stepanian, Nir Sapir, Charlotte Wainwright, Mathieu Boos, Anna Górska, Myles H. M. Menz, Pedro Rodrigues, Hidde Leijnse, Pavel Zehtindjiev, Robin Brabant, Günther Haase, Nadja Weisshaupt, Michał Ciach & Felix Liechti
Introduction This package contains data, filters and visualizations from Nilsson and Dokter et al. (2019). Files radar_metadata.csv: Metadata for the 84 European radars considered for this study. Includes radar code (odim_code = country + odim_code_3char and alternative radar code vp_radar), radar site location (location, latitude, longitude), radar site elevation (site_altitude_asl in meters above sea level) and radar altitude range used in this study (min_height_cut_asl and max_height_cut_asl in meters above sea level). vp.zip: Vertical profiles of...

DOIBoost Dataset Dump

Sandro La Bruzzo, Paolo Manghi & Andrea Mannocci
Research in information science and scholarly communication strongly relies on the availability of openly accessible datasets of metadata and, where possible, their relative payloads. To this end, CrossRef plays a pivotal role by providing free access to its entire metadata collection, and allowing other initiatives to link and enrich its information. Therefore, a number of key pieces of information result scattered across diverse datasets and resources freely available online. As a result of this fragmentation,...

DOIBoost Dataset Dump

Sandro La Bruzzo, Paolo Manghi & Andrea Mannocci
Research in information science and scholarly communication strongly relies on the availability of openly accessible datasets of metadata and, where possible, their relative payloads. To this end, CrossRef plays a pivotal role by providing free access to its entire metadata collection, and allowing other initiatives to link and enrich its information. Therefore, a number of key pieces of information result scattered across diverse datasets and resources freely available online. As a result of this fragmentation,...

OpenAIRE ScholeXplorer Service: Scholix JSON Dump

Sandro La Bruzzo & Paolo Manghi
This datasets contains the GZ-compressed dump of the information space of the OpenAIRE ScholeXplorer service. The dataset consists of 126+ Million literature-dataset and dataset-dataset links between 12+ Million objects, where links are encoded as records in Scholix format (schema Version 3). Links were collected from publishers (CrossRef, EventData), data centers (DataCite), and institutional/thematic repositories (OpenAIRE). the links are organized in 29 compressed files, each of ~500MB, for a total of ~15GB,.

LexiRumah v3.0.0

Owen Edwards, Gereon A. Kaiping & Marian Klamer
LexiRumah database of languages of Eastern Indonesia.

21 coffee makers energy consumption dataset

Diego Casado Mansilla
This dataset represents the use of 21 coffee machines over time, with each line representing an energy consumption event of a specific machine.

21 coffee makers energy consumption dataset

Diego Casado Mansilla
This dataset represents the use of 21 coffee machines over time, with each line representing an energy consumption event of a specific machine.

lexibank/chaconbaniwa: The diversity of Arawakan languages from the upper Rio Negro in recordings from the 1950s

Johann-Mattis List & Thiago Costa Chacon
Data accompanying the paper "The diversity of Arawakan languages from the upper Rio Negro in recordings from the 1950s" by Thiago C. Chacon et al. Cite the source of the dataset as: Chacon, T. C.; Gonçalves, A. G.; and da Silva, L. F (forthcoming): A diversidade linguística Aruák no Alto Rio Negro em gravações da década de 1950 [The diversity of Arawakan languages from the upper Rio Negro in recordings from the 1950s]

CIViCmine

Jake Lever
This describes the output files for the CIViCmine project. These files are loaded directly by the CIViCmine viewer. The code for this viewer is available in the CIViCmine Github repo if you want to run it independently. Each file is a tab-delimited file with a header, no comments and no quoting. You likely want civicmine_collated.tsv if you just want the list of cancer biomarkers. If you want the supporting sentences, look at civicmine_sentences.tsv. You can...

lingpy/language-island-paper: Bangime and Friends

Johann-Mattis List & Abbie Hantgan
This package offers source code and data for the forthcoming paper by Hantgan and List ""Bangime: Secret Language, Language Isolate, or Language Island?" by Hantgan and List " If you use the code, please also cite the original paper: Hantgan, Abbie and List, Johann-Mattis (forthcoming): Bangime. Secret language, language isolate, or language island? Journal of Language Contact.

Registration Year

  • 2018
    23,923

Resource Types

  • Dataset
    23,923

Data Centers

  • Zenodo
    23,923