Text and Data miners

Navigation

Resources

By  Marjan Grootveld
Publication year: 2018  |  Research Data Management  |  Open Repositories
Intended audience Programmers, Researchers and Students, Text and Data miners
Level: Advanced: apply

The Freeling component provides basic language analysis functionalities (tokenization, lemmatization, Pos Tagging and dependency parsers.) for the variety of languages that Freeling includes (English, Spanish, Portuguese, Italian, French, German, Russian, Catalan, Galician, Croatian, Slovene). The specific usage scenario for this component concerns scientific publications in non-English languages. The component has been shared as a Docker a...

Intended audience Programmers, Researchers and Students, Text and Data miners
Level: Advanced: apply

The objective of this component is to scan a tokenized text to detect entries in BabelNet in the input document. This component is the base of entity linking and word sense disambiguation as it detects the candidates to be disambiguated. The component produces WSD item annotations as defined in the DKPro WSD typesystem. Afterwards, disambiguation can be performed by other components (like DKPro WSD). The component has been shared as a Docke...

Intended audience Industry and Business, Programmers, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: no previous knowledge is required

This tutorial includes three parts that describe how to use the Wheat Phenotypic Information Extractor and the two end-user applications, WheatIS and AlvisIR, that integrates its results for the use case developed by Inra during the OpenMinTeD project.The application extracts information related to wheat on phenotypes, genes, markers, species, and varieties.

Intended audience Policy makers and Funders, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: no previous knowledge is required

The objective of this tutorial is to showcase how Social Science researchers can take full advantage of the OpenMinTeD TDM platform for Detecting and Linking Variables in Scientific Publications.