OpenMinTed sets out to create an open, service-oriented e-Infrastructure for Text and Data Mining (TDM) of scientific and scholarly content. Researchers can collaboratively create, discover, share and re-use Knowledge from a wide range of text-based scientific related sources in a seamless way.
In the current context of scientific information overload in which new knowledge is created at a rapid pace, we propose to develop text summarization services for automatically identifying the m...
The purpose of this introductory course is to provide a starting point to the concepts of Text and Data Mining (TDM), since the field is gradually gaining more attention from funders a...
The Freeling component provides basic language analysis functionalities (tokenization, lemmatization, Pos Tagging and dependency parsers.) for the variety of languages that Freeling includes (En...
This tutorial includes three parts that describe how to use the Wheat Phenotypic Information Extractor and the two end-user applications, WheatIS and AlvisIR, that integrates its results for the...
The objective of this tutorial is to showcase how Social Science researchers can take full advantage of the OpenMinTeD TDM platform for Detecting and Linking Variables in Scientific Publications...
This tutorial describes how to use TDM to build a Recommender system for scholarly resources and utilise OpenMinTed platform to build and annotate corpuses for this purpose.
This tutorial explains how to use the Bio Term Hub, an aggregator of biomedical terminologies sourced from manually curated databases, to create a terminology suited to the users need....
This tutorial walks users through the simple process of creating a workflow in the OpenMinTeD platform that allows them to identify acknowledged projects (i.e. funding information) from scientif...
This tutorial walks users through the simple process of creating a workflow in the OpenMinTeD platform that allows them to extract links to DataCite (https://www.datacite.org) - mainly citations...
This tutorial will users through the simple process of creating a workflow in the OpenMinTeD platform that allows them to perform content-based document classification on scientific publications...
This tutorial is made up of two parts:
Part I is the OpenMinTED guide to create a workflow that reads from a data source and annotate articles related to chronic liver diseases.
Pa...
In this course we will explain how IXA pipes have been integrated as Docker images in the OpenMinTeD (OMTD) platform and how can they be used (http://ixa2.si.ehu.es/ixa-pipes/).
This tutorial focuses on using the Docker image to annotate raw text files. It shows how to install the docker system on a machine, how to pull the UPFMT image and how to pass the input/output p...
This tutorial focuses on using the code directly on a host machine. It gives access to the code (Python) + models and shows the user how to run the code from the console. Also, all the steps nee...
The objective of this tutorial is to showcase how the Neuroscience use case available at the OpenMinTeD platform can facilitate the curation of neuroscience entities from the literature with the...
The objective of this tutorial is to showcase the use case of “Extract Metabolites and their Properties and Modes of Actions”. The tutorial describes step-by-step how to create a workflow in the...
This tutorial explains how to use the “Arabidopsis Gene Regulation Extractor” application available from the OpenMinTeD platform. It also explains the scientific issues it addresses, and how res...
This tutorial explains how to use the “Habitat-Phenotype Relation Extractor for Microbes” application available from the OpenMinTeD platform. It also explains the scientific issues it addresses,...
The OpenMinTeD project offers an integrated registry of text mining components alongside a powerful corpus builder. The platform can be used to identify a set of documents of interest and then r...
The Unstructured Information Management Architecture (UIMA) is a widely used software framework and specification to create multi modal analysis systems, in particular for Natural Language Proce...
The purpose of this tutorial is to explain how to add annotation resources to the OMTD platform. In this context, annotation resources mean ontologies or vocabularies selected in different ontol...
The objective of this component is to scan a tokenized text to detect entries in BabelNet in the input document. This component is the base of entity linking and word sense disambiguation as it ...
Abbreviations and basic subject-object relations can be valuable data to understand the context and meaning of text passages. This tutorial explains how to use the two tools Ab3P and OpenSesamIE...
The objective of this tutorial is to showcase how the use case application on Agriculture, and more specifically Viticulture, can be utilized by researchers of this domain on a specific topic by...
The objective of this tutorial is to showcase how the use case application around the Food Safety thematic area, and more specifically around Food Safety and Water Health, can be utilized by res...