Go to the project website

Tutorials and Courses

Scientific Summarization Services

In the current context of scientific information overload in which new knowledge is created at a rapid pace, we propose to develop text summarization services for automatically identifying the m...

Introduction to Text and Data Mining

The purpose of this introductory course is to provide a starting point to the concepts of Text and Data Mining (TDM), since the field is gradually gaining more attention from funders a...

Assessing the FAIRness of Data

In this course you'll learn how to go about assessing the FAIRness of research data using freely accessible tools and resources. This course will:

introduce you ...

FreeLing Component Tutorial

The Freeling component provides basic language analysis functionalities (tokenization, lemmatization, Pos Tagging and dependency parsers.) for the variety of languages that Freeling includes (En...

Text mining for linking wheat data with literature

This tutorial includes three parts that describe how to use the Wheat Phenotypic Information Extractor and the two end-user applications, WheatIS and AlvisIR, that integrates its results for the...

Tutorial: Variable Detection and Linking in Social Sciences Publications

The objective of this tutorial is to showcase how Social Science researchers can take full advantage of the OpenMinTeD TDM platform for Detecting and Linking Variables in Scientific Publications...

OpenMinTeD Use Case Recommender System for Scholarly Resources

This tutorial describes how to use TDM to build a Recommender system for scholarly resources and utilise OpenMinTed platform to build and annotate corpuses for this purpose.

Ontogene entity recognition OGER

This tutorial explains how to use the Bio Term Hub, an aggregator of biomedical terminologies sourced from manually curated databases, to create a terminology suited to the users need....

OpenMinTeD Use Case – Funding Mining Extractor

This tutorial walks users through the simple process of creating a workflow in the OpenMinTeD platform that allows them to identify acknowledged projects (i.e. funding information) from scientif...

DataCite Linking

This tutorial walks users through the simple process of creating a workflow in the OpenMinTeD platform that allows them to extract links to DataCite (https://www.datacite.org) - mainly citations...

Document Classification

This tutorial will users through the simple process of creating a workflow in the OpenMinTeD platform that allows them to perform content-based document classification on scientific publications...

Using the OpenMinTeD platform to build article based HSM - Health State Models

This tutorial is made up of two parts:
Part I is the OpenMinTED guide to create a workflow that reads from a data source and annotate articles related to chronic liver diseases.
Pa...

Using IXA pipes in the OpenMinTeD platform

In this course we will explain how IXA pipes have been integrated as Docker images in the OpenMinTeD (OMTD) platform and how can they be used (http://ixa2.si.ehu.es/ixa-pipes/).

The ai...

UPFMT Docker Usage Tutorial

This tutorial focuses on using the Docker image to annotate raw text files. It shows how to install the docker system on a machine, how to pull the UPFMT image and how to pass the input/output p...

UPFMT Direct API Usage Tutorial

This tutorial focuses on using the code directly on a host machine. It gives access to the code (Python) + models and shows the user how to run the code from the console. Also, all the steps nee...

Text Mining Neuroscience Literature using the OpenMinTeD Platform

The objective of this tutorial is to showcase how the Neuroscience use case available at the OpenMinTeD platform can facilitate the curation of neuroscience entities from the literature with the...

Using the OpenMinTeD platform to aid curators of the ChEBI database

The objective of this tutorial is to showcase the use case of “Extract Metabolites and their Properties and Modes of Actions”. The tutorial describes step-by-step how to create a workflow in the...

Integrative biology gene regulations of Arabidopsis thaliana literature in FLAGdb

This tutorial explains how to use the “Arabidopsis Gene Regulation Extractor” application available from the OpenMinTeD platform. It also explains the scientific issues it addresses, and how res...

Florilege, a new database of habitats and phenotypes of food microbe flora

This tutorial explains how to use the “Habitat-Phenotype Relation Extractor for Microbes” application available from the OpenMinTeD platform. It also explains the scientific issues it addresses,...

Using the Text Mining For Journalism Application with the OpenMinted platform

The OpenMinTeD project offers an integrated registry of text mining components alongside a powerful corpus builder. The platform can be used to identify a set of documents of interest and then r...

Sharing web services through OpenMinTeD platform

In this tutorial we describe a way to wrap a command line tool (NLProt) as web service and share it through the OpenMinTeD platform.

How to wrap your Java NLP tool into an UIMA component

The Unstructured Information Management Architecture (UIMA) is a widely used software framework and specification to create multi modal analysis systems, in particular for Natural Language Proce...

How to add semantic knowledge resources to the OMTD platform?

The purpose of this tutorial is to explain how to add annotation resources to the OMTD platform. In this context, annotation resources mean ontologies or vocabularies selected in different ontol...

BOLSTM classifying relations via long short term memory networks along biomedical ontologies

BO-LSTM is a model based on biomedical ontologies and Long short-term memory networks. T...

BabelNet Concept Detector

The objective of this component is to scan a tokenized text to detect entries in BabelNet in the input document. This component is the base of entity linking and word sense disambiguation as it ...

Ab3P and OpenSesamIE for abbreviations and relations with PubRunner

Abbreviations and basic subject-object relations can be valuable data to understand the context and meaning of text passages. This tutorial explains how to use the two tools Ab3P and OpenSesamIE...

VineSum

A software component for vine/grape variety named entity extraction and clustering

Tutorial on Text Mining over Viticulture Bibliographic Data

The objective of this tutorial is to showcase how the use case application on Agriculture, and more specifically Viticulture, can be utilized by researchers of this domain on a specific topic by...

Tutorial on Text mining over Food Safety and Water Health Bibliographic Data

The objective of this tutorial is to showcase how the use case application around the Food Safety thematic area, and more specifically around Food Safety and Water Health, can be utilized by res...

Video Explanations

What are the current challenges in Text and Data Mining?

What are the benefits of Text and Data Mining?

What is State-of-the-art in Recommender Systems?

What are the current challenges in Knowledge Discovery?

How would you define Deep Learning?

What is State-of-the-art in Deep Learning?

What are the current challenges in Semantic Search?

Top Resources in Text and Data Mining

Tutorial : Legal and ethical considerations for sharing research data (clip)

Tutorial : TDM and Machine Readability of Open Access research

By Nancy Pontika | Text And Data Mining

Tutorial : TDM and Machine Readability of Open Access research

By Nancy Pontika | Text And Data Mining

Introduction to RDM concepts and tools

Les enjeux de l'open access

OpenMinTeD @ CORIA/TALN

By Mouhamadou Ba, Robert Bossy | Reproducibility Guidelines | Analysis/experimentation

Introduction to the OpenMinTeD platform

By Martine Oudenhoven | Text And Data Mining

Creative Commons badge Unless otherwise stated, all materials created by the FOSTER consortium are licensed under a CREATIVE COMMONS ATTRIBUTION 4.0 INTERNATIONAL LICENSE.

This project has received funding from the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement no 612425.
This project has also received funding from the European Union's Horizon2020 programme for research, technological development and demonstration under agreement no 741839.