Text And Data Mining

The process of extracting high quality and meaningful information from text to answer unknown questions.

Resources

Intended audience Policy makers and Funders, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: no previous knowledge is required

The objective of this tutorial is to showcase how Social Science researchers can take full advantage of the OpenMinTeD TDM platform for Detecting and Linking Variables in Scientific Publications.
 

Intended audience Policy makers and Funders, Programmers, Researchers and Students, Text and Data miners
Level: Introductory: aware of

This tutorial describes how to use TDM to build a Recommender system for scholarly resources and utilise OpenMinTed platform to build and annotate corpuses for this purpose.

Intended audience Industry and Business, Policy makers and Funders, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: aware of

This tutorial is made up of two parts:
Part I is the OpenMinTED guide to create a workflow that reads from a data source and annotate articles related to chronic liver diseases.
Part II is the guide to use the tools to build a graph displaying the progression between liver conditions.
 

Liver diseases and progression annotation

Intended audience Industry and Business, Policy makers and Funders, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: aware of

This tutorial is made up of two parts:
Part I is the OpenMinTED guide to create a workflow that reads from a data source and annotate articles related to chronic liver diseases.
Part II is the guide to use the tools to build a graph displaying the progression between liver conditions.
 

Liver diseases and progression annotation

Intended audience Text and Data miners, Industry and Business, Programmers, Researchers and Students
Level: Introductory: no previous knowledge is required

This tutorial focuses on using the Docker image to annotate raw text files. It shows how to install the docker system on a machine, how to pull the UPFMT image and how to pass the input/output parameters and instantiate the container. The user simply has to provide an input folder containing any number of files to be annotated (.txt or .xmi) and an output folder where the annotated files (.conllu and .xmi) will be generated. No programming ...

Intended audience Text and Data miners, Industry and Business, Programmers, Researchers and Students, Text and Data miners
Level: Introductory: aware of

This tutorial focuses on using the code directly on a host machine. It gives access to the code (Python) + models and shows the user how to run the code from the console. Also, all the steps needed on how to train new models are given, as well as other pointers. The user will be able to download the code, run a tokenizer/tagger/parser on a set of files (either .xmi or .txt) and obtain as output the annotated files (conllu format).
&n...

Intended audience Industry and Business, Policy makers and Funders, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: no previous knowledge is required

The objective of this tutorial is to showcase how the Neuroscience use case available at the OpenMinTeD platform can facilitate the curation of neuroscience entities from the literature with the aim of supporting ongoing curation efforts in the Blue Brain Project (BBP), at the École Polytechnique Fédérale de Lausanne (EPFL).
 

Intended audience Industry and Business, Policy makers and Funders, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: no previous knowledge is required

The objective of this tutorial is to showcase the use case of “Extract Metabolites and their Properties and Modes of Actions”. The tutorial describes step-by-step how to create a workflow in the OpenMinTeD platform that can read input from a source and annotate entities useful for the curation of the ChEBI database.
 

Intended audience Programmers
Level: Intermediate: able to

In this tutorial we describe a way to wrap a command line tool (NLProt) as web service and share it through the OpenMinTeD platform. 

Intended audience Industry and Business, Policy makers and Funders, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: no previous knowledge is required

The objective of this tutorial is to showcase how the use case application on Agriculture, and more specifically Viticulture, can be utilized by researchers of this domain on a specific topic by using the components and the workflows that are available at the OpenMinTed Platform.