Text Categorisation/document Classification

Text categorisation Is a problem of organising objects, such as documents or named entities, into classes or hierarchies.

Navigation

Parent Topic

Resources

Intended audience Policy makers and Funders, Project Managers, Publishers, Researchers and Students, Text and Data miners
Level: Introductory: aware of

This tutorial will users through the simple process of creating a workflow in the OpenMinTeD platform that allows them to perform content-based document classification on scientific publications, based on the arXiv, MeSH, ACM and DCC taxonomies.
 

Intended audience Industry and Business, Programmers, Researchers and Students, Text and Data miners
Level: Introductory: no previous knowledge is required

In this course we will explain how IXA pipes have been integrated as Docker images in the OpenMinTeD (OMTD) platform and how can they be used (http://ixa2.si.ehu.es/ixa-pipes/).

The aim of IXA pipes is to provide a modular set of ready to use Natural Language Processing (NLP) tools. IXA pipes uses the same approach across NLP tasks in order to create robust processors both across domains and languages.
 

&nbs...

Intended audience Programmers, Researchers and Students, Text and Data miners
Level: Intermediate: able to

In the current context of scientific information overload in which new knowledge is created at a rapid pace, we propose to develop text summarization services for automatically identifying the most important information of a research article. The work will be based on an adaptation of our current scientific text mining and summarization technology at our  LaSTUS/TALN lab. The summarization system will apply a natural language processin...

By  Fabrizio Celli, Johannes Keizer, Yves Jaques, Stasinos Konstantopoulos, Dušan Vudragović