Materiais

...

ICT - Inductive Classification Tool

Inductive Classification Tool was developed in Java language and aims to generate results using traditional inductive algorithms and their different parameter for datasets represented in ARFF format.

...

TPT - Text Preprocessing Tool

This is a Java tool which transforms text files in a document-term matrix.

...

SKET - Statistical Keyword Extraction Tool

This tool extracts keywords from single documents using statistical methods.

...

FEATuRE - Features gEnerator based on AssociaTion RulEs

The required steps to generate the bag-of-related words are implemented in this tool. Thre is also functionalities to analyse the generated bag-of-related-words.

...

Mldatagen - A multi-label dataset generator

This framework, which is described in ICMC-USP technical report, can to generate synthetic multi-label datasets using two strategies: hyperspheres or hypercubes. For each label in a dataset, these strategies randomly generate a geometric shape (hypersphere or hypercube), which is populated with points (instances or examples) randomly generated. Afterwards, each instance is labeled according to the shapes it belongs to, which defines the instance multi-label.

...

Gráficos Labic

Gráficos Labic is a very simple tool which helps the user to quickly choose from a set of graphs commonly used in the pre-processing phase of the Data Mining process, as well as retrieving the correspondent R code of the chosen graph.

...

TaXEm - a tool for helping evaluate domain topics

The TaXEm (Taxonomia em XML da Embrapa) is a fast and efficient tool to organize, retrieve, browse and extract knowledge from textual documents. In order to organize specific domain information, TaXEm builds a taxonomy which can be (semi)/automatically evaluated. This evaluation can be carried out using objective measures or using a subjective analysis based on the domain specialist judgment.

...

PRETEXT - Text preprocessing

PRETEXT is a computational tool implemented in Perl using the object oriented paradigm, which automatically performs most of the Text Mining pre-processing tasks in a collection of documents. The documents may be written in three different languages: Portuguese, Spanish and English. In addition, the tool includes facilities to reduce the dimensionality of any text pre-processed data set by using Zipf’s law and Luhn cut-offs.

...

IESystem - Information Extraction System

The IESystem extracts metadata from scientific articles, even when they are provided from different sources or written in different languages. The process of metadata extraction is based on models which describe relative content positions or content indicators to be extracted. A set of functionalities for pre-processing and assistance to the user when constructing models are also available.

Mostrando de 1 até 9 de 9 registros