Projet

Général

Profil

Documentation » Historique » Révision 13

Révision 12 (François Rioult, 23/06/2010 21:07) → Révision 13/37 (François Rioult, 23/06/2010 22:08)

h1. Documentation 

 KDAriane is a set of operators for data mining and machine learning, and a set of scenarios (supervised classification, missing values completion, strong emerging pattern mining, etc.). It uses Ariane as a graphical platform for designing the data streams. 

 h2. Installation 

 As KDAriane requires Ariane, that requires Pandore, all have to be installed in the following order: 
 * [[Prerequisite]]  
 * [[Pandore]] 
 * [[Ariane]] 
 * [[KDAriane]]  

 h2. Special operators for shell scripting 

 KDAriane is provided with basic components for executing shell scripts. The choice depends on how many parameters (p), input (i)    and output (o) you want. The operators are named  
 @"eval" + p + i + o @ and call the eponymous .sh script. 

 When an operator is executed, Ariane launches the script (for example @script.sh@) associated to the operator with giving the following arguments: 
 <pre> 
 script.sh parameter-1 parameter-2 ... parameter-p input-1 input-2 ... input-i output-1 output-2 ... output-o 
 </pre> 

 In Ariane, every operator has a return value, even if it has no output. 

 The operator are divided in two categories:  
 * [[KDD operators]] are special components for calling Weka components, RapidMiner processes or MVminer binaries. 
 * [[Shell operators]] that directly execute the commands entered by Ariane.  


 h2. Scenarios 

 KDAriane provides some examples of KDD realized through Ariane: 
 * [[Data preparation]] : a first scenario for the binarization of CSV data. 
 * pattern mining and complexity visualization 
 * supervised classification with association rules 
 * experiences about perturbation on training and test file with Weka classifiers and RapidMiner processes.