Hex, Bugs and More Physics | Emre S. Tasci

a blog about physics, computation, computational physics and materials…

DataMining Functions Library for Mathematica

November 19, 2007 Posted by Emre S. Tasci

Mathematica Library for DataMining
Functions Library Intended To Be Used In DataMining Procedures
11/2007
Emre S. Tasci
S&C / MSE / 3mE / TUDelftFunctions List (EF: Required External Functions)

  • Int DMBooleEST[statement_]
    Converts True/False => 1/0
    EF – NONE
  • Bool DMBooleEST[statement_]
    Converts 1/0 => True/False
    EF – NONE
  • Int DMCondEntropyEST[MatrixIN_,var2_,var1_, posVal1_, posVal2_,probVar1_]
    Calculates the Conditional Entropy H(var2|var1)
    EF – MSelectEST, DMProbeEST, DMEntropyEST
  • Int DMEntropyEST[problist_]
    Calculates the entropy given the probability list “problist”
    EF – NONE
  • Matrix DMGenMatrixEST[dim_,file_]
    Generates a Matrix of [dim,dim] dimension and writes this matrix to the “file” file. Returns the matrix generated.
    EF – DMBooleEST, DMUnitizeEST
  • List DMInfGainEST[MatrixIN_,varno_]
    Calculates the Information Gain related to the “varno” variable wrt to the remaining variables.
    EF -Â MDiffElementEST , MSwapEST, DMCondEntropyEST, DMEntropyEST, DMProbeEST
  • Vector DMProbeEST[MatrixIN_,varno_,varlist_]
    Calculates the probabilities of the variables’ possible outcomes.
    EF – NONE;

Important!!! ————————————

Assumed form of the MatrixIN: ;
Formula: $$ a_1   a_2 ...  a_n$$
Formula: $$ b_1   b_2 ...  b_n$$
Formula: $$ c_1   c_2 ...  c_n$$
(i.e., values for the same variable are stored in ROWS, NOTÂ in COLUMNS!)

One Response to “DataMining Functions Library for Mathematica”

  1. A Mathematica Example for Information Gain at Hex, Bugs and More Physics Says:

    […] will be needing the two libraries and you can download them from the related entries for Matrix and DataMining […]

Leave a Reply