Professional Documents
Culture Documents
Integrated Learners
This page lists the learning methods already integrated in mlr (http://www.rdocumentation.org/packages/mlr/).
Columns Num., Fac., Ord., NAs, and Weights indicate if a method can cope with numerical, factor, and ordered factor predictors, if it can deal with missing values in a
meaningful way (other than simply removing observations with missing values) and if observation weights are supported.
Column Props shows further properties of the learning methods specific to the type of learning task. See also RLearner
(http://www.rdocumentation.org/packages/mlr/functions/RLearner.html) for details.
Classification (84)
For classification the following additional learner properties are relevant and shown in column Props:
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
ada Boosting
classif.bartMachine bartMachine X X X prob use_missing_data has been
bartmachine (http://www.rdocumentation.org/packages/bartMachine/) twoclass set to TRUE by default to allow
missing data support.
Bayesian Additive Regression
Trees
classif.bdk kohonen (http://www.rdocumentation.org/packages/kohonen/) X prob keep.data is set to FALSE to
bdk twoclass reduce memory requirements.
multiclass
Bi-Directional Kohonen map
classif.binomial stats (http://www.rdocumentation.org/packages/stats/) X X X prob Delegates to glm with freely
binomial twoclass choosable binomial link function
via learner parameter link .
Binomial Regression We set 'model' to FALSE by
default to save memory.
classif.blackboost mboost (http://www.rdocumentation.org/packages/mboost/) X X X X prob See ?ctree_control for
blackboost party (http://www.rdocumentation.org/packages/party/) twoclass possible breakage for nominal
features with missingness.
Gradient Boosting With family has been set to
Regression Trees Binomial by default. For
'family' 'AUC' and 'AdaExp'
probabilities cannot be
predcited.
classif.boosting adabag (http://www.rdocumentation.org/packages/adabag/) X X X prob xval has been set to 0 by
adabag rpart (http://www.rdocumentation.org/packages/rpart/) twoclass default for speed.
multiclass
Adabag Boosting featimp
classif.bst bst (http://www.rdocumentation.org/packages/bst/) X twoclass Renamed parameter learner
bst to Learner due to nameclash
with setHyperPars . Default
Gradient Boosting changes: Learner = "ls" ,
xval = 0 , and maxdepth =
1 .
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 1/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 2/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Geometric Predictive
Discriminant Analysis
classif.glmboost mboost (http://www.rdocumentation.org/packages/mboost/) X X X prob family has been set to
glmboost twoclass Binomial by default. For
'family' 'AUC' and 'AdaExp'
Boosting for GLMs probabilities cannot be
predcited.
classif.glmnet glmnet (http://www.rdocumentation.org/packages/glmnet/) X X X prob The family parameter is set to
glmnet twoclass binomial for two-class
multiclass problems and to multinomial
GLM with Lasso or Elasticnet otherwise. Factors automatically
Regularization get converted to dummy
columns, ordered factors to
integer. Parameter s (value of
the regularization parameter
used for predictions) is set to
0.1 by default, but needs to
be tuned by the user. glmnet
uses a global control object for
its parameters. mlr resets all
control parameters to their
defaults before setting the
specified parameters and after
training. If you are setting
glmnet.control parameters
through glmnet.control, you
need to save and re-set them
after running the glmnet learner.
classif.h2o.deeplearning h2o (http://www.rdocumentation.org/packages/h2o/) X X X prob
h2o.dl twoclass
multiclass
h2o.deeplearning
classif.h2o.gbm h2o (http://www.rdocumentation.org/packages/h2o/) X X prob 'distribution' is set automatically
h2o.gbm twoclass to 'gaussian'.
multiclass
h2o.gbm
classif.h2o.glm h2o (http://www.rdocumentation.org/packages/h2o/) X X X prob 'family' is always set to 'binomial'
h2o.glm twoclass to get a binary classifier.
h2o.glm
classif.h2o.randomForest h2o (http://www.rdocumentation.org/packages/h2o/) X X prob
h2o.rf twoclass
multiclass
h2o.randomForest
classif.hdrda sparsediscrim X prob
hdrda (http://www.rdocumentation.org/packages/sparsediscrim/) twoclass
High-Dimensional Regularized
Discriminant Analysis
classif.IBk RWeka (http://www.rdocumentation.org/packages/RWeka/) X X prob
ibk twoclass
multiclass
k-Nearest Neighbours
classif.J48 RWeka (http://www.rdocumentation.org/packages/RWeka/) X X X prob NAs are directly passed to
j48 twoclass WEKA with na.action =
multiclass na.pass .
J48 Decision Trees
classif.JRip RWeka (http://www.rdocumentation.org/packages/RWeka/) X X X prob NAs are directly passed to
jrip twoclass WEKA with na.action =
multiclass na.pass .
Propositional Rule Learner
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 3/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
k-Nearest Neighbor
classif.ksvm kernlab (http://www.rdocumentation.org/packages/kernlab/) X X prob Kernel parameters have to be
ksvm twoclass passed directly and not by using
multiclass the kpar list in ksvm . Note
Support Vector Machines class.weights that fit has been set to
FALSE by default for speed.
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 4/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Node Harvest
classif.OneR RWeka (http://www.rdocumentation.org/packages/RWeka/) X X X prob NAs are directly passed to
oner twoclass WEKA with na.action =
multiclass na.pass .
1-R Classifier
classif.pamr pamr (http://www.rdocumentation.org/packages/pamr/) X prob Threshold for prediction
pamr twoclass ( threshold.predict ) has
been set to 1 by default.
Nearest shrunken centroid
classif.PART RWeka (http://www.rdocumentation.org/packages/RWeka/) X X X prob NAs are directly passed to
part twoclass WEKA with na.action =
multiclass na.pass .
PART Decision Lists
classif.penalized.fusedlasso penalized X X prob trace=FALSE was set by default
fusedlasso (http://www.rdocumentation.org/packages/penalized/) twoclass to disable logging output.
lambda1 and lambda2 have
Logistic Fused Lasso Regression been set to 1 by default, as
fusedlasso needs both
penalizations > 0.
classif.penalized.lasso penalized X X X prob trace=FALSE was set by default
lasso (http://www.rdocumentation.org/packages/penalized/) twoclass to disable logging output.
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 5/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Random k-Nearest-Neighbors
classif.rotationForest rotationForest X X X prob
rotationForest (http://www.rdocumentation.org/packages/rotationForest/) twoclass
Rotation Forest
classif.rpart rpart (http://www.rdocumentation.org/packages/rpart/) X X X X X prob xval has been set to 0 by
rpart twoclass default for speed.
multiclass
Decision Tree featimp
classif.RRF RRF (http://www.rdocumentation.org/packages/RRF/) X X prob
RRF twoclass
multiclass
Regularized Random Forests featimp
classif.rrlda rrlda (http://www.rdocumentation.org/packages/rrlda/) X twoclass
rrlda multiclass
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 6/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Regression (64)
Additional learner properties:
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Bayesian CART
regr.bdk kohonen (http://www.rdocumentation.org/packages/kohonen/) X keep.data is set to FALSE to
bdk reduce memory requirements.
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 7/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Regression Splines
regr.ctree party (http://www.rdocumentation.org/packages/party/) X X X X X See ?ctree_control for
ctree possible breakage for nominal
features with missingness.
Conditional Inference Trees
regr.cubist Cubist (http://www.rdocumentation.org/packages/Cubist/) X X X
cubist
Cubist
regr.cvglmnet glmnet (http://www.rdocumentation.org/packages/glmnet/) X X X Factors automatically get
cvglmnet converted to dummy columns,
ordered factors to integer. glmnet
GLM with Lasso or Elasticnet uses a global control object for its
Regularization (Cross Validated parameters. mlr resets all control
Lambda) parameters to their defaults before
setting the specified parameters
and after training. If you are setting
glmnet.control parameters through
glmnet.control, you need to save
and re-set them after running the
glmnet learner.
regr.earth earth (http://www.rdocumentation.org/packages/earth/) X X
earth
Multivariate Adaptive
Regression Splines
regr.elmNN elmNN (http://www.rdocumentation.org/packages/elmNN/) X nhid has been set to 1 and
elmNN actfun has been set to "sig"
by default.
Extreme Learning Machine for
Single Hidden Layer
Feedforward Neural Networks
regr.evtree evtree (http://www.rdocumentation.org/packages/evtree/) X X X X pmutatemajor ,
evtree pmutateminor , pcrossover ,
psplit , and pprune , are
Evolutionary learning of globally scaled internally to sum to 100.
optimal trees
regr.extraTrees extraTrees X X
extraTrees (http://www.rdocumentation.org/packages/extraTrees/)
Featureless regression
regr.fnn FNN (http://www.rdocumentation.org/packages/FNN/) X
fnn
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 8/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
h2o.deeplearning
regr.h2o.gbm h2o (http://www.rdocumentation.org/packages/h2o/) X X 'distribution' is set automatically to
h2o.gbm 'gaussian'.
h2o.gbm
regr.h2o.glm h2o (http://www.rdocumentation.org/packages/h2o/) X X X 'family' is always set to 'gaussian'.
h2o.glm
h2o.glm
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 9/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
h2o.randomForest
regr.IBk RWeka (http://www.rdocumentation.org/packages/RWeka/) X X
ibk
K-Nearest Neighbours
regr.kknn kknn (http://www.rdocumentation.org/packages/kknn/) X X
kknn
K-Nearest-Neighbor regression
regr.km DiceKriging X se In predict, we currently always use
km (http://www.rdocumentation.org/packages/DiceKriging/) type = "SK" . The extra
parameter jitter (default is
Kriging FALSE ) enables adding a very
small jitter (order 1e-12) to the x-
values before prediction, as
predict.km reproduces the
exact y-values of the training data
points, when you pass them in,
even if the nugget effect is turned
on. We further introduced
nugget.stability which sets
the nugget to
nugget.stability * var(y)
before each training to improve
numerical stability. We recommend
a setting of 10^-8
regr.ksvm kernlab (http://www.rdocumentation.org/packages/kernlab/) X X Kernel parameters have to be
ksvm passed directly and not by using
the kpar list in ksvm . Note that
Support Vector Machines fit has been set to FALSE by
default for speed.
regr.laGP laGP (http://www.rdocumentation.org/packages/laGP/) X se
laGP
Multivariate Adaptive
Regression Splines
regr.mob party (http://www.rdocumentation.org/packages/party/) X X X
mob
Model-based Recursive
Partitioning Yielding a Tree with
Fitted Models Associated with
each Terminal Node
regr.nnet nnet (http://www.rdocumentation.org/packages/nnet/) X X X size has been set to 3 by
nnet default.
Neural Network
regr.nodeHarvest nodeHarvest X X
nodeHarvest (http://www.rdocumentation.org/packages/nodeHarvest/)
Node Harvest
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 10/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Principal Component
Regression
regr.penalized.fusedlasso penalized X X trace=FALSE was set by default to
fusedlasso (http://www.rdocumentation.org/packages/penalized/) disable logging output. lambda1
and lambda2 have been set to 1
Fused Lasso Regression by default, as fusedlasso needs
both penalizations > 0.
regr.penalized.lasso penalized X X trace=FALSE was set by default to
lasso (http://www.rdocumentation.org/packages/penalized/) disable logging output.
Lasso Regression
regr.penalized.ridge penalized X X trace=FALSE was set by default to
ridge (http://www.rdocumentation.org/packages/penalized/) disable logging output.
Ridge Regression
regr.plsr pls (http://www.rdocumentation.org/packages/pls/) X X
plsr
Random k-Nearest-Neighbors
regr.rpart rpart (http://www.rdocumentation.org/packages/rpart/) X X X X X featimp xval has been set to 0 by
rpart default for speed.
Decision Tree
regr.RRF RRF (http://www.rdocumentation.org/packages/RRF/) X X X featimp
RRF
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 11/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 12/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
LassoRegression
surv.penalized.ridge penalized X X X rcens trace=FALSE was set by default
ridge (http://www.rdocumentation.org/packages/penalized/) to disable logging output.
Ridge Regression
surv.randomForestSRC survival (http://www.rdocumentation.org/packages/survival/) X X X X X featimp na.action has been set to
rfsrc randomForestSRC oobpreds "na.impute" by default to
(http://www.rdocumentation.org/packages/randomForestSRC/) rcens allow missing data support.
Random Forest
surv.ranger ranger (http://www.rdocumentation.org/packages/ranger/) X X X featimp By default, internal parallelization
ranger rcens is switched off ( num.threads =
1 ), verbose output is
Random Forests disabled,
respect.unordered.factors
is set to TRUE . All settings are
changeable.
surv.rpart rpart (http://www.rdocumentation.org/packages/rpart/) X X X X X featimp xval has been set to 0 by
rpart rcens default for speed.
Survival Tree
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
Expectation-Maximization
Clustering
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 13/15
20/07/2017 Integrated Learners - mlr tutorial
Class / Short Name / Name Packages Num. Fac. Ord. NAs Weights Props Note
cluster.FarthestFirst RWeka X
farthestfirst (http://www.rdocumentation.org/packages/RWeka/)
FarthestFirst Clustering
Algorithm
cluster.kkmeans kernlab X centers has been set to 2L by default.
kkmeans (http://www.rdocumentation.org/packages/kernlab/) The nearest center in kernel distance
determines cluster assignment of new data
Kernel K-Means points. Kernel parameters have to be passed
directly and not by using the kpar list in
kkmeans
K-Means Clustering
cluster.XMeans RWeka X You may have to install the XMeans Weka
xmeans (http://www.rdocumentation.org/packages/RWeka/) package: WPM('install-package',
'XMeans') .
XMeans (k-means with
automatic determination of k)
Cost-sensitive classification
For ordinary misclassification costs you can use all the standard classification methods listed above.
For example-dependent costs there are several ways to generate cost-sensitive learners from ordinary regression and classification learners. See section cost-sensitive
classification (../cost_sensitive_classif/index.html) and the documentation of makeCostSensClassifWrapper
(http://www.rdocumentation.org/packages/mlr/functions/makeCostSensClassifWrapper.html), makeCostSensRegrWrapper
(http://www.rdocumentation.org/packages/mlr/functions/makeCostSensRegrWrapper.html) and makeCostSensWeightedPairsWrapper
(http://www.rdocumentation.org/packages/mlr/functions/makeCostSensWeightedPairsWrapper.html) for details.
Random ferns
Moreover, you can use the binary relevance method to apply ordinary classification learners to the multilabel problem. See the documentation of function
makeMultilabelBinaryRelevanceWrapper (http://www.rdocumentation.org/packages/mlr/functions/makeMultilabelBinaryRelevanceWrapper.html) and the tutorial section on
multilabel classification (../multilabel/index.html) for details.
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 14/15
20/07/2017 Integrated Learners - mlr tutorial
https://mlr-org.github.io/mlr-tutorial/release/html/integrated_learners/index.html 15/15