AdaptiveRandomForest

Repository for the AdaptiveRandomForest (also known as ARF) algorithm implemented in MOA 2016-04

~~The Adaptive Random Forest (ARF) algorithm is going to be available as an extension to MOA in the future.~~ ~~Until that, you may use this repository to have access to its source code or to an executable MOA-2016-04 jar.~~

The Adaptive Random Forest algorithm has been added to the MOA main code as of July 2017. The code has been updated in here as well to make it clearer and aligned with the code in MOA. The main change is that now ARF uses ChangeDetector abstract class, which allows more flexibility while selecting the drift and warning detection algorithms.

For more informations about MOA, check out the official website: http://moa.cms.waikato.ac.nz

Citing AdaptiveRandomForest

To cite this ARF in a publication, please cite the following paper:

Heitor Murilo Gomes, Albert Bifet, Jesse Read, Jean Paul Barddal, Fabricio Enembreck, Bernhard Pfharinger, Geoff Holmes, Talel Abdessalem. Adaptive random forests for evolving data stream classification. In Machine Learning, DOI: 10.1007/s10994-017-5642-8, Springer, 2017.

Important source files

If you are here, then you are probably looking for the implementations used in the original AdaptiveRandomForest paper, which are:

AdaptiveRandomForest.java: The ensemble learner AdaptiveRandomForest
ARFHoeffdingTree.java: The base tree learner used by AdaptiveRandomForest
EvaluatePrequentialDelayed.java: The evaluation task that includes a k delay
EvaluatePrequentialDelayedCV.java: Similar to EvaluatePrequentialDelayed.java, however it simulates Cross-validation.

How to execute it

To test AdaptiveRandomForest in either delayed or immediate setting execute the latest MOA jar. You can copy and paste the following command in the interface (right click the configuration text edit and select "Enter configuration”). Sample command:

EvaluatePrequentialDelayedCV -l (meta.AdaptiveRandomForest -s 100) -s ArffFileStream -e BasicClassificationPerformanceEvaluator -f 100000000

Explanation: this command executes a 10 fold cross-validation delayed prequential evaluation on ARF with 100 classifiers (-s 100) using m = sqrt(total features) + 1 (default option, see parameter -o for others) on the ELEC dataset (-f elecNormNew.arff). Make sure to extract the elecNormNew.arff dataset, and setting -f to its location, before executing the command.

Datasets used in the original paper

The real datasets are compressed and available at the root directory. The synthetic datasets configurations are available at SYNTHETIC_STREAMS.txt, also on the root directory.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.settings		.settings
src		src
AIRL.arff.zip		AIRL.arff.zip
COVT.arff.zip		COVT.arff.zip
ELEC.arff.zip		ELEC.arff.zip
GMSC.arff.zip		GMSC.arff.zip
KDD99.arff.zip		KDD99.arff.zip
README.md		README.md
SPAM.arff.zip		SPAM.arff.zip
SYNTHETIC_STREAMS.txt		SYNTHETIC_STREAMS.txt
nbactions.xml		nbactions.xml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AdaptiveRandomForest

Citing AdaptiveRandomForest

Important source files

How to execute it

Datasets used in the original paper

About

Releases

Packages

Languages

hmgomes/AdaptiveRandomForest

Folders and files

Latest commit

History

Repository files navigation

AdaptiveRandomForest

Citing AdaptiveRandomForest

Important source files

How to execute it

Datasets used in the original paper

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages