The following points should be covered in the documentation:
There are three sampling methods. We need to explain each of them and their optimization strategies.
random_matrixshould be set to a different value in each run, such as,
seed=as.integer(Sys.time()), unless the user wants reproducible results. Results reproducibility is possible in two ways.
- At a specific step.
- An entire classification run.
In case it is not possible to find models, increasing the number of
init.samplesis not always the solution. The user should also try to re-sample so a new set of sample points is picked.
randomForestit is possible to set the number of trees, which should be 1/3 of the total number of predictors. For small values, below 100, the value should be odd so the models can be used by different predict functions, but also reproducible between runs. Check related issues for more information and to add support material.
Information to cover issue #57
Add information about issue #61 (closed)
- Describe what are the optimizations for the