Skip to main content

Creating Classification model

From a workbook, click Actions > ML Dashboard. The ML Dashboard page is displayed. From the Create Model section, click Classification and specify the following:

FeatureRandom Forest
Model TypeEnsemble tree-based model
Best Suited ForComplex, non-linear relationships; mixed data types
Relationship ModelingCaptures non-linear patterns via aggregated decision trees
Handling of InteractionsAutomatically captures feature interactions
InterpretabilityModerate – feature importances available, but overall model is a “black box”
Parameter TuningSeveral hyperparameters (n_estimators, max_depth, min_samples_split, etc.)
Data RequirementsWorks with high-dimensional, noisy data; tolerates outliers and missing values
Handling Imbalanced DataSupports class weights or balanced subsampling
Feature ImportanceBuilt-in via mean decrease impurity or permutation importance
Library Supportscikit-learn (RandomForestClassifier), XGBoost (RF variant), Spark MLlib
Prediction OutputHard labels and class probabilities
Deployment ReadinessScales well; fast inference per tree; model size grows with number of trees
External FeaturesEasy to add new features without changing core algorithm
Community & ResourcesExtensive tutorials, academic papers, and widespread industry adoption

Click Create and commence the model training process.