| Title: | Multi-Model Stacking Optimization and Continuous Regression Ensembles |
|---|---|
| Description: | Automated parallelized regression ensemble pipeline executing concurrent evaluations across 17 base machine learning architectures, 6 specialized stacking meta-learners, and 136 pairwise hybrid combinations. Implements dynamic performance-weighted stacking blends, internal data dictionary profiling metrics, iterative multicollinearity pruning via Variance Inflation Factors (VIF), and programmatic executive Quarto report generation. |
| Authors: | Russ Conte [aut, cre, cph] |
| Maintainer: | Russ Conte <[email protected]> |
| License: | MIT + file LICENSE |
| Version: | 1.2.1 |
| Built: | 2026-06-07 18:15:51 UTC |
| Source: | https://github.com/infinitecuriosity/numericensembles |
Concrete is the most important material in civil engineering. The concrete compressive strength is a highly nonlinear function of age and ingredients.
ConcreteConcrete
Concrete A data frame with 1030 rows and 9 columns:
quantitative – kg in a m3 mixture – Input Variable
quantitative – kg in a m3 mixture – Input Variable
quantitative – kg in a m3 mixture – Input Variable
quantitative – kg in a m3 mixture – Input Variable
quantitative – kg in a m3 mixture – Input Variable
quantitative – kg in a m3 mixture – Input Variable
quantitative – kg in a m3 mixture – Input Variable
Day (1~365) – Input Variable
quantitative – MPa – Output Variable
https://archive.ics.uci.edu/dataset/165/concrete+compressive+strength
Export Numeric Pipeline Outcomes and Graphs
ExportNumericResults(pipeline_object = NULL, export_directory = NULL)ExportNumericResults(pipeline_object = NULL, export_directory = NULL)
pipeline_object |
An object generated by the |
export_directory |
Character string path targeting file output folders. |
Invisible character string tracking file save directory destinations.
This dataset contains detailed information about insurance customers, including their age, sex, body mass index (BMI), number of children, smoking status and region. Having access to such valuable insights allows analysts to get a better view into customer behaviour and the factors that contribute to their insurance charges.
InsuranceInsurance
Insurance A data frame with 1338 rows and 7 columns Credit to Bob Wakefield
The age of the customer. (Integer)
The number of children the customer has. (Integer)
Whether or not the customer is a smoker. (Boolean)
The region the customer lives in. (String)
The insurance charges for the customer. (Float)
https://www.kaggle.com/datasets/thedevastator/prediction-of-insurance-charges-using-age-gender
Fires up a local instance of the interactive OLS vs. GLM tuning, diagnostic, and residual validation web dashboard system.
LaunchNumericApp()LaunchNumericApp()
Load Serialized Numeric Pipeline Environment from Disk File
load_pipeline(file_path)load_pipeline(file_path)
file_path |
Character string tracking destination directory file path. |
A re-hydrated numeric_pipeline object.
Core Performance Pipeline Engine for Continuous Data
Numeric( dataset = NULL, target_col = NULL, facet_col = "", color_col = "", stratify_col = "", palette_style = c("standard", "viridis", "modern"), config = NumericEnsemblesConfig(), verbose = TRUE )Numeric( dataset = NULL, target_col = NULL, facet_col = "", color_col = "", stratify_col = "", palette_style = c("standard", "viridis", "modern"), config = NumericEnsemblesConfig(), verbose = TRUE )
dataset |
A data.frame containing continuous target outputs and features. |
target_col |
Character string specifying the name of the target column. |
facet_col |
Character string specifying a column to facet EDA charts by. Default = "". |
color_col |
Character string specifying a column to color EDA charts by. Default = "". |
stratify_col |
Character string specifying a categorical column to anchor stratified sampling splits. Default = "". |
palette_style |
Character string choosing a color palette: "standard", "viridis", or "modern". |
config |
A pre-configured architecture parameter matrix block from |
verbose |
Logical console logging status indicator. Default TRUE. |
An integrated execution bundle data object of class type numeric_pipeline.
Configuration Parameters Matrix for Numeric Ensemble Pipeline
NumericEnsemblesConfig( cv_folds = 5, train_pct = 0.6, vif_threshold = 5, transform_steps = c("nzv", "medianImpute", "center", "scale", "YeoJohnson"), rf_grid = NULL, glmnet_grid = expand.grid(alpha = seq(0, 1, length = 5), lambda = seq(0.001, 0.2, length = 10)), svm_tune_length = 8, mars_tune_length = 5, pcr_tune_length = 10, tree_tune_length = 10 )NumericEnsemblesConfig( cv_folds = 5, train_pct = 0.6, vif_threshold = 5, transform_steps = c("nzv", "medianImpute", "center", "scale", "YeoJohnson"), rf_grid = NULL, glmnet_grid = expand.grid(alpha = seq(0, 1, length = 5), lambda = seq(0.001, 0.2, length = 10)), svm_tune_length = 8, mars_tune_length = 5, pcr_tune_length = 10, tree_tune_length = 10 )
cv_folds |
Integer. Number of cross-validation folds. Default 5. |
train_pct |
Decimal fraction between 0 and 1 for the training split. Default 0.60. |
vif_threshold |
Numeric. Maximum allowed Variance Inflation Factor. Default 5. |
transform_steps |
Character vector. Preprocessing transformations applied via caret. |
rf_grid |
Data frame or NULL. Explicit tuning grid for Random Forest. |
glmnet_grid |
Data frame. Hyperparameter grid for elastic net. |
svm_tune_length |
Integer. Total tuning resolution steps for svmRadial. Default 8. |
mars_tune_length |
Integer. Total tuning resolution steps for MARS. Default 5. |
pcr_tune_length |
Integer. Total tuning resolution steps for PCR. Default 10. |
tree_tune_length |
Integer. Total tuning resolution steps for decision trees. Default 10. |
A structured list containing isolated operational tuning parameters.
Run a Demonstration of the Numeric Ensembles Pipeline
NumericEnsemblesDemo()NumericEnsemblesDemo()
Fast-Execution Configuration Matrix
NumericEnsemblesFastConfig()NumericEnsemblesFastConfig()
A structured list containing optimized, fast-track hyperparameter settings.
Plot Numeric Pipeline Performance Metrics and Visual Diagnostics
## S3 method for class 'numeric_pipeline' plot(x, pace_output = TRUE, ...)## S3 method for class 'numeric_pipeline' plot(x, pace_output = TRUE, ...)
x |
A numeric_pipeline object. |
pace_output |
Logical. If TRUE and session is interactive, paces chart pages. Default = TRUE. |
... |
Additional arguments. |
Generate Executive Production Projections with Row-Level 95 percent Confidence Intervals
predict_production(object, newdata)predict_production(object, newdata)
object |
A trained 'numeric_pipeline' object. |
newdata |
A data.frame containing new data configurations to score. |
A data.frame structured with prediction vectors and interval metrics for the top 3 models.
Predict with Numeric Pipeline Framework
## S3 method for class 'numeric_pipeline' predict(object, newdata, model_name = "best", ...)## S3 method for class 'numeric_pipeline' predict(object, newdata, model_name = "best", ...)
object |
A numeric_pipeline object. |
newdata |
A data.frame containing new data configurations to score. |
model_name |
Character string specifying the target model from the leaderboard to score. Default = "best". |
... |
Additional arguments. |
A numeric vector of predictions.
Print Numeric Pipeline Summary Report
## S3 method for class 'numeric_pipeline' print(x, ...)## S3 method for class 'numeric_pipeline' print(x, ...)
x |
A numeric_pipeline object. |
... |
Additional arguments. |
Render Automated Quarto Numeric Executive Report
RenderExecutiveReport(pipeline_object = NULL, output_directory = getwd())RenderExecutiveReport(pipeline_object = NULL, output_directory = getwd())
pipeline_object |
An object generated by the |
output_directory |
Character string specifying where to save the report files. |
Save Serialized Numeric Pipeline Environment to Disk File
save_pipeline(object, file_path)save_pipeline(object, file_path)
object |
A numeric_pipeline object. |
file_path |
Character string tracking destination directory file path. |