healthcareai-package | healthcareai: a streamlined way to develop and deploy models |
addSAMUtilityCols | Add SAM utility columns to table |
assignClusterLabels | Assign labels to the kmeans confusion matrix |
calculateAllCorrelations | Correlation analysis on an input table over all numeric columns |
calculateConfusion | Generate confusion matrix of percentages |
calculateCOV | Calculate coefficient of variation |
calculateHourBins | Calculate a vector of reasonable time bins |
calculatePerformance | Generate performance metrics after model has been trained |
calculateSDChanges | Calculate std deviation up/down for each numeric field in row |
calculateTargetedCorrelations | Correlation analysis on an input table, focusing on one target variable |
calulcateAlternatePredictions | Recalculate predicted value based on alternate scenarios |
catalyst_test_deploy_in_prod | Test function to check that the production environment is active. |
convertDateTimeColToDummies | Converts datetime columns into dummy columns |
countDaysSinceFirstDate | Creates column based on days since first date |
countMissingData | Function to find proportion of NAs in each column of a dataframe or matrix |
countPercentEmpty | DEPRECATED. Calculates percentage of each column in df that is NULL (NA) |
createAllCombinations | Find all possible unique combinations |
createVarianceTallTable | Transform a dataframe to be three columns and tall instead of wide |
dataScale | Center and scale columns in a numeric data frame |
distancePointLine | Compute the distance of a point from a line |
distancePointSegment | Compute the distance of a point from a line segment |
featureAvailabilityProfiler | Calculate and plot data availability over time |
findBestAlternateScenarios | Find most biggest drop in predictive probability across alternate features |
findElbow | Find the elbow in a curve |
findTrends | Find any columns that have a trend above a particular threshold |
findVariation | Find high variation |
generateAUC | Generate ROC or PR curve for a dataset. |
getCutOffList | Function to return ideal cutoff and TPR/FPR or precision/recall. |
getPipedValue | Grab number after single pipe in pipe-delimited string |
getPipedWordCount | Count number of words in pipe-delimited string |
groupedLOCF | Last observation carried forward |
healthcareai | healthcareai: a streamlined way to develop and deploy models |
ignoreSpecWarn | Function to suppress specific warnings in unit tests |
imputeColumn | Depreciated in favor of 'imputeDF' |
imputeDF | Perform imputation on a dataframe |
initializeParamsForTesting | Function to initialize and populate the SupervisedModelDevelopmentParams each time a unit test is run. |
isBinary | Check if a vector has only two unique values. |
isNumeric | Check if a data frame only has numeric columns. |
isTargetYN | Tests whether predictedCol is Y/N. Allows for NAs to be present. |
KmeansClustering | Build clusters using kmeans() |
LassoDeployment | Deploy a production-ready predictive Lasso model |
LassoDevelopment | Compare predictive models, created on your data |
LinearMixedModelDeployment | Deploy a production-ready predictive Linear Mixed Model model |
LinearMixedModelDevelopment | Compare predictive models, created on your data |
lineMagnitude | Compute the distance between two points |
nelsonRule1 | Analyze points in time to determine whether or not Nelson Rule 1 was violated |
orderByDate | Order the rows in a data frame by date |
pcaAnalysis | Perform principle component analysis |
percentDataAvailableInDateRange | Find the percent of a column that's filled |
plotPRCurve | Plot PR Curves from SupervisedModel classes |
plotProfiler | Display availability feature profile over time |
plotROCs | Plot ROCs from SupervisedModel classes |
RandomForestDeployment | Deploy a production-ready predictive RandomForest model |
RandomForestDevelopment | Compare predictive models, created on your data |
removeColsWithAllSameValue | Remove columns from a data frame when those columns have the same values in each row |
removeColsWithDTSSuffix | Remove columns with DTS suffix |
removeColsWithOnlyNA | Remove columns from a data frame that are only NA |
removeRowsWithNAInSpecCol | Remove rows where specified col is NA |
returnColsWithMoreThanFiftyCategories | Return vector of columns in a data frame with greater than 50 categories |
RiskAdjustedComparisons | Make risk adjusted comparisons between groups/units or years/months |
selectData | Pull data into R via an ODBC connection |
skip_on_not_appveyor | Function to skip specific tests if they are not being run on Appveyor. |
splitOutDateTimeCols | Splits datetime column into multiple date features |
start_prod_logs | Sets console logging to a file in the working directory. |
stop_prod_logs | Stops all console logging. |
SupervisedModelDeployment | Deploy predictive models, created on your data |
SupervisedModelDeploymentParams | SupervisedModelDeploymentParams class to set up parameters required to build SupervisedModelDeployment class |
SupervisedModelDevelopment | Compare predictive models, created on your data |
SupervisedModelDevelopmentParams | SupervisedModelDevelopmentParams class to set up parameters required to build SupervisedModel classes |
UnsupervisedModel | Build clusters based on your data. |
UnsupervisedModelParams | UnsupervisedModelParams class to set up parameters required to build UnsupervisedModel classes |
variationAcrossGroups | Find variation across groups |
writeData | Write data to database |
XGBoostDeployment | Deploy a production-ready predictive XGBoost model |
XGBoostDevelopment | Compare predictive models, created on your data |