Tools for Healthcare Machine Learning


[Up] [Top]

Documentation for package ‘healthcareai’ version 1.2.3

Help Pages

healthcareai-package healthcareai: a streamlined way to develop and deploy models
addSAMUtilityCols Add SAM utility columns to table
assignClusterLabels Assign labels to the kmeans confusion matrix
calculateAllCorrelations Correlation analysis on an input table over all numeric columns
calculateConfusion Generate confusion matrix of percentages
calculateCOV Calculate coefficient of variation
calculateHourBins Calculate a vector of reasonable time bins
calculatePerformance Generate performance metrics after model has been trained
calculateSDChanges Calculate std deviation up/down for each numeric field in row
calculateTargetedCorrelations Correlation analysis on an input table, focusing on one target variable
calulcateAlternatePredictions Recalculate predicted value based on alternate scenarios
catalyst_test_deploy_in_prod Test function to check that the production environment is active.
convertDateTimeColToDummies Converts datetime columns into dummy columns
countDaysSinceFirstDate Creates column based on days since first date
countMissingData Function to find proportion of NAs in each column of a dataframe or matrix
countPercentEmpty DEPRECATED. Calculates percentage of each column in df that is NULL (NA)
createAllCombinations Find all possible unique combinations
createVarianceTallTable Transform a dataframe to be three columns and tall instead of wide
dataScale Center and scale columns in a numeric data frame
distancePointLine Compute the distance of a point from a line
distancePointSegment Compute the distance of a point from a line segment
featureAvailabilityProfiler Calculate and plot data availability over time
findBestAlternateScenarios Find most biggest drop in predictive probability across alternate features
findElbow Find the elbow in a curve
findTrends Find any columns that have a trend above a particular threshold
findVariation Find high variation
generateAUC Generate ROC or PR curve for a dataset.
getCutOffList Function to return ideal cutoff and TPR/FPR or precision/recall.
getPipedValue Grab number after single pipe in pipe-delimited string
getPipedWordCount Count number of words in pipe-delimited string
groupedLOCF Last observation carried forward
healthcareai healthcareai: a streamlined way to develop and deploy models
ignoreSpecWarn Function to suppress specific warnings in unit tests
imputeColumn Depreciated in favor of 'imputeDF'
imputeDF Perform imputation on a dataframe
initializeParamsForTesting Function to initialize and populate the SupervisedModelDevelopmentParams each time a unit test is run.
isBinary Check if a vector has only two unique values.
isNumeric Check if a data frame only has numeric columns.
isTargetYN Tests whether predictedCol is Y/N. Allows for NAs to be present.
KmeansClustering Build clusters using kmeans()
LassoDeployment Deploy a production-ready predictive Lasso model
LassoDevelopment Compare predictive models, created on your data
LinearMixedModelDeployment Deploy a production-ready predictive Linear Mixed Model model
LinearMixedModelDevelopment Compare predictive models, created on your data
lineMagnitude Compute the distance between two points
nelsonRule1 Analyze points in time to determine whether or not Nelson Rule 1 was violated
orderByDate Order the rows in a data frame by date
pcaAnalysis Perform principle component analysis
percentDataAvailableInDateRange Find the percent of a column that's filled
plotPRCurve Plot PR Curves from SupervisedModel classes
plotProfiler Display availability feature profile over time
plotROCs Plot ROCs from SupervisedModel classes
RandomForestDeployment Deploy a production-ready predictive RandomForest model
RandomForestDevelopment Compare predictive models, created on your data
removeColsWithAllSameValue Remove columns from a data frame when those columns have the same values in each row
removeColsWithDTSSuffix Remove columns with DTS suffix
removeColsWithOnlyNA Remove columns from a data frame that are only NA
removeRowsWithNAInSpecCol Remove rows where specified col is NA
returnColsWithMoreThanFiftyCategories Return vector of columns in a data frame with greater than 50 categories
RiskAdjustedComparisons Make risk adjusted comparisons between groups/units or years/months
selectData Pull data into R via an ODBC connection
skip_on_not_appveyor Function to skip specific tests if they are not being run on Appveyor.
splitOutDateTimeCols Splits datetime column into multiple date features
start_prod_logs Sets console logging to a file in the working directory.
stop_prod_logs Stops all console logging.
SupervisedModelDeployment Deploy predictive models, created on your data
SupervisedModelDeploymentParams SupervisedModelDeploymentParams class to set up parameters required to build SupervisedModelDeployment class
SupervisedModelDevelopment Compare predictive models, created on your data
SupervisedModelDevelopmentParams SupervisedModelDevelopmentParams class to set up parameters required to build SupervisedModel classes
UnsupervisedModel Build clusters based on your data.
UnsupervisedModelParams UnsupervisedModelParams class to set up parameters required to build UnsupervisedModel classes
variationAcrossGroups Find variation across groups
writeData Write data to database
XGBoostDeployment Deploy a production-ready predictive XGBoost model
XGBoostDevelopment Compare predictive models, created on your data