bnpredict

Imputation and Value Prediction

Missing data is a major problem for anyone working with large datasets. There have been many approaches to dealing with missing data, from list-wise deletion of missing data to imputation via multivariate logistic regression. This packages provides a framework to use Bayesian networks to predict, evaluate and impute missing or held-out data using network-based expectation maximization. This method is particularly well suited for highly dimensional categorical data that doesn't behave well with traditional imputation approaches.

In the following vignette, I will walk through an example in which we:

Create the structure of a bayesian network
Hold out 25% of the values of the job feature in the dataset, creating missing data to predict.
Train the bayesian network with the truncated dataset
Predict the "missing" values using Most Probable Explanation technique
Evaluate the model
Visualize the evaluation process
Impute the data

See the example in the vignettes folder for further instruction and information

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
R		R
data		data
man		man
vignettes		vignettes
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
README.md		README.md
example.R		example.R
thresh_ex.R		thresh_ex.R
viable_ex.R		viable_ex.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bnpredict

Imputation and Value Prediction

About

Releases

Packages

Languages

lauramaxwell/bnpredict

Folders and files

Latest commit

History

Repository files navigation

bnpredict

Imputation and Value Prediction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages