Disclosure: when you buy through links on our site, we may earn an affiliate commission.

Visualization and Imputation of Missing Data

Learn to create numerous unique visualizations to better understand patterns of missing data in your data sample.
(35 reviews)
1,275 students
Created by


CourseMarks Score®







Platform: Udemy
Video: 4h 53m
Language: English
Next start: On Demand

Table of contents


There are many problems associated with analyzing data sets that contain missing data. However, there are various techniques to ‘fill in,’ or impute, missing data values with reasonable estimates based on the characteristics of the data itself and on the patterns of ‘missingness.’ Generally, techniques appropriate for imputing missing values in multivariate normal data and not as useful when applied to non-multivariate-normal data. This Visualization and Imputation of Missing Data course focuses on understanding patterns of ‘missingness’ in a data sample, especially non-multivariate-normal data sets, and teaches one to use various appropriate imputation techniques to “fill in” the missing data. Using the VIM and VIMGUI packages in R, the course also teaches how to create dozens of different and unique visualizations to better understand existing patterns of both the missing and imputed data in your samples.
The course teaches both the concepts and provides software to apply the latest non-multivariate-normal-friendly data imputation techniques, including: (1) Hot-Deck imputation: the sequential and random hot-deck algorithm; (2) the distance-based, k-nearest neighbor imputation approach; (3) individual, regression-based imputation; and (4) the iterative, model-based, stepwise regression imputation technique with both standard and robust methods (the IRMI algorithm). Furthermore, the course trains one to recognize the patterns of missingness using many vibrant and varied visualizations of the missing data patterns created by the professional VIMGUI software included in the course materials and made available to all course participants.
This course is useful to anyone who regularly analyzes large or small data sets that may contain missing data. This includes graduate students and faculty engaged in empirical research and working professionals who are engaged in quantitative research and/or data analysis. The visualizations that are taught are especially useful to understand the types of data missingness that may be present in your data and consequently, how best to deal with this missing data using imputation. The course includes the means to apply the appropriate imputation techniques, especially for non-multivariate-normal sets of data which tend to be most problematic to impute.
The course author provides free-of-charge with the course materials his own unique VIMGUI toolbar developed in the RGtk2 visualization programming language in R. However, please note that both the R-provided VIMGUI package (developed in RGtk2), as well as the course author’s provided VIMGUI toolbar application (also developed in RGtk2) may have some problems starting up properly on a Mac computer. So if you only have a Mac available to you, you may have some initial difficulties getting the applications to run properly.

You will learn

✓ Use visualizations created by R software to identify patterns of ‘missingness’ in data sets and to impute reasonable values to replace the missing data.
✓ Recognize and identify the different patterns of missing data and the relative severity of their likely consequences.
✓ Learn to use the VIM and VIMGUI R packages to create unique, novel and vibrant images which promote the understanding of patterns of both missing and imputed data in a set of data.
✓ Learn the different historical approaches to impute reasonable values for missing data and their relative advantages and disadvantages.
✓ Learn the characteristics of: (1) Hot-Deck; (2) K-Nearest Neighbor; (3) Regression-Based; and (4) Iterative, Model-Based, Stepwise Regression (IRMI) imputation techniques to “fill in” missing data and when and how to implement them with provided software.


• Students will need to install R software but ample instructions for doing so are provided.

This course is for

• This course is useful for anyone analyzing large or small data sets that may contain missing data.
• The course is useful for graduate students conducting quantitative, empirical research and/or practicing quantitative analytic professionals.
• Please note that the VIMGUI software is written in the R-specific RGtk2 language (based on GTK+) which has been known to be problematic running on a Mac computer.

How much does the Visualization and Imputation of Missing Data course cost? Is it worth it?

The course costs $14.99. And currently there is a 70% discount on the original price of the course, which was $49.99. So you save $35 if you enroll the course now.
The average price is $15.0 of 2 Data Imputation courses. So this course is 0% cheaper than the average Data Imputation course on Udemy.

Does the Visualization and Imputation of Missing Data course have a money back guarantee or refund policy?

YES, Visualization and Imputation of Missing Data has a 30-day money back guarantee. The 30-day refund policy is designed to allow students to study without risk.

Are there any SCHOLARSHIPS for this course?

Currently we could not find a scholarship for the Visualization and Imputation of Missing Data course, but there is a $35 discount from the original price ($49.99). So the current price is just $14.99.

Who is the instructor? Is Geoffrey Hubona, Ph.D. a SCAM or a TRUSTED instructor?

Geoffrey Hubona, Ph.D. has created 27 courses that got 4,030 reviews which are generally positive. Geoffrey Hubona, Ph.D. has taught 30,922 students and received a 4.0 average review out of 4,030 reviews. Depending on the information available, Geoffrey Hubona, Ph.D. is a TRUSTED instructor.
Associate Professor of Information Systems
Dr. Geoffrey Hubona has held full-time tenure-track, and tenured, assistant and associate professor faculty positions at 4 major state universities in the United States since 1993. Currently, he is an associate professor of MIS at Texas A&M International University where he teaches for-credit courses on Business Data Visualization (undergrad), Advanced Programming using R (graduate), and Data Mining and Business Analytics (graduate). In previous academic faculty positions, he taught dozens of various statistics, business information systems, and computer science courses to undergraduate, master’s and Ph.D. students. He earned a Ph.D. in Business Administration (Information Systems and Computer Science) from the University of South Florida (USF) in Tampa, FL; an MA in Economics, also from USF; an MBA in Finance from George Mason University in Fairfax, VA; and a BA in Psychology from the University of Virginia in Charlottesville, VA. He is the founder of the Georgia R School (2010-2014) and of R-Courseware (2014-Present), online educational organizations that teach research methods and quantitative analysis techniques. These research methods techniques include linear and non-linear modeling, multivariate methods, data mining, programming and simulation, and structural equation modeling and partial least squares (PLS) path modeling.
Browse all courses by on Coursemarks.
Platform: Udemy
Video: 4h 53m
Language: English
Next start: On Demand

Students are also interested in