Data Science Foundations: Data Assessment for Predictive Modeling
Duration: 4h 3m | .MP4 1280x720, 30 fps® | AAC, 48000 Hz, 2ch |
Genre: eLearning | Language: English
CRISP-DM, the cross-industry standard process for data mining, is composed of six phases. Most new data scientists rush to modeling because it's the phase in which they have the most training.

But whether the project succeeds or fails is actually determined far earlier. This course introduces a systematic approach to the data understanding phase for predictive modeling. Instructor Keith McCormick teaches principles, guidelines, and tools, such as KNIME and R, to properly assess a data set for its suitability for machine learning. Discover how to collect data, describe data, explore data by running bivariate visualizations, and verify your data quality, as well as make the transition to the data preparation phase. The course includes case studies and best practices, as well as challenge and solution sets for enhanced knowledge retention. By the end, you should have the skills you need to pay proper attention to this vital phase of all successful data science projects.

Topics include:

Distinguishing data assessment from data viz
Mastering the four data understanding tasks
Collecting initial data
Identifying the level of measurement
Loading data
Describing data
Visualizing data
Working with top predictors
Using ggDescription2 for data viz
Verifying data quality
Transitioning to data preparation

Download link:

Links are Interchangeable - No Password - Single Extraction