Introduction to Data Science



Download free course Introduction to Data Science, pdf file on 722 pages by Rafael A Irizarry.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning. It also helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, algorithm building with caret, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation with knitr and R markdown. The book is divided into six parts: R, Data Visualization, Data Wrangling, Probability, Inference and Regression with R, Machine Learning, and Productivity Tools. Each part has several chapters meant to be presented as one lecture. The book includes dozens of exercises distributed across most chapters.

Table of contents

  • R
  • Getting Started with R and RStudio
  • R Basics
  • Programming basics
  • The tidyverse
  • Importing data
  • Data Visualization
  • Introduction to data visualization
  • ggplot2
  • Visualizing data distributions
  • Data visualization in practice
  • Data visualization principles
  • Robust summaries
  • Statistics with R
  • Introduction to Statistics with R
  • Probability
  • Random variables
  • Statistical Inference
  • Statistical models
  • Regression
  • Linear Models
  • Association is not causation
  • Data Wrangling
  • Introduction to Data Wrangling
  • Reshaping data
  • Joining tables
  • Web Scraping
  • String Processing
  • Parsing Dates and Times
  • Text mining
  • Machine Learning
  • Introduction to Machine Learning
  • Smoothing
  • Cross validation
  • The caret package
  • Examples of algorithms
  • Machine learning in practice
  • Large datasets
  • Clustering
  • Productivity tools
  • Introduction to productivity tools
  • Organizing with Unix
  • Git and GitHub
  • Reproducible projects with RStudio and R markdown
Pages : 722
Size : 55.8 MB
File type : PDF
Downloads: 185
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Rafael A Irizarry
Introduction to Data Science

Others Computer science Tutorials

PowerShell Notes for Professionals

Getting Started with InnerSource

Hibernate Notes for Professionals

Computational Thinking Education

Accelerators in Silicon Valley

Others related eBooks about Introduction to Data Science

Rethinking the Internet of Things

Over the next decade, most devices connected to the Internet will not be used by people in..., download free Internet of Things tutorial in PDF (192 pages) created by Francis daCosta ....

Hacker hightschool, learn how being a Hacker

Free tutorial intended to beginners to learn how being a hacker for educational purpose....

Using Adobe Soundbouth CS5

This PDF tutorial presents an understanding of key audio concepts helps you get the most out of Adobe Soundbooth CS5,download free training document for intermediat level-users....

Software Innovation

Download free course Software Innovation, pdf file on 129 pages by Jeremy Rose....

Cloud Native Applications

Cloud computing is a game changer. Being able to automate and constantly adjust infrastruc..., download free Cloud tutorial in PDF (123 pages) created by ....

Contextual Process Digitalization

Download free course Contextual Process Digitalization, pdf file on 282 pages by Albert Fleischmann, Stefan Oppl, Werner Schmidt, Christian Stary....

SLO Adoption and Usage in Site Reliability Engineering

Download free course SLO Adoption and Usage in Site Reliability Engineering, pdf file on 104 pages by Julie McCoy, Nicole Forsgren....

Electron Succinctly

Download free course Electron Succinctly, pdf file on 131 pages by Ed Freitas....

Lightweight Systems for Realtime Monitoring

Download free course Lightweight Systems for Realtime Monitoring, pdf file on 27 pages by Sam Newman....

R for Data Science

Download free course R for Data Science, pdf file on 520 pages by Garrett Grolemund, Hadley Wickham....