Introduction to Data Science



Download free course Introduction to Data Science, pdf file on 722 pages by Rafael A Irizarry.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning. It also helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, algorithm building with caret, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation with knitr and R markdown. The book is divided into six parts: R, Data Visualization, Data Wrangling, Probability, Inference and Regression with R, Machine Learning, and Productivity Tools. Each part has several chapters meant to be presented as one lecture. The book includes dozens of exercises distributed across most chapters.

Table of contents

  • R
  • Getting Started with R and RStudio
  • R Basics
  • Programming basics
  • The tidyverse
  • Importing data
  • Data Visualization
  • Introduction to data visualization
  • ggplot2
  • Visualizing data distributions
  • Data visualization in practice
  • Data visualization principles
  • Robust summaries
  • Statistics with R
  • Introduction to Statistics with R
  • Probability
  • Random variables
  • Statistical Inference
  • Statistical models
  • Regression
  • Linear Models
  • Association is not causation
  • Data Wrangling
  • Introduction to Data Wrangling
  • Reshaping data
  • Joining tables
  • Web Scraping
  • String Processing
  • Parsing Dates and Times
  • Text mining
  • Machine Learning
  • Introduction to Machine Learning
  • Smoothing
  • Cross validation
  • The caret package
  • Examples of algorithms
  • Machine learning in practice
  • Large datasets
  • Clustering
  • Productivity tools
  • Introduction to productivity tools
  • Organizing with Unix
  • Git and GitHub
  • Reproducible projects with RStudio and R markdown
Pages : 722
Size : 55.8 MB
File type : PDF
Downloads: 181
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Rafael A Irizarry
Introduction to Data Science

Others Computer science Tutorials

Basic Computer Book PDF Download Computer

Intelligence Unleashed

Introduction to OKRs

PowerShell Notes for Professionals

HackSpace Magazine: Issue 46

Others related eBooks about Introduction to Data Science

Application Insights Succinctly

Download free course Application Insights Succinctly, pdf file on 75 pages by by Roberto Albano....

TensorFlow Roadmap

Download free course TensorFlow Roadmap, pdf file on 22 pages by Amirsina Torfi....

S-BPM Illustrated

Download free course S-BPM Illustrated, pdf file on 144 pages by Albert Fleischmann, Stefan Raß, Robert Singer....

Trigonometry: A Trig Cheat Sheet for Solving Problems

In this tutorial on trigonometry, we'll cover the basics of right triangles and the primary trigonometric functions, we'll refer to a Trig Cheat Sheet to help you quickly recall key concepts and formulas....

The Brain of the Computer

The purpose of the book is to take a basic computer system and show you how every part works. It is taught from a technicians point of view, not an engineer's. These are the things that are taught in the book. Digital electronic components, digital logic circuits, CPU theory, computer system theor...

Docker Succinctly

Download free course Docker Succinctly, pdf file on 98 pages by Elton Stoneman....

GIS Succinctly

Download free course GIS Succinctly, pdf file on 108 pages by Peter Shaw....

Retro Gaming with Raspberry Pi

This book shows you how to set up a Raspberry Pi to play classic games, and a whole lot mo..., download free Raspberry Pi tutorial in PDF (164 pages) created by Bob Clagett ....

A Practical Guide to TPM 2.0

Download free course A Practical Guide to TPM 2.0, pdf file on 375 pages by by Will Arthur, David Challener, Kenneth Goldman....

Foundations of Software Science and Computation Structures

Download free course Foundations of Software Science and Computation Structures, pdf file on 556 pages by Miko?aj Boja?czyk, Alex Simpson....