Introduction to Data Science

Download free course Introduction to Data Science, pdf file on 722 pages by Rafael A Irizarry.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning. It also helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, algorithm building with caret, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation with knitr and R markdown. The book is divided into six parts: R, Data Visualization, Data Wrangling, Probability, Inference and Regression with R, Machine Learning, and Productivity Tools. Each part has several chapters meant to be presented as one lecture. The book includes dozens of exercises distributed across most chapters.

Table of contents

  • R
  • Getting Started with R and RStudio
  • R Basics
  • Programming basics
  • The tidyverse
  • Importing data
  • Data Visualization
  • Introduction to data visualization
  • ggplot2
  • Visualizing data distributions
  • Data visualization in practice
  • Data visualization principles
  • Robust summaries
  • Statistics with R
  • Introduction to Statistics with R
  • Probability
  • Random variables
  • Statistical Inference
  • Statistical models
  • Regression
  • Linear Models
  • Association is not causation
  • Data Wrangling
  • Introduction to Data Wrangling
  • Reshaping data
  • Joining tables
  • Web Scraping
  • String Processing
  • Parsing Dates and Times
  • Text mining
  • Machine Learning
  • Introduction to Machine Learning
  • Smoothing
  • Cross validation
  • The caret package
  • Examples of algorithms
  • Machine learning in practice
  • Large datasets
  • Clustering
  • Productivity tools
  • Introduction to productivity tools
  • Organizing with Unix
  • Git and GitHub
  • Reproducible projects with RStudio and R markdown
Pages : 722
Size : 55.8 MB
File type : PDF
Downloads: 183
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Rafael A Irizarry
Introduction to Data Science

Others Computer science Tutorials

UWP Succinctly

How To Manage Remote Servers with Ansible


Making Servers Work

Intel Galileo and Intel Galileo Gen 2

Others related eBooks about Introduction to Data Science

Denial of service attacks and network defense

This PDF tutorial will examine various denial of service attacks and network defense measures taken against them....

MATLAB Introduction

This tutorial provides a small overview about Matlab by Stephan Roth ,training PDF courses to learn the basics of MATLAB programming designated to beginners....

The Deno Handbook

Deno is a runtime for JavaScript and TypeScript that is based on the V8 JavaScript engine ..., download free Deno tutorial in PDF (46 pages) created by ....

Web content filtering and access denied

Looking for a comprehensive guide on web content filtering and access denied ? Check out the free course offered by MIT Press! This 467-page PDF file provides insights on Internet blocking and filtering practices across the globe. ...

The Little Redis Book

The book is many years old, but still relevant. Redis has evolved a lot, but most of that ..., download free Redis tutorial in PDF (31 pages) created by ....

Agile Development for Serverless Platforms

You need more than great dev tools to release great software; you need an efficient pipeli..., download free Agile Development tutorial in PDF (124 pages) created by ....

Learning SAP ABAP

Download free course Learning SAP ABAP, pdf file on 42 pages by Stack Overflow Community....

Contextual Process Digitalization

Download free course Contextual Process Digitalization, pdf file on 282 pages by Albert Fleischmann, Stefan Oppl, Werner Schmidt, Christian Stary....

LaTeX Notes for Professionals

Download free course LaTeX Notes for Professionals, pdf file on 59 pages by Stack Overflow Community....

Medical Imaging Systems

Download free course Medical Imaging Systems, pdf file on 263 pages by Andreas Maier, Stefan Steidl, Vincent Christlein, Joachim Hornegger....