Introduction to Data Science



Download free course Introduction to Data Science, pdf file on 722 pages by Rafael A Irizarry.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning. It also helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, algorithm building with caret, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation with knitr and R markdown. The book is divided into six parts: R, Data Visualization, Data Wrangling, Probability, Inference and Regression with R, Machine Learning, and Productivity Tools. Each part has several chapters meant to be presented as one lecture. The book includes dozens of exercises distributed across most chapters.

Table of contents

  • R
  • Getting Started with R and RStudio
  • R Basics
  • Programming basics
  • The tidyverse
  • Importing data
  • Data Visualization
  • Introduction to data visualization
  • ggplot2
  • Visualizing data distributions
  • Data visualization in practice
  • Data visualization principles
  • Robust summaries
  • Statistics with R
  • Introduction to Statistics with R
  • Probability
  • Random variables
  • Statistical Inference
  • Statistical models
  • Regression
  • Linear Models
  • Association is not causation
  • Data Wrangling
  • Introduction to Data Wrangling
  • Reshaping data
  • Joining tables
  • Web Scraping
  • String Processing
  • Parsing Dates and Times
  • Text mining
  • Machine Learning
  • Introduction to Machine Learning
  • Smoothing
  • Cross validation
  • The caret package
  • Examples of algorithms
  • Machine learning in practice
  • Large datasets
  • Clustering
  • Productivity tools
  • Introduction to productivity tools
  • Organizing with Unix
  • Git and GitHub
  • Reproducible projects with RStudio and R markdown
Pages : 722
Size : 55.8 MB
File type : PDF
Downloads: 214
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Rafael A Irizarry
Introduction to Data Science

Warning: Trying to access array offset on false in /home/tutovnfz/public_html/article.php on line 233

Others Computer science Tutorials

Entity-Oriented Search

Learning LaTeX

Learning iOS

Docker for Developers

Anti-fragile ICT Systems

Others related eBooks about Introduction to Data Science

WPF Debugging and Performance Succinctly

Download free course WPF Debugging and Performance Succinctly, pdf file on 79 pages by Alessandro Del Sole....

Machine Learning Yearning

AI is transforming numerous industries. Machine Learning Yearning, a free ebook from Andre..., download free Machine Learning tutorial in PDF (118 pages) created by Andrew Ng ....

Cloud Native Applications

Download free course Cloud Native Applications, pdf file on 120 pages by Michael Wittig, Andreas Wittig....

What is the Text Encoding Initiative? How to add intelligent markup to digital resources

The Text Encoding Initiative (TEI) Guidelines have long been regarded as the de facto standard for the preparation of digital textual resources in the scholarly research community. For the beginner, they offer a daunting range of possibilities, reflecting the huge range of potential applications f...

MATLAB Introduction

This tutorial provides a small overview about Matlab by Stephan Roth ,training PDF courses to learn the basics of MATLAB programming designated to beginners....

Test-Driven iOS Development with Swift

Test-driven development (TDD) is a proven way to find software bugs early. Writing tests b..., download free Testing tutorial in PDF (218 pages) created by Dr. Dominik Hauser ....

Making Servers Work

This book highlights practical sysadmin skills, common architectures that you'll encounter..., download free Making Servers tutorial in PDF (280 pages) created by ....

The Privacy Engineer's Manifesto

The Privacy Engineer's Manifesto: Getting from Policy to Code to QA to Value is the first ..., download free Manifesto tutorial in PDF (400 pages) created by Michelle Finneran Dennedy ....

Advances in Satellite Communications

Satellite communication systems are now a major part of most telecommunications networks as well as our everyday lives through mobile personal communication systems and broadcast television. A sound understanding of such systems is therefore important for a wide range of system designers, engineers ...

Think Bayes

Download free course Think Bayes, pdf file on 213 pages by Allen Downey....