Introduction to Data Science



Download free course Introduction to Data Science, pdf file on 722 pages by Rafael A Irizarry.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning. It also helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, algorithm building with caret, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation with knitr and R markdown. The book is divided into six parts: R, Data Visualization, Data Wrangling, Probability, Inference and Regression with R, Machine Learning, and Productivity Tools. Each part has several chapters meant to be presented as one lecture. The book includes dozens of exercises distributed across most chapters.

Table of contents

  • R
  • Getting Started with R and RStudio
  • R Basics
  • Programming basics
  • The tidyverse
  • Importing data
  • Data Visualization
  • Introduction to data visualization
  • ggplot2
  • Visualizing data distributions
  • Data visualization in practice
  • Data visualization principles
  • Robust summaries
  • Statistics with R
  • Introduction to Statistics with R
  • Probability
  • Random variables
  • Statistical Inference
  • Statistical models
  • Regression
  • Linear Models
  • Association is not causation
  • Data Wrangling
  • Introduction to Data Wrangling
  • Reshaping data
  • Joining tables
  • Web Scraping
  • String Processing
  • Parsing Dates and Times
  • Text mining
  • Machine Learning
  • Introduction to Machine Learning
  • Smoothing
  • Cross validation
  • The caret package
  • Examples of algorithms
  • Machine learning in practice
  • Large datasets
  • Clustering
  • Productivity tools
  • Introduction to productivity tools
  • Organizing with Unix
  • Git and GitHub
  • Reproducible projects with RStudio and R markdown
Pages : 722
Size : 55.8 MB
File type : PDF
Downloads: 202
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Rafael A Irizarry
Introduction to Data Science

Warning: Trying to access array offset on false in /home/tutovnfz/public_html/article.php on line 233

Others Computer science Tutorials

WPF Debugging and Performance Succinctly

Flutter Succinctly

Power BI Succinctly

Information ­technology ­project managers' ­competencies

Scala Succinctly

Others related eBooks about Introduction to Data Science

Introduction to Numerical Methods and MATLAB Programming for Engineers

This book was developed for a course on applied numerical methods for Engineering. The main goals these lectures are to introduce concepts of numerical methods and introduce Matlab in an Engineering framework. ...

Rethinking Productivity in Software Engineering

Get the most out of this foundational reference and improve the productivity of your softw..., download free Software Engineering tutorial in PDF (310 pages) created by Caitlin Sadowski ....

The DSC Book

Download free course The DSC Book, pdf file on 12 pages by Don Jones, Steve Murawski....

O'Reilly® DocBook 5: The Definitive Guide

If you need a reliable tool for technical documentation, this clear and concise reference will help you take advantage of DocBook, the popular XML schema originally developed to document computer and hardware projects. DocBook 5.0 has been expanded and simplified to address documentation needs in ...

GTK+/Gnome Application Development

Part of the open-source initiative, the GNU Network Object Model Environment, or Gnome, provides a powerful development framework for building applications in Linux/Unix using C. When combined with GTK+, a user interface library that simplifies graphics programming, you have a nearly unbeatable comb...

XSLT Tutorial in PDF

Download XSLT Tutorial in PDF, free training document in 47 pages by Dan Olteanu....

Introduction to Computer Graphics

Covering the fundamentals of computer graphics and computer graphics programming. This book is meant for use as a textbook in a one-semester course that would typically be taken by undergraduate computer science majors in their third or fourth year of college....

Fundamentals of Business Process Management

Download free course Fundamentals of Business Process Management, pdf file on 546 pages by Marlon Dumas, Marcello La Rosa, Jan Mendling, Hajo A. Reijers....

Blender Basics: A Classroom Tutorial Book

This book is a definitive resource for getting started with 3D art in Blender, one of the most popular 3D/Animation tools on the market . With the expert insight and experience of Roland Hess, noted Blender expert and author, animators and artists will learn the basics starting with the revised 2....

Optimizing HPC Applications with Intel Cluster Tools

Optimizing HPC Applications with Intel Cluster Tools takes the reader on a tour of the fas..., download free HPC Applications tutorial in PDF (300 pages) created by Alexander Supalov ....