Introduction to Data Science



Download free course Introduction to Data Science, pdf file on 722 pages by Rafael A Irizarry.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning. It also helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, algorithm building with caret, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation with knitr and R markdown. The book is divided into six parts: R, Data Visualization, Data Wrangling, Probability, Inference and Regression with R, Machine Learning, and Productivity Tools. Each part has several chapters meant to be presented as one lecture. The book includes dozens of exercises distributed across most chapters.

Table of contents

  • R
  • Getting Started with R and RStudio
  • R Basics
  • Programming basics
  • The tidyverse
  • Importing data
  • Data Visualization
  • Introduction to data visualization
  • ggplot2
  • Visualizing data distributions
  • Data visualization in practice
  • Data visualization principles
  • Robust summaries
  • Statistics with R
  • Introduction to Statistics with R
  • Probability
  • Random variables
  • Statistical Inference
  • Statistical models
  • Regression
  • Linear Models
  • Association is not causation
  • Data Wrangling
  • Introduction to Data Wrangling
  • Reshaping data
  • Joining tables
  • Web Scraping
  • String Processing
  • Parsing Dates and Times
  • Text mining
  • Machine Learning
  • Introduction to Machine Learning
  • Smoothing
  • Cross validation
  • The caret package
  • Examples of algorithms
  • Machine learning in practice
  • Large datasets
  • Clustering
  • Productivity tools
  • Introduction to productivity tools
  • Organizing with Unix
  • Git and GitHub
  • Reproducible projects with RStudio and R markdown
Pages : 722
Size : 55.8 MB
File type : PDF
Downloads: 210
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Rafael A Irizarry
Introduction to Data Science

Warning: Trying to access array offset on false in /home/tutovnfz/public_html/article.php on line 233

Others Computer science Tutorials

A Practical Guide to TPM 2.0

Learning LaTeX

SAT/SMT by Example

Computational Thinking Education

Power BI Succinctly

Others related eBooks about Introduction to Data Science

Learning Haskell

Download free course Learning Haskell, pdf file on 296 pages by Stack Overflow Community....

Internet of Things (IoT) in 5 Days: an easy guide to Wireless Sensor Networks (WSN), IPv6, and IoT

This booklet is a quick but thoughtful guide to jump into the Internet of Things (IoT), covering important subjects as IPv6 networking, sensors, wireless protocols and technologies, as well as IoT cloud platforms and its most commonly used protocols, featuring lots of hands-on examples to start work...

Big Data on Real-World Applications

As technology advances, high volumes of valuable data are generated day by day in modern organizations. The management of such huge volumes of data has become a priority in these organizations, requiring new techniques for data management and data analysis in Big Data environments. These environment...

Bash Notes for Professionals

Download free course Bash Notes for Professionals, pdf file on 204 pages by by Stack Overflow Community....

Code the Classics

Download free course Code the Classics, pdf file on 224 pages by David Crookes, Andrew Gillett, Liz Upton, Eben Upton, Sean M. Tracey, Dan Malone, Allister Brimble....

Cryptography and Network security

Download Cryptography and network security PDF tutorial by Chandraskhar Rao intended to Bachelor of Technology in Computer Science and Engineering....

Learning R

Download free course Learning R, pdf file on 619 pages by Stack Overflow Community....

Learning acumatica PDF course

Download free Acumatica tutorial course in PDF, training file in 25 chapters and 116 pages. Free unaffiliated ebook created from Stack OverFlow contributor....

Essential Dart

Dart is a class-based, object-oriented language that simplifies the development of structured modern apps, scales from small scripts to large applications, and can be compiled to JavaScript for use in any modern browser. In this rigorous but readable introductory text, Dart specification lead Gilad ...

Statistics with Julia

Download free course Statistics with Julia, pdf file on 413 pages by Hayden Klok, Yoni Nazarathy....