Text Mining with R



Download free course Text Mining with R, pdf file on 194 pages by Julia Silge, David Robinson.
Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you'll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext and other tidy tools in R can make text analysis easier and more effective.

The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You'll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media.

- Learn how to apply the tidy text format to NLP;
- Use sentiment analysis to mine the emotional content of text;
- Identify a document's most important terms with frequency measurements;
- Explore relationships and connections between words with the ggraph and widyr packages;
- Convert back and forth between R's tidy and non-tidy text formats;
- Use topic modeling to classify document collections into natural groups;
- Examine case studies that compare Twitter archives, dig into NASA metadata, and analyze thousands of Usenet messages.

Table of contents

  • The Tidy Text Format
  • Sentiment Analysis with Tidy Data
  • Analyzing Word and Document Frequency: tf-idf
  • Relationships Between Words: N-grams and Correlations
  • Converting to and from Nontidy Formats
  • Topic Modeling
  • Case Study: Comparing Twitter Archives
  • Case Study: Mining NASA Metadata
  • Case Study: Analyzing Usenet Text
Pages : 194
Size :
File type : PDF
Downloads: 105
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Julia Silge, David Robinson
Text Mining with R

Others Computer science Tutorials

Version Control with Subversion

LaTeX in 24 Hours

Blockchain and Crypt Currency

The Next.js Handbook

Robotic Process Automation Succinctly

Others related eBooks about Text Mining with R

Building Games for Firefox OS

Download free course Building Games for Firefox OS, pdf file on 125 pages by by Andre Garzia....

Learning Node.js

Download free course Learning Node.js, pdf file on 414 pages by Stack Overflow Community....

Introduction to MATLAB

With this tutorial you will work through the examples to understand the basics of MATLAB programming ,a free training document under 74 pages for download....

Tensorflow 2 Tutorial

TensorFlow is a free and open-source software library for machine learning. It can be used..., download free Tensorflow tutorial in PDF (66 pages) created by Ren Zhang ....

Digital Video Concepts, Methods, and Metrics

Digital Video Concepts, Methods, and Metrics: Quality, Compression, Performance, and Power..., download free Video Concepts tutorial in PDF (368 pages) created by Shahriar Akramullah ....

Cryptography and Network security

Download Cryptography and network security PDF tutorial by Chandraskhar Rao intended to Bachelor of Technology in Computer Science and Engineering....

The Big Data Agenda

Download free course The Big Data Agenda, pdf file on 156 pages by Annika Richterich....

Learn Photo Editing in Photoshop

This is a free Photoshop PDF tutorial in 21 chapters and 23 pages. This course aims to give students tips and tricks in how to use efficiently Photoshop to edit your image. ...

Adobe Photoshop CS6

This pdf tutorial (Adobe Photoshop CS6 Tutorial) will teach you the basics of the most popular image editing software.Free training courses designated for student....

Open Source Systems: Towards Robust Practices

Download free course Open Source Systems: Towards Robust Practices, pdf file on 225 pages by Federico Balaguer, Roberto Di Cosmo, Alejandra Garrido, Fabio Kon, Gregorio Robles, Stefano Zacchiroli....