Text Mining with R



Download free course Text Mining with R, pdf file on 194 pages by Julia Silge, David Robinson.
Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you'll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext and other tidy tools in R can make text analysis easier and more effective.

The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You'll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media.

- Learn how to apply the tidy text format to NLP;
- Use sentiment analysis to mine the emotional content of text;
- Identify a document's most important terms with frequency measurements;
- Explore relationships and connections between words with the ggraph and widyr packages;
- Convert back and forth between R's tidy and non-tidy text formats;
- Use topic modeling to classify document collections into natural groups;
- Examine case studies that compare Twitter archives, dig into NASA metadata, and analyze thousands of Usenet messages.

Table of contents

  • The Tidy Text Format
  • Sentiment Analysis with Tidy Data
  • Analyzing Word and Document Frequency: tf-idf
  • Relationships Between Words: N-grams and Correlations
  • Converting to and from Nontidy Formats
  • Topic Modeling
  • Case Study: Comparing Twitter Archives
  • Case Study: Mining NASA Metadata
  • Case Study: Analyzing Usenet Text
Pages : 194
Size :
File type : PDF
Downloads: 137
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Julia Silge, David Robinson
Text Mining with R

Warning: Trying to access array offset on false in /home/tutovnfz/public_html/article.php on line 233

Others Computer science Tutorials

PowerShell Notes for Professionals

Seeing Theory

Open Source Systems: Towards Robust Practices

Docker for Developers

Objective-C Notes for Professionals

Others related eBooks about Text Mining with R

Introductory Statistics

This book follows the scope and sequence of a one-semester, introduction to statistics course and is geared toward students majoring in fields other than math or engineering. This text assumes students have been exposed to intermediate algebra, and it focuses on the applications of statistical know...

Compiler Design: Theory, Tools, and Examples

...

Blockchain and Crypt Currency

Download free course Blockchain and Crypt Currency, pdf file on 150 pages by by Makoto Yano, Chris Dai, Kenichi Masuda, Yoshio Kishimoto....

SAT/SMT by Example

Download free course SAT/SMT by Example, pdf file on 585 pages by Dennis Yurichev....

S-BPM in the Wild

Download free course S-BPM in the Wild, pdf file on 284 pages by Albert Fleischmann, Werner Schmidt, Christian Stary....

PowerShell Notes for Professionals

Download free course PowerShell Notes for Professionals, pdf file on 183 pages by Stack Overflow Community....

Azure Durable Functions Succinctly

Download free course Azure Durable Functions Succinctly, pdf file on 101 pages by by Massimo Bonanni....

Flutter Succinctly

Download free course Flutter Succinctly, pdf file on 129 pages by Ed Freitas....

Understanding API Security

Gone are the days when it was acceptable for a piece of software to live in its own little..., download free API Security tutorial in PDF (205 pages) created by ....

Data Protection for the Hybrid Cloud

Download free course Data Protection for the Hybrid Cloud, pdf file on 115 pages by Shreesh Dubey, Vijay Tandra Sistla, Shivam Garg, Aashish Ramdas, Mitch Tulloch....