Text Mining with R



Download free course Text Mining with R, pdf file on 194 pages by Julia Silge, David Robinson.
Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you'll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext and other tidy tools in R can make text analysis easier and more effective.

The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You'll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media.

- Learn how to apply the tidy text format to NLP;
- Use sentiment analysis to mine the emotional content of text;
- Identify a document's most important terms with frequency measurements;
- Explore relationships and connections between words with the ggraph and widyr packages;
- Convert back and forth between R's tidy and non-tidy text formats;
- Use topic modeling to classify document collections into natural groups;
- Examine case studies that compare Twitter archives, dig into NASA metadata, and analyze thousands of Usenet messages.

Table of contents

  • The Tidy Text Format
  • Sentiment Analysis with Tidy Data
  • Analyzing Word and Document Frequency: tf-idf
  • Relationships Between Words: N-grams and Correlations
  • Converting to and from Nontidy Formats
  • Topic Modeling
  • Case Study: Comparing Twitter Archives
  • Case Study: Mining NASA Metadata
  • Case Study: Analyzing Usenet Text
Pages : 194
Size :
File type : PDF
Downloads: 130
Created: 2022-02-03
License: CC BY-NC-SA
Author(s): Julia Silge, David Robinson
Text Mining with R

Warning: Trying to access array offset on false in /home/tutovnfz/public_html/article.php on line 233

Others Computer science Tutorials

Cyber-Physical Systems of Systems

Learning MongoDB

R Notes for Professionals

Rails on Containers

Open Source Systems: Towards Robust Practices

Others related eBooks about Text Mining with R

The Node.js Handbook

Download free course The Node.js Handbook, pdf file on 189 pages by Flavio Copes....

Building Secure and Reliable Systems

Download free course Building Secure and Reliable Systems, pdf file on 557 pages by by Betsy Beyer, Piotr Lewandowski, Ana Oprea, Paul Blankinship, Heather Adkins, Adam Stubblefield....

Intertwingled

Download free course Intertwingled, pdf file on 157 pages by Douglas Dechow, Daniele Struppa....

Learning R

Download free course Learning R, pdf file on 619 pages by Stack Overflow Community....

Cloud Computing - Architecture and Applications

In the era of Internet of Things and with the explosive worldwide growth of electronic data volume, and associated need of processing, analysis, and storage of such humongous volume of data, it has now become mandatory to exploit the power of massively parallel architecture for fast computation. ...

Automating Manufacturing Systems with PLCs

This is a manuscript for a PLC based control system book that is currently being used for teaching an undergraduate controls course - Manufacturing Controls. The course and book focus on the Allen Bradley family of controllers, thus allowing a deeper topic coverage than is normal in PLC books....

Think OCaml

Download free course Think OCaml, pdf file on 142 pages by Allen Downey, Nicholas Monje....

Principles of Management

This book teaches management principles to tomorrow's business leaders by weaving three threads through every chapter: strategy, entrepreneurship and active leadership. ...

How to become an Ethical Hacker

This is a series of lessons that's basic about the networking concepts, networking tools, security and countermeasures....

Elements of Robotics

Download free course Elements of Robotics, pdf file on 311 pages by Mordechai Ben-Ari, Francesco Mondada....