Book description
How can you tap into the wealth of social web data to discover who’s making connections with whom, what they’re talking about, and where they’re located? With this expanded and thoroughly revised edition, you’ll learn how to acquire, analyze, and summarize data from all corners of the social web, including Facebook, Twitter, LinkedIn, Google+, GitHub, email, websites, and blogs.
- Employ the Natural Language Toolkit, NetworkX, and other scientific computing tools to mine popular social web sites
- Apply advanced text-mining techniques, such as clustering and TF-IDF, to extract meaning from human language data
- Bootstrap interest graphs from GitHub by discovering affinities among people, programming languages, and coding projects
- Build interactive visualizations with D3.js, an extraordinarily flexible HTML5 and JavaScript toolkit
- Take advantage of more than two-dozen Twitter recipes, presented in O’Reilly’s popular "problem/solution/discussion" cookbook format
The example code for this unique data science book is maintained in a public GitHub repository. It’s designed to be easily accessible through a turnkey virtual machine that facilitates interactive learning with an easy-to-use collection of IPython Notebooks.
Publisher resources
Table of contents
- Dedication
- Preface
-
I. A Guided Tour of the Social Web
- Prelude
- 1. Mining Twitter: Exploring Trending Topics, Discovering What People Are Talking About, and More
- 2. Mining Facebook: Analyzing Fan Pages, Examining Friendships, and More
- 3. Mining LinkedIn: Faceting Job Titles, Clustering Colleagues, and More
-
4. Mining Google+: Computing Document
Similarity, Extracting Collocations, and More
- Overview
- Exploring the Google+ API
- A Whiz-Bang Introduction to TF-IDF
- Querying Human Language Data with TF-IDF
- Closing Remarks
- Recommended Exercises
- Online Resources
- 5. Mining Web Pages: Using Natural Language Processing to Understand Human Language, Summarize Blog Posts, and More
- 6. Mining Mailboxes: Analyzing Who’s Talking to Whom About What, How Often, and More
-
7. Mining GitHub: Inspecting Software
Collaboration Habits, Building Interest Graphs, and More
- Overview
- Exploring GitHub’s API
- Modeling Data with Property Graphs
- Analyzing GitHub Interest Graphs
- Closing Remarks
- Recommended Exercises
- Online Resources
- 8. Mining the Semantically Marked-Up Web: Extracting Microformats, Inferencing over RDF, and More
-
II. Twitter Cookbook
-
9. Twitter Cookbook
- Accessing Twitter’s API for Development Purposes
- Doing the OAuth Dance to Access Twitter’s API for Production Purposes
- Discovering the Trending Topics
- Searching for Tweets
- Constructing Convenient Function Calls
- Saving and Restoring JSON Data with Text Files
- Saving and Accessing JSON Data with MongoDB
- Sampling the Twitter Firehose with the Streaming API
- Collecting Time-Series Data
- Extracting Tweet Entities
- Finding the Most Popular Tweets in a Collection of Tweets
- Finding the Most Popular Tweet Entities in a Collection of Tweets
- Tabulating Frequency Analysis
- Finding Users Who Have Retweeted a Status
- Extracting a Retweet’s Attribution
- Making Robust Twitter Requests
- Resolving User Profile Information
- Extracting Tweet Entities from Arbitrary Text
- Getting All Friends or Followers for a User
- Analyzing a User’s Friends and Followers
- Harvesting a User’s Tweets
- Crawling a Friendship Graph
- Analyzing Tweet Content
- Summarizing Link Targets
- Analyzing a User’s Favorite Tweets
- Closing Remarks
- Recommended Exercises
- Online Resources
-
9. Twitter Cookbook
- III. Appendixes
- Index
- About the Author
- Colophon
- Copyright
Product information
- Title: Mining the Social Web, 2nd Edition
- Author(s):
- Release date: October 2013
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781449367619
You might also like
book
Mining the Social Web, 3rd Edition
Mine the rich data tucked away in popular social websites such as Twitter, Facebook, LinkedIn, and …
book
Mining Social Media
Did fake Twitter accounts help sway a presidential election? What can Facebook and Reddit archives tell …
audiobook
Crucial Conversations
The book that revolutionized business communications has been updated for today's workplace. Crucial Conversations provides powerful …
book
Information Security Handbook
Implement information security effectively as per your organization's needs. About This Book Learn to build your …