Book description
Practical methods for analyzing your data with graphs, revealing hidden connections and new insights.Graphs are the natural way to represent and understand connected data. This book explores the most important algorithms and techniques for graphs in data science, with concrete advice on implementation and deployment. You don’t need any graph experience to start benefiting from this insightful guide. These powerful graph algorithms are explained in clear, jargon-free text and illustrations that makes them easy to apply to your own projects.
In Graph Algorithms for Data Science you will learn:
- Labeled-property graph modeling
- Constructing a graph from structured data such as CSV or SQL
- NLP techniques to construct a graph from unstructured data
- Cypher query language syntax to manipulate data and extract insights
- Social network analysis algorithms like PageRank and community detection
- How to translate graph structure to a ML model input with node embedding models
- Using graph features in node classification and link prediction workflows
Graph Algorithms for Data Science is a hands-on guide to working with graph-based data in applications like machine learning, fraud detection, and business data analysis. It’s filled with fascinating and fun projects, demonstrating the ins-and-outs of graphs. You’ll gain practical skills by analyzing Twitter, building graphs with NLP techniques, and much more.
About the Technology
A graph, put simply, is a network of connected data. Graphs are an efficient way to identify and explore the significant relationships naturally occurring within a dataset. This book presents the most important algorithms for graph data science with examples from machine learning, business applications, natural language processing, and more.
About the Book
Graph Algorithms for Data Science shows you how to construct and analyze graphs from structured and unstructured data. In it, you’ll learn to apply graph algorithms like PageRank, community detection/clustering, and knowledge graph models by putting each new algorithm to work in a hands-on data project. This cutting-edge book also demonstrates how you can create graphs that optimize input for AI models using node embedding.
What's Inside
- Creating knowledge graphs
- Node classification and link prediction workflows
- NLP techniques for graph construction
About the Reader
For data scientists who know machine learning basics. Examples use the Cypher query language, which is explained in the book.
About the Author
Tomaž Bratanič works at the intersection of graphs and machine learning.
Arturo Geigel was the technical editor for this book.
Quotes
Undoubtedly the quickest route to grasping the practical applications of graph algorithms. Enjoyable and informative, with real-world business context and practical problem-solving.
- Roger Yu, Feedzai
Brilliantly eases you into graph-based applications.
- Sumit Pal, Independent Consultant
I highly recommend this book to anyone involved in analyzing large network databases.
- Ivan Herreros, talentsconnect
Insightful and comprehensive. The author’s expertise is evident. Be prepared for a rewarding journey.
- Michal Štefaňák, Volke
Table of contents
- inside front cover
- Graph Algorithms for Data Science
- Copyright
- contents
- front matter
- Part 1 Introduction to graphs
- 1 Graphs and network science: An introduction
- 2 Representing network structure: Designing your first graph model
- Part 2 Network analysis
- 3 Your first steps with Cypher query language
- 4 Exploratory graph analysis
- 5 Introduction to social network analysis
- 6 Projecting monopartite networks
- 7 Inferring co-occurrence networks based on bipartite networks
- 8 Constructing a nearest neighbor similarity network
- Part 3 Graph machine learning
- 9 Node embeddings and classification
-
10 Link prediction
- 10.1 Link prediction workflow
- 10.2 Dataset split
- 10.2.1 Time-based split
- 10.2.2 Random split
- 10.2.3 Negative samples
- 10.3 Network feature engineering
- 10.3.1 Network distance
- 10.3.2 Preferential attachment
- 10.3.3 Common neighbors
- 10.3.4 Adamic-Adar index
- 10.3.5 Clustering coefficient of common neighbors
- 10.4 Link prediction classification model
- 10.4.1 Missing values
- 10.4.2 Training the model
- 10.4.3 Evaluating the model
- 10.5 Solutions to exercises
- Summary
- 11 Knowledge graph completion
- 12 Constructing a graph using natural language processing techniques
- Appendix. The Neo4j environment
- references
- index
Product information
- Title: Graph Algorithms for Data Science
- Author(s):
- Release date: February 2024
- Publisher(s): Manning Publications
- ISBN: 9781617299469
You might also like
book
Graph Data Science with Neo4j
Supercharge your data with the limitless potential of Neo4j 5, the premier graph database for cutting-edge …
book
Graph Machine Learning
Build machine learning algorithms using graph data and efficiently exploit topological information within your models Key …
book
Graph-Powered Analytics and Machine Learning with TigerGraph
With the rapid rise of graph databases, organizations are now implementing advanced analytics and machine learning …
book
Bayesian Analysis with Python - Third Edition
Learn the fundamentals of Bayesian modeling using state-of-the-art Python libraries, such as PyMC, ArviZ, Bambi, and …