Book description
Data science unifies statistics, data analysis and machine learning to achieve a better understanding of the masses of data which are produced today, and to improve prediction. Special kinds of data (symbolic, network, complex, compositional) are increasingly frequent in data science. These data require specific methodologies, but there is a lack of reference work in this field.
Advances in Data Science fills this gap. It presents a collection of up-to-date contributions by eminent scholars following two international workshops held in Beijing and Paris. The 10 chapters are organized into four parts: Symbolic Data, Complex Data, Network Data and Clustering. They include fundamental contributions, as well as applications to several domains, including business and the social sciences.
Table of contents
- Cover
- Preface
-
Part 1: Symbolic Data
- 1 Explanatory Tools for Machine Learning in the Symbolic Data Analysis Framework
- 2 Likelihood in the Symbolic Context
-
3 Dimension Reduction and Visualization of Symbolic Interval-Valued Data Using Sliced Inverse Regression
- 3.1. Introduction
- 3.2. PCA for interval-valued data and the sliced inverse regression
- 3.3. SIR for interval-valued data
- 3.4. Projections and visualization in DR subspace
- 3.5. Some computational issues
- 3.6. Simulation studies
- 3.7. A real data example: face recognition data
- 3.8. Conclusion and discussion
- 3.9. References
- 4 On the “Complexity” of Social Reality. Some Reflections About the Use of Symbolic Data Analysis in Social Sciences
-
Part 2: Complex Data
-
5 A Spatial Dependence Measure and Prediction of Georeferenced Data Streams Summarized by Histograms
- 5.1. Introduction
- 5.2. Processing setup
- 5.3. Main definitions
- 5.4. Online summarization of a data stream through CluStream for Histogram data
- 5.5. Spatial dependence monitoring: a variogram for histogram data
- 5.6. Ordinary kriging for histogram data
- 5.7. Experimental results on real data
- 5.8. Conclusion
- 5.9. References
- 6 Incremental Calculation Framework for Complex Data
-
5 A Spatial Dependence Measure and Prediction of Georeferenced Data Streams Summarized by Histograms
- Part 3: Network Data
- Part 4: Clustering
- List of Authors
- Index
- End User License Agreement
Product information
- Title: Advances in Data Science
- Author(s):
- Release date: February 2020
- Publisher(s): Wiley-ISTE
- ISBN: 9781786305763
You might also like
book
Introduction to Statistical and Machine Learning Methods for Data Science
Boost your understanding of data science techniques to solve real-world problems Data science is an exciting, …
book
Managing Data Science
Understand data science concepts and methodologies to manage and deliver top-notch solutions for your organization Key …
book
Cleaning Data for Effective Data Science
Think about your data intelligently and ask the right questions Key Features Master data cleaning techniques …
book
Practical Data Science with Python 3: Synthesizing Actionable Insights from Data
Gain insight into essential data science skills in a holistic manner using data engineering and associated …