Book description
Principles of Big Data helps readers avoid the common mistakes that endanger all Big Data projects. By stressing simple, fundamental concepts, this book teaches readers how to organize large volumes of complex data, and how to achieve data permanence when the content of the data is constantly changing. General methods for data verification and validation, as specifically applied to Big Data resources, are stressed throughout the book. The book demonstrates how adept analysts can find relationships among data objects held in disparate Big Data resources, when the data objects are endowed with semantic support (i.e., organized in classes of uniquely identified data objects). Readers will learn how their data can be integrated with data from other resources, and how the data extracted from Big Data resources can be used for purposes beyond those imagined by the data creators.
- Learn general methods for specifying Big Data in a way that is understandable to humans and to computers
- Avoid the pitfalls in Big Data design and analysis
- Understand how to create and use Big Data safely and responsibly with a set of laws, regulations and ethical standards that apply to the acquisition, distribution and integration of Big Data resources
Table of contents
- Cover image
- Title page
- Table of Contents
- Copyright
- Dedication
- Acknowledgments
- Author Biography
- Preface
- Introduction
- Chapter 1. Providing Structure to Unstructured Data
- Chapter 2. Identification, Deidentification, and Reidentification
- Chapter 3. Ontologies and Semantics
- Chapter 4. Introspection
- Chapter 5. Data Integration and Software Interoperability
- Chapter 6. Immutability and Immortality
- Chapter 7. Measurement
- Chapter 8. Simple but Powerful Big Data Techniques
- Chapter 9. Analysis
- Chapter 10. Special Considerations in Big Data Analysis
-
Chapter 11. Stepwise Approach to Big Data Analysis
- Background
- Step 1. A Question Is Formulated
- Step 2. Resource Evaluation
- Step 3. A Question Is Reformulated
- Step 4. Query Output Adequacy
- Step 5. Data Description
- Step 6. Data Reduction
- Step 7. Algorithms Are Selected, If Absolutely Necessary
- Step 8. Results Are Reviewed and Conclusions Are Asserted
- Step 9. Conclusions Are Examined and Subjected to Validation
- References
- Chapter 12. Failure
-
Chapter 13. Legalities
- Background
- Responsibility for the Accuracy and Legitimacy of Contained Data
- Rights to Create, Use, and Share the Resource
- Copyright and Patent Infringements Incurred by Using Standards
- Protections for Individuals
- Consent
- Unconsented Data
- Good Policies Are a Good Policy
- Use Case: The Havasupai Story
- References
- Chapter 14. Societal Issues
- Chapter 15. The Future
- Glossary
- References
- Index
Product information
- Title: Principles of Big Data
- Author(s):
- Release date: May 2013
- Publisher(s): Morgan Kaufmann
- ISBN: 9780124047242
You might also like
book
Big Data
Presenting the contributions of leading experts in their respective fields, this book bridges the gap between …
book
Big Data
Big Data teaches you to build big data systems using an architecture that takes advantage of …
book
Big Data
Big Data: Principles and Paradigms captures the state-of-the-art research on the architectural aspects, technologies, and applications …
book
Big Data
Convert the promise of big data into real world results There is so much buzz around …