Book description
DATA WRANGLINGWritten and edited by some of the world’s top experts in the field, this exciting new volume provides state-of-the-art research and latest technological breakthroughs in data wrangling, its theoretical concepts, practical applications, and tools for solving everyday problems.
Data wrangling is the process of cleaning and unifying messy and complex data sets for easy access and analysis. This process typically includes manually converting and mapping data from one raw form into another format to allow for more convenient consumption and organization of the data. Data wrangling is increasingly ubiquitous at today’s top firms.
Data cleaning focuses on removing inaccurate data from your data set whereas data wrangling focuses on transforming the data’s format, typically by converting “raw” data into another format more suitable for use. Data wrangling is a necessary component of any business. Data wrangling solutions are specifically designed and architected to handle diverse, complex data at any scale, including many applications, such as Datameer, Infogix, Paxata, Talend, Tamr, TMMData, and Trifacta.
This book synthesizes the processes of data wrangling into a comprehensive overview, with a strong focus on recent and rapidly evolving agile analytic processes in data-driven enterprises, for businesses and other enterprises to use to find solutions for their everyday problems and practical applications. Whether for the veteran engineer, scientist, or other industry professional, this book is a must have for any library.
Table of contents
- Cover
- Series Page
- Title Page
- Copyright Page
- 1 Basic Principles of Data Wrangling
-
2 Skills and Responsibilities of Data Wrangler
- 2.1 Introduction
- 2.2 Role as an Administrator (Data and Database)
- 2.3 Skills Required
- 2.4 Responsibilities as Database Administrator
- 2.5 Concerns for a DBA [12]
- 2.6 Data Mishandling and Its Consequences
- 2.7 The Long-Term Consequences: Loss of Trust and Diminished Reputation
- 2.8 Solution to the Problem
- 2.9 Case Studies
- 2.10 Conclusion
- References
- 3 Data Wrangling Dynamics
-
4 Essentials of Data Wrangling
- 4.1 Introduction
- 4.2 Holistic Workflow Framework for Data Projects
- 4.3 The Actions in Holistic Workflow Framework
- 4.4 Transformation Tasks Involved in Data Wrangling
- 4.5 Description of Two Types of Core Profiling
- 4.6 Case Study
- 4.7 Quantitative Analysis
- 4.8 Graphical Representation
- 4.9 Conclusion
- References
-
5 Data Leakage and Data Wrangling in Machine Learning for Medical Treatment
- 5.1 Introduction
- 5.2 Data Wrangling and Data Leakage
- 5.3 Data Wrangling Stages
- 5.4 Significance of Data Wrangling
- 5.5 Data Wrangling Examples
- 5.6 Data Wrangling Tools for Python
- 5.7 Data Wrangling Tools and Methods
- 5.8 Use of Data Preprocessing
- 5.9 Use of Data Wrangling
- 5.10 Data Wrangling in Machine Learning
- 5.11 Enhancement of Express Analytics Using Data Wrangling Process
- 5.12 Conclusion
- References
- 6 Importance of Data Wrangling in Industry 4.0
- 7 Managing Data Structure in R
- 8 Dimension Reduction Techniques in Distributional Semantics: An Application Specific Review
-
9 Big Data Analytics in Real Time for Enterprise Applications to Produce Useful Intelligence
- 9.1 Introduction
- 9.2 The Internet of Things and Big Data Correlation
- 9.3 Design, Structure, and Techniques for Big Data Technology
- 9.4 Aspiration for Meaningful Analyses and Big Data Visualization Tools
- 9.5 Big Data Applications in the Commercial Surroundings
- 9.6 Big Data Insights’ Constraints
- 9.7 Conclusion
- References
- 10 Generative Adversarial Networks: A Comprehensive Review
- 11 Analysis of Machine Learning Frameworks Used in Image Processing: A Review
-
12 Use and Application of Artificial Intelligence in Accounting and Finance: Benefits and Challenges
- 12.1 Introduction
- 12.2 Uses of AI in Accounting & Finance Sector
- 12.3 Applications of AI in Accounting and Finance Sector
- 12.4 Benefits and Advantages of AI in Accounting and Finance
- 12.5 Challenges of AI Application in Accounting and Finance
- 12.6 Suggestions and Recommendation
- 12.7 Conclusion and Future Scope of the Study
- References
- 13 Obstacle Avoidance Simulation and Real-Time Lane Detection for AI-Based Self-Driving Car
- 14 Impact of Suppliers Network on SCM of Indian Auto Industry: A Case of Maruti Suzuki India Limited
- About the Editors
- Index
- Also of Interest
- End User License Agreement
Product information
- Title: Data Wrangling
- Author(s):
- Release date: July 2023
- Publisher(s): Wiley-Scrivener
- ISBN: 9781119879688
You might also like
book
Data for All
Do you know what happens to your personal data when you are browsing, buying, or using …
book
Cleaning Data for Effective Data Science
Think about your data intelligently and ask the right questions Key Features Master data cleaning techniques …
book
The Shape of Data
Whether you’re a mathematician, seasoned data scientist, or marketing professional, you’ll find The Shape of Data …
book
Dive Into Data Science
Dive into the exciting world of data science with this practical introduction. Packed with essential skills …