Book description
The Handbook of Statistical Analysis and Data Mining Applications is a comprehensive professional reference book that guides business analysts, scientists, engineers and researchers (both academic and industrial) through all stages of data analysis, model building and implementation. The Handbook helps one discern the technical and business problem, understand the strengths and weaknesses of modern data mining algorithms, and employ the right statistical methods for practical application. Use this book to address massive and complex datasets with novel statistical approaches and be able to objectively evaluate analyses and solutions. It has clear, intuitive explanations of the principles and tools for solving problems using modern analytic techniques, and discusses their application to real problems, in ways accessible and beneficial to practitioners across industries - from science and engineering, to medicine, academia and commerce. This handbook brings together, in a single resource, all the information a beginner will need to understand the tools and issues in data mining to build successful data mining solutions.
- Written "By Practitioners for Practitioners"
- Non-technical explanations build understanding without jargon and equations
- Tutorials in numerous fields of study provide step-by-step instruction on how to use supplied tools to build models
- Practical advice from successful real-world implementations
- Includes extensive case studies, examples, MS PowerPoint slides and datasets
- CD-DVD with valuable fully-working 90-day software included: "Complete Data Miner - QC-Miner - Text Miner" bound with book
Table of contents
- Cover image
- Title page
- Table of Contents
- Copyright
- Foreword 1
- Foreword 2
- Preface
- Introduction
- List of Tutorials by Guest Authors
-
Part I. History of Phases of Data Analysis, Basic Theory, and the Data Mining Process
- Chapter 1. The Background for Data Mining Practice
-
Chapter 2. Theoretical Considerations for Data Mining
- Preamble
- The Scientific Method
- What Is Data Mining?
- A Theoretical Framework for the Data Mining Process
- Strengths of the Data Mining Process
- Customer-Centric Versus Account-Centric: A New Way to Look at Your Data
- The Data Paradigm Shift
- Creation of the CAR
- Major Activities of Data Mining
- Major Challenges of Data Mining
- Examples of Data Mining Applications
- Major Issues in Data Mining
- General Requirements for Success in a Data Mining Project
- Example of a Data Mining Project: Classify a Bat’s Species by Its Sound
- The Importance of Domain Knowledge
- Postscript
- References
-
Chapter 3. The Data Mining Process
- Preamble
- The Science of Data Mining
- The Approach to Understanding and Problem Solving
- Business Understanding (Mostly Art)
- Data Understanding (Mostly Science)
- Data Preparation (A Mixture of Art and Science)
- Modeling (A Mixture of Art and Science)
- Deployment (Mostly Art)
- Closing the Information Loop* (Art)
- The Art of Data Mining
- Postscript
- References
- Chapter 4. Data Understanding and Preparation
- Chapter 5. Feature Selection
- Chapter 6. Accessory Tools for Doing Data Mining
-
Part II. The Algorithms in Data Mining and Text Mining, the Organization of the Three Most Common Data Mining Tools, and Selected Specialized Areas Using Data Mining
- Chapter 7. Basic Algorithms for Data Mining: A Brief Overview
- Chapter 8. Advanced Algorithms for Data Mining
- Chapter 9. Text Mining and Natural Language Processing
- Chapter 10. The Three Most Common Data Mining Software Tools
- Chapter 11. Classification
-
Chapter 12. Numerical Prediction
- Preamble
- Linear Response Analysis and the Assumptions of the Parametric Model
- Parametric Statistical Analysis
- Assumptions of the Parametric Model
- Linear Regression
- Generalized Linear Models (GLMs)
- Methods for Analyzing Nonlinear Relationships
- Nonlinear Regression and Estimation
- Data Mining and Machine Learning Algorithms Used in Numerical Prediction
- Advantages of Classification and Regression Trees (C&RT) Methods
- Application to Mixed Models
- Neural Nets for Prediction
- Support Vector Machines (SVMs) and Other Kernel Learning Algorithms
- Postscript
- References
- Chapter 13. Model Evaluation and Enhancement
- Chapter 14. Medical Informatics
- Chapter 15. Bioinformatics
- Chapter 16. Customer Response Modeling
-
Chapter 17. Fraud Detection
- Preamble
- Issues with Fraud Detection
- How Do You Detect Fraud?
- Supervised Classification of Fraud
- How Do You Model Fraud?
- How Are Fraud Detection Systems Built?
- Intrusion Detection Modeling
- Comparison of Models with and Without Time-Based Features
- Building Profiles
- Deployment of Fraud Profiles
- Postscript and Prolegomenon
- References
-
Part III. Tutorials—Step-by-step Case Studies as a Starting Point to Learn How to Do Data Mining Analyses
- Guest Authors of the Tutorials
- Tutorial A. How to Use Data Miner Recipe: STATISTICA Data Miner Only
- Tutorial B. Data Mining for Aviation Safety: Using Data Mining Recipe “Automatized Data Mining” from STATISTICA
- Tutorial C. Predicting Movie Box-Office Receipts: Using SPSS Clementine Data Mining Software
- Tutorial D. Detecting Unsatisfied Customers: A Case Study Using SAS Enterprise Miner Version 5.3 for the Analysis
- Tutorial E. Credit Scoring Using STATISTICA Data Miner
- Tutorial F. Churn Analysis with SPSS-Clementine
- Tutorial G. Text Mining: Automobile Brand Review Using STATISTICA Data Miner and Text Miner
- Tutorial H. Predictive Process Control: QC-Data Mining Using STATISTICA Data Miner and QC-Miner
- Tutorials I, J, and K. Three Short Tutorials Showing the Use of Data Mining and Particularly C&RT to Predict and Display Possible Structural Relationships among Data
- Tutorial I. Business Administration in a Medical Industry: Determining Possible Predictors for Days with Hospice Service for Patients with Dementia
- Tutorial J. Clinical Psychology: Making Decisions about Best Therapy for a Client: Using Data Mining to Explore the Structure of a Depression Instrument
- Tutorial K. Education–Leadership Training for Business and Education Using C&RT to Predict and Display Possible Structural Relationships
- Tutorial L. Dentistry: Facial Pain Study Based on 84 Predictor Variables (Both Categorical and Continuous)
- Tutorial M. Profit Analysis of the German Credit Data Using SAS-EM Version 5.3
- Tutorial N. Predicting Self-Reported Health Status Using Artificial Neural Networks
-
Part IV. Measuring Truecomplexity, the “Right Model for the Right Use,” Top Mistakes, and the Future of Analytics
- Chapter 18. Model Complexity (and How Ensembles Help)
- Chapter 19. The Right Model for the Right Purpose: When Less Is Good Enough
-
Chapter 20. Top 10 Data Mining Mistakes
- Preamble
- Introduction
- 0 Lack Data
- 1 Focus on Training
- 2 Rely on One Technique
- 3 Ask the Wrong Question
- 4 Listen (Only) to the Data
- 5 Accept Leaks from the Future
- 6 Discount Pesky Cases
- 7 Extrapolate
- 8 Answer Every Inquiry
- 9 Sample Casually
- 10 Believe the Best Model
- How Shall We Then Succeed?
- Postscript
- References
- Chapter 21. Prospects for the Future of Data Mining and Text Mining as Part of Our Everyday Lives
-
Chapter 22. Summary: Our Design
- Preamble
- Beware of Overtrained Models
- A Diversity of Models and Techniques Is Best
- The Process Is More Important Than the Tool
- Text Mining of Unstructured Data Is Becoming Very Important
- Practice Thinking about Your Organization as Organism Rather Than as Machine
- Good Solutions Evolve Rather Than Just Appear after Initial Efforts
- What You Don’t Do Is Just as Important as What You Do
- Very Intuitive Graphical Interfaces Are Replacing Procedural Programming
- Data Mining Is No Longer a Boutique Operation; It Is Firmly Established in the Mainstream of Our Society
- “Smart” Systems Are the Direction in Which Data Mining Technology Is Going
- Postscript
- References
- Glossary
- Index
Product information
- Title: Handbook of Statistical Analysis and Data Mining Applications
- Author(s):
- Release date: May 2009
- Publisher(s): Elsevier Science
- ISBN: 9780080912035
You might also like
book
Data Mining and Machine Learning Applications
DATA MINING AND MACHINE LEARNING APPLICATIONS The book elaborates in detail on the current needs of …
book
Data Mining and Predictive Analytics, 2nd Edition
Learn methods of data analysis and their application to real-world data sets This updated second edition …
book
Predictive Analytics and Data Mining
Put Predictive Analytics into ActionLearn the basics of Predictive Analysis and Data Mining through an easy …
book
R: Predictive Analysis
Master the art of predictive modeling About This Book Load, wrangle, and analyze your data using …