Book description
What is text mining, and how can it be used? What relevance do these methods have to everyday work in information science and the digital humanities? How does one develop competences in text mining? Working with Text provides a series of cross-disciplinary perspectives on text mining and its applications. As text mining raises legal and ethical issues, the legal background of text mining and the responsibilities of the engineer are discussed in this book. Chapters provide an introduction to the use of the popular GATE text mining package with data drawn from social media, the use of text mining to support semantic search, the development of an authority system to support content tagging, and recent techniques in automatic language evaluation. Focused studies describe text mining on historical texts, automated indexing using constrained vocabularies, and the use of natural language processing to explore the climate science literature. Interviews are included that offer a glimpse into the real-life experience of working within commercial and academic text mining.
- Introduces text analysis and text mining tools
- Provides a comprehensive overview of costs and benefits
- Introduces the topic, making it accessible to a general audience in a variety of fields, including examples from biology, chemistry, sociology, and criminology
Table of contents
- Cover image
- Title page
- Table of Contents
- Copyright
- Contributors
- Preface
- Acknowledgements
-
Chapter 1: Working with Text
- 1.1 Introduction: Portraits of the Past
- 1.2 The Reading Robot
- 1.3 From Data to Text Mining
- 1.4 Definitions of Text Mining
- 1.5 Exploring the Disciplinary Neighbourhood
- 1.6 Prerequisites for Text Mining
- 1.7 Learning Minecraft: What Makes a Text Miner?
- 1.8 Contemporary Attitudes to Text Mining
- 1.9 Conclusions
-
Chapter 2: A Day at Work (with Text): A Brief Introduction
- Abstract
- 2.1 Introduction
- 2.2 Encouraging an Interest in Text Mining
- 2.3 Legal and Ethical Aspects of Text Mining
- 2.4 Manual Annotation: Preparing for Evaluation
- 2.5 Common Text Mining Tasks
- 2.6 Basic Corpus Analysis
- 2.7 Preprocessing a Text
- 2.8 Extracting Features from a Text
- 2.9 Information Extraction
- 2.10 Applications of Indexing and Metadata Extraction
- 2.11 Extraction of Subjective Views
- 2.12 Build, Customise or Apply? Choosing an Appropriate Implementation
- 2.13 Evaluation
- 2.14 The Role of Visualisation in Text Mining
- 2.15 Visualisation Tools and Frameworks
- 2.16 Conclusions
- Chapter 3: If You Find Yourself in a Hole, Stop Digging: Legal and Ethical Issues of Text/Data Mining in Research
- Chapter 4: Responsible Content Mining
- Chapter 5: Text Mining for Semantic Search in Europe PubMed Central Labs
- Chapter 6: Extracting Information from Social Media with GATE
- Chapter 7: Newton: Building an Authority-Driven Company Tagging and Resolution System
- Chapter 8: Automatic Language Identification
- Chapter 9: User-Driven Text Mining of Historical Text
- Chapter 10: Automatic Text Indexing with SKOS Vocabularies in HIVE
- Chapter 11: The PIMMS Project and Natural Language Processing for Climate Science: Extending the ChemicalTagger Natural Language Processing Tool with Climate Science Controlled Vocabularies
- Chapter 12: Building Better Mousetraps: A Linguist in NLP
- Chapter 13: Raúl Garreta, Co-founder of Tryolabs.com, Tells Emma Tonkin About the Journey from Software Engineering Graduate to Startup Entrepreneur
- Appendix A: Resources for Text Mining
- Appendix B: Databases and Vocabularies
- Appendix C: Visualisation Tools and Resources
- Appendix D: Learning Opportunities
- Index
Product information
- Title: Working with Text
- Author(s):
- Release date: July 2016
- Publisher(s): Chandos Publishing
- ISBN: 9781780634302
You might also like
article
Communicate with Teams More Effectively
This selection of shortcuts will enable you to improve your communication, critical thinking, documentation, and networking …
article
Reinventing the Organization for GenAI and LLMs
Previous technology breakthroughs did not upend organizational structure, but generative AI and LLMs will. We now …
article
Learn to Give Constructive Feedback
Communication is a multifaceted skill that facilitates the flow of information and collaboration in the workplace. …
video
This should be faster; what's going on?
Software and applications run on a real physical network. Trying to deliver packets across the data …