Book description
Computer vision is central to many leading-edge innovations, including self-driving cars, drones, augmented reality, facial recognition, and much, much more. Amazing new computer vision applications are developed every day, thanks to rapid advances in AI and deep learning (DL). Deep Learning for Vision Systems teaches you the concepts and tools for building intelligent, scalable computer vision systems that can identify and react to objects in images, videos, and real life. With author Mohamed Elgendy's expert instruction and illustration of real-world projects, you’ll finally grok state-of-the-art deep learning techniques, so you can build, contribute to, and lead in the exciting realm of computer vision!About the Technology
How much has computer vision advanced? One ride in a Tesla is the only answer you’ll need. Deep learning techniques have led to exciting breakthroughs in facial recognition, interactive simulations, and medical imaging, but nothing beats seeing a car respond to real-world stimuli while speeding down the highway.
About the Book
How does the computer learn to understand what it sees? Deep Learning for Vision Systems answers that by applying deep learning to computer vision. Using only high school algebra, this book illuminates the concepts behind visual intuition. You'll understand how to use deep learning architectures to build vision system applications for image generation and facial recognition.
What's Inside
- Image classification and object detection
- Advanced deep learning architectures
- Transfer learning and generative adversarial networks
- DeepDream and neural style transfer
- Visual embeddings and image search
About the Reader
For intermediate Python programmers.
About the Author
Mohamed Elgendy is the VP of Engineering at Rakuten. A seasoned AI expert, he has previously built and managed AI products at Amazon and Twilio.
Quotes
From text and object detection to DeepDream and facial recognition...this book is comprehensive, approachable, and relevant for modern applications of deep learning to computer vision systems!
- Bojan Djurkovic, DigitalOcean
Real-world problem solving without drowning you in details. It elaborates concepts bit by bit, making them easy to assimilate.
- Burhan Ul Haq, Audit XPRT
An invaluable and comprehensive tour for anyone looking to build real-world vision systems.
- Richard Vaughan, Purple Monkey Collective
Shows you what’s behind modern technologies that allow computers to see things.
- Alessandro Campeis, Vimar
Table of contents
- Deep Learning for Vision Systems
- Copyright
- dedication
- contents
- front matter
- Part 1. Deep learning foundation
- 1 Welcome to computer vision
- 2 Deep learning and neural networks
- 3 Convolutional neural networks
-
4 Structuring DL projects and hyperparameter tuning
- 4.1 Defining performance metrics
- 4.2 Designing a baseline model
- 4.3 Getting your data ready for training
- 4.4 Evaluating the model and interpreting its performance
- 4.5 Improving the network and tuning hyperparameters
- 4.6 Learning and optimization
- 4.7 Optimization algorithms
- 4.8 Regularization techniques to avoid overfitting
- 4.9 Batch normalization
- 4.10 Project: Achieve high accuracy on image classification
- Summary
- Part 2. Image classification and detection
- 5 Advanced CNN architectures
-
6 Transfer learning
- 6.1 What problems does transfer learning solve?
- 6.2 What is transfer learning?
- 6.3 How transfer learning works
- 6.4 Transfer learning approaches
-
6.5 Choosing the appropriate level of transfer learning
- 6.5.1 Scenario 1: Target dataset is small and similar to the source dataset
- 6.5.2 Scenario 2: Target dataset is large and similar to the source dataset
- 6.5.3 Scenario 3: Target dataset is small and different from the source dataset
- 6.5.4 Scenario 4: Target dataset is large and different from the source dataset
- 6.5.5 Recap of the transfer learning scenarios
- 6.6 Open source datasets
- 6.7 Project 1: A pretrained network as a feature extractor
- 6.8 Project 2: Fine-tuning
- Summary
- 7 Object detection with R-CNN, SSD, and YOLO
- Part 3. Generative models and visual embeddings
- 8 Generative adversarial networks (GANs)
- 9 DeepDream and neural style transfer
- 10 Visual embeddings
- appendix A. Getting set up
- index
Product information
- Title: Deep Learning for Vision Systems
- Author(s):
- Release date: November 2020
- Publisher(s): Manning Publications
- ISBN: 9781617296192
You might also like
book
Deep Learning for Computer Vision
Learn how to model and train advanced neural networks to implement a variety of Computer Vision …
video
PyTorch for Deep Learning and Computer Vision
PyTorch has rapidly become one of the most transformative frameworks in the field of deep learning. …
book
Practical Machine Learning for Computer Vision
This practical book shows you how to employ machine learning models to extract information from images. …
book
Deep Learning
Ever since computers began beating us at chess, they've been getting better at a wide range …