Book description
Perform genome analysis and sequencing of data with Amazon Web Services
Genomics in the AWS Cloud: Analyzing Genetic Code Using Amazon Web Services enables a person who has moderate familiarity with AWS Cloud to perform full genome analysis and research. Using the information in this book, you'll be able to take a FASTQ file containing raw data from a lab or a BAM file from a service provider and perform genome analysis on it. You'll also be able to identify potentially pathogenic gene sequences.
- Get an introduction to Whole Genome Sequencing (WGS)
- Make sense of WGS on AWS
- Master AWS services for genome analysis
Some key advantages of using AWS for genomic analysis is to help researchers utilize a wide choice of compute services that can process diverse datasets in analysis pipelines. Genomic sequencers that generate raw data files are located in labs on premises and AWS provides solutions to make it easy for customers to transfer these files to AWS reliably and securely. Storing Genomics and Medical (e.g., imaging) data at different stages requires enormous storage in a cost-effective manner. Amazon Simple Storage Service (Amazon S3), Amazon Glacier, and Amazon Elastics Block Store (Amazon EBS) provide the necessary solutions to securely store, manage, and scale genomic file storage. Moreover, the storage services can interface with various compute services from AWS to process these files.
Whether you're just getting started or have already been analyzing genomics data using the AWS Cloud, this book provides you with the information you need in order to use AWS services and features in the ways that will make the most sense for your genomic research.
Table of contents
- Cover
- Title Page
- Introduction
- CHAPTER 1: Why Do Genome Analysis Yourself When Commercial Offerings Exist?
- CHAPTER 2: A Crash Course in Molecular Biology
- CHAPTER 3: Obtaining Your Genome
-
CHAPTER 4: The Bioinformatics Workflow
- Extraction of DNA
- FASTA Files
- FASTQ Files
- Alignment to a Reference Genome
- Reference Genomes
- Quality Control
- Trimming
- The Alignment Process
- Marking Duplicates
- Recalibrating Base Quality Score
- Calling SNVs and Indel Variants
- Annotating SNVs and Indel Variants
- Prioritizing Variants
- Inheritance Analysis
- Identifying SVs and CNVs
- Bioinformatics Workflow
- Summary
- CHAPTER 5: AWS Services for Genome Analysis
- CHAPTER 6: Building Your Environment in the AWS Cloud
-
CHAPTER 7: Linux and AWS Command-Line Basics for Genomics
- Selecting a Linux Distribution
- Accessing Your AWS Linux Instance from Your Local Computer
- Getting Familiar with the Command Line
- Transferring Files to and from Your AWS Instance
- Running Programs in the Background
- Understanding File Permissions
- Compressing and Archiving Files
- Managing Linux
- The AWS Command-Line Interface
- AWS CLI Essentials
- An Alternative Approach: AWS Systems Manager
- Summary
- CHAPTER 8: Processing theSequencing Data
- CHAPTER 9: Visualizing the Genome
- CHAPTER 10: Containerizing Your Workflow on the Desktop
- CHAPTER 11: Variants and Applications
- CHAPTER 12: Cancer Genomics
- Index
- Copyright
- Dedication
- Acknowledgments
- About the Authors
- End User License Agreement
Product information
- Title: Genomics in the AWS Cloud
- Author(s):
- Release date: May 2023
- Publisher(s): Wiley
- ISBN: 9781119573371
You might also like
book
Cloud Native DevOps with Kubernetes, 2nd Edition
Kubernetes has become the operating system of today's cloud native world, providing a reliable and scalable …
book
Genomics in the Azure Cloud
This practical guide bridges the gap between general cloud computing architecture in Microsoft Azure and scientific …
book
Learning Amazon Web Services (AWS): A Hands-On Guide to the Fundamentals of AWS Cloud
The Practical, Foundational Technical Introduction to the World's #1 Cloud Platform Includes access to several hours …
book
Genomics in the Cloud
Data in the genomics field is booming. In just a few years, organizations such as the …