Video description
Alluxio is the solution of choice for big companies who need to manage data at multi-petabyte scale. In this course, PMC member Calvin Jia offers a full-blown Alluxio tour to any data scientist, developer or system administrator looking to improve the performance of their workloads, develop applications with Alluxio, or deploy and manage Alluxio clusters.
He offers a high level view (why Alluxio was developed, the problems it solves, who uses it, etc.) as well as a hands-on practicum. You'll set-up your own deployment (locally and in a cluster) using a compute framework on top of Alluxio, connecting it to multiple persistent data stores while preserving one namespace. Take this course and you'll come away knowing the benefits Alluxio brings to big data stacks.
- Understand the features and benefits of Alluxio and master the basics of how to use it
- Discover why companies like Intel, Baidu, and Alibaba use Alluxio for their big data needs
- Learn how the storage unification layer bridges computation frameworks and storage systems
- Gain practical experience deploying Alluxio in local and cluster modes
- Learn how to use Alluxio tools like the command line and the web UI
- Explore the Alluxio open source ecosystem and learn who the players are
Publisher resources
Table of contents
-
Introduction
- About Alluxio And The Course 00:03:38
- About The Author 00:01:24
-
Using Alluxio Locally
- Downloading Alluxio 00:03:03
- Starting The System Locally 00:05:09
- Interacting Via The Shell 00:02:45
- Browsing The Web UI 00:03:53
-
Examples With Alluxio
- Setting Up Alluxio With Spark And S3 00:06:15
- Running Spark on Alluxio with S3 00:05:29
- Using Alluxio With Unified Namespace 00:06:05
-
Deploying Alluxio On A Cluster
- Deploying Alluxio In AWS 00:07:49
- Conclusion
Product information
- Title: Introduction to Alluxio
- Author(s):
- Release date: June 2016
- Publisher(s): Infinite Skills
- ISBN: 9781771376006
You might also like
article
Reinventing the Organization for GenAI and LLMs
Previous technology breakthroughs did not upend organizational structure, but generative AI and LLMs will. We now …
book
Expert Hadoop® Administration
The Comprehensive, Up-to-Date Apache Hadoop Administration Handbook and Reference “Sam Alapati has worked with production Hadoop …
article
Splitting Strings on Any of Multiple Delimiters
Build your knowledge of Python with this Shortcuts collection. Focusing on common problems involving text manipulation, …
article
Use Github Copilot for Prompt Engineering
Using GitHub Copilot can feel like magic. The tool automatically fills out entire blocks of code--but …