Video description
Neural architecture search (NAS) is a promising research direction that has the potential to replace expert-designed networks with learned, task-specific architectures. Ameet Talwalkar (Carnegie Mellon University | Determined AI) shares work that aims to help ground the empirical results in this field and proposes new NAS baselines that build off the following observations: NAS is a specialized hyperparameter optimization problem, and random search is a competitive baseline for hyperparameter optimization.
Leveraging these observations, Ameet evaluates both random search with early-stopping and a novel random search with a weight-sharing algorithm on two standard NAS benchmarks: PTB and CIFAR-10. Results show that random search with early-stopping is a competitive NAS baseline that performs at least as well as ENAS, a leading NAS method, on both benchmarks. Additionally, random search with weight-sharing outperforms random search with early-stopping, achieving a state-of-the-art NAS result on PTB and a highly competitive result on CIFAR-10.
Ameet concludes by exploring existing reproducibility issues for published NAS results, noting the lack of source material needed to exactly reproduce these results, and discussing the robustness of published results given the various sources of variability in NAS experimental setups.
All information (code, random seeds, documentation) needed to exactly reproduce our results will be shared, along with random search with weight-sharing results for each benchmark on two independent experimental runs.
This session was recorded at the 2019 O'Reilly Artificial Intelligence Conference in New York.
Product information
- Title: Random search and reproducibility for neural architecture search
- Author(s):
- Release date: October 2019
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 0636920339373
You might also like
video
Meet the Expert: Dean Wampler on Scaling ML/AI Applications with Ray
Modern ML and AI applications require a lot of compute power, which usually means distribution over …
video
Using AutoML to automate selection of machine learning models and hyperparameters
Automated machine learning (AutoML) enables both data scientists and domain experts (with limited machine learning training) …
audiobook
Difficult Conversations
You have to talk with a colleague about a fraught situation, but you're worried that they'll …
video
ODSC Europe 2018 (Open Data Science Conference)
ODSC Europe 2018 Royalties for this video set help fund the ODSC Grant Award for open …