Disadvantages of Sqoop

Even though Sqoop has very strong advantages to its name, it does have some inherent disadvantages, which can be summarized as:

  • It uses a JDBC connection to connect with RDBMS based data stores, and this can be inefficient and less performant.
  • For performing analysis, it executes various map-reduce jobs and, at times, this can be time consuming when there are lot of joins if the data is in a denormalized fashion.
  • Being used for bulk transfer of data, it could put undue pressure on the source data store, and this is not ideal if these stores are heavily used by the main business application.

Get Data Lake for Enterprises now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.