Tuesday, August 2, 2016

Data synchronization between rdbms and hive using sqoop

Hive can be a great backup environment for RDBMS data or simply as a data warehouse. Hive provides a great architecture for bulk OLAP data. Hive is also a great choice for data charting workspace where hadoop technologies can be employed to crunch data.
Because many organizations still use rdbms and sql technology in their data warehouse, it is easier to export data in hive to perform bulk processing. Sometimes data dumping and reimporting into hive is inefficient therefore a data synchronization strategy using jdbc technology is more logical. Sqoop is designed to replicate data between different databases by speaking the same 'jdbc language'.
Lets see how sqoop works between sql server and hive.



1 comment:

  1. Quraan.pk is Pakistan’s leading online store dedicated to providing authentic and beautifully crafted editions of the Holy Quran. With a wide range of Qurans in different scripts, fonts, and translations, Quraan.pk ensures that every reader finds the perfect copy to enhance their recitation and understanding.https://quraan.pk/

    ReplyDelete