Heartbeat in hadoop
Failed to receive heartbeat
Adding a new host to the cloudera hadoop cluster fails with no heartbeat from the agent
http://www.cloudera.com/ is a Hadoop distribution and helps you handle the complexity of Hadoop platform with a robust … Continue reading
Apache Sentry is a Big data tool used to enforce fine grained role based authorization to data and metadata on your hadoop clusters. Recently I was playing around with Sentry and from the configuration manual on cloudera website, the … Continue reading
How to run sqoop jobs from Oozie
Sqoop is a tool to import/export data from a relaional database to HDFS and vice-versa. It is super-easy to use and uses Map-reduce jobs behind the scenes to move data from source to … Continue reading
How to set up Sqoop incremental imports?
Here’s a step by step guide for Sqoop incremental imports and since it says step-by-step, it’s going to be only that 😉 .
Hive, a data warehousing tool is a solution … Continue reading
Illegal partition exception in sqoop for incremental imports
Sqoop is an amazing tool by apache that is widely used to import/export data between Hadoop and relational databases. I have particularly used this for developing a data-warehouse wherein I was pulling … Continue reading
Horton-works data platform abbreviated as HDP is a completely open source distribution of Hadoop. As much as a breeze it is to provision, manage and monitor clusters each with multiple nodes through Ambari, It’s a pain in a*s as soon … Continue reading
How to run sqoop job through oozie successfully
Sqoop jobs are used to create and save the import and export commands. It helps to automate the sqoop tasks and in re-execution of sqoop actions. I mostly find myself writing sqoop … Continue reading