Apache Hive is the primary data-warehousing tool we use at our workplace for querying and data-analysis.On top of that, we use Apache Oozie to schedule our workflows of Sqoop and Pig jobs apart from Hive … Continue reading
Apache Sentry is a Big data tool used to enforce fine grained role based authorization to data and metadata on your hadoop clusters. Recently I was playing around with Sentry and from the configuration manual … Continue reading
How to set up Sqoop incremental imports?
Here’s a step by step guide for Sqoop incremental imports and since it says step-by-step, it’s going to be only that 😉 .
Hive, a data … Continue reading