Apache Hive Replication : Encryption ZonesWe have been working on Apache hive replication for almost 3 years now solving various use cases some of which i have talked about in my…Mar 6, 2020Mar 6, 2020
Apache Hive Replication : Doing right by External TablesOver the past year we have been focusing on building a ground up replication solution for Apache hive, the most popular Data warehousing…Mar 29, 2019Mar 29, 2019
Apache Hive Replication: Overview of changes in ACIDEarlier I had published a post introducing what we have been doing with Replication V2 in Apache Hive. In this post I am going to provide a…Aug 24, 2018Aug 24, 2018
Apache Hive: Introduction to Replication V2I am going to write about the changes that are happening for replication in Apache hive. These will be released in Apache Hive 3.0.0. This…Dec 20, 20171Dec 20, 20171
Apache Hive Beeline : Progress BarApache Hive needs no introduction for any one working in the big data space. Its the default go to SQL on Hadoop solution used in most…Mar 22, 2017Mar 22, 2017
Log4j2 Logging: A PrimerLogging is required in any application that we write. This is one of the most common components that developers across different industries…Jan 2, 20171Jan 2, 20171
@ Scale: Little things matterThere are a lot of tiny optimizations that can be done in large scale systems over time that will make the whole system significantly…Oct 17, 2016Oct 17, 2016
Kappa Architecture : In PracticeMost of us who have been working with big data systems have come across the two most important architecture configurations namely Kappa…Aug 21, 2016Aug 21, 2016
Kafka Log Cleaner IssuesKafka has become the default messaging system in most of the companies now. It has been around for sometime and provides great control to…Jun 21, 20161Jun 21, 20161
High Level Consumer On Kafka 0.8.2.2In this post i am going to discuss the user of high level consumer with kafka 0.8.2.2, a problem we faced and some points / links to make…Mar 10, 2016Mar 10, 2016