How to install Apache ZooKeeper in Ubuntu 20.04

Apache ZooKeeper is a centralized service for maintaining configuration information, naming, providing synchronization, and providing group services. It enables highly reliable distributed coordination.

In this tutorial, we are going to learn how to install Zookeeper in Ubuntu 20.04, see the alternatives to ZooKeeper.

ZooKeeper Alternatives

There are some other products you can use if ZooKeeper isn’t your favorite flavor. These are:

  • Docker
  • Hashicorp Consul
  • Eureka
  • GRPC
  • etcd
  • Traefik

Errors to avoid if you configure ZooKeeper correctly.

If you want to avoid errors from occurring, you need to configure ZooKeeper properly. Here are the errors you can easily avoid:

  • Inconsistent list of servers. Zookeeper clients servers must match Zookeeper servers that each ZooKeeper server has.
  • Inconsistent placement of transaction logs. ZooKeeper sync transactions to media before it returns a response. A dedicated transaction log device is key to consistent good performance.
  • Incorrect java heap size. To avoid this, set java heap size correctly.
  • Publicly accessible deployment. Deploy a zooKeeper behind a firewall

Prerequisites

  • Have Java release 1.8 and above JDK 12
  • Use 3 recommended ZooKeeper servers for ensemble
  • Atleast 2GB RAM

Installing Apache ZooKeeper on Ubuntu 20.04

1. Updating system repositories

We need to make our system repositories up to date in order to avoid running into different errors later during installation.

$ sudo apt update && apt upgrade -y

2. Install Java runtime

Apache ZooKeeper requires java to run because it is written in java language. So to install java run the following command in your terminal. First, you can check the version if you already installed java.

$ java -version
openjdk version "1.8.0_312"
OpenJDK Runtime Environment (build 1.8.0_312-8u312-b07-0ubuntu1~20.04-b07)
OpenJDK 64-Bit Server VM (build 25.312-b07, mixed mode)

For my case I already have java installed but for a fresh installation of java in your system, run the following command to make the installed. Check this article on how to install java on Ubuntu 20.04.

$ sudo apt install openjdk-8-jdk openjdk-8-jre

3. Create user for Apache ZooKeeper

We need to create a user that will run all the ZooKeeper services.

$ sudo useradd nextgen -m

-m will create a dedicated home directory for the zookeeper user.

Next, we need to set bash as the default shell for the zookeeper user.

$ sudo usermod --shell /bin/bash nextgen

set a password for this user

$ sudo passwd nextgen

Next, add a user to sudoers group

$ sudo usermod -aG sudo nextgen 

You can check the created user if it exists.

$ sudo getent group sudo 
sudo:x:27:nextgen

4. Create a data directory for Apache ZooKeeper

ZooKeeper requires a place to read and write data to, this is the reason we need to create one for ZooKeeper.

$ sudo mkdir -p /data/zookeeper

To ensure that the created user can write into the directory, we need to give them permission.

$ sudo chown nextgen:nextgen /data/zookeeper

5. Install Apache Zookeeper on Ubuntu 20.04

Now we can download Zookeeper from the Apache download page. But first, move to /opt/ directory where you will extract the apache ZooKeeper to.

$ cd /opt

Use wget to get the download.

$ wget https://dlcdn.apache.org/zookeeper/zookeeper-3.7.0/apache-zookeeper-3.7.0.tar.gz

Next, we need to extract the binaries from the above download.

$ sudo tar -xvf apache-zookeeper-3.7.0.tar.gz

Next, we need to give permission to extracted binaries.

$ sudo chown nextgen:nextgen -R apache-zookeeper-3.7.0

The next thing is to create a symbolic link so that ZooKeeper can become accessible from all directories in your system.

$ sudo ln -s apache-zookeeper-3.7.0 zookeeper

we now need to change the ownership of that link to nextgen:nextgen

$ sudo chown -h nextgen:nextgen zookeeper

-h specifies you have changed the ownership of the link.

6. Configuring Apache ZooKeeper

Because we are not in a production environment, we are going to configure ZooKeeper in a standalone environment. When you are in a production environment configure replication mode. To configure ZooKeeper in standalone create a zoo.cfg in the Zookeeper directory.

$ sudo vi /opt/zookeeper/conf/zoo.cfg

We need to add the following lines in the zoo.cfg file

tickTime = 2000
dataDir = /data/zookeeper
clientPort = 2181
initLimit = 5
syncLimit = 2

Let’s look at the following in detail

  • tickTime: Sets the length of a tick in milliseconds.
  • dataDir: Specifies the directory used to store snapshots of the in-memory database and the transaction log for updates.
  • clientPort: The port used to listen for client connections.
  • initLimit: The number of ticks that the initial synchronization phase can take.
  • syncLimit: The number of ticks that can pass between sending a request and getting an acknowledgement.

7. Start Zookeeper service

To start the ZooKeeper service run the following command. Run this command inside /opt/zookeeper directory

$ sudo bin/zkServer.sh start

output

/usr/bin/java
ZooKeeper JMX enabled by default
Using config: /opt/apache-zookeeper-3.7.0/bin/../conf/zoo.cfg
Starting zookeeper ... STARTED

8. Connecting to ZooKeeper server

To connect to the local ZooKeeper server run the following command

$ bin/zkCli.sh -server 127.0.0.1:2181

Conclusion

We have learned how to install Apache ZooKeeper on Ubuntu 20.04. I am happy that you have enjoyed the session.

About Mason Kipward

I am a technology enthusiast who loves to share gained knowledge through offering daily tips as a way of empowering others. I am fan of Linux and all other things open source.
View all posts by Mason Kipward →

Leave a Reply

Your email address will not be published.