How HashMap works internally in Java: A debug approach

An interesting approach to learn how HashMap works in Java

Arpit Mandliya
Javarevisited
7 min readMay 3, 2019

--

Most common interview questions are “How HashMap works in Java”, “How to get and put method of HashMap work internally”. Here I am trying to explain internal functionality with an easy example.

HashMap is one of the most used Collections in java. Rather than going through theory, we will start with an example first, so that you will get a better understanding and then we will see how to get and put function work in java.

Let’s take a very simple example. I have a Country class, we are going to use Country class object as key and its capital name(string) as value. Below example will help you to understand, how these key-value pair will be stored in hashmap.

1. Country.java

If you want to understand more about hashcode and equals method of an object, you may refer hashcode() and equals() method in java

2. HashMapStructure.java(main class)

Now put a break-point at line 24 and right click on project->debug as-> java application. The program will stop execution at line 24 then right click on countryCapitalMap then select a watch. You will be able to see structure as below.

Now From the above diagram, you can observe the following points

  1. There is an Entry[] array called table which has size 16.
  2. This table stores the Entry class’s object. HashMap class has an inner class called Entry. This Entry has key value as an instance variable. Let’s see the structure of the entry class Entry Structure.
  1. Whenever we try to put any key-value pair in hashmap, Entry class object is instantiated for key value and that object will be stored in above-mentioned Entry[](table). Now you must be wondering, where will above created Entry object gets stored(exact position in the table). The answer is, the hash code is calculated for a key by calling the hashcode() method. This hashcode is used to calculate the index for the above Entry[] table.
  2. Now, If you see at array index 10 in the above diagram, It has an Entry object named HashMap$Entry.
  3. We have put 4 key-values in hashmap but it seems to have only 2!!!! This is because if two objects have the same hashcode, they will be stored at the same index. Now the question arises how? It stores objects in the form of LinkedList(logically).

So how hashcode of above country key-value pairs is calculated.

Below diagram will explain the LinkedList concept clearly.

So now if you have a good understanding of hash table data structure, Let's go throughput and get method.

Put

Let’s see the implementation of the put method:

now let’s understand above code step by step

  1. The key object is checked for null. If a key is null then it will be stored at the table[0] because hashcode for null is always 0.
  2. Key object’s hashcode() method is called and the hash code is calculated. This hashcode is used to find the index of an array for storing the Entry object. It may happen sometimes that, this hashcode function is poorly written so JDK designer has put another function called hash() which takes the above-calculated hash value as an argument. If you want to learn more about hash() function, you can refer to the hash and indexFor method in hashmap.
  3. indexFor(hash, table.length) is used to calculate exact index in table array for storing the Entry object.
  4. As we have seen in our example, if two key objects have same hashcode(which is known as collision) then it will be stored in the form of the linked list. So here, we will iterate through our linked list.
  • If there is no element present at that index which we have just calculated then it will directly put our Entry object at that index.
  • If There is element present at that index then it will iterate until it gets Entry->next as null.
  • What if we are putting the same key again, logically it should replace old value. Yes, it will do that. While iterating it will check key equality by calling equals() method(key.equals(k)), if this method returns true then it replaces value object with the current Entry’s value object.
  • If it did not find the duplicate key, then the current Entry object will become the first node in linked list and current Entry -> next will become an existing first node on that index.

Get

Let’s see the implementation of get now:

As you got the understanding of put functionality of hashmap. So to understand get functionality is quite simple. If you pass any key to get the value object from a hashmap.

  1. The key object is checked for null. If the key is null then the value of Object resides at the table[0] will be returned.
  2. Key object’s hashcode() method is called and the hash code is calculated.
  3. indexFor(hash, table.length) is used to calculate exact index in table array using generated hashcode for getting the Entry object.
  4. After getting index in table array, it will iterate through the linked list and check for key equality by calling equals() method and if it returns true then it returns the value of Entry object else returns null.

Key points to Remeber:

  • HashMap has an inner class called Entry which stores key-value pairs.
  • Above Entry, the object is stored in Entry[ ](Array) called table
  • An index of the table is logically known as the bucket and it stores the first element of linked list
  • Key object’s hashcode() is used to find the bucket of that Entry object.
  • If two key objects have the same hashcode, they will go in the same bucket of table array.
  • Key object ‘s equals() method is used to ensure the uniqueness of a key object.
  • Value object ‘s equals() and hashcode() method is not used at all

Please go through core java interview questions and java interview questions for more interview questions.

You may also like:

Originally published at https://java2blog.com on May 3, 2019.

--

--