ADS 0x02: Arrays & Linked Lists

Published November 16th, 2021

ADS
Java

Introduction

Linear data structures are by far the most common kinds of data structure in computer science. If implemented correctly, they are easy, fast, and efficient. The idea of any data structure is that they store arbitrary data in a way that makes it easy to access and manipulate the data.

We will go over all the important linear data structures with implementations in Java and amortised analysis of the methods they provide.

Arrays

Arrays are the most common linear data structure. Most other linear data structures are implemented using arrays.

As you can see, arrays are simple: they are simple contiguous blocks of memory that hold data.

Elements on the array are accessed by their index. Indices start at 0, except they start at 1 in some ghoulish programming languages that shall not be named for legal reasons. In Java, which is a not a ghoulish language (or is it?), indices start at 0.

Pros:

Any element can be accessed easily by its index
The elements are stored contiguously, which means that the caching of elements is very efficient
It is very straightforward to iterate over the elements on the array

Cons:

Arrays are fixed size, so eventually we will run out of space
Arrays don't have a lot of functionality, especially when it comes to searching for elements

Array Implementation

			
public class Array<T>

{

	private T[] array;

	public

	Array(int size)

	{

		// Allocate memory for the array.

		array = new T[size];

	}

	public T

	get(int index)

	{

		// Check if the index is valid.

		if(index < 0 || index >= array.length)

		{

			throw new IndexOutOfBoundsException();

		}

		// Return the element at the provided index.

		return array[index];

	}

	public void

	set(int index, T value)

	{

		// Check if the index is valid.

		if(index < 0 || index >= array.length)

		{

			throw new IndexOutOfBoundsException();

		}

		// Set the element at the provided index.

		array[index] = value;

	}

}

In order to support any types of data, we use Java generics. This allows us to use the same array implementation for all types of data.

An array of integers can be written as Array<Integer>, an array of strings can be written as Array<String>, and so on.

As you can see, arrays are very simple and don't have a lot of functionality.

Linked Lists

Linked lists are a non-contiguous data structure. Each node in the list contains a reference to the next node.

Linked Lists vs. Arrays

As you can see from the diagram, linked lists are very similar to arrays. The biggest difference is that linked lists are not contiguous, whereas arrays are.

Pros:

It is very easy to add or remove elements from a linked list at any position
It is still easy to iterate over all elements of a linked list

Cons:

Since linked lists are not contiguous, we can't access elements by their index
Caching locality of linked lists is really bad because the elements are not stored contiguously
Linked lists waste memory because we need to store a reference to the next node

All in all, linked lists are a trade-off between flexibility and efficiency. Linked lists are more flexible because they allow you to add or remove elements at any position, whereas arrays are more efficient because they are contiguous.

Linked lists are slower because the elements are not contiguously stored. This can lead to a lot of cache misses when accessing elements. This can have a substantial performance impact in programs.

The only good reason to use linked lists is if you really need a container that supports adding or removing elements at any position.
If you store references to elements in a linked list in another data structure, you might be able to add or remove elements in constant time.

Linked List Implementation

			
public class Node<T>

{

	public T data;

	public Node<T> next;

	public

	Node(T data)

	{

		this.data = data;

		next = null;

	}

}

public class LinkedList<T>

{

	private Node<T> head;

	private Node<T> tail;

	private int curSize;

	public

	LinkedList()

	{

		// Initialise the linked list.

		head = null;

		tail = null;

		curSize = 0;

	}

	public int

	size()

	{

		return curSize;

	}

	public Node<T>

	getNodeByIndex(int index)

	{

		// Check if the index is valid.

		if (index < 0 || index >= curSize)

		{

			throw new IndexOutOfBoundsException();

		}

		// Iterate over the list until we reach the index.

		Node<T> curNode = head;

		for (int i = 0; i < index; i++)

		{

			curNode = curNode.next;

		}

		return curNode;

	}

	public T

	get(int index)

	{

		return getNodeByIndex(index).data;

	}

	public Node<T>

	getNodeByData(T data)

	{

		// Iterate over the list until we reach

		// the first node with the given data.

		Node<T> curNode = head;

		while (curNode != null)

		{

			if (curNode.data == data)

			{

				break;

			}

			curNode = curNode.next;

		}

		// If there was a node with the given data,

		// a reference to it is returned.

		// The reference will be null if a node

		// with the given data was not found.

		return curNode;

	}

	...

}

Above you can see the core structure of a linked list. The head and tail nodes are used to keep track of the first and last elements of the list.

There are some utility functions that can be used to find elements in the list.

We also keep track of the current size of the list. Every time we add or remove an element, we increment the size, and every time we add or remove an element, we decrement the size.

Next, we will add a couple of methods to the linked list and go over how they work.

Linked List: Append Element

Appending an element to the end of a linked list is pretty easy. We just need to create a new node and append it to the tail.

There is one edge case to be aware of though: if the linked list is empty, we need to set the head and tail to the new node instead.

			
public void

append(T data)

{

	// Create a new node.

	// Since `node.next` is initialised to NULL already,

	// so we don't need to set it.

	Node<T> node = new Node<T>(data);

	// Check if the list is empty.

	if(head == null)

	{

		// Set the head and tail to the new node.

		head = node;

		tail = node;

	}

	else

	{

		// Add the node to the end of the linked list.

		tail.next = node;

		// Update the tail.

		tail = node;

	}

}

Linked List: Prepend Element

Prepending an element to the beginning of a linked list is very similar to appending an element to the end of a linked list:

			
public void

prepend(T data)

{

	// Create a new node.

	Node<T> node = new Node<T>(data);

	// Check if the list is empty.

	if(head == null)

	{

		// Set the head and tail to the new node.

		head = node;

		tail = node;

	}

	else

	{

		// Add the node to the beginning of the linked list.

		node.next = head;

		// Update the head.

		head = node;

	}

}

Removing the first or last element from the linked list is also quite trivial and is left as an exercise for the reader.

Doubly Linked Lists

In order to support some of the more advanced operations like adding or removing elements at the middle of the list, we need to add a reference to the previous node.
A linked list with nodes that store references to the previous node is called a doubly linked list and looks like this:

Note that each node now also points back to the previous node, and the previous element of the head is NULL.

We simply add a reference to the previous node to the Node<T> class and update the constructor:

			
public class Node<T>

{

	public T data;

	public Node<T> next;

	public Node<T> prev;

	public

	Node(T data)

	{

		this.data = data;

		next = null;

		prev = null;

	}

}

That's it! We don't really have to change anything in the LinkedList<T> class, except for the append and prepend methods.
Think about how we would have to change these methods for a doubly linked list.

Doubly Linked List: Add Element

If you have a reference to the previous node, you can add a new element to the middle of the list by updating a couple of references, as you can see in the picture above.

			
public void

add(T data, Node<T> prevNode)

{

	// Create a new node.

	Node<T> node = new Node<T>(data);

	// Save a reference to the next node.

	Node<T> nextNode = prevNode.next;

	// Set the next and prev references on the new node.

	node.next = nextNode;

	node.prev = prevNode;

	// Link the node with the previous node.

	prevNode.next = node;

	// Link the node with the next node.

	if (nextNode != null)

	{

		nextNode.prev = node;

	}

	// If there is no next node, set the new node as tail

	else

	{

		tail = node;

	}

}

Doubly Linked List: Remove Element

Finally, let's create a method that takes a reference to a node and removes it from a doubly linked list.

			
public void

remove(Node<T> node)

{

	// Save a reference to the next node.

	Node<T> nextNode = node.next;

	// Save a reference to the previous node.

	Node<T> prevNode = node.prev;

	// If we're removing the head, update the head.

	if (prevNode == null)

	{

		head = nextNode;

		nextNode.prev = null;

	}

	// Else, link the previous node with the next node.

	else

	{

		prevNode.next = nextNode;

	}

	// If we're removing the tail, update the tail.

	if (nextNode == null)

	{

		tail = prevNode;

		prevNode.next = null;

	}

	// Else, link the next node with the previous node.

	else

	{

		nextNode.prev = prevNode;

	}

	// Let Java's Garbage Collector delete the node.

}

Arrays vs. Linked Lists: Time Complexity

We have gone through some common operations on arrays and linked lists, so try to analyse their time complexity and compare your results to the below table.

Operation	Array	Linked List
Accessing by index
Appending an element	-
Prepending an element	-
Adding an element	-	*
Removing an element	-	*

* Assuming we already have a reference to the node we want to add or remove. If we don't have this reference, we will have to traverse the list to find the node. This takes time.

As you can see in the table, accessing elements from a linked list takes time, and adding or removing elements takes time.
The catch is that we can only add or remove elements if we have a reference to the node we want to add to or to remove. We first have to find the node, so we still have to traverse the list to add or remove an element.

So, linked lists really only make sense if you store references to nodes you might want to change, which is usually not very efficient.

The only true operation that a linked list supports is appending or prepending an element, which we cannot do with arrays, since they are fixed in size.

Also, since arrays are fixed in size, we cannot really do anything with them except accessing and changing elements by index.
This is why dynamic arrays and deques exist, which are arrays that can change in size and support appending and prepending operations.
Dynamic arrays and deques are much more efficient than linked lists, since they do store their elements contiguously.
We will talk about these data structures in the next blog post.

Final Thoughts

In today's blog post, we've learnt how arrays and linked lists work, and what their differences are. We've also learnt how to implement linked lists and doubly linked lists in Java.

In the next blog post, we'll learn how to implement a dynamic array works and we will use it to implement a stack and a queue.

If you found this post helpful, feel free to share it! Any feedback is very welcome!

ADS 0x02: Arrays & Linked Lists

Published November 16th, 2021

Introduction Get link to this section

Arrays Get link to this section

Array Implementation Get link to this section

Linked Lists Get link to this section

Linked Lists vs. Arrays Get link to this section

Linked List Implementation Get link to this section

Linked List: Append Element Get link to this section

Linked List: Prepend Element Get link to this section

Doubly Linked Lists Get link to this section

Doubly Linked List: Add Element Get link to this section

Doubly Linked List: Remove Element Get link to this section

Arrays vs. Linked Lists: Time Complexity Get link to this section

Final Thoughts Get link to this section