Data Structures and Algorithms

1. Data Structures and Algorithms

Data structures store essential information in the application. They are organized via lists, sets, queues, and associative maps. In Java, the interfaces and classes around data structures are called Collection-API. Since there are so many types to choose from, the purpose of this chapter is to bring order to the confusion and to illustrate the use of the corresponding collections through the exercises.

Prerequisites

be able to distinguish data structures lists, sets, associative memory.
know data types List, ArrayList and LinkedList.
know data types Set, HashSet and TreeSet.
know Map, HashMap and TreeMap data types.
know the difference between queue and deque.
know how to create an order with Comparator.
know how to use and implement iterators.
be able to use data structures in a thread-safe way.
optional: Interest in the open-source library Guava for further data structures.

Data types used in this chapter:

1.1. The types of the Collection API

The list of user types is long this time. However, the design follows a basic principle, so it is not so complicated after all:

Interfaces describe the functionality of a data structure, "what is provided".
Classes use different strategies to implement the specification from the interfaces; they represent the "how it is implemented".

As developers, we need to know interfaces and implementations, and to review, let’s look again at the central types we’ll encounter more often in this chapter:

Figure 1. UML diagram of selected data structures and type relationships

To note:

Iterable is the most general interface, representing what can be traversed; Iterable provides Iterator instances. Not only data structures are Iterable.
Collection is the top interface that really represents data structures. It specifies methods for adding elements to the collection or for deleting them.
Under Collection are the actual abstractions, whether it is a list, set, or queue. Below are the implementations.
Some operations are not with the data types themselves, but are outsourced to a class Collections. Similar applies to arrays, where there is also a utility class Arrays.

We want to build a decision tree for the classes and interfaces java.util.Set, java.util.List, java.util.Map, java.util.HashSet, java.util.TreeSet, java.util.Hashtable, java.util.HashMap and java.util.TreeMap. The following considerations must be made in the selection process:

Access via key
duplicates allowed
fast access
sorted iteration
thread safe

If access is from a key to a value, this is generally an associative map, that is, an implementation of the Map interface. Implementations of Map are HashMap, TreeMap, and the outdated class Hashtable. However, lists are also special associative stores, where the index is an integer starting at 0 and ascending. Lists work quite well whenever the key is a small integer and there are few spaces. The association of arbitrary integers to objects does not work well with a list.

Duplicates are allowed in lists, but not in sets and associative stores. There are indeed requirements that a set should note how often an element occurs, but this must be implemented itself with an associative memory that associates the element with a counter.

All data structures allow fast access. The only question is what to ask for. A list cannot quickly answer whether an element is present or not because the list must be traversed from front to back to do so. With an associative store or set, this query is much faster because of the internal organization of the data. This test of existence can be answered even somewhat faster for data structures that use hashing internally than for data structures that keep elements sorted.

Lists can be sorted, and a traversal returns the elements in the sorted order. A TreeSet and a TreeMap are also sorted by criteria. The data structures with the hashing method have no user-defined order of sorting.

Data structures can be divided into three groups: Data structures since Java 1.0, data structures since Java 1.2, and data structures since Java 5. In the first Java versions the data structures Vector, Hashtable, Dictionary and Stack were introduced. These data structures are all thread-safe, but they are no longer used today. In Java 1.2 the Collection API was introduced, all data structures are not thread safe. In Java 5 the new package java.util.concurrent has been introduced, all data structures in it are safe against concurrent changes.

1.2. Lists

For the exercises, let’s start with the simplest data structure, lists. Lists are sequences of information where the order is maintained when appending new elements, and elements can occur multiple times. Even null is allowed as an element.

1.2.1. Singing and cooking: Traverse lists and check properties ⭐

Captain CiaoCiao is putting together a new crew. Everyone in the crew has a name and a profession:

record CrewMember( String name, Profession profession ) {
  enum Profession { CAPTAIN, NAVIGATOR, CARPENTER, COOK, MUSICIAN, DOCTOR }
}

For each crew, Captain CiaoCiao makes sure that there are as many cooks as musicians.

Exercise:

Write a method areSameNumberOfCooksAndMusicians(List<CrewMember>) that returns true if there are the same number of cooks as musicians, false otherwise.

Example:

CrewMember captain   = new CrewMember( "CiaoCiao", CrewMember.Profession.CAPTAIN );
CrewMember cook1     = new CrewMember( "Remy", CrewMember.Profession.COOK );
CrewMember cook2     = new CrewMember( "The Witch Cook", CrewMember.Profession.COOK );
CrewMember musician1 = new CrewMember( "Mahna Mahna", CrewMember.Profession.MUSICIAN );
CrewMember musician2 = new CrewMember( "Rowlf", CrewMember.Profession.MUSICIAN );

List<CrewMember> crew1 = List.of( cook1, musician1 );
System.out.println( areSameNumberOfCooksAndMusicians( crew1 ) ); // true

List<CrewMember> crew2 = List.of( cook1, musician1, musician2, captain );
System.out.println( areSameNumberOfCooksAndMusicians( crew2 ) ); // false

List<CrewMember> crew3 = List.of( cook1, musician1, musician2, captain, cook2  );
System.out.println( areSameNumberOfCooksAndMusicians( crew3 ) ); // true

1. Data Structures and Algorithms

1.1. The types of the Collection API

1.2. Lists

1.2.1. Singing and cooking: Traverse lists and check properties ⭐

1.2.2. Filter comments from lists ⭐

1.2.3. Shorten lists because there is no downturn ⭐

1.2.4. Eating with friends: Compare elements, find commonalities ⭐

1.2.5. Check lists for same order of elements ⭐

1.2.6. And now the weather: Find repeated elements ⭐

1.2.7. Generate receipt output ⭐

1.2.8. Everything tastes better with cheese: Insert elements into lists ⭐

1.2.9. Search elements with the iterator and find Covid Cough ⭐⭐

1.2.10. Move elements, play musical chairs ⭐

1.2.11. Programming a question game with planets ⭐⭐

1.2.12. Understanding the implementation of the class java.util.ArrayList ⭐⭐

1.3. Sets

1.3.1. Form subsets, find common elements ⭐

1.3.2. Get all words contained in a word ⭐⭐

1.3.3. Exclude duplicate elements with a UniqueIterator ⭐⭐

1.4. Map keys to values

1.4.1. Convert two-dimensional arrays to map ⭐

1.4.2. Convert text to Morse code and vice versa ⭐

1.4.3. Remember word frequency with associative memory ⭐⭐

1.4.4. Read in and read out colors ⭐⭐

1.4.5. Read in names and manage lengths ⭐

1.4.6. Find missing characters ⭐⭐

1.4.7. Calculate number of paths to the three-headed monkey ⭐⭐

1.4.8. Manage holidays in a sorted associative store ⭐

1.4.9. Determine commonality: Party set and souvenir ⭐

1.5. Properties

1.5.1. Develop convenient properties decorator ⭐⭐

1.6. Stack and queues

1.6.1. Program RPN pocket calculator ⭐

1.7. BitSet

1.7.1. Forget no ship ⭐

1.7.2. Find duplicate entries, and solve the animal chaos ⭐

1.8. Thread-safe data structures

1.8.1. Understanding the difference between HashMap, Synchronized-Wrapper, ConcurrentHashMap.

1.8.2. Loading ship ⭐⭐

1.8.3. Handle important messages first ⭐⭐

1.8.4. If used up, create a new one ⭐⭐⭐