Radix Sort in Java

1. Introduction

In this tutorial, we’ll learn about Radix Sort, analyze its performance,
and take a look at its implementation.

Here we focus on using Radix Sort to sort integers, but it’s not
limited to just numbers.
We can use it to sort other types such as
String, too.

In order to keep it simple, we’re gonna focus on the decimal system in
which the numbers are expressed in base (radix) 10.

2. Algorithm Overview

Radix sort is a sorting algorithm that sorts numbers based on the
positions of their digits. Basically, it uses the place value of the
digits in a number. Unlike most of the other sorting algorithms, such
as Merge Sort,
Insertion Sort,
Bubble Sort, it doesn’t
compare the numbers.

Radix sort uses a
stable sorting
algorithm
as a subroutine to sort the digits. We’ve used a variation of
counting sort as a subroutine here that uses the radix to sort the
digits in every position.
Counting sort is a stable
sorting algorithm and it works well in practice.

Radix sort works by sorting digits from the Least Significant Digit
(LSD) to the Most Significant Digit (MSD). We can also implement Radix
sort to process digits from MSD.

3. A Quick Example

Let’s see how it works with an example. Let’s consider the following
array:

Iteration 1:

We’ll sort this array by processing digits from LSD and moving towards
MSD.

So let’s start with the digits in ones place:

After the first iteration, the array now looks like:

Note that the numbers have been sorted according to the digits in ones
place.

Iteration 2:

Let’s move on to the digits in tens place:

Now the array looks like:

We see that the number 7 has occupied the first position in the array
since it doesn’t have any digit in the tens place. We could also think
of this as having a 0 in the tens place.

Iteration 3:

Let’s move on to the digits in the hundreds position:

After this iteration, the array looks like:

And the algorithm stops here, with all elements sorted.

4. Implementation

Let’s now look at the implementation.

void sort(int[] numbers) {
    int maximumNumber = findMaximumNumberIn(numbers);
    int numberOfDigits = calculateNumberOfDigitsIn(maximumNumber);
    int placeValue = 1;
    while (numberOfDigits-- > 0) {
        applyCountingSortOn(numbers, placeValue);
        placeValue *= 10;
    }
}

The algorithm works by finding out the maximum number in the array and
then calculating its length. This step helps us to ensure that we
execute the subroutine for every place value.

For example, in the array, [7, 37, 68, 123, 134, 221, 387, 468, 769],
the maximum number is 769 and its length is 3.

So, we iterate and apply the subroutine thrice on the digits in every
position:

void applyCountingSortOn(int[] numbers, int placeValue) {

    int range = 10 // decimal system, numbers from 0-9

    // ...

    // calculate the frequency of digits
    for (int i = 0; i < length; i++) {
        int digit = (numbers[i] / placeValue) % range;
        frequency[digit]++;
    }

    for (int i = 1; i < range; i++) {
        frequency[i] += frequency[i - 1];
    }

    for (int i = length - 1; i >= 0; i--) {
        int digit = (numbers[i] / placeValue) % range;
        sortedValues[frequency[digit] - 1] = numbers[i];
        frequency[digit]--;
    }

    System.arraycopy(result, 0, numbers, 0, length);

}

In the subroutine, we’ve used the radix (range) to count the
occurrence of each digit and increment its frequency. So, each bin in
the range from 0 to 9 will have some value based on the frequency of
digits. We then use the frequency to position each element in the array.
This also helps us to minimize the space required to sort the array.

Now let’s test our method:

@Test
public void givenUnsortedArray_whenRadixSort_thenArraySorted() {
    int[] numbers = {387, 468, 134, 123, 68, 221, 769, 37, 7};
    RadixSort.sort(numbers);
    int[] numbersSorted = {7, 37, 68, 123, 134, 221, 387, 468, 769};
    assertArrayEquals(numbersSorted, numbers);
}

5. Radix Sort vs Counting Sort

In the subroutine, the length of the frequency array is 10 (0-9). In
the case of Counting Sort, we don’t use the range. The length of the
frequency array will be the maximum number in the array + 1. So we
don’t divide them into bins whereas Radix Sort uses the bins to sort.

Counting Sort is quite efficient when the length of the array is not
much smaller than the maximum value in the array whereas Radix Sort
allows for larger values in the array.

6. Complexity

The performance of Radix Sort depends on the stable sorting algorithm
chosen to sort the digits.

Here we’ve used the Radix Sort to sort an array of n numbers in base
b. In our case, the base is 10. We’ve applied the Counting Sort d
times where d stands for the number of digits. So the time complexity
of Radix Sort becomes O(d * (n + b)).

The space complexity is O(n + b) since we have used a variation of
Counting Sort as a subroutine here.

7. Conclusion

In this article, we described the Radix sort algorithm and illustrated
how to implement it.

As usual, the code implementations are available
over
on Github
.

Leave a Reply

Your email address will not be published.