Let’s code it: Compact tree (part 2)

In the previous post we defined the tree itself and the methods to add and delete a child. This post covers the pack and re-index functionality, this allows the removal of unused space and makes the tree more compact. The pack could be very useful to make the tree easier to load in the cpu cache.

Note that the first post was updated due to the introduction of the new method.

Before starting writing some code let’s see what is the goal of the new method on an example. Let’s say that we a tree with a root and three nodes. We called delete on index 2 child, and pack after:

Arena based tree

As you can see from above the node index 2 is moved to index 3 and the node at index 1 is updated with the new index, moreover the last value is dropped from the memory chunk because is not needed anymore.

Note that the allocated space is not dropped, if we want to add a new node we will not pay the allocation cost associated to the memory chunk allocation.

The pack method will move all the deleted nodes at the end of the memory chunk and pack all the valid nodes near the start. Before introducing the method itself, however, we want to write some helpers to:

retrieve the first deleted node starting from the head of the memory chunk;

/// Returns the next deleted node starting from start
fn find_next_deleted(&self, start: usize) -> Option<usize> {
    for i in start..self.nodes.len() {
        let n = &self.nodes[i];
        // The node is deleted if there is no parent and the next node is pointing to itself
        if n.parent.is_none() && n.next_node == Some(i) {
            return Some(i);
        }
    }
    None
}

retrieve the first non deleted node starting from the end;

/// Returns the next non deleted node starting from start
fn find_next_not_deleted(&self, start: usize) -> Option<usize> {
    for i in (start..self.nodes.len()).rev() {
        let n = &self.nodes[i];
        // The node is not deleted if is not pointing to itself
        if n.next_node != Some(i) {
            return Some(i);
        }
    }
    None
}

update two swapped nodes.

fn update_links(&mut self, old_index: usize, new_index: usize) {
    // Checks if there is a parent, if not the root is deleted and there is not
    // a link to update
    if let Some(p) = self[new_index].parent {
        // Gets the first child
        let first_child = self.nodes[p].first_child;
        // Checks if the first child is the node that needs to be moved
        if first_child == Some(old_index) {
            // Connects the child with the node moved node
            self.nodes[p].first_child = Some(new_index);
        } else {
            // Takes the first sibling
            let mut node = first_child;
            // Until there is a sibling
            while let Some(n) = node {
                // If the next node of this sibling is the swapped node,
                if self.nodes[n].next_node == Some(old_index) {
                    // Updates the index with the new one
                    self.nodes[n].next_node = Some(new_index);
                    // There should be only one instance of the node in the list
                    break;
                }
                // Moves to the next sibling
                node = self.nodes[n].next_node;
            }
        }

        // Updates all the children with the new parent
        let mut child = self.nodes[new_index].first_child;
        // Until there is a child
        while let Some(c) = child {
            // Updates the child parent with the new index of the moved node
            self.nodes[c].parent = Some(new_index);
            // Moves to the next child
            child = self.nodes[c].next_node;
        }
    }
    // Sets the old_index node to itself, this node is the deleted moved
    self.nodes[old_index].next_node = Some(old_index);
}

Using the previous methods we can write the pack one, as you can see the implementation is fairly simple compared to the add and delete child:

pub fn pack_nodes_and_reindex(&mut self) -> Vec<(usize, usize)> {
    // This contains the pair (old index, new index)
    let mut result = Vec::new();
    // Checks if there is a deleted node
    if let Some(first_delete) = self.node_deleted {
        // Starts from the first node, the delete node is not the one closer to the start of the vec
        for i in first_delete..self.nodes.len() - 1 {
            // Searches for the first deleted node from the start
            let deleted = self.find_next_deleted(i);
            // Searches for the first non deleted node from the end
            let not_deleted = self.find_next_not_deleted(i + 1);
            // Checks if there is a deleted node
            if let Some(d) = deleted {
                // Checks if there is a node to move
                if let Some(nd) = not_deleted {
                    // Swaps only if the indices are in the right position.
                    // If delete is higher than not delete the pack is completed
                    if d < nd {
                        // Swaps the nodes
                        self.nodes.swap(d, nd);
                        // Updates the links in the tree
                        self.update_links(nd, d);
                        // Pushes the changed index into the list
                        result.push((nd, d));
                    } else {
                        // Nodes found but already in the correct positions
                        break;
                    }
                } else {
                    // No node to swap found
                    break;
                }
            } else {
                // No delete node to swap found
                break;
            }
        }

        // Finds the first delete node
        if let Some(deleted) = self.find_next_deleted(0) {
            // Drop all the values from that index to the end
            self.nodes.drain(deleted..self.nodes.len());
        }
        // Removes the delete node, all the deleted nodes now are removed
        self.node_deleted = None;
    }
    result
}

Note that we need to save and return all the new indices because we physically moved some nodes. If the result is empty the tree was already packed.

See you next time!