Implemented by templates and template specializations, traits classes make information about types available during compilation. Combining traints with overloading, it is possible to perform compile-time if...else tests on types.

In this item, let’s look at how STL supports the utility template advance, which moves a specified iterator a specified distance:

1
2


template<typename IterT, typename DistT>  
void advance(IterT& iter, DistT d); // move iter d units forward; if d < 0, move iter backward

Before we step into the field of advance implementation, we should be familiar about STL iterator categoryies, because different iterators support different operations.

Five iterator categories

According to the operations they support, there are five categories of iterators¹:

Input iterators can move only one step forward at a time, can only read, and can read what they’re pointing to only once (e.g., istream_iterator)
Output iterators can move only one step forwrad at a time, can only write, and can write what they’re pointing to only once (e.g., ostream_iterator)
Forward iterators can do whatever input and output iterators can do, and they can read or write what they point to momre than once (e.g., singly linked list or iterators into TR1’s hashed containers (item 54) may be in the forward category)
Bidirectional iterators add to forward iterators the ability to move backward as well as forward (e.g., iterators for list, set, multiset, map, multimap in STL)
Random access iterators are the most powerful ones: they add to bidirectional iterators the ability to perform “iterator arithmetic” - to jump forward or backward an arbitrary distance in constant time, which is similar to pointer arithmetic (e.g., iterators for vector, deque, string)

For each of the five iterator categories, C++ has a “tag struct” in the standard library that serves to identify it:

1
2
3
4
5


struct input_iterator_tag {};
struct output_iterator_tag {};
struct forward_iterator_tag: public input_iterator_tag {};
struct bidirectional_iterator_tag: public forward_iterator_tag {};
struct random_access_iterator_tag: public bidirectional_iterator_tag {};

Pseudocode for `advance`

Knowing that different categories support different operations, advance will essentially be implemented like this:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11


template<typename IterT, typename DistT>
void advance(IterT& iter, DistT d)
{
    if (iter is a random access iterator) {
        iter += d; // use iterator arithmetic for random access iters
    }
    else {
        if (d >= 0) { while (d--) ++iter; } // use iterative calls 
        else { while (d++) --iter; }        // to ++ or -- for other iterator categories
    }
}

How to determine whether iter is a random access iterator? Here comes the technique of traits, which allow us to get type information during compilation.

Traits

Traits aren’t a keyword or a predefined construct in C++, but simply a technique and a convention followed by C++ programmers. It has some requirements, one of which asks that it has to work as well for built-in types as it does for user-defined types. This demand leads to the conclusion that the traits information for a type must be external to the type².

For user-defined types

The standard technique is to put the information into a template and one or more specializations of that template. For iterators, the template in the standard library is named iterator_traits³:

1
2
3
4
5
6
7


// the iterator_category for type IterT is whatever IterT says it is
// see item 42 for info on the use of "typedef typename"
template<typename IterT>
struct iterator_traits {
    typedef typename IterT::iterator_category iterator_category;
    ...
};

How does iterator_traits work? Since iterator_traits just parrots back a typedef called iterator_category inside the iterator class, it requires that any user-defined iterator type must contain this nested typedef named iterator_category, which identifies the appropriate tag struct. For example, deque iterators belong to random access category, and list’s iterators are bidirectional:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10


template<...>
class deque {
public:
    class iterator {
    public:
        typedef random_access_iterator_tag iterator_category;
        ...
    };
    ...
};

 1
 2
 3
 4
 5
 6
 7
 8
 9
10


template<...>
class list {
public:
    class iterator {
    public:
        typedef bidirectional_iterator_tag iterator_category;
        ...
    };
    ...
};

For pointer types

The code above works well for user-defined types, but it doasn’t work for iterators that are pointers, since there’s no such thing as a nested typedef inside a built-in pointer. To make it work, iterator_traits need to offer a partial template specialization for pointer types:

1
2
3
4
5
6


template<typename IterT>  // partial template specialization
struct iterator_traits<IterT*> // for built-in pointer types
{
    typedef random_access_iterator_tag iterator_category; // pointers act as random access iterators
    ...
}

Workflow for designing and implementing a traits class

Identify some information about types we’d like to make available (e.g., iterator category for iterator types)
Choose a name to identify that information (e.g., iterator_category)
Provide a template and set of specializations (e.g., iterator_traits) that contain the information for the types we want to support

Implementing `advance`

Given std::iterator_traits, we can refine our pseudocode for advance:

1
2
3
4
5
6
7


template<typename IterT, typename DistT>
void advance(IterT& iter, DistT d)
{
    if (typeid(typename std::iterator_traits<IterT>::iterator_category) ==
        typeid(std::random_access_iterator_tag))
        ...
}

Sadly, this is not what we want: for one thing, it will lead to compilation problems (refer to item 48); anther issue, which is more fundamental, is that if statement is evaluated at runtime but iterator_traits<IterT>::iterator_category can be determined during compilation already. There’s no point to move the evaluation from compile-time to runtime to waste time and bloat the executable.

Template metaprogramming

What we want is a conditional construct for types that is evaluated during compilation. Actually this demand brings us to the realm of template metaprogramming (item 48), but the technique we’ll use turns out to be quit familiar: overloading:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28


template<typename IterT, typename DistT> // use this impl for random access iterators
void doAdvance(IterT& iter, DistT d, std::random_access_iterator_tag)
{
    iter += d;
}

template<typename IterT, typename DistT> // use this impl for bidirectional iterators
void doAdvance(IterT& iter, DistT d, std::bidirectional_iterator_tag)
{
    if (d >= 0) { while(d--) ++iter; }
    else { while(d++) --iter; }
}

template<typename IterT, typename DistT> // use this impl for input iterators, also applies to inherited forward_iterator_tag
void doAdvance(IterT& iter, DistT d, std::input_iterator_tag)
{
    if (d < 0) {
        throw std::out_of_range("Negative distance"); 
    }
    while(d--) ++iter;
}

template<typename IterT, typename DistT>
void advance(IterT& iter, DistT d)
{
    doAdvance(iter, d, 
        typename std::iterator_traits<IterT>::iterator_category());
};

Summary

How to use a traits class:

Create a set of overloaded “worker” functions or function templates (e.g., doAdvance) that differ in a traits parameter. Implement each function in accrod with the traits information passed.
Create a “master” function or function template (e.g., advance) that calls the workers, passing information provided by a traits class.

Input and output iterators, as two least powerful iterator categories, are modeled on the read (into an input file) and write (into an output file) pointer, suitable only for one-pass algorithm. Forward (and higher level iterators) can do multi-pass algoritms. The most powerful random access iterators are modeled on built-in pointers. ↩︎
For example, if advance is called with a pointer const char* and an int, since traits technique must apply to built-in types like const char*, and there’s no way to nesting information inside pointers, thus the traits information for a type must be external to the type. ↩︎
By convention, traits are always implemented as structs, but that structs are called “traits classes” - not joking. ↩︎

Item-47 Use traits classes for information about types