Change _iter_nodes so that it is a generator.
Change _check_remap to use _iter_nodes to eliminate the redundancy.
We also added a 'batch_size' parameter, so that with large fan-outs
we don't page in all the nodes unnecessarily. Which adds both direct
computation overhead, but also causes us to thrash the page cache.