Improved tile performance by using reserve() to alloc the size of the arrays internally all at once instead of on-demand since this is way faster. In my test case this made a significant impact. Also attempting to get the compiler to avoid some other more hidden copying.