I suspect this is worth some perf, but it's probably not enough for us to care. More importantly, it breaks in Go 1.5, where a recent optimization made sync.WaitGroup much smaller.