* minor: use spawn to stream larger output rather than exec which buffers it
* test: verify distinct error code is returned from large output test
* test: breakout additional integration tests to run in parallel
* test: dont pass/fail PRs for coverage yet