The --bench flag uses the testing.B to execute the EVM bytecode many times and get the average exeuction time out of it.