→ Yes.
Consider all the time is spent into memory access and allocation, and not into ‘actual’ computation. The compiler finds the computation nodes then combines chained operations to minimize the temporary data assignments and repeated accesses, (and modernly in a SIMD way).