Hi,
I have two machines that have identical Hardware. CPU, RAM, and BIOS configurations are exactly the same. I am running Spark 3.3.1 with Hadoop 3.3.1. The benchmark is also exactly the same. I am not using any HDFS at all.
Problem: Spark on Windows runs slower than Linux
Any idea why Windows implementation is slower? What is exactly inside hadoop.dll and winutils.exe.