================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                5879           6176         420         17.0          58.8       1.0X
DataFrame                                          1159           1189          43         86.3          11.6       5.1X
Dataset                                            1617           1667          70         61.8          16.2       3.6X

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                6934           6965          44         14.4          69.3       1.0X
DataFrame                                          2836           2893          82         35.3          28.4       2.4X
Dataset                                            6597           6669         102         15.2          66.0       1.1X

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                4186           4227          57         23.9          41.9       1.0X
DataFrame                                           671            703          53        148.9           6.7       6.2X
Dataset                                            1598           1599           1         62.6          16.0       2.6X

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2083           2087           6         48.0          20.8       1.0X
DataFrame                                           111            124          11        903.6           1.1      18.8X
Dataset                                            2453           2498          65         40.8          24.5       0.8X

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            1412           1427          20         70.8          14.1       1.0X
DataFrame sum                                        69             81           9       1447.3           0.7      20.4X
Dataset sum using Aggregator                       2107           2148          58         47.5          21.1       0.7X
Dataset complex Aggregator                         4949           5079         183         20.2          49.5       0.3X


