================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 21.0.6+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                6520           6520           1         15.3          65.2       1.0X
DataFrame                                          1192           1241          68         83.9          11.9       5.5X
Dataset                                            1640           1705          92         61.0          16.4       4.0X

OpenJDK 64-Bit Server VM 21.0.6+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                7453           7459           9         13.4          74.5       1.0X
DataFrame                                          2922           2940          25         34.2          29.2       2.6X
Dataset                                            7021           7024           4         14.2          70.2       1.1X

OpenJDK 64-Bit Server VM 21.0.6+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                4428           4492          90         22.6          44.3       1.0X
DataFrame                                           728            736          10        137.5           7.3       6.1X
Dataset                                            1638           1654          23         61.1          16.4       2.7X

OpenJDK 64-Bit Server VM 21.0.6+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2086           2095          12         47.9          20.9       1.0X
DataFrame                                           110            122          10        912.5           1.1      19.0X
Dataset                                            2367           2398          45         42.3          23.7       0.9X

OpenJDK 64-Bit Server VM 21.0.6+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            1404           1408           5         71.2          14.0       1.0X
DataFrame sum                                        57             70          10       1767.9           0.6      24.8X
Dataset sum using Aggregator                       1944           2023         111         51.4          19.4       0.7X
Dataset complex Aggregator                         4838           4881          60         20.7          48.4       0.3X


