================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                6408           6469          86         15.6          64.1       1.0X
DataFrame                                          1151           1152           2         86.9          11.5       5.6X
Dataset                                            1725           1850         177         58.0          17.2       3.7X

OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                7374           7378           6         13.6          73.7       1.0X
DataFrame                                          3111           3119          11         32.1          31.1       2.4X
Dataset                                            6397           6516         168         15.6          64.0       1.2X

OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                4022           4162         197         24.9          40.2       1.0X
DataFrame                                           701            722          19        142.6           7.0       5.7X
Dataset                                            1528           1545          24         65.4          15.3       2.6X

OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2065           2095          43         48.4          20.6       1.0X
DataFrame                                           106            120          12        944.9           1.1      19.5X
Dataset                                            2302           2326          34         43.4          23.0       0.9X

OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            1424           1452          40         70.2          14.2       1.0X
DataFrame sum                                        61             73           9       1636.9           0.6      23.3X
Dataset sum using Aggregator                       1953           2020          94         51.2          19.5       0.7X
Dataset complex Aggregator                         5019           5030          16         19.9          50.2       0.3X


