================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 17.0.5+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                               11190          11293         145          8.9         111.9       1.0X
DataFrame                                          1736           1816         112         57.6          17.4       6.4X
Dataset                                            2567           2579          16         38.9          25.7       4.4X

OpenJDK 64-Bit Server VM 17.0.5+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                               13353          13367          19          7.5         133.5       1.0X
DataFrame                                          4046           4053          10         24.7          40.5       3.3X
Dataset                                           14414          14480          93          6.9         144.1       0.9X

OpenJDK 64-Bit Server VM 17.0.5+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2579           2639          84         38.8          25.8       1.0X
DataFrame                                          1014           1036          31         98.6          10.1       2.5X
Dataset                                            2498           2507          13         40.0          25.0       1.0X

OpenJDK 64-Bit Server VM 17.0.5+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                3547           3557          14         28.2          35.5       1.0X
DataFrame                                           161            194          22        622.7           1.6      22.1X
Dataset                                            4029           4085          79         24.8          40.3       0.9X

OpenJDK 64-Bit Server VM 17.0.5+8 on Linux 5.15.0-1023-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            3333           3344          16         30.0          33.3       1.0X
DataFrame sum                                        59             77          12       1708.6           0.6      56.9X
Dataset sum using Aggregator                       3257           3279          31         30.7          32.6       1.0X
Dataset complex Aggregator                         7984           8053          97         12.5          79.8       0.4X


