================================================================================================
Benchmark for performance of JSON parsing
================================================================================================

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
JSON schema inferring:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2310           2321          12          2.2         461.9       1.0X
UTF-8 is set                                       4784           4795          10          1.0         956.9       0.5X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
count a short column:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2215           2246          30          2.3         443.0       1.0X
UTF-8 is set                                       4718           4724           7          1.1         943.5       0.5X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
count a wide column:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        4269           4376         151          0.2        4268.9       1.0X
UTF-8 is set                                       4447           4459          15          0.2        4446.8       1.0X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
select wide row:                          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        9025           9101          70          0.0      180505.7       1.0X
UTF-8 is set                                       9983          10004          24          0.0      199666.6       0.9X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
Select a subset of 10 columns:            Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Select 10 columns                                  1585           1588           5          0.6        1584.6       1.0X
Select 1 column                                    1253           1256           4          0.8        1253.2       1.3X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
creation of JSON parser per line:         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Short column without encoding                       644            646           2          1.6         644.2       1.0X
Short column with UTF-8                            1163           1174          18          0.9        1162.6       0.6X
Wide column without encoding                       5178           5191          12          0.2        5177.7       0.1X
Wide column with UTF-8                             5983           5987           4          0.2        5982.7       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
JSON functions:                           Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                            56             59           3         17.9          55.8       1.0X
from_json                                          1039           1057          28          1.0        1038.8       0.1X
json_tuple                                         1036           1040           5          1.0        1035.5       0.1X
get_json_object wholestage off                     1056           1061           8          0.9        1055.9       0.1X
get_json_object wholestage on                       993            999           6          1.0         992.7       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
Dataset of json strings:                  Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           260            263           4         19.2          52.0       1.0X
schema inferring                                   1765           1768           2          2.8         353.0       0.1X
parsing                                            2756           2775          18          1.8         551.2       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
Json files in the per-line mode:          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           580            583           2          8.6         116.1       1.0X
Schema inferring                                   2349           2355           8          2.1         469.8       0.2X
Parsing without charset                            2935           2940           8          1.7         587.0       0.2X
Parsing with UTF-8                                 5463           5504          69          0.9        1092.6       0.1X

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
Write dates and timestamps:               Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Create a dataset of timestamps                      102            106           6          9.8         101.9       1.0X
to_json(timestamp)                                  634            637           3          1.6         634.4       0.2X
write timestamps to files                           676            678           2          1.5         676.2       0.2X
Create a dataset of dates                           114            117           2          8.7         114.4       0.9X
to_json(date)                                       439            440           1          2.3         439.1       0.2X
write dates to files                                435            438           2          2.3         435.2       0.2X

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
Read dates and timestamps:                                             Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
-----------------------------------------------------------------------------------------------------------------------------------------------------
read timestamp text from files                                                   151            154           2          6.6         151.5       1.0X
read timestamps from files                                                      1049           1051           3          1.0        1048.9       0.1X
infer timestamps from files                                                     1959           1967          10          0.5        1958.7       0.1X
read date text from files                                                        136            141           5          7.4         135.9       1.1X
read date from files                                                             703            713          10          1.4         702.7       0.2X
timestamp strings                                                                141            143           2          7.1         141.1       1.1X
parse timestamps from Dataset[String]                                           1191           1195           7          0.8        1190.9       0.1X
infer timestamps from Dataset[String]                                           2065           2071           9          0.5        2065.3       0.1X
date strings                                                                     203            205           2          4.9         202.6       0.7X
parse dates from Dataset[String]                                                 967            970           4          1.0         967.0       0.2X
from_json(timestamp)                                                            1629           1632           2          0.6        1628.9       0.1X
from_json(date)                                                                 1371           1396          41          0.7        1371.0       0.1X
infer error timestamps from Dataset[String] with default format                 1400           1404           4          0.7        1400.1       0.1X
infer error timestamps from Dataset[String] with user-provided format           1401           1405           5          0.7        1400.8       0.1X
infer error timestamps from Dataset[String] with legacy format                  1429           1431           2          0.7        1429.0       0.1X

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
Filters pushdown:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
w/o filters                                        6452           6469          19          0.0       64519.3       1.0X
pushdown disabled                                  6245           6253           8          0.0       62445.5       1.0X
w/ filters                                          556            560           4          0.2        5563.4      11.6X

OpenJDK 64-Bit Server VM 17.0.14+7-LTS on Linux 6.8.0-1020-azure
AMD EPYC 7763 64-Core Processor
Partial JSON results:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
parse invalid JSON                                 2255           2376         205          0.0      225510.6       1.0X


