================================================================================================
Benchmark for performance of JSON parsing
================================================================================================

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
JSON schema inferring:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2406           2422          16          2.1         481.1       1.0X
UTF-8 is set                                       3323           3335          10          1.5         664.5       0.7X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
count a short column:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2005           2037          29          2.5         401.0       1.0X
UTF-8 is set                                       3138           3143           5          1.6         627.6       0.6X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
count a wide column:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        3337           3406          64          0.3        3336.8       1.0X
UTF-8 is set                                       4383           4411          27          0.2        4383.4       0.8X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
select wide row:                          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        9364           9394          35          0.0      187287.2       1.0X
UTF-8 is set                                      10402          10439          42          0.0      208036.3       0.9X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
Select a subset of 10 columns:            Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Select 10 columns                                  1657           1663           5          0.6        1657.2       1.0X
Select 1 column                                    1328           1331           4          0.8        1327.7       1.2X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
creation of JSON parser per line:         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Short column without encoding                       656            656           1          1.5         655.8       1.0X
Short column with UTF-8                             844            858          23          1.2         843.5       0.8X
Wide column without encoding                       5501           5529          26          0.2        5501.5       0.1X
Wide column with UTF-8                             6440           6460          25          0.2        6440.0       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
JSON functions:                           Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                            51             52           1         19.8          50.6       1.0X
from_json                                          1134           1142           7          0.9        1134.4       0.0X
json_tuple                                         1117           1121           4          0.9        1116.9       0.0X
get_json_object wholestage off                     1036           1042           7          1.0        1036.3       0.0X
get_json_object wholestage on                       944            945           1          1.1         944.3       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
Dataset of json strings:                  Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           227            230           3         22.0          45.5       1.0X
schema inferring                                   1835           1836           1          2.7         367.1       0.1X
parsing                                            2831           2843          15          1.8         566.3       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
Json files in the per-line mode:          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           588            594           6          8.5         117.5       1.0X
Schema inferring                                   2444           2449           5          2.0         488.8       0.2X
Parsing without charset                            3046           3052           5          1.6         609.2       0.2X
Parsing with UTF-8                                 3937           3940           4          1.3         787.4       0.1X

OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
Write dates and timestamps:               Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Create a dataset of timestamps                      100            105           6         10.0         100.5       1.0X
to_json(timestamp)                                  815            820           4          1.2         815.2       0.1X
write timestamps to files                           734            745          14          1.4         733.6       0.1X
Create a dataset of dates                           112            118           6          8.9         111.9       0.9X
to_json(date)                                       606            608           3          1.6         606.3       0.2X
write dates to files                                472            480           8          2.1         472.3       0.2X

OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
Read dates and timestamps:                                             Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
-----------------------------------------------------------------------------------------------------------------------------------------------------
read timestamp text from files                                                   154            156           3          6.5         153.9       1.0X
read timestamps from files                                                      1048           1055           6          1.0        1048.1       0.1X
infer timestamps from files                                                     1962           1967           6          0.5        1961.6       0.1X
read date text from files                                                        139            148          12          7.2         139.1       1.1X
read date from files                                                             728            736           7          1.4         727.8       0.2X
timestamp strings                                                                135            140           5          7.4         134.7       1.1X
parse timestamps from Dataset[String]                                           1193           1197           3          0.8        1192.9       0.1X
infer timestamps from Dataset[String]                                           2042           2046           4          0.5        2042.3       0.1X
date strings                                                                     203            204           1          4.9         203.1       0.8X
parse dates from Dataset[String]                                                 991            996           8          1.0         990.6       0.2X
from_json(timestamp)                                                            1670           1679          11          0.6        1669.9       0.1X
from_json(date)                                                                 1459           1460           1          0.7        1458.6       0.1X
infer error timestamps from Dataset[String] with default format                 1393           1400           7          0.7        1392.6       0.1X
infer error timestamps from Dataset[String] with user-provided format           1384           1388           5          0.7        1383.6       0.1X
infer error timestamps from Dataset[String] with legacy format                  1418           1419           2          0.7        1418.3       0.1X

OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
Filters pushdown:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
w/o filters                                        5964           5972          10          0.0       59641.7       1.0X
pushdown disabled                                  5780           5785           6          0.0       57798.8       1.0X
w/ filters                                          701            702           1          0.1        7010.4       8.5X

OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
AMD EPYC 7763 64-Core Processor
Partial JSON results:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
parse invalid JSON                                 2429           2545         138          0.0      242888.8       1.0X


