update docs on super command for SQL/OLAP audience #5481

mccanne · 2024-11-14T23:43:12Z

No description provided.

philrz · 2024-11-15T16:44:19Z

docs/commands/super.md

+super -f arrows file1.json file2.parquet file3.csv > file-combined.arrows
+```
+When `super` is run with a query that has no "from" operator and no input arguments,
+the SuperSQL query is fed a single `null` value analagous to SQL's default


Suggested change

the SuperSQL query is fed a single `null` value analagous to SQL's default

the SuperSQL query is fed a single `null` value analogous to SQL's default

philrz · 2024-11-15T16:44:40Z

docs/commands/super.md

+select value 1+1
+```
+To learn more about shortcuts, refer to the SuperSQL
+[documenation on shortcuts](../language/pipeline-model.md#implied-operators).


Suggested change

[documenation on shortcuts](../language/pipeline-model.md#implied-operators).

[documentation on shortcuts](../language/pipeline-model.md#implied-operators).

philrz · 2024-11-15T16:44:57Z

docs/commands/super.md

+`super` supports a number of [input](#input-formats) and [output](#output-formats) formats, but the super formats
+([Super Binary](../formats/bsup.md),
+[Super Columnar](../formats/csup.md),
+and [Super JSON](../formats/jsup.md)) tend to the most versatile and


Suggested change

and [Super JSON](../formats/jsup.md)) tend to the most versatile and

and [Super JSON](../formats/jsup.md)) tend to be the most versatile and

philrz · 2024-11-15T16:46:28Z

docs/commands/super.md

+...
+wget https://data.gharchive.org/2023-02-08-23.json.gz
+```
+We downloadied these files into a directory called `gharchive_gz`


Suggested change

We downloadied these files into a directory called `gharchive_gz`

We downloaded these files into a directory called `gharchive_gz`

philrz · 2024-11-15T16:47:00Z

docs/commands/super.md

+`super` with Super Binary is substantially faster than the relational systems for
+the search use cases and performs on par with the others for traditional OLAP queries,
+except for the union query, where the super-structured data model trounces the relational
+model (by over 100X!) for stiching together disparate data types for analysis in an aggregation.


Suggested change

model (by over 100X!) for stiching together disparate data types for analysis in an aggregation.

model (by over 100X!) for stitching together disparate data types for analysis in an aggregation.

philrz · 2024-11-15T16:47:13Z

docs/commands/super.md


-We used the Bash `time` command to measure elapsed time.
+For our tests, We diverged a bit from the methodology in the DuckDB blog and wanted


Suggested change

For our tests, We diverged a bit from the methodology in the DuckDB blog and wanted

For our tests, we diverged a bit from the methodology in the DuckDB blog and wanted

philrz · 2024-11-15T16:47:40Z

docs/commands/super.md

+```
+duckdb gha.db -c "CREATE TABLE gha AS FROM read_json('gharchive_gz/*.json.gz', union_by_name=true)"
+```
+We now have the `duckdb` database file for out GitHub Archive data called `gha.db`


Suggested change

We now have the `duckdb` database file for out GitHub Archive data called `gha.db`

We now have the `duckdb` database file for our GitHub Archive data called `gha.db`

philrz

I put up suggestions to fix some obvious typos and such. There's more changes I'd propose but I'm fine with seeing this merged and I could put up my proposals in a follow-on PR.

update docs on super command for SQL/OLAP audience

fd8adff

mccanne assigned philrz and unassigned philrz Nov 14, 2024

mccanne requested review from philrz and a team November 14, 2024 23:43

mccanne added 2 commits November 14, 2024 15:44

fix trailing whitespace

29b5e20

fix markdown link

4c76764

philrz reviewed Nov 15, 2024

View reviewed changes

philrz approved these changes Nov 15, 2024

View reviewed changes

philrz added skip-autoperf skip-notify-downstream labels Nov 15, 2024

address PR feedback

0321a1c

mccanne merged commit 5f18349 into main Nov 15, 2024
4 checks passed

mccanne deleted the super-doc-updates branch November 15, 2024 19:38

philrz mentioned this pull request Nov 17, 2024

Minor edits to super command doc #5487

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update docs on super command for SQL/OLAP audience #5481

update docs on super command for SQL/OLAP audience #5481

mccanne commented Nov 14, 2024

philrz Nov 15, 2024

philrz Nov 15, 2024

philrz Nov 15, 2024

philrz Nov 15, 2024

philrz Nov 15, 2024

philrz Nov 15, 2024

philrz Nov 15, 2024

philrz left a comment

	the SuperSQL query is fed a single `null` value analagous to SQL's default
	the SuperSQL query is fed a single `null` value analogous to SQL's default

	[documenation on shortcuts](../language/pipeline-model.md#implied-operators).
	[documentation on shortcuts](../language/pipeline-model.md#implied-operators).

	and [Super JSON](../formats/jsup.md)) tend to the most versatile and
	and [Super JSON](../formats/jsup.md)) tend to be the most versatile and

	We downloadied these files into a directory called `gharchive_gz`
	We downloaded these files into a directory called `gharchive_gz`

	model (by over 100X!) for stiching together disparate data types for analysis in an aggregation.
	model (by over 100X!) for stitching together disparate data types for analysis in an aggregation.


		We used the Bash `time` command to measure elapsed time.
		For our tests, We diverged a bit from the methodology in the DuckDB blog and wanted

	We now have the `duckdb` database file for out GitHub Archive data called `gha.db`
	We now have the `duckdb` database file for our GitHub Archive data called `gha.db`

update docs on super command for SQL/OLAP audience #5481

update docs on super command for SQL/OLAP audience #5481

Conversation

mccanne commented Nov 14, 2024

philrz Nov 15, 2024

Choose a reason for hiding this comment

philrz Nov 15, 2024

Choose a reason for hiding this comment

philrz Nov 15, 2024

Choose a reason for hiding this comment

philrz Nov 15, 2024

Choose a reason for hiding this comment

philrz Nov 15, 2024

Choose a reason for hiding this comment

philrz Nov 15, 2024

Choose a reason for hiding this comment

philrz Nov 15, 2024

Choose a reason for hiding this comment

philrz left a comment

Choose a reason for hiding this comment