-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create Overview #2591
base: latest
Are you sure you want to change the base?
Create Overview #2591
Conversation
Allow 10 minutes from last push for the staging site to build. If the link doesn't work, try using incognito mode instead. For internal reviewers, check web-documentation repo actions for staging build status. Link to build for this PR: http://docs-dev.timescale.com/docs-dev-overview-lana |
Signed-off-by: Lana Brindley <[email protected]>
Site build failed. For Timescale internal contributors, check the logs in the web-documentation repo to see the failure reason. For help, contact the docs team. |
Site build failed. For Timescale internal contributors, check the logs in the web-documentation repo to see the failure reason. For help, contact the docs team. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Partial review. Will look a little more on the rest.
Continuous aggregates require a `time_bucket` on the time partitioning column of | ||
the hypertable. | ||
|
||
By default, views are automatically refreshed. You can adjust this by setting |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
By default, views are automatically refreshed. You can adjust this by setting | |
By default, views are automatically refreshed when they are created. You can adjust this by using |
the [WITH NO DATA](#using-the-with-no-data-option) option. Additionally, the | ||
view can not be a [security barrier view][postgres-security-barrier]. | ||
|
||
Continuous aggregates use hypertables in the background, which means that they |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Continuous aggregates use hypertables in the background, which means that they | |
Continuous aggregates use hypertables internally, which means that they |
Research has shown that when data is newly ingested, the queries are more likely | ||
to be shallow in time, and wide in columns. Generally, they are debugging | ||
queries, or queries that cover the whole system, rather than specific, analytic | ||
queries. An example of the kind of query more likely for new data is "show the | ||
current CPU usage, disk usage, energy consumption, and I/O for a particular | ||
server". When this is the case, the uncompressed data has better query | ||
performance, so the native PostgreSQL row-based format is the best option. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hmm... not sure if "debugging queries" is the best categorization of this.
Research has shown that when data is newly ingested, the queries are more likely | |
to be shallow in time, and wide in columns. Generally, they are debugging | |
queries, or queries that cover the whole system, rather than specific, analytic | |
queries. An example of the kind of query more likely for new data is "show the | |
current CPU usage, disk usage, energy consumption, and I/O for a particular | |
server". When this is the case, the uncompressed data has better query | |
performance, so the native PostgreSQL row-based format is the best option. | |
For newly ingested data, the queries are usually | |
shallow in time, and wide in columns. At this stage, the queries delve into details of the system. An example of the kind of query more likely for new data is "show the | |
current CPU usage, disk usage, energy consumption, and I/O for a particular | |
server". When this is the case, the uncompressed data has better query | |
performance, so the native PostgreSQL row-based format is the best option. |
The result is transparent queries across standard PostgreSQL storage and S3 | ||
storage, so your queries fetch the same data as before, with minimal added | ||
latency. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Work focus on the utility, not on performance.
The result is transparent queries across standard PostgreSQL storage and S3 | |
storage, so your queries fetch the same data as before, with minimal added | |
latency. | |
As a result, you can write queries seamlessly reading and involving both tiered and untiered data.``` |
When you create and use a hypertable, it automatically partitions data by time, | ||
and optionally by space. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When you create and use a hypertable, it automatically partitions data by time, | |
and optionally by space. | |
Hypertables are used to automatically partition data: traditionally using time, but hypertables can also be used to partition data in other dimensions.``` |
Each hypertable is made up of child tables called chunks. Each chunk is assigned | ||
a range of time, and only contains data from that range. If the hypertable is | ||
also partitioned by space, each chunk is also assigned a subset of the space | ||
values. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can partition using multiple time dimensions and multiple space dimensions, so suggest to elaborate a little on this.
Each hypertable is made up of child tables called chunks. Each chunk is assigned | |
a range of time, and only contains data from that range. If the hypertable is | |
also partitioned by space, each chunk is also assigned a subset of the space | |
values. | |
Each hypertable is made up of child tables called chunks. Each chunk is assigned | |
a range of time, and only contains data from that range. If the hypertable is | |
also partitioned by other dimensions, each chunk is also assigned a subset of the values in that dimension. |
Timescale is the database platform built for developers. It's engineered to | ||
deliver speed and scale for your resource-intensive workloads—like those using | ||
time series, event, and analytics data. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that we also want to push more on the ease of use. Hypertables are performing well, but they are also hassle-free because they can handle automatic partititioning and manage data in different stages of the life-cycle.
Timescale is the database platform built for developers. It's engineered to | |
deliver speed and scale for your resource-intensive workloads—like those using | |
time series, event, and analytics data. | |
Timescale is the database platform built for developers. It's engineered to | |
without hassle deliver speed and scale for your resource-intensive workloads—like those using | |
time series, event, and analytics data. |
* _PostgreSQL++_ - Timescale is the PostgreSQL you know and love, giving you | ||
access to the entire PostgreSQL ecosystem, but Timescale has additional | ||
features like hypertables, compression and continuous aggregates. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder if we want to add a bullet around something like "Designed for Data Intensive Applications".
Time-series data grows very quickly, and as the data grows, analyzing it gets | ||
slower and uses more resources. Timescale solves the slow-down with continuous | ||
aggregates. Based on PostgreSQL materialized views, continuous aggregates are | ||
incrementally and continuously updated, to make them lightning fast. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Time-series data grows very quickly, and as the data grows, analyzing it gets | |
slower and uses more resources. Timescale solves the slow-down with continuous | |
aggregates. Based on PostgreSQL materialized views, continuous aggregates are | |
incrementally and continuously updated, to make them lightning fast. | |
For data-intensive applications the amount of of data that needs to be managed grows very quickly, and as the data grows, analyzing it gets | |
slower and uses more resources. Timescale solves the slow-down with continuous | |
aggregates. Based on PostgreSQL materialized views, continuous aggregates are | |
incrementally and continuously updated, to make them lightning fast. |
When you are working with time-series and event data, storage costs can easily | ||
spiral out of control. With Timescale, you never have to worry about hidden |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe talk more about "data-intensive applications that collect large amounts of time-series and events data".
Reopening this PR as it seems to have been closed by accident |
Description
[Short summary of why you created this PR]
Links
Writing help
For information about style and word usage, see the style guide
Review checklists
Reviewers: use this section to ensure you have checked everything before approving this PR:
Subject matter expert (SME) review checklist
Documentation team review checklist
and have they been implemented?