Using EDB Wait States
When EDB Wait States is registered as one of the shared preload libraries, it probes each of the running sessions at regular intervals. For every session, it collects information such as:
- The database to which it's connected.
- The logged-in user of the session.
- The query running in that session.
- The wait events on which it's waiting.
This information is saved in a set of files in a user-configurable path and directory folder given by the edb_wait_states.directory
parameter to add to the postgresql.conf
file. The path must be a full, absolute path, not a relative path.
Exploring data with the interface
Each of the functions in the EDB Wait States interface has common input and output parameters. Those parameters are:
Parameter(s) | Input or output | Description |
---|---|---|
start_ts and end_ts | Input | Together these specify the time interval and the data to read. If you specify only start_ts , the data starting from start_ts is output. If you specify only end_ts , data up to end_ts is output. If you don't specify either, all the data is output. |
query_id | Output | Identifies a normalized query. It's internal hash code computed from the query. |
session_id | Output | Identifies a session. |
ref_start_ts and ref_end_ts | Output | The timestamps of a file containing a particular data point. A data point might be a wait event sample record, a query record, or a session record. |
wait_time | Output | The amount of time in seconds spent waiting for some wait events. |
cpu_time | Output | The amount of time in seconds spent working on the CPU. For this given duration, the query wasn't waiting on any wait event. |
db_time | Output | The sum of the wait_time and the cpu_time . The db_time , wait_time , and the cpu_time don't provide an exact time. They provide an approximate time computed based on number of occurrences and the sampling interval. |
The following examples use a scenario where three queries are executed simultaneously on four different sessions connected to different databases using different users. Those three queries are:
edb_wait_states_data
Use this function to read the data collected by the BGW:
You can use this function to find out the following:
The queries running in the given duration (defined by
start_ts
andend_ts
) in all the sessions, and the wait events, if any, they were waiting on. For example:The progress of a session within a given duration, that is, the queries run in a session (
session_id = 100000
) and the wait events the queries waited on. For example:The duration for which the samples are available. For example:
Parameters
In addition to the common parameters described previously, each row of the output gives the following:
dbname
— The session's database.
username
— The session's logged-in user.
query
— The query running in the session.
query_start_time
— The time when the query started.
sample_time
— The time when wait event data was collected.
wait_event_type
— The type of wait event the session (backend) is waiting on.
wait_event
— The wait event the session (backend) is waiting on.
Example
This example shows sample output from the edb_wait_states_data()
function:
edb_wait_states_directory_size
This function gives the size of the $PGDATA/edb_wait_states
directory.
The function returns the total size of all the files in the edb_wait_states
directory in bytes. Optionally specify the start_ts
and end_ts
timestamps to get the file size of all the files in the specified interval.
Note
This function calculates and gives the size of all the files with prefix ews_*
only. It ignores any other file added to the edb_wait_states
directory manually.
Example
This example shows the sample output from the edb_wait_states_directory_size()
function:
edb_wait_states_queries
This function gives information about the queries sampled by the BGW. For example:
A new queries file is created periodically. Multiple query files can be generated corresponding to specific intervals.
This function returns all the queries in query files that overlap with the given time interval. A query gives all the queries in query files that contained queries sampled between start_ts
and end_ts
:
In other words, the function can output queries that didn't run in the given interval. To do that, use edb_wait_states_data()
.
Parameters
In addition to the common parameters described previously, each row of the output gives the following:
query