Using EDB Wait States
When EDB Wait States is registered as one of the shared preload libraries, it probes each of the running sessions at regular intervals. For every session, it collects information such as:
- The database to which it's connected.
- The logged-in user of the session.
- The query running in that session.
- The wait events on which it's waiting.
This information is saved in a set of files in a user-configurable path and directory folder given by the edb_wait_states.directory
parameter to add to the postgresql.conf
file. The path must be a full, absolute path, not a relative path.
Exploring data with the interface
Each of the functions in the EDB Wait States interface has common input and output parameters. Those parameters are:
Parameter(s) | Input or output | Description |
---|---|---|
start_ts and end_ts | Input | Together these specify the time interval and the data to read. If you specify only start_ts , the data starting from start_ts is output. If you specify only end_ts , data up to end_ts is output. If you don't specify either, all the data is output. |
query_id | Output | Identifies a normalized query. It's internal hash code computed from the query. |
session_id | Output | Identifies a session. |
ref_start_ts and ref_end_ts | Output | The timestamps of a file containing a particular data point. A data point might be a wait event sample record, a query record, or a session record. |
wait_time | Output | The amount of time in seconds spent waiting for some wait events. |
cpu_time | Output | The amount of time in seconds spent working on the CPU. For this given duration, the query wasn't waiting on any wait event. |
db_time | Output | The sum of the wait_time and the cpu_time . The db_time , wait_time , and the cpu_time don't provide an exact time. They provide an approximate time computed based on number of occurrences and the sampling interval. |
The following examples use a scenario where three queries are executed simultaneously on four different sessions connected to different databases using different users. Those three queries are:
SELECT schemaname FROM pg_tables, pg_sleep(15) WHERE schemaname <> 'pg_catalog'; /* ran on 2 sessions */ SELECT tablename FROM pg_tables, pg_sleep(10) WHERE schemaname <> 'pg_catalog'; SELECT tablename, schemaname FROM pg_tables, pg_sleep(10) WHERE schemaname <> 'pg_catalog';
edb_wait_states_data
Use this function to read the data collected by the BGW:
edb_wait_states_data( IN start_ts timestamptz default '-infinity'::timestamptz, IN end_ts timestamptz default 'infinity'::timestamptz, OUT session_id int4, OUT <dbname> text, OUT <username> text, OUT <query> text, OUT <query_start_time> timestamptz, OUT <sample_time> timestamptz, OUT <wait_event_type> text, OUT <wait_event> text )
You can use this function to find out the following:
The queries running in the given duration (defined by
start_ts
andend_ts
) in all the sessions, and the wait events, if any, they were waiting on. For example:SELECT query, session_id, wait_event_type, wait_event FROM edb_wait_states_data(start_ts, end_ts);
The progress of a session within a given duration, that is, the queries run in a session (
session_id = 100000
) and the wait events the queries waited on. For example:SELECT query, wait_event_type, wait_event FROM edb_wait_states_data(start_ts, end_ts) WHERE session_id = 100000;
The duration for which the samples are available. For example:
SELECT min(sample_time), max(sample_time) FROM edb_wait_states_data();
Parameters
In addition to the common parameters described previously, each row of the output gives the following:
dbname
— The session's database.
username
— The session's logged-in user.
query
— The query running in the session.
query_start_time
— The time when the query started.
sample_time
— The time when wait event data was collected.
wait_event_type
— The type of wait event the session (backend) is waiting on.
wait_event
— The wait event the session (backend) is waiting on.
Example
This example shows sample output from the edb_wait_states_data()
function:
edb=# SELECT * FROM edb_wait_states_data();
-[ RECORD 1 ]----+------------------------------------------------------------------------- session_id | 4398 dbname | edb username | enterprisedb query | SELECT schemaname FROM pg_tables, pg_sleep($1) WHERE schemaname <> $2 query_start_time | 17-AUG-18 11:56:05.271962 -04:00 sample_time | 17-AUG-18 11:56:19.700236 -04:00 wait_event_type | Timeout wait_event | PgSleep -[ RECORD 2 ]----+------------------------------------------------------------------------- session_id | 4398 dbname | edb username | enterprisedb query | SELECT schemaname FROM pg_tables, pg_sleep($1) WHERE schemaname <> $2 query_start_time | 17-AUG-18 11:56:05.271962 -04:00 sample_time | 17-AUG-18 11:56:18.699938 -04:00 wait_event_type | Timeout wait_event | PgSleep -[ RECORD 3 ]----+------------------------------------------------------------------------- session_id | 4398 dbname | edb username | enterprisedb query | SELECT schemaname FROM pg_tables, pg_sleep($1) WHERE schemaname <> $2 query_start_time | 17-AUG-18 11:56:05.271962 -04:00 sample_time | 17-AUG-18 11:56:17.700253 -04:00 wait_event_type | Timeout wait_event | PgSleep . . .
edb_wait_states_directory_size
This function gives the size of the $PGDATA/edb_wait_states
directory.
edb_wait_states_directory_size( IN start_ts timestampz default '-inifinity'::timestampz, IN end_ts timestampz default 'infinity'::timestampz );
The function returns the total size of all the files in the edb_wait_states
directory in bytes. Optionally specify the start_ts
and end_ts
timestamps to get the file size of all the files in the specified interval.
Note
This function calculates and gives the size of all the files with prefix ews_*
only. It ignores any other file added to the edb_wait_states
directory manually.
Example
This example shows the sample output from the edb_wait_states_directory_size()
function:
edb=# select edb_wait_states_directory_size();
edb_wait_states_directory_size ------------------------------ 1712256 (1 row)
edb_wait_states_queries
This function gives information about the queries sampled by the BGW. For example:
edb_wait_states_queries( IN start_ts timestamptz default '-infinity'::timestamptz, IN end_ts timestamptz default 'infinity'::timestamptz, OUT query_id int8, OUT <query> text, OUT ref_start_ts timestamptz OUT ref_end_ts timestamptz )
A new queries file is created periodically. Multiple query files can be generated corresponding to specific intervals.
This function returns all the queries in query files that overlap with the given time interval. A query gives all the queries in query files that contained queries sampled between start_ts
and end_ts
:
SELECT query FROM edb_wait_states_queries(start_ts, end_ts);
In other words, the function can output queries that didn't run in the given interval. To do that, use edb_wait_states_data()
.
Parameters
In addition to the common parameters described previously, each row of the output gives the following:
query