Managing Load Streams

You can use the Using System Tables to keep track of data being loaded on your cluster.  

System Table Description

LOAD_STREAMS

Monitors load metrics for each load stream on each node.

For details about Vertica system tables, see Vertica System Tables.

When a COPY statement using the DIRECT option is in progress, the ACCEPTED_ROW_COUNT value can increase during parsing. This value can reach the maximum number of rows in the input file.

If COPY reads input data from multiple named pipes, the PARSE_COMPLETE_PERCENT value remains at zero (0) until all named pipes return an EOF. While COPY awaits an EOF from multiple pipes, it can appear to be hung. However, before canceling the COPY statement, check your system CPU and disk accesses to determine if any activity is in progress.

In a typical load, the PARSE_COMPLETE_PERCENT value can either increase slowly or jump quickly to 100%, if you are loading from named pipes or STDIN. However, SORT_COMPLETE_PERCENT remains at 0 when loading from named pipes or STDIN. After PARSE_COMPLETE_PERCENT reaches 100%, SORT_COMPLETE_PERCENT increases to 100%. Depending on the data sizes, a significant lag can occur between the time PARSE_COMPLETE_PERCENT reaches 100% sand the time SORT_COMPLETE_PERCENT begins to increase.