Parallel Load Streams
When you have a large amount of data to load, use parallel load streams to distribute the load operations across the cluster. To do this, create threads from multiple nodes that are connected to the Vertica database and load the data. This approach lets you use vsql, ODBC, ADO.NET, or JDBC. You can load server-side files or client-side files using the COPY from LOCAL statement.
Best practices for parallel load streams are:
- Issue a single multi-node COPY command that loads different files from different nodes. Specify the nodename option for each file.
- Issue a single multi-node COPY command that loads different files from any node using the ON ANY NODE option.
For additional information, see Using Parallel Load Streams in the Administrator's Guide.