Using HDFS URLs

Use the hdfs URL scheme when accessing files in HDFS. With this scheme, Vertica bypasses the slower, less-stable WebHDFS service when possible, and falls back to it when necessary. To use WebHDFS for all access, set the HDFSUseWebHDFS configuration parameter and continue to use hdfs URLs; you do not need to rewrite URLs to change connection methods. (See Apache Hadoop Parameters.)

You can use the hdfs scheme with COPY and with CREATE EXTERNAL TABLE AS COPY. When using the hdfs scheme with COPY, you do not need to specify ON ANY NODE.

Vertica requires access to certain configuration files from your HDFS cluster.

If the file you want to read resides on an HDFS cluster that uses Kerberos authentication, Vertica uses the current user's principal, session doAs user, or session delegation token. See Accessing Kerberized HDFS Data for more information about these options.

In This Section