Working with External Data

An alternative to importing data into Vertica is to query it in place. Querying external data instead of importing it can be advantageous in some cases:

To query external data, you must describe your data as an external table. Like Vertica-managed tables, external tables have table definitions and can be queried. Unlike Vertica-managed tables, external tables have no catalog and Vertica loads selected data from the external source as needed.  For some formats, the query planner can take advantage of partitions and sorting in the data, so querying an external table does not mean you load all of the data at query time. (For more information about Vertica-managed tables, see Working with Vertica-Managed Tables.)

There is one special type of external data not covered in this section. If you are reading data from Hadoop, and specifically from a Hive data warehouse, then instead of defining your own external tables you can read the schema information from Hive. For more information, see Using the HCatalog Connector in Integrating with Apache Hadoop.

In This Section