Native BI, ETL, & Hadoop/MapReduce Integration

As “big data” growth and the importance of monetizing this information continue, more companies are joining a growing ecosystem of data warehousing and analytics solutions. Not every company can provide all the necessary tools to make the most of your business-critical information, so HP partners with best-of-breed tools that integrate with the HP Vertica Analytics Platform.

Through our standard connectors and a flexible API, HP enables you to move data into and out of the HP Vertica Analytics Platform, in creating reports, dashboards, and queries, and in developing applications that offer drastically better query performance and scalability. HP offers this industry-standard connectivity through ODBC, JDBC, ADO.Net, and our rich API as well as native integrations and certifications with a variety of tools like Cognos, MicroStrategy, Tableau and others. HP Vertica also offers an Informatica plug-in.

HP Vertica Hadoop Distributed File System (HDFS) Connector

HP Vertica was the first analytic database company to deliver a Hadoop Connector. HP Vertica now offers two connectors to transfer data seamlessly between Hadoop and HP Vertica:

  • The Hadoop Distributed File System (HDFS) connector enables you to load data from HDFS using the HP Vertica native COPY facility. This mechanism simplifies and accelerates the process of loading data stored in HDFS without any MapReduce coding. The connector also ensures that data is loaded from the Hadoop cluster with the optimal amount of parallelism. By using the connector with the HP Vertica External Tables feature, you can even query data in HDFS without copying data into HP Vertica. To provide you with the highest level of data security and compliance, HP Vertica offers secure access to your HDFS data with Kerberos authentication.
  • The Hadoop & Pig Connector is bidirectional and enables you to move data from Hadoop to HP Vertica or vice versa via either MapReduce or Pig jobs.

With HP Vertica HDFS and Pig Connectors, you have unprecedented flexibility and speed in loading data from HDFS to the HP Vertica Analytics Platform and querying data from the HP Vertica Analytics Platform in Hadoop. The HP Vertica HDFS and Pig Connectors are open source, supported by HP Vertica, and available for download.

HP Vertica provides optimized JDBC and ODBC client drivers for most platforms including Windows, Linux, Solaris, AIX, and others.

HCatalog Integration

The HP Vertica Analytics Platform offers direct integration with HCatalog, Hadoop’s table and storage management layer. This integration enhances your ability to navigate data in Hadoop and quickly and easily load that data into the HP Vertica Analytics Platform for high-performance analytics. Specifically, this feature lets you access your data while dealing with changing schemas in your big data files. You don’t have to spend time recreating schema definitions in HP Vertica when the data changes. In addition, you can more easily explore external tables stored in Hadoop.

Open SQL on Hadoop 

Data stored in the Hadoop offers the potential for extraordinary business insights yet can be challenging to explore in traditional database platforms. The HP Vertica Analytics Platform offers the industry’s most open “SQL-on-Hadoop” capabilities. Unlike other SQL-on-Hadoop solutions, HP Vertica works with major Hadoop distributions, ensuring high-performance analytics across the broadest range of data types and sources.

HP Vertica Flex Zone

HP Vertica Flex Zone is a revolutionary offering based on the patent-pending Flex Tables technology. HP Vertica Flex Zone enables you to ingest and manage structured and, now, semi-structured data – without creating any time-consuming schemas. This offering simplifies data preparation and loading — especially as data sources and file structures change frequently — enabling cost-effective, semi-structured storage without purchasing additional hardware. Best of all, you can achieve a considerably higher performance improvement over file-based systems upon initial upload and preparation for real-time analytics.


Additional Resources

Download the Vertica Connector for Hadoop & Pig