ETL

Jump Start your ETL Application Development with Vertica

Interested in exploring the Vertica Analytic Database in the context of data movement and transformation? To get a feel for it, try our new ETL QuickStart sample apps. You’ll find them on the Big Data Marketplace. Our Partner Engineering team develops QuickStart apps using tools from our technology partners. Currently we have ETL QuickStarts for […]

Workload Management Metrics ? A Golden Triangle

Modern databases are often required to process many different kinds of workloads, ranging from short/tactical queries, to medium complexity ad-hoc queries, to long-running batch ETL jobs to extremely complex data mining jobs (See my previous blog on workload classification for more information.) DBAs must ensure that all concurrent workload, along with their respective Service Level Agreements (SLAs), can co-exist well with each other while maximizing a system?s overall performance.

A Method for Vertica Workload Classification

Modern analytic databases such as Vertica often need to process a myriad of workloads ranging from the simplest primary-key lookup to complex analytical queries that include dozens of large tables and joins between them. Different types of load jobs (such as batch type ETL jobs and near real-time trickle loads) keep the data up-to-date in an enterprise data warehouse (EDW). Therefore, an enterprise class database like Vertica must have a robust yet easy-to-use mixed-workload management capability.

Enter the FlexZone – Let’s talk ETL

In December, HP released version 7 of the Vertica analytics platform which includes, among others, a great new feature called Vertica Flex Zone (Flex Zone). Flex Zone enables you to quickly and easily load, explore and analyze some forms of semi-structured data. It eliminates the need for coding-intensive schemas to be defined or applied before the data is loaded for exploration.