Vertica

Archive for the ‘Vertica 7’ Category

What’s New in Dragline (7.1.0): Resource Pool Routing

Resource Pool Routing from Vertica Systems on Vimeo.

HP Vertica 7.1.x introduces new features that allow you to dynamically reroute queries to secondary resource pools.

Use the new CASCADE TO parameter when creating or altering a resource pool to indicate a secondary resource pool to which queries can cascade and execute when they exceed the RUNTIMECAP of the pool on which they started running. This way, if a query exceeds its initial pool’s RUNTIMECAP, it can cascade to a designated secondary pool with a larger RUNTIMECAP instead of causing an error.

Because grant privileges are not considered on secondary pools, you can use this functionality to designate secondary resource pools where user queries can cascade to without giving users explicit permission to run queries on that pool.

Check out this video to learn more about dynamically rerouting queries.
For more information, see the documentation.

What’s New in Dragline (7.1.0): Projections with Expressions

Projections with Expressions Video

Projections with Expressions from Vertica Systems on Vimeo.

HP Vertica 7.1 introduces the ability to use expressions with projections.

When you create a projection, you now have the option to use an expression as a column definition. For example, you can declare a column that sums two variables. When you load data into a projection with expressions, HP Vertica automatically calculates the values according to the expressions and inserts the data into that new projection.
Once you create a projection with expressions, you can query the projection directly to access your pre-calculated data and eliminate resource-intensive computations at query time.

Watch this video to learn more about using expressions with projections.

Projections with Expressions documentation.

See also:
Live Aggregate Projections
Top-K Projections

What’s New in Dragline (7.1.0): Top-K Projections

Top-K Projections Video

Top K Projections from Vertica Systems on Vimeo.

HP Vertica 7.1 introduces Top-K projections. A Top-K projection is a type of live aggregate projection that returns the top k rows from a partition.

Top-K projections are useful when you want to retrieve the top rows from a group of frequently updated, aggregated data. For example, say you want to view a list of the 5 singers who have the most votes in a competition. This value will change every time a vote comes in. But since the data is aggregated as it is loaded into your table and Top-k projection, all you have to do is grab the top 5 rows to see the top 5 singers at any moment. Using a Top-k projection is more efficient than if you had to aggregate (count all the votes) every time you wanted to find the top 5 singers.

Check out this video to learn more about Top-k projections and stayed tuned for our next video ‘Projections with Expressions’.

Top-K Documentation

What’s New in Dragline (7.1.0): Installing HP Vertica Pulse

Installing HP Vertica Pulse Video

Installing Pulse from Vertica Systems on Vimeo.

HP Vertica 7.1.0 introduces the general availability of HP Vertica Pulse, our add-on sentiment analysis package for HP Vertica. Pulse provides a suite of functions that allow you to analyze and extract the sentiment from text, directly from your HP Vertica database. For example, you can use HP Vertica Pulse to analyze sentiment from Tweets or online product reviews to get a feel for how satisfied your customers are about your products or services.

HP Vertica Pulse automatically discovers attributes included in text and scores them using a built-in system dictionary. You can tune user-dictionaries to detect certain words or phrases, to determine how words are scored, and to filter out attributes that are of no interest to you. Because of this flexibility, you can tune HP Vertica Pulse to work for your specific business needs.

Currently, HP Vertica Pulse allows you to analyze English language text only. You can download HP Vertica Pulse as an add-on package for your Enterprise Edition or as a trial for your Community Edition, from my.vertica.com. Additionally, the Innovations section of the HP Vertica Marketplace offers a beta version of Pulse for Spanish only. Take a look at this video to learn how to install Pulse and stay tuned for our next video, ‘Using Pulse’.

HP Vertica Pulse documentation.

Big Data Analytics: Blurring the Line Between Brick-&-Mortar and E-Retail Experience

Snapfish-Logo-575x491

As an online retailer, there are many challenges to face. You don’t have a physical outlet for customers to browse and see what they like, nor do you have a floor manager to recommend certain products or answer questions your customers might have. However e-retailers make up for that by enabling customers to look through an entire stores inventory before making a purchase, see suggestions made by the website based on their past browsing history, and in many cases are just a few clicks away from an online customer service rep. But how can e-retailers become just as personal, if not more so, than their brick-and-mortar counterparts? The answer for more e-retailers is Big Data analytics, and, more specifically, the HP Vertica Analytics Platform.

E-retailer Snapfish is a textbook example of using Big Data analytics with HP Vertica to revamp their online business. Every online company these days collects data – some more than others. Yet few actually know what to do with their data or don’t have an analytics system in place to capitalize on it. Prior to Vertica, Snapfish had much of the data they needed right there in front of them, but couldn’t access or take action fast enough.

  • Promotions based on customer behaviors were delivered too late
  • Popular card designs weren’t featured properly
  • Customer photos were stored online unsorted instead of automatically placed in books for purchase.

In order to create an online environment that is as personal as a physical store, e retailers have to be able to understand their customers goals, see what they do, and pick up on social cues in real-time just like one would in a face to face conversation. That’s why Snapfish chose HP Vertica.

Blazing-Fast Speed for Immediate Buyer Insight

Following the implementation of Vertica, Snapfish saw an immediate turnaround. They are now able to finish queries up to 400 times faster, and send out email promotions and offers to capitalize on near real-time trends. Snapfish can now see where customers on the site are going, what items they prefer, and identify overall trends that are taking shape, all of which they are able to do on the very same day instead of months down the line (it doesn’t help to know what kind of Christmas card customers prefer when its March, does it?).

One of the more fascinating aspects of Implementing HP Vertica that makes customers interaction seem more human is, perhaps ironically titled, the use of “machine learning techniques”. As I discussed in my last blog post about Vertica and CI Vertica works very well with R, the open-source language used by many data scientists for predictive analytics and data mining. Machine learning is the brains behind suggesting items to you that you may not have looked at, but are related to ones you have. For example, If you are shopping for a DSLR camera online it might suggest that you look at an extra battery, or perhaps recommend either an external flash or a shoulder mount, depending on whether you tended to gravitate towards equipment for photo or video use during your browsing. It is Vertica’s tight integration with R that allows these techniques to be put into action swiftly and effectively.

Vertica enabled Snapfish to react quickly to customers’ preferences and patterns, allowing them to market themselves accordingly, boost revenue and increase customer retention. As a highlight: Snapfish DBA’s can now perform look-up queries in under 200 milliseconds, enabling them to accurately recommend products to customers much more effectively than before, once again adding to that personal in-store feel that’s so important. Snapfish can now see where their customers on the site are going, what items they prefer, and observe the overall trends that are taking shape – just as if they were running a brick and mortar store. Big data analytics are clearly critical in tailoring customer experiences online to be more personal, and the faster you can execute the more valuable it will be for both.

To Learn more, click here to view our recent webinar with E-Retailer Etsy, where we discuss how they use HP Vertica to their advantage. Both e- retailers Etsy and One King’s Lane will be represented at our upcoming BDC in Boston to speak about their success stories with HP Vertica. Visit our registration page to learn more if you’re interested.

Introducing HP Vertica “Dragline”

Today, we announced “Dragline,” the code name for the latest release of the HP Vertica Analytics Platform. Focused on the strategic value of all data to every organization, “Dragline” includes a range of industrial-strength features befitting its code name for serious Big Data initiatives.

Our data sheet provides detailed feature descriptions on this release’s full range of capabilities and benefits, so we’ll just focus on three top features that are sure to capture your attention (after all, they came highly requested from our growing customer base).

By the way, learn about these and all of the new “Dragline” features in our upcoming Webinar.


“Project Maverick” – Speed in All Directions

Danger Zone

Speed is a given … but what about in every possible direction? “Project Maverick” has multiple elements including fast, self-service analytics about discrete individuals or devices. Live Aggregate Projections, a key, new feature of “Project Maverick,” accelerates the speed and performance of these live lookups by up to 10x – more frequent or highly concurrent queries on an individual customer basis — by computing metrics on the data as it arrives for targeted and personalized analytics without programming accelerator layers.

For telecommunications companies as well as utilities and energy providers who, due to deregulation, are pursuing smart metering initiatives to differentiate from competitors, this capability is hot. With Live Aggregate Projections, these providers can deliver smart metering reports that educate their customers on consumption rates vis a vis their neighbors in promoting a greener planet and establishing a tighter relationship with their provider. Interested in learning how HP Vertica easily chews through the enormous volume and scale of smart meter data and the Internet of Things? Check out our newly published white paper, including detailed benchmarks.


Mixed Dynamic Workload Management – Make ’em All Happy

Coffee

Another major advancement of “Project Maverick” is Dynamic Mixed Workload Management. Commonly found in many data warehouse technologies and requested by some of our largest customers, this powerful new feature identifies and adapts to varying query complexities — simple and ad-hoc queries as well as long-running advanced queries — and dynamically assigns the appropriate amount of resources to meet the needs of all data consumers.

So, now, you can preserve your precious hardware and system resources, while pleasing even your most demanding internal and external constituents – from analysts who seek ad-hoc insights to data scientists who manage complex analytics to business executives who need customized views or dashboards on Monday mornings.


Cost-Optimized Storage – Don’t Break the Bank

pig

Hadoop is commonly used as a data lake to store bulk data. That’s why with each release, we offer tighter integration that delivers the most open SQL on Hadoop. “Dragline” builds on that strong heritage by helping you to manage multiple storage tiers for cost-effective analytics. You can perform advanced SQL queries on bulk data stored in HDFS using the HP Vertica Analytics Platform without moving the data or using any connectors. You can move data into HP Vertica when your organization requires faster performance in-depth analytics.

HP Vertica supports all of the major Hadoop distributors but the power of MapR and HP Vertica on the same cluster is something special. Join our upcoming Webinar with MapR to get the full details behind this solution and to build your business case for SQL on Hadoop.

Try HP Vertica 7 today and stay tuned for more blog posts and materials to support this release.

Get Started With Vertica Today

Subscribe to Vertica