Vertica

Author Archive

Backing Up and Restoring Your HP Vertica Database

Backing up and restoring your database is an important part of basic database maintenance. HP Vertica lets you create both comprehensive and object-level backups. This article highlights the HP Vertica 7.x features that enhance the backup and restore process.

Previously, you could not add nodes to your cluster between object-level backup creation and restoration. With HP Vertica 7.x, you can add nodes to the cluster after creating an object-level backup and then successfully restore the objects.

Another feature simplifies the mapping section of your vbr.py configuration file. When creating a backup, you create a configuration file that includes mapping sections that specify all database nodes included in the backup, along with the backup host and directory for each node. Previously, this information was stored in multiple [Mapping] sections, each with their own parameters (dbNode, backupHost, and backupDir). Now, when you use the vbr.py –setupconfig command to create a new configuration file, the file includes a single Mapping section. The section contains entries for each cluster node, with parameters representing each database node (dbNode), its associated backup host (backupHost), and backup directory (backupDir). Having a single [Mapping] section makes your configuration file more compact and easy to read.

The example below shows how the configuration files have been simplified in HP Vertica 7.x

Previous Versions HP Vertica 7.x
[Mapping0]
dbNode = node0
backupHost = clust-1
backupDir = /tmp/backup/


[Mapping1]
dbNode = node1
backupHost = clust-1
backupDir = /tmp/backup/
[Mapping]
node0 = clust-1:/tmp/backup
node1 = clust-1:/tmp/backup

Backing up and restoring is an important part of maintaining a database, and the new HP Vertica 7.x features improve that process. Both the new mapping feature and the ability to add nodes between backup and restore actions contribute to overall enhanced functionality. For more information on backup and restore processes, see Backing Up and Restoring the Database in the Administrator’s Guide.

What’s New in Dragline (7.1.0): Top-K Projections

Top-K Projections Video

Top K Projections from Vertica Systems on Vimeo.

HP Vertica 7.1 introduces Top-K projections. A Top-K projection is a type of live aggregate projection that returns the top k rows from a partition.

Top-K projections are useful when you want to retrieve the top rows from a group of frequently updated, aggregated data. For example, say you want to view a list of the 5 singers who have the most votes in a competition. This value will change every time a vote comes in. But since the data is aggregated as it is loaded into your table and Top-k projection, all you have to do is grab the top 5 rows to see the top 5 singers at any moment. Using a Top-k projection is more efficient than if you had to aggregate (count all the votes) every time you wanted to find the top 5 singers.

Check out this video to learn more about Top-k projections and stayed tuned for our next video ‘Projections with Expressions’.

Top-K Documentation

Meet our Summer 2014 Interns!

Did your 2014 summer internship include rubber bulldozers, ice cream, and bumper boats? Were you able to develop and work on real projects for real customers, while eating free bagels and bananas? If not, then consider applying to HP Vertica for the summer of 2015.

Our 2014 interns had a great time scaling mountains (all right, it was a 635-foot hill) and building bridges (made out of toothpicks and gumdrops), while developing software that makes the HP Vertica database faster, more accurate, and more secure.

Our interns work closely with their mentors to solve hard problems and improve our product. Along the way, we encourage them to collaborate with cross-functional teams, attend technical talks that aren’t necessarily related to their projects, and create funny short videos about our database features. One of those videos appeared in EVP and GM Colin Mahony’s welcome presentation at the HP Vertica Big Data Conference!

Our interns tackled (and solved!) some interesting problems during the summer of 2014, including:

  • Improving trickle loading using Kestrel and Apache Storm.
  • Improving the encoding algorithm for Vmap data in a flex table.
  • Creating an R package for pattern mining.
  • Integrating HP Vertica with Apache Hadoop YARN.
  • Enhancing the documentation about database locks.
  • Implementing key-based client authentication for the HP Vertica Amazon Machine Image (AMI).
  • Adding features to and improve the performance of our test-tracking application.
  • Improving the scalability and performance of HP Vertica Database Designer.
  • Improving query optimizer plans for columns that are guaranteed to be unique.
  • Developing a tool that processes diagnostic information.

Everyone at HP Vertica works hard, but we like to have fun, too. We make sure to include the interns in our company outings and weekly gaming nights, but we also plan extra activities for them: hiking, mini-golf, volleyball, and tubing were some of this summer’s highlights. And our in-work and out-of-work activities this summer usually included copious consumption of ice cream.

Over the years, many of our best employees are former interns. So if you want to improve your technical skills, gain an understanding of our column-store database, make new friends, and have a lot of fun in the process, now is the best time to apply for an internship at HP Vertica.

What’s New in Dragline (7.1.0): Live Aggregate Projections

Live Aggregate Projections Video

Live Aggregate Projections from Vertica Systems on Vimeo.

 

HP Vertica 7.1 introduces live aggregate projections. A live aggregate projection is a projection that contains one or more columns of data that have been aggregated from a table.

If you frequently query data that requires aggregation, you could benefit from using a live aggregate projection. Because data in your live aggregate projection is aggregated at load time, rather than at the time you run a query, you can save time and resources. On subsequent data loads, HP Vertica updates the table and loads the aggregated values into the live aggregate projection. If you query the live aggregate projection any time after that, you’ll not only see the same results you would if you queried the data from the table and then aggregated it, but you’ll also use less resources in the process.

Check out this video to learn more about Live Aggregate Projections. Stay tuned for our next video, ‘Top-K Projections’.

See Also:

 Live Aggregate Projections with HP Vertica

Live Aggregate Projection documentation

Smart Grid Solution Demo

What’s New in Dragline (7.1.0): Using HP Vertica Pulse

Using HP Vertica Pulse Video

Using Pulse from Vertica Systems on Vimeo.

In our previous video, we showed you how to install HP Vertica Pulse, our add-on sentiment analysis package that allows you to analyze and extract the sentiment from text.

Take a look at this video to learn how to use and tune HP Vertica Pulse to work for your specific business needs.

You can download HP Vertica Pulse as an add-on package for your Enterprise Edition of HP Vertica at my.vertica.com.

HP Vertica Pulse documentation.

What’s New in Dragline (7.1.0): Installing HP Vertica Pulse

Installing HP Vertica Pulse Video

Installing Pulse from Vertica Systems on Vimeo.

HP Vertica 7.1.0 introduces the general availability of HP Vertica Pulse, our add-on sentiment analysis package for HP Vertica. Pulse provides a suite of functions that allow you to analyze and extract the sentiment from text, directly from your HP Vertica database. For example, you can use HP Vertica Pulse to analyze sentiment from Tweets or online product reviews to get a feel for how satisfied your customers are about your products or services.

HP Vertica Pulse automatically discovers attributes included in text and scores them using a built-in system dictionary. You can tune user-dictionaries to detect certain words or phrases, to determine how words are scored, and to filter out attributes that are of no interest to you. Because of this flexibility, you can tune HP Vertica Pulse to work for your specific business needs.

Currently, HP Vertica Pulse allows you to analyze English language text only. You can download HP Vertica Pulse as an add-on package for your Enterprise Edition or as a trial for your Community Edition, from my.vertica.com. Additionally, the Innovations section of the HP Vertica Marketplace offers a beta version of Pulse for Spanish only. Take a look at this video to learn how to install Pulse and stay tuned for our next video, ‘Using Pulse’.

HP Vertica Pulse documentation.

Get Started With Vertica Today

Subscribe to Vertica