Vertica Blog

Vertica Blog

machine learning

Red arrow points to silver ball with 4 red arrows pointing away representing dividing a single thing into multiple

Parallel Processing Using Partitions With Vertica UDx

You can add functionality to Vertica using UDx, but what if you need to process more data than can be efficiently processed in a single thread or a single node? Vertica can divide data into partitions defined with the OVER() clause and distribute computing across nodes. This partition processing is "shared-nothing" similar to the Map...
View from above of a sailing ship in the ocean with the sail full and the crew leaning to one side to balance

Announcing Vertica Version 9.3 – Ride the Winds of Change

The have been blowing strong. This week, Vertica version 9.3 made its big splash. Vertica has a tendency to pack a lot of features even into minor releases, and this is not a minor release. Rather than dive deep, I’m going to skim the surface of the ocean of new features. Eon Mode Improvements Eon...
Vertica and Python logos stacked

Extending Vertica with Python functions: Adding NumPy FFT as a UDx

User-Defined Transform Function (UDTF) support for Python UDx were added back in Vertica 9.1, allowing you to add a much greater range of existing libraries and functions to Vertica. In this example, I'll add Fast Fourier Transform (FFT) from the NumPy package. FFT is a way to transform time-domain data into frequency-domain data. My test...
Clusters of points colored differently by grouping on a black background

Finding the “K” in K-means Clustering With a UDx

You can apply k-means clustering to partition data points into k different groups. Along with the data, the number of clusters "k" is an input to the algorithm. Common examples like the Iris data set tell you upfront how many different groups exist, so you set k=3. What if you don't know how many clusters...

Vertica Big Data Conference Registration is Open!

We’re excited to announce that for the next Vertica Big Data Conference (BDC), held March 30 – April 2, 2020, at the Encore Boston Harbor Hotel. Don’t miss out on the early bird pricing rate of $695 (a $300 savings), along with buy 3 get 1 free group discounts!   What is the Big Data...

7 Critical Capabilities for Embedded Analytics

Software and technology vendors serving a range of industries – from security to healthcare – are increasingly turning to embedded analytics as a way to deliver value-added services and applications to their customers. Enterprises that embrace embedded analytics are changing the dynamics of business in every vertical market. The most disruptive companies in these industries...
Focused young woman working at laptop in office

The Vertica Academy Is Open for Learning

One of the greatest impacts from digital transformation, both from a personal and professional point of view, is not just the democratization of data - a very popular topic here at Vertica! - but the democratization of knowledge. Today, we are excited to announce the new, a platform that offers self-paced, technical training and certifications...
Machine learning robot studying equations and graphs

In-Database Machine Learning 2: Calculate a Correlation Matrix – A Data Exploration Post

In the , we discussed some of the reasons why it makes sense to do your machine learning inside the database. This will be the first post where we discuss some of the steps involved in the in-database machine learning workflow. Generally, the first thing you need to do is explore your data. This can...
Woman holding onto seat as high wind blows her hair and lots of data points past

Strong Winds of Change at Chief Data & Analytics Officers Event

Last week, Vertica sponsored the . As a first-time sponsor, we were cautiously optimistic that we could meet with data and analytics leaders that were, according to event organizer IQPC, “…on the forefront of capitalizing on data and analytics in the enterprise as the volume, availability, and complexity of data continues to increase and evolve.”...
Analyze, Design, Develop, Implement, Evaluate cycle illustration

Evaluating Classifier Models in Vertica

Co-authored by Elizabeth Michaud Vertica provides an out-of-the box machine learning toolset that covers a complete data science workflow. The toolset includes several distributed functions for evaluating both classifier and regressor machine learning models. The goal of this blog post is to demonstrate how you can use the built-in functions for evaluating the prediction performance...
Cloud pattern in circuit board

Vertica Ranked #1 Cloud Data Warehouse

July has been quite an exciting month for Vertica. Our three keynote presenters are now confirmed for the Vertica Big Data Conference 2020 – Vertica founder and Turing Award winner Dr. Michael Stonebraker, kingpin of the famous MIT blackjack team Jeffrey Ma, and renowned analyst Ray Wang of Constellation Research. And, in addition to receiving...
young girl choosing recommended video playback button.

Content Analytics and Video Recommendation Systems with Vertica

Analyzing digital content and the behavior of viewing customers in real-time is essential to a lot of companies. Communication operators and content providers are offering many services that enable customers to consume video content using different fixed and mobile technologies through different devices, either in their homes or mobile. For these operators and providers, it...