Vertica Blog

Vertica Blog

Under the Hood

Checking and Improving Column Compression and Encoding

When working with terabytes of data, storage and transfer become major time and cost sinks. Vertica can help minimize storage cost and transfer time with column compression and encoding. How can we identify Vertica tables that might benefit from compression? Information about column size and current compression is stored across column_storage and projection_columns table. The...
Cubes with binary ones and zeroes on them clustered together floating in blue space with light shining out

1 – What are Complex Data Types?

Co-authored by James Clampffer, Deepak Majeti. - First in a series A lot of projects require querying ORC or Parquet files, or other data that may have internal types that are a bit more complex than usual. It’s a little hard to explain what we mean when we say, “complex.” So, this is the first in...
View from above of a sailing ship in the ocean with the sail full and the crew leaning to one side to balance

Announcing Vertica Version 9.3 – Ride the Winds of Change

The winds of change have been blowing strong. This week, Vertica version 9.3 made its big splash. Vertica has a tendency to pack a lot of features even into minor releases, and this is not a minor release. Rather than dive deep, I’m going to skim the surface of the ocean of new features. Eon...
Vertica and Python logos stacked

Extending Vertica with Python functions: Adding NumPy FFT as a UDx

User-Defined Transform Function (UDTF) support for Python UDx were added back in Vertica 9.1, allowing you to add a much greater range of existing libraries and functions to Vertica. In this example, I'll add Fast Fourier Transform (FFT) from the NumPy package. FFT is a way to transform time-domain data into frequency-domain data. My test...
Bullet train rendering with Vertica on side, and zooming effect with "Depot" sign behind

Vertica in Eon Mode Depot – Where Did You Go?

Separating compute from storage is such a hot topic these days. “Instance types” and “S3 Object Storage” are core to the language of the public clouds (unless you’re Azure, then substitute S3 with Blob but leave the Object Storage part). This next-generation architecture enables so many operational and economic advantages – workload isolation, elastic scaling,...
Focused young woman working at laptop in office

The Vertica Academy Is Open for Learning

One of the greatest impacts from digital transformation, both from a personal and professional point of view, is not just the democratization of data - a very popular topic here at Vertica! - but the democratization of knowledge. Today, we are excited to announce the new Vertica Academy, a platform that offers self-paced, technical training...
How to Calculate a Correlation Matrix

2 – How to Calculate a Correlation Matrix – Data Exploration for Machine Learning

In the first machine learning in a database post, we discussed some of the reasons why it makes sense to do your machine learning data analytics inside the database. This will be the first post where we discuss some of the steps involved in the in-database machine learning workflow. Generally, the first thing you need...
Analyze, Design, Develop, Implement, Evaluate cycle illustration

Evaluating Classifier Models in Vertica

Co-authored by Elizabeth Michaud Vertica provides an out-of-the box machine learning toolset that covers a complete data science workflow. The toolset includes several distributed functions for evaluating both classifier and regressor machine learning models. The goal of this blog post is to demonstrate how you can use the built-in functions for evaluating the prediction performance...
Cyber security theme with abstract high speed technology POV motion blur

Vertica Test Results for MDS Security Patches

Several customers have asked us how Intel's recent MDS vulnerability patches will impact the performance of their Vertica databases. Vertica is known for its performance. So, anything that may impact its speed is a major concern for both us and our customers. Vertica engineers have finished a round of testing the OS and microcode fixes...
Digital image of a trash can made of data points on blue background

Watch those Delete Vectors!

Vertica is very good at ingesting data, compressing it, and querying at high speed. The trade-off here is that the data is stored in large block files called ROS containers. These containers can grow to large sizes, sometimes over 10 GB, and this makes it impractical to decompress and edit the files during updates and...
Go programming language logo and little blue beaver mascot

The Vertica SQL Driver for Go

We are happy to announce the alpha release of the Vertica SQL Driver for Go. Go, also called GoLang, is a statically-typed, high performance language, syntactically similar to C. Originating at Google, it has grown rapidly in popularity over the past decade and has become a language of choice for highspeed components. The Vertica SQL...
Connecting grip

JetBrains DataGrip Offers New Vertica Connector

We are pleased to announce that JetBrains has released a Vertica connector for DataGrip! The connector, introduced in DataGrip 2019.1, allows SQL developers to use DataGrip to interact with Vertica. DataGrip uses Vertica’s JDBC driver to connect to Vertica. For details about connecting DataGrip to Vertica, see Vertica Integration with DataGrip: Connection Guide. DataGrip is...