Vertica Blog

Vertica Blog

machine learning

Shorten the Path to Production with In-Database Machine Learning

Moving data, transforming data types, taking small samples so they’ll fit in your sandbox – these are all things every data scientist puts up with as routine. And when you’re finished, a data engineer has to build full production pipelines to reproduce all that work at scale. It can be months or a year or...

OpenText Welcomes Micro Focus Customers, Partners, and Employees

OpenText CEO and CTO Mark Barrenechea blogs on how the Micro Focus acquisition expands OpenText's mission to help enterprise professionals secure their operations, gain more insight into their information, and better manage an increasingly hybrid and complex digital fabric. Read his article here.

Israel AI & Data Summit 2023

Vertica is sponsoring Israel AI & Data Summit 2023! This summit will provide the opportunity to hear about the most advanced technologies and solutions in the fields of DATA, AI & ML, alongside networking of the Israeli industry best human capital – Researchers, Developers, Data Scientists, and Business Decision Makers. Don’t miss Badr Ouali, VerticaPy...

GigaOm Radar for Data Warehouses Recognizes Vertica as Leader

With the excitement in our industry regarding the emergence of data lakehouses, there’s a good chance that you are planning to modernize or even “replatform” your incumbent data warehouse to meet an ever-evolving variety of analytical use cases. Many vendors, like Vertica, have transformed the original concept of data warehouses with groundbreaking innovations like massively...

VerticaPy reaches a milestone at 100 stars

The Vertica team is happy to share a milestone in our “VerticaPy journey”: We just reached 100 stars in our GitHub repo, and it’s growing every day. (Repo: That’s “repository” for those of you unfamiliar with GitHub.) Repos accumulate stars as an indication of user interest – think of them as bookmarks in a user’s...
Vertica plus Domino Data Lab logos

Unleash the Power of Data Science with Vertica and Domino Data Lab

Introducing an end-to-end machine learning solution with Vertica and Domino Data Lab that enables you to explore, analyze, and model your data in Vertica using VerticaPy. Domino Data Lab is a data science platform to build and deploy machine learning models, monitor performance, and collaborate with one another. VerticaPy is a Python library to perform...
SQL Query Optimization

Improving COUNT DISTINCT Performance with Approximate Functions

A common analytic use case is to find the number of distinct items in a data set. Vertica performs well at solving COUNT DISTINCT in a few ways. Since Vertica stores all data in columns, it is possible to optimize for COUNT DISTINCT by building a projection that is tuned for this use case. Vertica...

Break the bias – and predict brake bias

The theme for this year’s International Women’s Day (IWD) was given the name #BreakTheBias to get us to imagine a gender equal world. A world free of bias, stereotypes, and discrimination, a world that is diverse, equitable, and inclusive, where difference is valued and celebrated. Together, we can protect women’s equality. At Micro Focus, as...
spark plus vertica hands touching

Unleash the Power of Vertica and Apache Spark Using the Upgraded Spark Connector

This post is authored by Alex Le What is Apache Spark? Apache Spark is a distributed compute engine that provides a robust API for data science, machine learning, or to work with big data. It is fast, scalable, simple, and supports multiple languages, including Python, SQL, Scala, Java, and R. Backed by the Apache 2.0...

What’s needed for a happy software dev team in data analytics?

For an organization to excel at data analytics, the IT team needs to coordinate a number of different disciplines and personnel with experience in those disciplines. This usually includes data analysts, data engineers, and, increasingly, data scientists. The data engineering discipline is sometimes thought of as the plumbing that, like pipes in a house, delivers...

VerticaPy Unify 2022 Sessions

Vertica Unify 2022 is a great time to learn about Vertica, its new features, and best practices. To complement the many great presentations at Vertica Unify 2022 both in Boston and Paris, I’m very excited to present two sessions: one on VerticaPy best practices, and another general session on VerticaPy and its features. VerticaPy is...

No need to extract data from your database to do your analytics!

While taking a long-awaited, and, IMHO some well-deserved R&R in the sun, I heard that ping notification coming from my backpack, and not being one who can ignore such things, reached for my iPad. With my OOTO email response already set, I should be able to ignore most incoming messages, texts and otherwise, safe in...