Vertica Blog

Vertica Blog

Best Practices

Tips in bright neon red letters on a dark background with touchscreens.

Default Custom User Resource Pool

In Vertica versions 12.0.4 and later, you can set a default user resource pool for all users instead of defaulting users to a general pool, using the DefaultResourcePoolForUsers configuration parameter. How to set it? As a dbadmin user, create a user resource pool. The pool name can be used to configure the DefaultResourcePoolForUsers option. In...

Load Balancing on Elastic Kubernetes Clusters

Your long-running sessions could fail after you deployed Vertica on Elastic Kubernetes Cluster (EKS) with Load Balancer as the service type. When the load balancer type is not specified in yaml, by default, EKS generates a classic load balancer with a default connection idle timeout of 60 seconds. The aws-load-balancer-connection-idle-timeout can be set up to...
Helpful Tips in blue text with magnifying glass

Restrict Modifying Table Rows (UPDATE/DELETE) Irrespective of User Privileges

Do you have a table with critical information such as SIEM events or credit card transactions? Do you want to be able to block user access to UPDATE/DELETE rows in those tables? From version 12.0.0, you can do this by setting the tables as IMMUTABLE. This means you can only insert data and not modify...
Tips in bright neon red letters on a dark background with touchscreens.

High Availability for a Vertica in Eon Mode Database with Read-Only Mode

When Vertica is deployed in Eon Mode, there are two major requirements for high availability: maintaining quorum and shard coverage. If either of them is lost, the whole cluster goes down. Before going into details, let us understand what quorum and shard coverage mean. Quorum: In Eon Mode, more than half of the primary nodes...

How to Quickly Install Vertica on Large Clusters

Vertica 12.0.2 introduces a new feature that allows you to install Vertica in parallel on all hosts rather than one at a time as in earlier releases. This reduces the installation time in large clusters. To use this feature, run the --parallel-no-prompts flag with the install_vertica command. This flag can be used only when passwordless...
SQL Query Optimization

Improving COUNT DISTINCT Performance with Approximate Functions

A common analytic use case is to find the number of distinct items in a data set. Vertica performs well at solving COUNT DISTINCT in a few ways. Since Vertica stores all data in columns, it is possible to optimize for COUNT DISTINCT by building a projection that is tuned for this use case. Vertica...

When the typewriter is broken: Vertica’s Event-Based Windows to the rescue

Those of you who read my blog post “Take nothing but memories. Leave nothing but footprints” may recall my passion for using Raspberry Pi computers to capture IoT data from aeroplane transponders and GPS satellites, then feeding that streaming data into Apache Kafka and to integrate it with Vertica and perform near real-time analytics and...

What’s needed for a happy software dev team in data analytics?

For an organization to excel at data analytics, the IT team needs to coordinate a number of different disciplines and personnel with experience in those disciplines. This usually includes data analysts, data engineers, and, increasingly, data scientists. The data engineering discipline is sometimes thought of as the plumbing that, like pipes in a house, delivers...

VerticaPy Unify 2022 Sessions

Vertica Unify 2022 is a great time to learn about Vertica, its new features, and best practices. To complement the many great presentations at Vertica Unify 2022 both in Boston and Paris, I’m very excited to present two sessions: one on VerticaPy best practices, and another general session on VerticaPy and its features. VerticaPy is...

Customers in the Spotlight at Vertica Unify ’22

It’s hard to believe that it has been almost a decade since we held our inaugural Vertica Big Data Conference, back in 2013. At that time, our goal was to hold an annual user conference with a few simple yet guiding principles – no sales or marketing pitches, hold the event in Boston, and line...

Vertica Wins the Gold in Bloor Market Update

Know Your Options for Analytical Databases and Data Warehouses For CDOs and CIOs, there have never been more options to choose from when deciding where to run their analytical workloads. Analytical databases and data warehouses are two popular options – with the data warehouse market experiencing 10% annual growth and many new entrants* – particularly...

New O’Reilly Book: Accelerate Machine Learning with a Unified Analytics Architecture

Every company I’ve ever encountered needs business intelligence (BI). Increasingly, in multiple industries, more and more also need machine learning (ML) or artificial intelligence (AI) to stay competitive or gain an edge. Eventually, it should be as easy for an organization to put an ML model to work as it is to put a new...