What's New in Vertica 9.0.1: Machine Learning

This blog post was authored by Soniya Shah.

Vertica 9.0.1 introduces new functionality that continues to match our goals for fast-paced development and enhancement of machine learning in Vertica. In this release, we introduce support for random forest for regression, a new statistical summary function, increased support for cross validation, and enhancements for data evaluation.

Summary of Enhancements

New Feature	Description
Random Forest for Regression	The new RF_REGRESSOR function allows you to predict numerical values on large data sets using both numerical and categorical predictors.
SUMMARIZE_CATCOLfunction	This statistical summary function enables users to get important statistical information on categorical columns.
UPGRADE_MODEL function	This functionality automatically upgrades the model format to the latest version during a Vertica upgrade and when you import models. The database administrator can upgrade all eligible models.
Cross validation enhancements	The CROSS_VALIDATE function now also supports Naïve Bayes.
L1 for Logistic Regression	Run logistic regression with L1, leading to sparser solutions.

Random Forest for Regression

The random forest model is a set of decision trees. The algorithm constructs decision trees during training of a model and then uses them for prediction. The output is the mean prediction of the individual trees.

Random forest is a robust regression algorithm that works well on many different types of data sets. A set of function parameters provides good control over how the ensemble model is built, including the number of trees, tree depth, sampling size, and more.

For More Information

For more information, see the following in the Vertica documentation:

• Machine Learning Functions in the SQL Reference Manual

• Machine Learning for Predictive Analytics in the Analyzing Data guide.

We are constantly expanding machine learning features in Vertica. You can expect to see expanded functionality in future releases.

About the Author

Soniya Shah
Information Developer

Currently, a first year law student with a background in science and technology. Experienced technical writer, with specializations in software documentation, big data, blog development, and website development. I build user-centered content to communicate complex and technical information more easily.

I used to work for Vertica full time for about 3 years. I still work at Vertica part time while going to law school.

Update: Soniya is now doing her law internship, and no longer working at Vertica. Good luck, Soniya!

Product Overview

Vertica Announces Vertica 12 for Future-Proof Analytics

Harness the Internet of Things (IoT)

Support & Services

Partners

Vertica Inside – Embedded Analytics at Scale

Resources

About Vertica

Stay Informed

What’s New in Vertica 9.0.1: Machine Learning

Summary of Enhancements

Random Forest for Regression

For More Information

About the Author

Search The Blog

Explore Popular Topics

Subscribe For Email Updates

Product Overview

Vertica Announces Vertica 12 for Future-Proof Analytics

Harness the Internet of Things (IoT)

Support & Services

Partners

Vertica Inside – Embedded Analytics at Scale

Resources

About Vertica

Stay Informed

What’s New in Vertica 9.0.1: Machine Learning

Summary of Enhancements

Random Forest for Regression

For More Information

About the Author

Search The Blog

Explore Popular Topics

Subscribe For Email Updates

See More 9.0.1 Posts