Vertica Blog

Vertica Blog

R

Clusters of points colored differently by grouping on a black background

Finding the “K” in K-means Clustering With a UDx

You can apply k-means clustering to partition data points into k different groups. Along with the data, the number of clusters "k" is an input to the algorithm. Common examples like the Iris data set tell you upfront how many different groups exist, so you set k=3. What if you don't know how many clusters...
Database Server Room

Using Big Data Analytical Tools for Predicting Potential Global Distribution of an Amphibian Pathogen

During the summer of 2015, I participated in an internship program with Vertica. Most interns assisted in software development, but my primary goal was to use Vertica, Vertica Place, and HP Distributed R to address an ecological problem