HP Distributed R
An Essential Offering of HP Haven Predictive Analytics
Delivering Predictive Big Data Analytics at Scale
- Gain actionable foresight from billions of observations and avoid downsampling
- Use out-of-the-box, standard R parallel algorithms with near-linear scalability
- Turbo charge data access performance by 5x and ingest and prepare data in seconds
- Enable a broader community of developers and DBAs to put predictive analytics into action with support for all BI and data visualization tools
- Access this new, free offering — fully compatible with the open source R language and tools and backed by enterprise support from HP, priced per node
The open source R programming language for statistical computing, has gained widespread popularity among statisticians and data miners for advanced predictive and prescriptive analysis. HP Vertica allows you to create and use User-Defined Functions written in R and deploy them in HP Vertica for fast insights.
However, “vanilla” R struggles to handle large data sets such as those that are even just hundreds of gigabytes in size or that contain billions of rows.
To overcome this problem, HP offers HP Haven Predictive Analytics, a new offering that accelerates and operationalizes large-scale machine learning and statistical analysis, and ultimately provides organizations with much deeper insights and understanding into today’s rapidly evolving data volumes.
HP Haven Predictive Analytics is powered by HP Vertica as well as HP Distributed R, an open-source, scalable and high-performance engine for the R language. Designed for data scientists, HP Distributed R accelerates large-scale machine learning, statistical analysis, and graph processing. The secret is in how HP Distributed R splits tasks between multiple processing nodes to vastly reduce execution time and enables users to analyze much larger data sets. Best of all, HP Distributed R retains the familiar R look and feel, and data scientists can continue to use their existing statistical packages.

HP Haven Big Data Platform supports all phases of predictive analytics solution lifecycle
HP Haven Predictive Analytics ensures HP Distributed R integrates seamlessly with HP Vertica to complete the lifecycle of operationalizing predictive analytics. It delivers an enterprise-ready Massively Parallel Processing platform for descriptive and predictive analytics enabling you to:
- Perform analyses in R on Big Data sets that you could not analyze before
- Leverage multiple nodes and multiple cores for vastly improved performance
- Use familiar programming environments such as R console and RStudio
- Build your own custom parallel algorithms or use out-of-the-box parallel algorithms
View infographic
Evaluate HP Distributed R today and prepare your organization to better understand future market trends, predict customer behavior, and make informed strategic decisions to drive business performance and improve operational efficiency.



