<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Vertica &#187; Blog</title>
	<atom:link href="http://www.vertica.com/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.vertica.com</link>
	<description>Simply Fast</description>
	<lastBuildDate>Wed, 22 May 2013 21:11:45 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Boom Times for Boston&#8217;s Biggest Data</title>
		<link>http://www.vertica.com/2013/05/22/boom-times-for-bostons-biggest-data/</link>
		<comments>http://www.vertica.com/2013/05/22/boom-times-for-bostons-biggest-data/#comments</comments>
		<pubDate>Wed, 22 May 2013 21:01:43 +0000</pubDate>
		<dc:creator>cselland@vertica.com</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Community]]></category>
		<category><![CDATA[Customers]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[HP Discover]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Discover]]></category>
		<category><![CDATA[Open House]]></category>
		<category><![CDATA[User Conference]]></category>
		<category><![CDATA[Vertica]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13809</guid>
		<description><![CDATA[<p>It&#8217;s boom times at HP Vertica &#8211; as the Boston area&#8217;s first and <strong>biggest</strong> Big Data technology provider, we continue to grow and expand our employee base and ecosystem.</p> <p>We had a full house last week for our spring Open House, which gave us a chance to have friends of the company including current (of course) and future employees, strategic partners and even members of our alumni network.</p> <p><a href="http://www.vertica.com/2013/05/22/boom-times-for-bostons-biggest-data/soc_5586/" rel="attachment wp-att-13810"><img class="alignnone wp-image-13810" title="Vertica Open House" src="http://www.vertica.com/wp-content/uploads/2013/05/SOC_5586-1024x683.jpg" alt="" width="717" height="478" /></a></p> <p>As with any growth business in a highly dynamic market, occasionally people decide to move on but it was gratifying to have a number of former colleagues join us &#8211; some of whom have become customers, and a <a href="http://www.vertica.com/2013/05/22/boom-times-for-bostons-biggest-data/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F05%2F22%2Fboom-times-for-bostons-biggest-data%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s boom times at HP Vertica &#8211; as the Boston area&#8217;s first and <strong>biggest</strong> Big Data technology provider, we continue to grow and expand our employee base and ecosystem.</p>
<p>We had a full house last week for our spring Open House, which gave us a chance to have friends of the company including current (of course) and future employees, strategic partners and even members of our alumni network.</p>
<p><a href="http://www.vertica.com/2013/05/22/boom-times-for-bostons-biggest-data/soc_5586/" rel="attachment wp-att-13810"><img class="alignnone  wp-image-13810" title="Vertica Open House" src="http://www.vertica.com/wp-content/uploads/2013/05/SOC_5586-1024x683.jpg" alt="" width="717" height="478" /></a></p>
<p>As with any growth business in a highly dynamic market, occasionally people decide to move on but it was gratifying to have a number of former colleagues join us &#8211; some of whom have become customers, and a few of which have even re-joined the company recently!</p>
<p>Over the next few weeks, HP Vertica will be taking things on the road to <a href="http://h30614.www3.hp.com/discover/home">HP Discover Las Vegas</a> and will be involved in some major strategic announcements regarding HP&#8217;s Big Data strategy, so watch this space for more.</p>
<p>We&#8217;re also going to be on the road for our Discover Performance series and a number of industry events &#8211; <a href="http://www.vertica.com/news/events/">check our website for details</a>.</p>
<p>And last but not at all least, we&#8217;ve got our <a href="http://www.vertica.com/userconference">HP Vertica Big Data Conference</a> in early August &#8211; <a href="http://www.vertica.com/userconference">click here</a> or on the image below to learn more and to register. We&#8217;re expecting another full house including our base of current and future customers, strategic partners and of course many of our colleagues from worldwide HP &#8211; sign up now, since space is limited and Early Bird Pricing expires June 28!</p>
<p><a href="http://www.vertica.com/userconference"><img src="http://hp-vertica.com/wp-content/themes/canvas/functions/thumb.php?src=wp-content/uploads/2012/04/vertica-banner-11.png&amp;w=980&amp;h=275&amp;zc=1&amp;q=90" alt="Register here" width="784" height="220" /></a></p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F05%2F22%2Fboom-times-for-bostons-biggest-data%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/05/22/boom-times-for-bostons-biggest-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Case Studies &#8211; Guess? and Kansys</title>
		<link>http://www.vertica.com/2013/05/17/new-case-studies-guess-and-kansys/</link>
		<comments>http://www.vertica.com/2013/05/17/new-case-studies-guess-and-kansys/#comments</comments>
		<pubDate>Fri, 17 May 2013 18:24:25 +0000</pubDate>
		<dc:creator>csmith</dc:creator>
				<category><![CDATA[Customers]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13745</guid>
		<description><![CDATA[<p>There are two new case studies posted on the site &#8211; Guess? and Kansys &#8211; and they offer two very different stories about how these world-class companies are using Big Data as a competitive differentiator in the retail and service provider industries.  But there is one common thread between the two stories &#8211; the <a title="HP Vertica Analytics Platform" href="http://www.vertica.com/the-analytics-platform/" target="_blank">HP Vertica Analytics Platform</a>.</p> <p>You can check out the stories by going to the <a title="HP Vertica Customer Case Studies" href="http://www.vertica.com/customers/case-studies/" target="_blank">Customer Case Studies</a> page, but I wanted to give you a short description of the company and a quote for each story to entice you to check them out:</p> <p><a href="http://www.vertica.com/wp-content/uploads/2013/04/Guess_Vertica_casestudy.pdf" target="_blank"><img class="alignright wp-image-13746" style="border: 1px solid black; margin-left: <a href="http://www.vertica.com/2013/05/17/new-case-studies-guess-and-kansys/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F05%2F17%2Fnew-case-studies-guess-and-kansys%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>There are two new case studies posted on the site &#8211; Guess? and Kansys &#8211; and they offer two very different stories about how these world-class companies are using Big Data as a competitive differentiator in the retail and service provider industries.  But there is one common thread between the two stories &#8211; the <a title="HP Vertica Analytics Platform" href="http://www.vertica.com/the-analytics-platform/" target="_blank">HP Vertica Analytics Platform</a>.</p>
<p>You can check out the stories by going to the <a title="HP Vertica Customer Case Studies" href="http://www.vertica.com/customers/case-studies/" target="_blank">Customer Case Studies</a> page, but I wanted to give you a short description of the company and a quote for each story to entice you to check them out:</p>
<p><a href="http://www.vertica.com/wp-content/uploads/2013/04/Guess_Vertica_casestudy.pdf" target="_blank"><img class="alignright  wp-image-13746" style="border: 1px solid black; margin-left: 10px;" title="Guess Case Study" src="http://www.vertica.com/wp-content/uploads/2013/05/GuessCaseStudy.png" alt="" width="140" height="181" /></a><span style="font-size: small;">Guess?, Inc., a leading global retailer, is achieving business and technology benefits with its HP Vertica Analytics Platform that includes a flexible, compelling mobile business analytics iPad application called G-Mobile. The easy-to-use mobile app empowers a range of non-traditional BI users including designers, buyers, planners, and allocators—to analyze sales, inventory and logistical information. The company has improved its insight into customer purchasing behaviors, and can ensure employee resources are allocated appropriately to optimize productivity. With HP Vertica, Guess? has the infrastructure in place to tackle the challenges retailers face—including Big Data and next-generation analytics. The company plans to roll out the analytics platform to Europe and Asia in the near future.</span></p>
<blockquote><p><span style="font-size: small;">“Vertica gives us the flexibility to tackle Big Data. With HP Vertica, our organization is ready for the challenges retailers are facing—from Big Data to next-generation analytics.”</span><br />
<span style="font-size: small;"> <em><strong>– Bruce Yen, director, Business Intelligence, Guess?, Inc.</strong></em></span></p></blockquote>
<p><a href="http://www.vertica.com/wp-content/uploads/2013/05/Kansys_CaseStudy.pdf" target="_blank"><img class="alignright  wp-image-13747" style="border: 1px solid black; margin-left: 10px;" title="Kansys Case Study" src="http://www.vertica.com/wp-content/uploads/2013/05/KansysCaseStudy.png" alt="" width="140" height="181" /></a>Founded in 1997, Kansys brings deep experience and expertise to the business support systems and operations support systems (BSS/OSS) for Communications Service Providers (CSPs) &amp; Multiple System Operators (MSOs). The company is known for an unparalleled ability to align, manage, optimize, and report on huge volumes of data from disparate sources for its clients.With expertise gathered over years of experience, Kansys is unique in its depth of knowledge across the areas that matter to CSPs — Mediation, Process Optimization, Revenue Assurance, Fraud Detection &amp; Analysis, Billing, Audits, Customer Analytics, and Reporting. HP Vertica powers the CSP &amp; MSO Big Data services provided by Kansys to their customers.</p>
<blockquote><p><span style="font-size: small;">“As a consumer of data, time to information is important, and we don’t want to be limited in the amount of data we process. HP Vertica solves that problem.”</span><br />
<span style="font-size: small;"> <em><strong>– Tom Wisnasky, CIO, Kansys</strong></em></span></p></blockquote>
<p>To check out these two case studies, and to learn more about Guess?, Kansys, and some of our other customers who are taking full advantage of the HP Vertica Analytics Platform, please visit the <a href="http://www.vertica.com/customers/">Customer</a> page on our web site.</p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F05%2F17%2Fnew-case-studies-guess-and-kansys%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/05/17/new-case-studies-guess-and-kansys/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Join us on Monday for an Open House</title>
		<link>http://www.vertica.com/2013/05/09/join-us-on-monday-for-an-open-house/</link>
		<comments>http://www.vertica.com/2013/05/09/join-us-on-monday-for-an-open-house/#comments</comments>
		<pubDate>Thu, 09 May 2013 21:38:51 +0000</pubDate>
		<dc:creator>cselland@vertica.com</dc:creator>
				<category><![CDATA[vertica]]></category>
		<category><![CDATA[Vertica Customers]]></category>
		<category><![CDATA[Cambridge]]></category>
		<category><![CDATA[Open House]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13642</guid>
		<description><![CDATA[<p>We&#8217;ve been sprucing up our gorgeous office space with help from our friends at <a href="http://www.engagemarketingdesign.com/">Engage Marketing Design</a> and are looking forward to showing it off on Monday.</p> <p>Please join us by <a href="http://bit.ly/YxkCN4">clicking this link and RSVP&#8217;ing</a>. All friends of HP Vertica are welcome though space is limited and filling fast!</p> <p>It&#8217;s coming together nicely &#8211; see below for a few previews &#8211; we&#8217;ll be taking more photos on Monday and want <strong>you</strong> in them. Join us and learn more about Boston&#8217;s Biggest Data!</p><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F05%2F09%2Fjoin-us-on-monday-for-an-open-house%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve been sprucing up our gorgeous office space with help from our friends at <a href="http://www.engagemarketingdesign.com/">Engage Marketing Design</a> and are looking forward to showing it off on Monday.</p>
<p>Please join us by <a href="http://bit.ly/YxkCN4">clicking this link and RSVP&#8217;ing</a>. All friends of HP Vertica are welcome though space is limited and filling fast!</p>
<p>It&#8217;s coming together nicely &#8211; see below for a few previews &#8211; we&#8217;ll be taking more photos on Monday and want <strong>you</strong> in them. Join us and learn more about Boston&#8217;s Biggest Data!</p>

<a href='http://www.vertica.com/2013/05/09/join-us-on-monday-for-an-open-house/20130509_091654/' title='20130509_091654'><img width="150" height="150" src="http://www.vertica.com/wp-content/uploads/2013/05/20130509_091654-150x150.jpg" class="attachment-thumbnail" alt="20130509_091654" title="20130509_091654" /></a>
<a href='http://www.vertica.com/2013/05/09/join-us-on-monday-for-an-open-house/20130509_091534/' title='20130509_091534'><img width="150" height="150" src="http://www.vertica.com/wp-content/uploads/2013/05/20130509_091534-150x150.jpg" class="attachment-thumbnail" alt="20130509_091534" title="20130509_091534" /></a>

<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F05%2F09%2Fjoin-us-on-monday-for-an-open-house%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/05/09/join-us-on-monday-for-an-open-house/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Taking a Moonshot at Big Data Analytics for Everyone</title>
		<link>http://www.vertica.com/2013/04/10/taking-a-moonshot-at-big-data-analytics-for-everyone/</link>
		<comments>http://www.vertica.com/2013/04/10/taking-a-moonshot-at-big-data-analytics-for-everyone/#comments</comments>
		<pubDate>Wed, 10 Apr 2013 17:29:43 +0000</pubDate>
		<dc:creator>cselland@vertica.com</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[vertica]]></category>
		<category><![CDATA[Moonshot]]></category>
		<category><![CDATA[Vertica]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13542</guid>
		<description><![CDATA[<p>HP Vertica is very excited about <a href="http://www.hp.com/go/moonshot">Monday&#8217;s announcement</a> of the HP Moonshot system.</p> <p>Why? Because we believe that the combination of the HP Vertica Analytics Platform running on the HP Moonshot Servers offers a truly game-changing value proposition for a variety of customers, and new segments of the market.</p> <p>Moonshot is, simply put, a groundbreaking system which offers customers the ability to rapidly deploy, scale and manage with dramatically lower space and energy constraints. While traditional IT services that support business functions will continue to be served by general purpose server infrastructure, a new computing platform is required for specialized workloads that can deliver innovative solutions to market at unprecedented speed and scale.</p> <p><a href="http://www.vertica.com/2013/04/10/taking-a-moonshot-at-big-data-analytics-for-everyone/moonshotcartridgesout/" rel="attachment wp-att-13553"><img class="alignnone wp-image-13553" <a href="http://www.vertica.com/2013/04/10/taking-a-moonshot-at-big-data-analytics-for-everyone/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F04%2F10%2Ftaking-a-moonshot-at-big-data-analytics-for-everyone%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>HP Vertica is very excited about <a href="http://www.hp.com/go/moonshot">Monday&#8217;s announcement</a> of the HP Moonshot system.</p>
<p>Why? Because we believe that the combination of the HP Vertica Analytics Platform running on the HP Moonshot Servers offers a truly game-changing value proposition for a variety of customers, and new segments of the market.</p>
<p>Moonshot is, simply put, a groundbreaking system which offers customers the ability to rapidly deploy, scale and manage with dramatically lower space and energy constraints. While traditional IT services that support business functions will continue to be served by general purpose server infrastructure, a new computing platform is required for specialized workloads that can deliver innovative solutions to market at unprecedented speed and scale.</p>
<p><a href="http://www.vertica.com/2013/04/10/taking-a-moonshot-at-big-data-analytics-for-everyone/moonshotcartridgesout/" rel="attachment wp-att-13553"><img class="alignnone  wp-image-13553" title="MoonshotCartridgesOut" src="http://www.vertica.com/wp-content/uploads/2013/04/MoonshotCartridgesOut-1024x768.png" alt="" width="717" height="538" /></a></p>
<p>&nbsp;</p>
<p>We’ve already successfully tested the HP Vertica Analytics Platform on HP Moonshot Servers, and achieved very comparable performance to traditional Big Data Analytics hardware across certain performance ranges, which for a large segment of the market is more than sufficient to handle their Big Data Analytics loads – while offering very significant potential cost, space and energy savings.</p>
<p>Running Vertica on Moonshot offers yet another proof point of the unmatched value provided by HP’s combination of Information Optimization solutions, and a great example of the opportunity created by innovation that makes us so excited to be a part of the greater OneHP.</p>
<p>To learn more about HP Project Moonshot, visit <a href="http://www.hp.com/go/moonshot">http://www.hp.com/go/moonshot</a></p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F04%2F10%2Ftaking-a-moonshot-at-big-data-analytics-for-everyone%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/04/10/taking-a-moonshot-at-big-data-analytics-for-everyone/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Comparing Pattern Mining on a Billion Records with HP Vertica and Hadoop</title>
		<link>http://www.vertica.com/2013/04/08/comparing-pattern-mining-on-a-billion-records-with-hp-vertica-and-hadoop/</link>
		<comments>http://www.vertica.com/2013/04/08/comparing-pattern-mining-on-a-billion-records-with-hp-vertica-and-hadoop/#comments</comments>
		<pubDate>Mon, 08 Apr 2013 18:20:39 +0000</pubDate>
		<dc:creator>Kyungyong Lee</dc:creator>
				<category><![CDATA[in-database analytics]]></category>
		<category><![CDATA[pattern matching]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13460</guid>
		<description><![CDATA[<p>Pattern mining can help analysts discover hidden structures in data. Pattern mining has many applications—from retail and marketing to security management. For example, from a supermarket data set, you may be able to predict whether customers who buy Lay’s potato chips are likely to buy a certain brand of beer. Similarly, from network log data, you may determine groups of Web sites that are visited together or perform event analysis for security enforcement. In this blog post, we will show you how the HP Vertica Analytics Platform can efficiently find frequent patterns in very large data sets.</p> <p><strong>A pattern mining algorithm</strong></p> <p>Frequent patterns are items that occur often in a data set. After finding frequent patterns, analysts can use methods <a href="http://www.vertica.com/2013/04/08/comparing-pattern-mining-on-a-billion-records-with-hp-vertica-and-hadoop/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F04%2F08%2Fcomparing-pattern-mining-on-a-billion-records-with-hp-vertica-and-hadoop%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>Pattern mining can help analysts discover hidden structures in data. Pattern mining has many applications—from retail and marketing to security management. For example, from a supermarket data set, you may be able to predict whether customers who buy Lay’s potato chips are likely to buy a certain brand of beer. Similarly, from network log data, you may determine groups of Web sites that are visited together or perform event analysis for security enforcement. In this blog post, we will show you how the HP Vertica Analytics Platform can efficiently find frequent patterns in very large data sets.</p>
<p><strong>A pattern mining algorithm</strong></p>
<p>Frequent patterns are items that occur often in a data set. After finding frequent patterns, analysts can use methods such as association rule mining to discover rules in the data. A classic example of an association (from Wikipedia) is that customers who buy diapers also tend to buy beer from a supermarket. While there are many frequent pattern mining algorithms in literature, we will use the FP-growth algorithm. FP-growth is considered efficient as it performs fewer database scans and does not require candidate set generation<span style="font-size: x-small;"> [1]</span>.</p>
<p>Instead of describing FP-growth in detail, we list the main steps from a practitioner’s perspective. We need to perform the following steps to obtain frequent patterns using FP-growth:</p>
<ol>
<li>Create transactions of items</li>
<li>Count occurrence of item sets</li>
<li>Sort item sets according to their occurrence</li>
<li>Remove infrequent items</li>
<li>Scan DB and build FP-tree</li>
<li>Recursively grow frequent item sets</li>
</ol>
<p>Let’s use an example to illustrate these steps. We will assume that our data set is a Web proxy log from a corporate network that, among other things, has IP address and Web sites visited as fields. Our goal is to find patterns such as Web sites that are visited together. After step 1, we obtain a set of transaction items shown in Table 1. Each transaction lists the Web sites visited from each IP address. After steps 2 and 3, we get Table 2 that has items sorted by their frequencies. Assuming that an item is considered frequent only if it occurs more than three times, then in step 4 we will discard <em>cnn</em> and yahoo from the table. In step 5 we use the pruned table to create an FP-tree (Figure 1). Finally, in step 6 we grow frequent patterns. The final output is shown in Table 3. The output, for example, shows that many users tend to visit both the Web sites of <em>HP</em> and <em>Amazon</em>.</p>
<table style="vertical-align: text-top;">
<tbody>
<tr>
<td style="&quot;vertical-align: text-top;">
<p><div id="attachment_13469" class="wp-caption alignnone" style="width: 250px"><img class=" wp-image-13469 " title="Table 1 Sites Visited" src="http://www.vertica.com/wp-content/uploads/2013/04/Table-1-Sites-Visited-300x168.jpg" alt="" width="240" height="134" /><p class="wp-caption-text">Table 1: Sites Visited</p></div></td>
<td style="&quot;vertical-align: text-top;">
<p><div id="attachment_13468" class="wp-caption alignnone" style="width: 241px"><img class=" wp-image-13468 " title="Table 2 Sorted Items" src="http://www.vertica.com/wp-content/uploads/2013/04/Table-2-Sorted-Items-289x300.jpg" alt="" width="231" height="240" /><p class="wp-caption-text">Table 2: Sorted Items</p></div></td>
</tr>
<tr>
<td colspan="2"></td>
</tr>
<tr>
<td style="&quot;vertical-align: text-top;">
<p><div id="attachment_13467" class="wp-caption alignnone" style="width: 310px"><img class="size-medium wp-image-13467" title="Figure 1 FP-tree" src="http://www.vertica.com/wp-content/uploads/2013/04/Figure-1-FP-tree-300x208.jpg" alt="" width="300" height="208" /><p class="wp-caption-text">Figure 1: FP-tree</p></div></td>
<td style="&quot;vertical-align: text-top;">
<p><div id="attachment_13466" class="wp-caption alignnone" style="width: 310px"><img class="size-medium wp-image-13466" title="Table 3 Final output of frequent patterns" src="http://www.vertica.com/wp-content/uploads/2013/04/Table-3-Final-output-of-frequent-patterns-300x100.jpg" alt="" width="300" height="100" /><p class="wp-caption-text">Table 3: Final output of frequent patterns</p></div></td>
</tr>
</tbody>
</table>
<p><strong>Parallel pattern mining on the HP Vertica Analytics Platform</strong></p>
<p>Despite the efficiency of the FP-Growth algorithm, single-threaded sequential version of FP-Growth can take very long on large data sets. Fortunately, we can rewrite the algorithm using SQL and HP Vertica user-defined functions (UDFs), and let the HP Vertica Analytics Platform parallelize the implementation. The main issue to resolve is how to map the algorithm to SQL statements and then remove dependencies between UDFs so that they can run independently and in parallel. Below are the statements that we used in the HP Vertica Analytics Platform. Let’s assume that we are still working with the Web proxy log example introduced earlier.</p>
<ol>
<li>Create transaction of items
<ul>
<li>SELECT DISTINCT srcIP, hostname INTO uniqueSipHn FROM networkLog;</li>
</ul>
</li>
<li>Count frequency of occurrence of each host name
<ul>
<li>SELECT count(hostname) INTO hnCnt FROM uniqueSipHn;</li>
</ul>
</li>
<li>List host names visited by each IP and also the frequency of each host name.
<ul>
<li>SELECT a.srcIP, b.hostName, b.frequency into sipHnCnt FROM uniqueSipHn a INNER JOIN hnCnt b ON a.hostName=b.hostName;</li>
</ul>
</li>
<li>Build conditional transactions. Assume an item is frequent if it occurs more than 20,000 times.
<ul>
<li>SELECT t1.hostName, t1.srcIP, t2.hostName AS condItem INTO condTr FROM sipHnCnt t1 JOIN sipHnCnt t2 ON (t1.srcIP=t2.srcIP) and (t1.count&gt;20000 and t2.count&gt;20000) and ((t2.count&gt;t1.count) or (t2.count=t1.count and t2.hostName&gt;t1.hostName))</li>
</ul>
</li>
<li>Generate patterns in parallel using UDF.
<ul>
<li>SELECT FPGrowth(srcIP, condItem, 20000) OVER(PARTITION BY hostName ORDER BY srcIP) INTO frequentItems FROM condTr;</li>
</ul>
</li>
</ol>
<p><strong>The real test: a billion records, and, of course, Hadoop</strong></p>
<p style="text-align: left;">Now that we know how to implement parallel frequent pattern mining in the HP Vertica Analytics Platform, let’s see how the implementation performs a large data set. Our input data is a few days’ worth of Web proxy logs. The log file is 330 GB in size, and has a billion records each with 22 fields. For comparison, we use Mahout’s implementation of parallel frequent pattern mining (Cloudera Hadoop 2.0 and mahout-0.7). We wrote a MapReduce program to create transactions from the log (step 1 of the algorithm). Our test bed consists of 12 HP ProLiant servers, each with 12 cores, 96GB RAM, and 128GB SSD.<br />
<img class="aligncenter" src="http://www.vertica.com/wp-content/uploads/2013/04/Figure-2-Comparison-of-pattern-mining-on-Vertica-and-Hadoop-Lower-is-better-1024x521.jpg" alt="" width="614" height="313" /><br />
Figure 2 depicts our results. On 4 servers, the HP Vertica Analytics Platform can complete the end-to-end pattern mining in fewer than 140 seconds. Hadoop takes 1,250 seconds (20 minutes)—approximately 9x more time than the HP Vertica Analytics Platform. As we increase the number of servers to 12, both the HP Vertica Analytics Platform and Hadoop take less time to complete. However, unlike Hadoop, the HP Vertica Analytics Platform has close to linear scaling for this setup.</p>
<p style="text-align: left;">Are you searching for patterns in your data set? Want a fast and easy-to-use data analytics platform? Evaluate the HP Vertica Community Edition today.</p>
<hr />
<p><span style="font-size: x-small;">[1] Mining frequent patterns without candidate generation. Jiawei Han, Jian Pei, Yiwen Yin. SIGMOD 2000.</span></p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F04%2F08%2Fcomparing-pattern-mining-on-a-billion-records-with-hp-vertica-and-hadoop%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/04/08/comparing-pattern-mining-on-a-billion-records-with-hp-vertica-and-hadoop/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Join HP Vertica’s User-Driven Community!</title>
		<link>http://www.vertica.com/2013/03/26/join-hp-verticas-user-driven-community/</link>
		<comments>http://www.vertica.com/2013/03/26/join-hp-verticas-user-driven-community/#comments</comments>
		<pubDate>Tue, 26 Mar 2013 21:49:44 +0000</pubDate>
		<dc:creator>Danielle Sandahl</dc:creator>
				<category><![CDATA[MyVertica]]></category>
		<category><![CDATA[Support]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[vertica]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13420</guid>
		<description><![CDATA[<p>We at HP Vertica are very excited to announce our new user-driven community that will help us better serve our users, partners, and anyone generally interested in learning more about the HP Vertica Analytics Platform. In partnership with GetSatisfaction, community members can now:</p> Start engaging with other customers to establish new and valuable relationships Add context to your past issues and get the best possible answers to your questions Wield your influence as subject matter expert in the community <p>Now, HP Vertica newbies as well as seasoned database/data analytics veterans and everyone in between can post questions, share ideas, report issues, and give praise. To make this information readily available for your convenient access, all questions are cataloged and searchable.</p> <a href="http://www.vertica.com/2013/03/26/join-hp-verticas-user-driven-community/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F26%2Fjoin-hp-verticas-user-driven-community%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>We at HP Vertica are very excited to announce our new user-driven community that will help us better serve our users, partners, and anyone generally interested in learning more about the HP Vertica Analytics Platform. In partnership with GetSatisfaction, community members can now:</p>
<ul>
<li>Start engaging with other customers to establish new and valuable relationships</li>
<li>Add context to your past issues and get the best possible answers to your questions</li>
<li>Wield your influence as subject matter expert in the community</li>
</ul>
<p>Now, HP Vertica newbies as well as seasoned database/data analytics veterans and everyone in between can post questions, share ideas, report issues, and give praise. To make this information readily available for your convenient access, all questions are cataloged and searchable.</p>
<p>The new-and-improved community interface will enable you to easily access more information and better communicate with HP Vertica and other users of the HP Vertica Analytics Platform.</p>
<p>We welcome you to join our community by visiting <a href="http://community.vertica.com" target="_blank" >http://community.vertica.com</a> or by accessing the community tab on the side of the www.vertica.com homepage.</p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F26%2Fjoin-hp-verticas-user-driven-community%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/03/26/join-hp-verticas-user-driven-community/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Big Data Analytics without Big Data Complexity</title>
		<link>http://www.vertica.com/2013/03/25/big-data-analytics-without-big-data-complexity/</link>
		<comments>http://www.vertica.com/2013/03/25/big-data-analytics-without-big-data-complexity/#comments</comments>
		<pubDate>Mon, 25 Mar 2013 20:02:03 +0000</pubDate>
		<dc:creator>Jeff Healey</dc:creator>
				<category><![CDATA[HP Discover]]></category>
		<category><![CDATA[vertica]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13386</guid>
		<description><![CDATA[<p>New analytics deployments can be complex, taking up to 18 months to implement and optimize. The complexity of maintaining and integrating these environments often results in missed deadlines, incomplete projects, increased costs, and lost opportunities. In fact, only 32 percent* of application deployments are rated as “&#8217;successful”&#8217; by organizations.</p> <p>To remove this Big Data complexity, we are pleased to announce the general availability of the <a href="https://www.hp.com/go/appsystems/vertica">HP AppSystem for Vertica</a>. Following through on the initial announcement at <a href="http://h30614.www3.hp.com/Discover/MyEvent">HP Discover</a> as part of the<a href="http://www.hp.com/go/appsystems"> HP AppSystems portfolio</a>, the HP AppSystem for Vertica ensures system performance and reduces implementation time from months to a matter of hours.</p> <p><strong>But what is an AppSystem and is it right for you?</strong></p> <p>Built on the <a href="http://www.vertica.com/2013/03/25/big-data-analytics-without-big-data-complexity/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F25%2Fbig-data-analytics-without-big-data-complexity%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>New analytics deployments can be complex, taking up to 18 months to implement and optimize. The complexity of maintaining and integrating these environments often results in missed deadlines, incomplete projects, increased costs, and lost opportunities. In fact, only 32 percent* of application deployments are rated as “&#8217;successful”&#8217; by organizations.</p>
<p>To remove this Big Data complexity, we are pleased to announce the general availability of the <a href="https://www.hp.com/go/appsystems/vertica">HP AppSystem for Vertica</a>. Following through on the initial announcement at <a href="http://h30614.www3.hp.com/Discover/MyEvent">HP Discover</a> as part of the<a href="http://www.hp.com/go/appsystems"> HP AppSystems portfolio</a>, the HP AppSystem for Vertica ensures system performance and reduces implementation time from months to a matter of hours.</p>
<p><strong>But what is an AppSystem and is it right for you?</strong></p>
<p>Built on the HP Converged Infrastructure, the new HP AppSystem for Vertica is a fully pre-integrated technology stack that includes a specifically optimized hardware configuration, factory pre-loaded OS, and the <a href="www.vertica.com/the-analytics-platform/">HP Vertica Analytics Platform</a> environment.</p>
<p>HP AppSystem for Vertica is ideal for organizations interested in accelerating time-to-business value with high-performance, massively scalable analytics at each layer of IT infrastructure — server, storage, network, and management. As a result, you can scale seamlessly, while adding capacity as your analytics needs for Big Data evolve.<br />
We encourage you to learn more about the <a href="https://www.hp.com/go/appsystems/vertica">HP AppSystem for Vertica</a> — and get started removing complexity to capitalize on your big data analytics initiatives.</p>
<p><span style="font-size: x-small;">* = CHAOS Summary 2009, Jim Johnson, Standish Group, April 2009</span></p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F25%2Fbig-data-analytics-without-big-data-complexity%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/03/25/big-data-analytics-without-big-data-complexity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A Method to the March Madness?</title>
		<link>http://www.vertica.com/2013/03/20/a-method-to-the-march-madness/</link>
		<comments>http://www.vertica.com/2013/03/20/a-method-to-the-march-madness/#comments</comments>
		<pubDate>Wed, 20 Mar 2013 20:18:03 +0000</pubDate>
		<dc:creator>Jeff Healey</dc:creator>
				<category><![CDATA[big data]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13324</guid>
		<description><![CDATA[<p style="text-align: center;"><a href="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot001.jpg"><img class="wp-image-13331 aligncenter" title="ScreenShot001" src="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot001-1024x614.jpg" alt="" width="573" height="344" /></a></p> <p>The NCAA 2013 Men’s Basketball March Madness Tournament officially tips off on Thursday, March 21st.  For those of you unfamiliar with the tournament, 64 teams from colleges and universities across the United States compete for the championship, awarded to just one winner in early April. Buzzer-beating upsets are as common as fan face paint and schools from parts unknown, making it challenging to choose the winner in your office tournament bracket.</p> <p>To give you a sense of the tournament’s popularity and appeal, according to <a href="http://usatoday30.usatoday.com/sports/college/mensbasketball/story/2012-03-09/march-madness-ncaa-workplace/53538598/1 ">USA Today</a> “Last year&#8217;s championship game alone had about 20 million TV viewers. The overall tournament had 52 million visits across March Madness <a href="http://www.vertica.com/2013/03/20/a-method-to-the-march-madness/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F20%2Fa-method-to-the-march-madness%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot001.jpg"><img class="wp-image-13331 aligncenter" title="ScreenShot001" src="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot001-1024x614.jpg" alt="" width="573" height="344" /></a></p>
<p>The NCAA 2013 Men’s Basketball March Madness Tournament officially tips off on Thursday, March 21<sup>st</sup>.  For those of you unfamiliar with the tournament, 64 teams from colleges and universities across the United States compete for the championship, awarded to just one winner in early April. Buzzer-beating upsets are as common as fan face paint and schools from parts unknown, making it challenging to choose the winner in your office tournament bracket.</p>
<p>To give you a sense of the tournament’s popularity and appeal, according to <a href="http://usatoday30.usatoday.com/sports/college/mensbasketball/story/2012-03-09/march-madness-ncaa-workplace/53538598/1 ">USA Today</a> “Last year&#8217;s championship game alone had about 20 million TV viewers. The overall tournament had 52 million visits across March Madness on Demand&#8217;s broadband and mobile platforms.”</p>
<p><span style="font-size: 13px; line-height: 19px;">So, what is the buzz on this year’s tournament on Twitter, and can social sentiment foreshadow ultimate success? A small team of us here — representing Autonomy, HP Vertica, and HP Information Management &amp; Analytics (IM&amp;A) — set out to answer that very question by building a March Madness Sentiment Tracker Demo to track the “sentiment of the crowd.”</span></p>
<p><strong>The Technology Behind the March Madness Sentiment Tracker</strong></p>
<p>Using <a href="https://blog.hpcloud.com/awards-meter-analyzing-big-data-real-time#.UUJjCCCK6B8.twitter">HP Labs’ Academy Awards Meter demo</a> as our guide, we created a framework in roughly a week based on Autonomy, HP Vertica, and Tibco Spotfire.</p>
<p>We unveiled the demonstration at the Sloan MIT Sports Analytics Conference. See Chris Selland’s <a href="http://www.vertica.com/2013/03/04/bdoc-big-data-on-campus/">blog post</a> from that event and his participation on the Big Data in Sports panel.</p>
<p>Since the MIT Sports Analytics Conference was held weeks before the tourney had begun, we first collected roughly half a million Tweets using Autonomy’s data aggregator from February 20<sup>th</sup> to March 1st. The Tweets included anything related to the Top 25 ranked teams at the time as well as the top scorers. Our colleagues at Autonomy also used Autonomy IDOL to structure and sentiment to the data. For example, a Tweet like “I am excited to watch my Jayhawks win #MarchMadness!” would carry a positive sentiment. However, a Tweet like “I hate #MarchMadness – it interrupts my favorite TV shows!” would carry a negative sentiment.</p>
<p>Our very own Will Cairns, who presented on the main stage of the MIT Sloan Sports Analytics Conference, loaded the data into the HP Vertica Analytics Platform, ran some analytical queries and provided an output file for HP IM&amp;A to create the visualization front-end with Tibco SpotFire. That is where the insight (and conversation with the data) began to happen.</p>
<p><strong>Visualizing the Sentiment and Lessons Learned</strong></p>
<p>HP IM&amp;A created impressive visualizations that helped us (and attendees) to explore:</p>
<ul>
<li>Volume of tweets by team</li>
<li>Volume of tweets by player</li>
<li>Positive, negative, and neutral sentiment groupings</li>
<li>Volume of tweets by U.S. city and by worldwide country</li>
<li>Volume of tweets by language (English, French, Spanish, etc.)</li>
</ul>
<div><a href="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot003.jpg"><img class="alignleft  wp-image-13357" title="ScreenShot003" src="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot003-1024x536.jpg" alt="" width="737" height="386" /></a></div>
<p><a href="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot006.jpg"><img class="alignleft  wp-image-13364" title="ScreenShot006" src="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot006-1024x456.jpg" alt="" width="645" height="287" /></a>They say that a picture is worth a 1,000 words. Well, the visualizations provided for great conversation – some results were not surprising such as NCAA perennial teams steeped with rich history, such as Kansas and Duke, leading the total volume of tweets. Some players ranked higher than others in volume of tweets, leading attendees to observe, “ Well, Trey Burke had a monster game the other night, so that makes sense.”</p>
<p><a href="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot007.jpg"><img class="alignleft  wp-image-13369" title="ScreenShot007" src="http://www.vertica.com/wp-content/uploads/2013/03/ScreenShot007-1024x559.jpg" alt="" width="645" height="352" /></a> But why did Chicago rank as the U.S. city with the highest number of tweets, despite having no college or university from Illinois team ranked in the top 25 at the time? Well, the Big 12 is one of the more competitive conferences in the country this season, and Chicago area schools (such as the University of Illinois) play Wisconsin, Indiana, Michigan, and Michigan State. It’s also one of the top five major media hubs in the country.</p>
<p>Spirited debates and conversations aside, most importantly, this exercise clearly demonstrated the power of sentiment for a range of use cases in nearly every industry with a major product, brand, or service. In the telecommunications industry, network providers are actively tracking social media channels to measure customer satisfaction. If there is an issue with the service, say in a certain region of the country, you better believe that customer service will soon receive calls to that very point. Using sentiment analysis to quickly address issues by, say, adding more network bandwidth and improving service can help reduce service costs, improve customer satisfaction, and minimize churn.</p>
<p>But can sentiment foreshadow success? I guess you will have to tune into the games to find out, while tracking your favorite social media channel. Better yet, why not use HP Vertica’s tight integration with R to develop a statistical model based on data available from ESPN and the likes on hard basketball statistics, such as field goal percentage, points allowed, head-to-head scoring, and more? You could correlate that statistical data with sentiment data trending from Twitter.</p>
<p>Hmm…that sounds like a perfect complement to our March Madness Sentiment Tracker demo. Stay tuned for more details or share your thoughts on how you could marry sentiment data with statistical data to ultimately predict this year’s winner.</p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F20%2Fa-method-to-the-march-madness%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/03/20/a-method-to-the-march-madness/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The Disruptive Power of Big Data</title>
		<link>http://www.vertica.com/2013/03/19/the-disruptive-power-of-big-data/</link>
		<comments>http://www.vertica.com/2013/03/19/the-disruptive-power-of-big-data/#comments</comments>
		<pubDate>Tue, 19 Mar 2013 19:47:03 +0000</pubDate>
		<dc:creator>cselland@vertica.com</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[use cases]]></category>
		<category><![CDATA[vertica]]></category>
		<category><![CDATA[Vertica Customers]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13290</guid>
		<description><![CDATA[<p>Aside from the sheer quantity of digital data created every day—about 2.5 exabytes1 —there’s more to Big Data than volume. Big Data offers enterprise leaders the opportunity to dramatically change the way their organizations operate to gain competitive advantage and find new revenue opportunities. But realizing the value Big Data promises requires a new approach. Traditional data warehouses and business intelligence tools weren’t built for the scale of Big Data, and can’t provide insight quickly enough to be useful or even keep up.<img class="size-medium wp-image-13296 alignright" title="Graph 1" src="http://www.vertica.com/wp-content/uploads/2013/03/Graph-1-300x227.gif" alt="" width="300" height="227" /></p> <p>But this isn’t just a case of data growth outstripping technology growth. Big Data embodies fundamental differences that necessitate new approaches and new technologies. Big Data takes <a href="http://www.vertica.com/2013/03/19/the-disruptive-power-of-big-data/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F19%2Fthe-disruptive-power-of-big-data%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>Aside from the sheer quantity of digital data created every day—about 2.5 exabytes<sup>1</sup> —there’s more to Big Data than volume. Big Data offers enterprise leaders the opportunity to dramatically change the way their organizations operate to gain competitive advantage and find new revenue opportunities. But realizing the value Big Data promises requires a new approach. Traditional data warehouses and business intelligence tools weren’t built for the scale of Big Data, and can’t provide insight quickly enough to be useful or even keep up.<img class="size-medium wp-image-13296 alignright" title="Graph 1" src="http://www.vertica.com/wp-content/uploads/2013/03/Graph-1-300x227.gif" alt="" width="300" height="227" /></p>
<p>But this isn’t just a case of data growth outstripping technology growth. Big Data embodies fundamental differences that necessitate new approaches and new technologies. Big Data takes many forms, three in particular we’ll discuss here:</p>
<ul>
<li>Transactional data</li>
<li>Sentiment and perceptual data based on conversations taking place in social media</li>
<li>Data from networked sensors—the so-called “Internet of Things”</li>
</ul>
<p><strong>Transactional Data</strong></p>
<p>As businesses have expanded—and expanded onto the Internet—the volume of business transactions has grown. The Economist reported in 2010 that Wal-Mart processes more than 1 million customer transactions every hour and maintains databases exceeding 2.5 petabytes (million gigabytes)<sup>2</sup>. Imagine how those numbers have grown since then.</p>
<p>What’s even more critical is that companies can now capture not just sales transactions, but the detailed histories and clickstreams that lead to the sale. From web-based clickstream analysis to call data records, pre- and post-transaction histories are more robust than ever—and our ability to collect, analyze and act on that data must adjust accordingly.</p>
<p><strong>The social media explosion</strong></p>
<p>Today’s online customer has progressed well beyond accessing information. Today’s consumers are not only interacting and collaborating with each other, but they’re talking about and interacting with your brand. Facebook has more than 1 billion active subscribers<sup>3</sup>, and it’s estimated they share almost 700,000 individual pieces on content every minute. On Twitter, more than a billion tweets go out every two to three days<sup>4</sup>. (You can watch them mapped geographically in real-time at <a href="&quot;http://tweetping.net/">tweetping.net</a>.)</p>
<p><img class="alignleft size-medium wp-image-13299" title="Graph 2" src="http://www.vertica.com/wp-content/uploads/2013/03/Graph-2-300x213.png" alt="" width="300" height="213" />Product reviews, user communities, forums and blogs allow consumers to generate content that contains critical insight for the business. The proliferation of user-generated content in these social channels has lead to new techniques and tools for “sentiment analysis”—the ability to measure emotion to determine how your company and brand are perceived.</p>
<p><strong>The Internet of Things</strong></p>
<p>The amount of information generated by devices rather than people is also growing explosively.<br />
Mobile devices—and the apps people use on them—regularly broadcast individuals’ location, performance and other factors to the network. Retailers and distributors are using radio frequency identification (RFID), bar and QR codes to track inventory and enhance their supply chain and inventory performance. The healthcare industry seeks to improve care and reduce costs through remote patient monitoring. The automotive industry is embedding sensors in vehicles. And utilities are beginning to rely on smart meters to track usage. McKinsey Global Institute reports that more than 30 million networked sensors are in use in the transportation, automotive, industrial, utilities and retail sectors—and the number is growing by 30 percent every year.<sup>5</sup></p>
<p>We recently presented a webinar on the <a href="http://www.vertica.com/resources/video/">Internet of Things and the Power of Sensor Data</a>, which delves into this exciting area in much more detail.</p>
<p><strong>Disrupting conventional analytics &#8211; developing a &#8216;conversational relationship with data&#8217;</strong></p>
<p>Using Big Data to make operations more efficient, improve competitiveness and increase revenue is not about generating traditional statistics or producing standard reports.</p>
<p>Just as important as systems to collect and store data are systems to analyze and extract insight from that data. Without insight, you can’t <strong>gain</strong> new knowledge into your markets, your products and your operations.</p>
<p>When you have this insight at your disposal, you can act faster and with greater probability of success.</p>
<p>Extracting business value from Big Data requires a new approach. We believe that Big Data analytics is an iterative process. We describe it as <em>developing a conversational relationship with your data</em>. Analytics becomes a continuous improvement loop, which uses the results of analyses to frame better, more meaningful analyses, which, in turn, produce more definitive results. When results are available in minutes, analysts can ask, “What if?”</p>
<p style="text-align: center;"><img class="aligncenter  wp-image-13301" title="Graph 3" src="http://www.vertica.com/wp-content/uploads/2013/03/Graph-3-1024x349.png" alt="" width="574" height="195" /></p>
<p>When properly applied, Big Data analytics enables business leaders to:</p>
<ul>
<li>Understand market reaction and brand perception</li>
<li>Identify key buying factors</li>
<li>Segment populations to customize actions</li>
<li>Enable experimentation</li>
<li>Accurately predict outcomes</li>
<li>Reinvent and enhance inventory and supply chain systems and processes</li>
<li>Disrupt their industries, gain an edge over competitors and enable new business models</li>
</ul>
<p>Big Data already proved its game-changing power during the 2012 U.S. presidential election. Obama campaign chairman Jim Messina said: “We were going to demand data on everything, we were going to measure everything…We were going to put an analytics team inside of us to study us the entire time to make sure we were being smart about things.”<br />
And, in fact, Big Data analytics helped the Obama campaign ratchet up the three key levers in any election: voter registration, persuasion and turnout. Rolling Stone magazine singled out Messina and the campaign’s CTO, Harper Reed, as two among a <a href="http://www.rollingstone.com/politics/news/the-obama-campaigns-real-heroes-20121126">handful of unsung heroes in Obama’s victory</a>.</p>
<p>You can hear more about how HP Vertica contributed to the high-tech strategy behind Obama’s reelection in a <a href="http://www.vertica.com/resources/video/">recent webinar</a> featuring Chris Wegrzyn, director of data architecture for the Democratic National Committee.</p>
<p><strong>The traditional data warehouse won’t get it done</strong></p>
<p>The concept of the data warehouse evolved in the 1980s. Then, data warehouses were simply databases into which data from multiple sources was consolidated for the purpose of query and reporting. But today, these systems fall short when confronted with the volume, velocity and variety of Big Data. Why? They fail to enable the conversational approach to data required by Big Data analytics.</p>
<p>Traditional databases and data warehouses don’t easily scale to the hundreds of terabytes or even petabytes needed for many Big Data applications. Data is often not compressed, so huge amounts of storage and I/O bandwidth are needed to load, store and retrieve data. Data is still stored in tables by row, so access to a single data element through many rows—a common operation in business analytics—requires retrieving practically all of the data in a dataset to extract the specific element(s) needed. That strains I/O bandwidth and extends processing time. We have seen cases where the velocity of incoming data exceeds the capacity of the system to load it into the database, and queries produce answers in hours rather than the seconds or minutes needed for iterative business analytics. As a result, systems cost too much to maintain, and they fail to deliver the insight business leaders seek.</p>
<p>Take sentiment analysis, for example. The goal is to extract meaningful information from unstructured data so results can be stored in databases and analyzed. But the formats of resulting data are less predictable, more varied and subject to change during iterative analytics. This requires frequent changes to relational database structure and to processes that load data into them. For IT, it means the iterative approach to extracting business insight from Big Data requires new approaches, new tools and new skills.</p>
<p><strong>Challenges for business leaders</strong></p>
<p>Big Data is not just a technical challenge. Gaining and applying business insight compels business leaders to adopt new and disruptive ways of thinking and working.<br />
Successful leaders we have known in data-driven organizations become more familiar with the sources of data available to them. Rather than asking IT what information is available in the database, they view information as a key competitive asset and explore how insights might be extracted from it to offer immediate and sustainable competitive advantage.</p>
<p><strong>A solution for Big Data analytics</strong></p>
<p>HP Vertica Analytics Platform is a new kind of database designed from the ground up for business analytics at the scale of Big Data. Compared to traditional databases and data warehouses, it drives down the cost of capturing, storing and analyzing data. And it produces answers 50 to 1,000 times faster to enable the iterative, conversational analytics approach needed.</p>
<ul>
<li>HP Vertica Analytics Platform compresses data to reduce storage costs and speed access by up to 90 percent.</li>
<li>It stores data by columns rather than rows and caches data in memory to make analytic queries 50 to 1,000 times faster.</li>
<li>It uses massively parallel processing (MPP) to spread huge data volumes over any hardware, including low-cost commodity servers.</li>
<li>It uses data replication, failover and recovery to achieve automatic high availability.</li>
<li>It includes a pre-packaged, in-database analytics library to handle complex analytics and development framework.</li>
<li>It supports the R statistical programming language so analysts can create user-defined analytics inside the database.</li>
<li>It dynamically integrates with Hadoop to analyze large sets of structured, semi-structured and unstructured data.</li>
</ul>
<p>HP Vertica Analytics Platform means better, faster business insight at less cost.</p>
<hr />
<p>Test drive the HP Vertica Analytics Platform at <a title="http://www.vertica.com/evaluate" href="http://www.vertica.com/evaluate">www.vertica.com/evaluate</a>.</p>
<ul>
<li>HP Vertica White Paper – <a href="http://www.vertica.com/the-disruptive-power-of-big-data/" target="_blank">The Disruptive Power of Big Data</a></li>
<li>HP Vertica Webinar – <a href="http://www.vertica.com/resources/video/" target="_blank">The Disruptive Power of Big Data</a> (featuring guest speaker Chris Wegrzyn, Director of Data Architecture for the DNC/Obama campaign)</li>
<li>HP Vertica Webinar – <a href="http://www.vertica.com/resources/video/" target="_blank">Unlocking the Massive Potential of Sensor Data and the Internet of Things</a></li>
<li>Blog – CIOs: <a href="http://www.enterprisecioforum.com/en/blogs/junem/cio-%E2%80%93-letter-i-your-title-never-meant-so" target="_blank">The letter I in your title never meant so much</a></li>
<li>Blog – <a href="http://www.enterprisecioforum.com/en/blogs/junem/unlocking-value-big-data">Unlocking the value of Big Data</a></li>
</ul>
<hr />
<p><span style="font-size: x-small;">[1] “<a href="http://hbr.org/2012/10/big-data-the-management-revolution/ar/1">Big Data: The Management Revolution</a>,” Andrew McAfee and Erik Brynjolfsson, Harvard Business Review, October, 1012.</span></p>
<p><span style="font-size: x-small;">[2]“<a href="http://www.economist.com/node/15557443?story_id=15557443">Data, data everywhere</a>,” The Economist, Feb 25, 2010.</span></p>
<p><span style="font-size: x-small;">[3]<a href="http://newsroom.fb.com/content/default.aspx?NewsAreaId=22">Facebook key facts</a>.</span></p>
<p><span style="font-size: x-small;">[4] <ins cite="mailto:Leslie%20Ayers" datetime="2013-03-05T14:49"><a href="http://www.mediabistro.com/alltwitter/tweetping_b35247">http://www.mediabistro.com/alltwitter/tweetping_b35247</a></ins></span></p>
<p><span style="font-size: x-small;">[5] “<a href="http://www.mckinsey.com/insights/mgi/research/technology_and_innovation/big_data_the_next_frontier_for_innovation">Big data: The next frontier for innovation, competition, and productivity</a>,” The McKinsey Global Institute, June 2011.</span></p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F19%2Fthe-disruptive-power-of-big-data%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/03/19/the-disruptive-power-of-big-data/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Big Data Value at Mobile World Congress 2013</title>
		<link>http://www.vertica.com/2013/03/11/big-data-value-at-mobile-world-congress-2013/</link>
		<comments>http://www.vertica.com/2013/03/11/big-data-value-at-mobile-world-congress-2013/#comments</comments>
		<pubDate>Mon, 11 Mar 2013 20:14:19 +0000</pubDate>
		<dc:creator>Jeff Healey</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[HP IT]]></category>

		<guid isPermaLink="false">https://www.vertica.com/?p=13197</guid>
		<description><![CDATA[<p>Barcelona, Spain is known for its tapas, futbol, and Gaudi-inspired architecture. However, as host to the world’s largest annual mobile industry event — <a href="http://www.mobileworldcongress.com/" target="_blank">Mobile World Congress</a>  — the city has also become synonymous with all things mobile.</p> <p>Nearly 80,000 attendees (72,000 from 200 countries to be precise — an all-time high) were blanketed with announcements and presentations about the latest gadgets and devices, the wireless enablement of mainly everything, Machine to Machine (M2M), and, largely, the growth and value of managing and analyzing Big Data.</p> <p>It’s no surprise to us, given that 7 of the top 10 <a href="http://www.vertica.com/industries/telecommunications/"> communications and service providers (CSPs)</a> trust the HP Vertica Analytics Platform to manage and analyze terabytes to petabytes of <a href="http://www.vertica.com/2013/03/11/big-data-value-at-mobile-world-congress-2013/">Read More &#187;</a><img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F11%2Fbig-data-value-at-mobile-world-congress-2013%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></description>
			<content:encoded><![CDATA[<p>Barcelona, Spain is known for its tapas, futbol, and Gaudi-inspired architecture. However, as host to the world’s largest annual mobile industry event — <a href="http://www.mobileworldcongress.com/" target="_blank">Mobile World Congress</a>  — the city has also become synonymous with all things mobile.</p>
<p>Nearly 80,000 attendees (72,000 from 200 countries to be precise — an all-time high) were blanketed with announcements and presentations about the latest gadgets and devices, the wireless enablement of mainly everything, Machine to Machine (M2M), and, largely, the growth and value of managing and analyzing Big Data.</p>
<p>It’s no surprise to us, given that 7 of the top 10 <a href="http://www.vertica.com/industries/telecommunications/"> communications and service providers (CSPs)</a> trust the HP Vertica Analytics Platform to manage and analyze terabytes to petabytes of data (i.e., Big Data) in near-real time.</p>
<p>But why are CSPs managing and analyzing all of this Big Data — in other words, where is the value?</p>
<p>Miguel Carrero, GM, Actionable Customer Intelligence, HP and I covered this very question at the show during a <a href="http://www.youtube.com/watch?v=R9zum1ze9Qw " target="_blank">short video interview</a>. Miguel also covered this topic in more detail in the day three editorial recap of <a href="http://mwdaily.mobileworldlive.com/2013/day3/index.html" target="_blank">Mobile World Daily 2013</a>.</p>
<p>At the HP booth (as well as the transformation workshops), CSPs met with HP CMS (Communications and Media Solutions) to learn how the <a href="http://www8.hp.com/us/en/business-solutions/solution.html?compURI=1321902" target="_blank">HP Smart Profile Server Solution</a> — powered by the HP Vertica Analytics Platform — helps them realize a range of real-world use cases with real business value:</p>
<ul>
<li><strong>Targeted product and marketing offers</strong> &#8211; Gain complete contextual insight into your customers’ needs then take action to improve customer satisfaction and achieve better retention rates.</li>
<p></p>
<li><strong>Network optimization</strong> – Improve your network engineering and planning and user experience via optimized network utilization and real-time response to traffic congestion situations.</li>
<p></p>
<li><strong>“Bill shock” mandate</strong> – Provide pre-paid and subscriber mobile customers with visibility into their mobile usage, including voice, data, and roaming.</li>
<p></p>
<li><strong>New business model enablement</strong> &#8211; Capture the real-time business value of each of your customers and leverage it via new collaborative business models — increasing upsell opportunities and delivering prioritized resolutions.</li>
<p>
</ul>
<p>
What is your organization’s most prevalent use case for managing and analyzing Big Data?</p>
<img src="http://track.hubspot.com/__ptq.gif?a=120019&k=14&bu=http%3A%2F%2Fwww.vertica.com%2Fblog%2F&r=http%3A%2F%2Fwww.vertica.com%2F2013%2F03%2F11%2Fbig-data-value-at-mobile-world-congress-2013%2F&bvt=rss&p=wordpress" style="float:left;" xml:base="http://www.vertica.com/feed/" width="1" height="1" border="0" align="right"/>]]></content:encoded>
			<wfw:commentRss>http://www.vertica.com/2013/03/11/big-data-value-at-mobile-world-congress-2013/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
