<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>InFocus</title>
	<atom:link href="http://infocus.emc.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://infocus.emc.com</link>
	<description>EMC Global Services Blog</description>
	<lastBuildDate>Tue, 15 May 2012 14:39:11 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Big Data MBA: Reading the Annual Report for Big Data Opportunities</title>
		<link>http://infocus.emc.com/william_schmarzo/big-data-mba-reading-the-annual-report-for-big-data-opportunities/</link>
		<comments>http://infocus.emc.com/william_schmarzo/big-data-mba-reading-the-annual-report-for-big-data-opportunities/#comments</comments>
		<pubDate>Mon, 14 May 2012 13:56:49 +0000</pubDate>
		<dc:creator>Bill Schmarzo</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Annual Reports]]></category>
		<category><![CDATA[financial services]]></category>
		<category><![CDATA[retail]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5786</guid>
		<description><![CDATA[I just finished presenting at the TDWI &#8220;Cool BI&#8221; Forum in Chicago this week.  I had an interesting conversation after my presentation about identifying where to look for big data opportunities within one’s organization.  I’m a big advocate of starting with the annual report, and I’ll use this blog to discuss 1) where to look in the [...]]]></description>
			<content:encoded><![CDATA[<p>I just finished presenting at the <a href="http://events.tdwi.org/events/chicago-cool-bi-forum-2012/home.aspx">TDWI &#8220;Cool BI&#8221; Forum</a> in Chicago this week.  I had an interesting conversation after my presentation about identifying where to look for big data opportunities within one’s organization.  I’m a big advocate of starting with the annual report, and I’ll use this blog to discuss 1) where to look in the annual report to identify potential big data opportunities and 2) how to do a quick assessment as to how big data might be used to power these opportunities.</p>
<h1><strong>Reading the Annual Report</strong></h1>
<p>I’m always surprised by how few people take the time to read their company’s annual report (you are excused if you work for a government agency).  In particular, the “President’s Letter to the Shareholders” is a gold mine.  It is within this section of the annual report that the President or CEO talks about all the great things he/she did for the company over the past year.  That usually takes up about 3/4<sup>th</sup> of the letter and can largely be ignored (IMHO).  It’s the last 1/4<sup>th</sup> of the letter that is most informative.  Usually in the last part of the Shareholders letter, the President/CEO talks about the key business initiatives for the next year.  Let’s review a couple annual reports to see what I mean.</p>
<h1><strong>Reading the Annual Report:  Financial Services</strong></h1>
<p>Below is the 2010 annual report from a financial services firm (by the way, there is no rhyme or reason behind the annual reports that I have picked.  They are just ones that have potential business impact using big data).</p>
<p style="text-align: center;" align="center">   <img class="size-full wp-image-5798 aligncenter" title="Figure 1" src="http://infocus.emc.com/wp-content/uploads/2012/05/figure-11.jpg" alt="" width="297" height="301" /></p>
<p style="text-align: center;" align="center"><em>Figure 1: 2010 Annual Report</em></p>
<p>The above section of the Shareholder’s Letter highlights a business initiative to improve customer cross-sell effectiveness in order to reach a goal of 10.0 products per banking household (from 5.7 products per banking household today).  While 10.0 may be a BHAG (Big, Hairy, Audacious Goal), it is clear that some executive in the organization (likely in Marketing) has been chartered with increasing cross-sell effectiveness.</p>
<p>Here are some examples of how big data could help their cross-sell effectiveness business initiative:</p>
<ul>
<li>Use detailed customer financial data on the number and types of accounts held by household, combined with key account information (e.g., length of account ownership, account balance, account balance trends) as well as household demographics data, to create more granular household segments.  Run analytic models to “score” these new household segments by their likelihood of buying a specific additional financial product (e.g., households who hold these products and are in this demographic group have an X% likelihood of buying this additional product).  Develop different models for different combinations of products and household demographics.</li>
<li>Use social media data from sites such as Facebook, Pinterest, and Twitter to identify trends in financial products that might be candidates for cross-product promotions (e.g., mortgage refinancing is hot, so look to bundle a mortgage refinancing with a second mortgage).  Run these trends against your household/product cross-sell models to identify direct marketing targets.</li>
</ul>
<h1><strong>Reading the Annual Report:  Retail</strong></h1>
<p>Here’s another example from a retailer’s 2011 Annual Report.  There are at least two sections of the annual report where the company could leverage structured data (e.g., point-of-sale, inventory, returns and orders transactions) with unstructured (social media, web log, consumer comments) data to drive these key business initiatives.</p>
<p style="text-align: center;"><img class="size-full wp-image-5799 aligncenter" title="Figure 2" src="http://infocus.emc.com/wp-content/uploads/2012/05/figure-21.jpg" alt="" width="436" height="84" /></p>
<p style="text-align: center;" align="center"><em>Figure 2: Store Assortment Optimization</em></p>
<p>The first example highlights to an opportunity where big data could help drive store assortment optimization.  In particular, big data could help in the following ways:</p>
<ul>
<li>Integrate social media data, third-party demographics data, and product sales data to forecast optimal store assortment (at individual store level); update optimal store assortments more frequently (weekly?) based upon local events (e.g., Cinco de Mayo, San Francisco Giants home games).</li>
<li>Integrate social media insights with consumer comments (call center, email, web site) with store and product sales data to calculate and track the net promoter scores and consumer sentiment for particular stores (by product category, by season); identify and act on under-performing stores, products and product categories.</li>
<li>Leverage social media data to identify product and market trends (by store and by product category) that can impact pricing, in-store merchandising and store assortment planning.</li>
</ul>
<p style="text-align: center;"><img class="size-full wp-image-5800 aligncenter" title="Figure 3" src="http://infocus.emc.com/wp-content/uploads/2012/05/figure-31.jpg" alt="" width="436" height="140" /></p>
<p style="text-align: center;" align="center"><em>Figure 3:  Private Label Effectiveness</em></p>
<p>The second example highlights driving private label effectiveness (e.g., increase private label sales from 15% of products sold to 30% over the next several years).  Here are some examples of where big data could be used to drive private label effectiveness:</p>
<ul>
<li>Integrate sales and inventory data with social media data to “score” product categories that are the most likely opportunities (highest probabilities of success) to introduce private label products (create scores by store, market, and product category).</li>
<li>Mine social media data to identify consumers’ areas of interest that can be used for direct marketing and in-store promotions and merchandising around promoting private label products.</li>
</ul>
<h1><strong>Reading the Annual Report:  Brokerage Firm</strong></h1>
<p>This third example comes from a brokerage firm’s 2010 annual report.</p>
<p style="text-align: center;" align="center"><img class="size-full wp-image-5801 aligncenter" title="Figure 4" src="http://infocus.emc.com/wp-content/uploads/2012/05/figure-41.jpg" alt="" width="354" height="254" /></p>
<p style="text-align: center;" align="center"><em>Figure 4:  2010 Brokerage Firm Annual Report</em></p>
<p>The annual report highlights the focus on driving their Client Promoter Score (CPS) program.  In 2010, the firm was able to achieve their highest score to-date, a record 37 percent (it would be nice to know if they had a goal and a timeline for their CPS, but maybe that’s something that can be determined via interviews).  Here are some examples where big data can be used to drive the CPS program.</p>
<ul>
<li>Leverage social media sites and blogs to create and continuously refine a new Client Promoter Score that enables the creation of a more timely, more comprehensive Client Promoter Score that is a better predictor of clients&#8217; feelings and perspectives (e.g., likeliness to recommend).</li>
<li>Build analytic models that incorporate social data and different financial transactions to break out and track CPS by most &#8220;valuable&#8221; customer segments; match up customer financial transaction patterns with sentiment analysis to flag potential CPS score drops by customer segment.</li>
<li>Use the CPS to segment key customers; leverage Twitter and Facebook data to monitor sentiment trends across your most valuable customer segments in order to more thoroughly identify and quantify potential customer attrition and their corresponding drivers.</li>
</ul>
<h1><strong>Summary</strong></h1>
<p>We talk about the business transformational power of big data; the power to tease out new customer, product, and market insights that we can use to drive higher-fidelity, more frequent business decisions.  But in order to drive business transformation, we need to “begin with an end in mind” (to steal from Stephen Covey).  We need to invest the time to understand our organization’s key business initiatives, and contemplate the “realm of the possible” with respect to the big data business drivers (e.g., more detailed structured data, new unstructured data sources, high-velocity/low-latency data access, and predictive analytics).  There is no better place to start your big data journey than by targeting the key business initiatives that can be found in your company’s annual report.</p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/william_schmarzo/big-data-mba-reading-the-annual-report-for-big-data-opportunities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Stump the Bloggers: Transitioning to ITaaS; Service Excellence Industry Recognition; BI &amp; Customer Data; &amp; Strategic Planning</title>
		<link>http://infocus.emc.com/community_manager/stump-the-bloggers-transitioning-to-itaas-service-excellence-industry-recognition-bi-strategic-planning/</link>
		<comments>http://infocus.emc.com/community_manager/stump-the-bloggers-transitioning-to-itaas-service-excellence-industry-recognition-bi-strategic-planning/#comments</comments>
		<pubDate>Fri, 11 May 2012 18:32:22 +0000</pubDate>
		<dc:creator>Community Manager</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Intelligence / Data Warehousing]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Enterprise Info. Mgt.]]></category>
		<category><![CDATA[Financial Services]]></category>
		<category><![CDATA[Governance]]></category>
		<category><![CDATA[IT Transformation]]></category>
		<category><![CDATA[Service Excellence]]></category>
		<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[financial services]]></category>
		<category><![CDATA[governance]]></category>
		<category><![CDATA[IaaS]]></category>
		<category><![CDATA[IT transformation]]></category>
		<category><![CDATA[ITaaS]]></category>
		<category><![CDATA[PaaS]]></category>
		<category><![CDATA[TSW]]></category>
		<category><![CDATA[virtualization]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5770</guid>
		<description><![CDATA[InFocus Weekly Roundup May 11, 2012 “Stump the Bloggers” with your questions on Capitalizing on Cloud Services and Transitioning to ITaaS.  Post your questions for InFocus bloggers here – and they will be answered live from EMC World May 21 (simulcast on the EMC Community Network). This week, Wayne Pauley, Choong Keng Leung, and JP Morgenthal [...]]]></description>
			<content:encoded><![CDATA[<p>InFocus Weekly Roundup May 11, 2012</p>
<p><strong>“Stump the Bloggers”</strong> with your questions on Capitalizing on Cloud Services and Transitioning to ITaaS.  <a title="Post your questions for InFocus bloggers here" href="http://infocus.emc.com/community_manager/stump-the-bloggers/">Post your questions for InFocus bloggers here </a>– and they will be answered live from EMC World May 21 (simulcast on the EMC Community Network).</p>
<p>This week, Wayne Pauley, Choong Keng Leung, and JP Morgenthal share their perspectives on <strong>IT Transformation and ITaaS and PaaS</strong>:</p>
<ul>
<li><a href="http://infocus.emc.com/wayne_pauley/itaas-assessment/">ITaaS Assessment</a>:  effective top down assessment process. </li>
<li><a href="http://infocus.emc.com/choong_kengleong/transforming-your-it-infrastructure/">Transforming Your IT infrastructure</a>: from Silos to consolidated infrastructures through the evolution from shared to Cloud and IT-as-a Service</li>
<li><a href="http://infocus.emc.com/jp_morgenthal/the-key-to-private-cloud-is-removing-it-stratification/">Key to Private Cloud is Removing IT Stratification</a>: moving to an agile organization through the use of DevOps culture and other collaborative techniques</li>
<li> <a href="http://infocus.emc.com/jp_morgenthal/iaas-and-paas-say-it-together-naturally/">IaaS and PaaS</a>: distinction between the two is no longer required</li>
</ul>
<p>Congratulations to EMC for being recognized this week at the <a href="http://www.technologyservicesworld.com/about/press-releases/2012-press-releases/2012-05-09-star-awards.html">Technology Services World</a> conference, the premier event in the IT Services industry. EMC was acknowledgement for <strong>commitment to service excellence</strong> in three categories:</p>
<ul>
<li>Innovation in Professional Services</li>
<li>Professional Services Excellence in Enabling Customer Success (co-winner)</li>
<li><a href="http://infocus.emc.com/frank_coleman/emc-wins-the-tsia-star-award-best-use-of-metrics-and-business-intelligence/">Excellence in Use of Metrics and Business Intelligence</a></li>
</ul>
<p>Frank Coleman discusses this award and the increased focus on Big Data and Business Intelligence.</p>
<p><strong>Business Intelligence and Customer Data Quality: </strong></p>
<ul>
<li>Bill Schmarzo explains how Hadoop, as your ODS and data staging area, can deliver new compelling business benefits: <a href="http://infocus.emc.com/william_schmarzo/understanding-the-role-of-hadoop-in-your-bi-environment/">Understanding the Role Of Hadoop In Your BI Environment.</a></li>
<li>Laddie Suk explores several core customer data issues that persist in today’s IT systems and covers <a href="http://infocus.emc.com/laddie_suk/customer-data-quality-how-to-measure-it-improve-it-and-make-it-stick/">Customer Data Quality- How to Measure it, Improve it and Make it Stick</a>.</li>
</ul>
<p><strong>Strategic Planning</strong></p>
<p>“Strategic planning is not something that can be fully programmed into a computer,” offers Dave Bagatelle in <a href="http://infocus.emc.com/dave_bagatelle/financial-planning-static-vs-probabilistic/">Financial Planning: Static vs. Probabilistic</a>, contrasting straight-line assumptions and forecasting to more probabilistic advice and tools that support forward-looking advice.</p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/community_manager/stump-the-bloggers-transitioning-to-itaas-service-excellence-industry-recognition-bi-strategic-planning/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IaaS and PaaS: Say It Together, Naturally</title>
		<link>http://infocus.emc.com/jp_morgenthal/iaas-and-paas-say-it-together-naturally/</link>
		<comments>http://infocus.emc.com/jp_morgenthal/iaas-and-paas-say-it-together-naturally/#comments</comments>
		<pubDate>Fri, 11 May 2012 15:59:24 +0000</pubDate>
		<dc:creator>JP Morgenthal</dc:creator>
				<category><![CDATA[Cloud]]></category>
		<category><![CDATA[IT Transformation]]></category>
		<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[IaaS]]></category>
		<category><![CDATA[IT organization]]></category>
		<category><![CDATA[NIST]]></category>
		<category><![CDATA[PaaS]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5758</guid>
		<description><![CDATA[Traditional views on IaaS and PaaS are already outmoded. They present a view of the universe that delineates based on a particular software element called a hypervisor. However, these terms do nothing more than continue to promote a stratified IT organization. As I discussed in my previous blog, stratification of IT has become a primary [...]]]></description>
			<content:encoded><![CDATA[<p>Traditional views on IaaS and PaaS are already outmoded. They present a view of the universe that delineates based on a particular software element called a hypervisor. However, these terms do nothing more than continue to promote a stratified IT organization. As I <a href="http://infocus.emc.com/jp_morgenthal/the-key-to-private-cloud-is-removing-it-stratification/" target="_blank">discussed in my previous blog</a>, stratification of IT has become a primary driver of increasing costs. While no one wants to really deal with the pragmatics of the situation, it will take radical changes in organizational structure to deliver an effective IT organization that can meet the needs of business at a reasonable cost.</p>
<p>The concept of IaaS as a standalone layer made sense at one time; especially in the early days of understanding cloud computing. Before we could move forward, we had to develop the methodologies, processes, technologies and capabilities for aggregating infrastructure resources and delivering them in an agile and elastic manner with the ability to measure and respond to demands. As the IaaS layer matured, it was natural that others would start to focus on the problem domain for delivery of applications that make use of the capabilities now afforded them. Hence, PaaS emerged that enabled rapid and agile delivery of large-scale distributed applications. Consider that prior to the availability of IaaS, large-scale distributed applications needed to be designed in a monolithic manner with the infrastructure requirements as a component of the overall solution. Thus, building and deploying applications that supported billions of users or petabytes of data were highly custom, required proprietary hardware and required specialized skills to build, deploy and operate.</p>
<p>However, the time for distinction between IaaS and PaaS is no longer required. Indeed, we&#8217;ve learned two key lessons through maturing these two environments: 1) how scale and design of the infrastructure impacts the ability to deliver highly-scalable applications through a PaaS operating on that infrastructure; and 2) how to manage the needs of multiple tenants on a single cloud that all have varying infrastructure requirements. The impact of these two lessons is that IaaS cannot live in isolation, but needs intimate knowledge (context) of the PaaS environments that will be operating on that infrastructure and must operate in support of those PaaS environments. Thus, the IaaS is not a statically defined entity, but must be agile and change over time.</p>
<p>Moreover, it has been realized that components that have been redundantly defined within the PaaS layer is actually an element of the infrastructure. By removing this redundancy and delivering them as part of the infrastructure, it reduces the costs and overhead associated with building and delivering applications in the enterprise. Examples of these components include data management, security, queuing, and metering. Additionally, delivering these as part of the infrastructure simplifies the ability to support multiple PaaS tenants in a consistent and supportable manner.</p>
<p>The collapse of these layers begin to form a new layer identified as infrastructure services.</p>
<p>The good news here is that this historic pattern has worked successfully in the past. At one time developing high performance graphic applications for Microsoft Windows was very difficult. It required the application developers to deliver specialized drivers along with the application. Then Microsoft provided developers with the DirectX interface. The results was simpler and faster development for application developers, greater reliability of these applications since code was not being injected into the operating system layer that conflicted and better sharing of machine resources since each application didn&#8217;t use memory and disk to overcome this problem. These are the same benefits for PaaS when we abstract these services within the infrastructure.</p>
<p>The difficulty moving forward will be to bring this understanding forward given the massive instruction to the IT community. <a href="http://www.nist.gov/index.html" target="_blank">The National Institutes of Standards and Technology</a> (NIST) definition has been broadly adopted and is used as a common nomenclature for discussion and communication regarding cloud computing architecture and efforts. However, given the pace of maturity and change surrounding cloud computing, we should not be reticent to let go of early definitions and design goals in favor of adopting emerging methods for communicating design and requirements for cloud computing. Current definitions of IaaS represent compute, storage and networking, and delegate software services to PaaS and SaaS. As we move forward, the lines between these layers will continue to blur even further. Infrastructure services captures the goals and essence of the larger goal of ITaaS and services delivery much more effectively without implying association to the implementation.</p>
<p style="text-align: center;"><a href="http://infocus.emc.com/wp-content/uploads/2012/05/PaaS_IaaS_Layers.gif" target="_blank"><img class="aligncenter  wp-image-5761" title="PaaS_IaaS_Layers" src="http://infocus.emc.com/wp-content/uploads/2012/05/PaaS_IaaS_Layers.gif" alt="" width="453" height="148" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/jp_morgenthal/iaas-and-paas-say-it-together-naturally/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Customer Data Quality – How to Measure it, Improve it, and Make it Stick</title>
		<link>http://infocus.emc.com/laddie_suk/customer-data-quality-how-to-measure-it-improve-it-and-make-it-stick/</link>
		<comments>http://infocus.emc.com/laddie_suk/customer-data-quality-how-to-measure-it-improve-it-and-make-it-stick/#comments</comments>
		<pubDate>Fri, 11 May 2012 15:17:00 +0000</pubDate>
		<dc:creator>Laddie Suk</dc:creator>
				<category><![CDATA[Enterprise Info. Mgt.]]></category>
		<category><![CDATA[Governance]]></category>
		<category><![CDATA[CRM]]></category>
		<category><![CDATA[customer]]></category>
		<category><![CDATA[Data Cleansing]]></category>
		<category><![CDATA[data quality]]></category>
		<category><![CDATA[governance]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5704</guid>
		<description><![CDATA[Too many years ago, when I was at Verizon, a key problem was lack of consistent customer data – including the basics such as name, address and other customer information. Today’s blog explores some of the key customer data issues that persist in today’s IT systems… a sort of “Back to the Future.” The same [...]]]></description>
			<content:encoded><![CDATA[<p>Too many years ago, when I was at Verizon, a key problem was lack of consistent customer data – including the basics such as name, address and other customer information.</p>
<p>Today’s blog explores some of the key customer data issues that persist in today’s IT systems… a sort of “Back to the Future.” The same techniques used 20 years ago are applicable – aided by the fact that commercial packages are available to assist in reporting and reconciliation.</p>
<p>I’ll approach this topic from the perspective of planning to implement a new customer relationship management (CRM) system and address the key topics, tools and techniques to prepare legacy data to be migrated to the new system.</p>
<p><strong>WHAT IS DATA CLEANSING?</strong></p>
<p><em>Data Cleansing is an organized approach to successfully migrate business data focused on achieving data integrity. </em>Its benefits include increasing data quality. Focusing on data quality is intended to ensure that the new CRM system has clean and valid data. Though data quality at a company may be thought to be good, a formal approach should be used to address data integrity issues prior to migration into the CRM solution. Best practices include:</p>
<ul>
<li>Source to target mapping of existing data for CRM is usually performed for the design phase</li>
<li>Prior to and during this data mapping exercise, data in your legacy repositories should be analyzed to identify potential occurrence of the following types of major data integrity issues:</li>
</ul>
<ol>
<li><strong>Orphaned/Missing Data</strong> &#8211; System and/or program errors, inadequate enforcement of Relational Integrity, data that may have been considered optional in legacy systems maybe mandatory for the new CRM.</li>
<li><strong>Invalid or Unclean data</strong> &#8211; Data entry errors, data that may have been entered in to a system prior to implementation of validation logic. Examples of these would be invalid addresses or potentially missing data.</li>
</ol>
<p><strong>Key Success Factor:</strong>  <em>A cleansing effort should be undertaken to get the data-to-be-converted as clean as possible prior to extraction from legacy repositories.</em></p>
<p>How to cleanse? This depends on the specific problems:</p>
<p><span style="text-decoration: underline;">Orphaned / Missing data</span></p>
<ul>
<li>Custom programs responsible for extracting and transforming the data in to XML (for target application load), would identify these instances</li>
<li>Additionally, conversion programs will generate error reports that would allow you to verify issues were resolved at the source</li>
</ul>
<p><span style="text-decoration: underline;">Invalid or Unclean data</span></p>
<ul>
<li>A variety of COTS tools that specialize in cleansing CRM related, custom programs and/or manual  intervention are generally used to resolved this type of data integrity problems</li>
<li> Other common integrity issues include: Incorrect , Obsolete, and Duplicate data</li>
</ul>
<p>&nbsp;</p>
<p><strong>Typical Data Cleansing Activities to Prepare for Conversion:</strong></p>
<ul>
<li>Data Discovery &#8211; Identification of all data elements within the enterprise data stores</li>
<li>Data Profiling – Null &amp; valid-values Identification, outlier analysis &amp; other advanced profiling analyses</li>
<li>Data Quality – Creating metrics which future data management policies will use to demonstrate improvement.  I have often said ‘you get what you measure’… and this is true for data cleansing activities.  The good news is that today’s tools provide various metrics that can be used for tracking and improvement</li>
<li>Data Matching &#8211; Determination of matching values within columns of differing types, formats and naming convent</li>
<li>Data Cleansing &#8211; Replacement of invalid values, deletion of nulls and bad records</li>
</ul>
<p><strong>Data Collection and Preparation for Loading:</strong></p>
<ul>
<li>Data preparation activities are conducted to reduce the time and effort spent converting, and to ensure that the data that is transferred to the new CRM is current and meaningful to the business</li>
<li>Data preparation requires assistance from various business organizations to identify and confirm the validity of any data to be migrated, and in approving data items (i.e. products) to be removed (or mapped differently)</li>
<li>Data preparation will also involve resolving any data conflicts.</li>
</ul>
<p><strong>Data Auditing is required to perform the following tasks:</strong></p>
<ul>
<li>Exception Testing, including unwanted data, duplicate data, or conflicting data</li>
<li>Exception handling based upon the results from exception testing</li>
<li>Field map validation</li>
<li>Data validation reports</li>
<li>Data comparison reports</li>
</ul>
<p>&nbsp;</p>
<p>So what are the systems needed to support your CRM conversion?  The following architecture provides the ‘best practice’ view of solutions for complex conversions:</p>
<p><a href="http://infocus.emc.com/wp-content/uploads/2012/05/CDQFigure11.gif"><img class="aligncenter size-full wp-image-5707" title="" src="http://infocus.emc.com/wp-content/uploads/2012/05/CDQFigure11.gif" alt="" width="443" height="265" /></a></p>
<p>In the architecture above, the best practices include:</p>
<ol>
<li>Using an Enterprise Application Interface (EAI) bus to develop interfaces to extract data from legacy applications. Once these application interfaces are developed once, they can be used repeatedly. This is particularly important if your legacy system may run in parallel with your new CRM system for an extended period of time.</li>
<li>Implementing a Master Data Management Model that clearly identifies systems that are primary owners of data elements as well as other systems that use the data.</li>
<li>Using Extract, Transform and Load tools (ETL) to extract legacy data, store in a temporary data base where data cleansing is performed.</li>
<li>After cleansing, legacy data is reloaded into legacy systems in order to improve the quality of source data.</li>
<li>Steps 3 and 4 can be repeated multiple times, with quality metric reports (produced by tools) used to assess and validate that quality gates have been met.</li>
<li>The same ETL tools used for cleansing data can be used in the ‘final’ extract, transform and load into the new CRM target system.</li>
</ol>
<p>&nbsp;</p>
<p>A word about tools (okay, actually several words):</p>
<p>Todays’ Information Data Quality tools can provide business-focused metrics that are simple and easy to disseminate to your company. For example, customer address quality has historically been a problem area. The root cause of problems is usually systems with “free form fields” where customer service representatives enter address without validation. The impact on your business is huge:  mailed invoices delayed, lost, or returned—resulting in lower cash flow.</p>
<p>&nbsp;</p>
<p><strong>Sample Tool Output</strong></p>
<p>Below are two standard reports from a commercial tool that analyzed street addresses for 3.5 million customer records.</p>
<p style="text-align: center;"><a href="http://infocus.emc.com/wp-content/uploads/2012/05/CDQFigure2a.gif" target="_blank"><img class="aligncenter  wp-image-5711" src="http://infocus.emc.com/wp-content/uploads/2012/05/CDQFigure2a.gif" alt="" width="444" height="121" /></a></p>
<p style="text-align: center;"><a href="http://infocus.emc.com/wp-content/uploads/2012/05/CDQFigure2b.gif" target="_blank"><img class="aligncenter  wp-image-5712" src="http://infocus.emc.com/wp-content/uploads/2012/05/CDQFigure2b.gif" alt="" width="443" height="112" /></a></p>
<p>In this case, about 8% of the street addresses were in categories 0, 1 and 2 – which indicated that 8% of this company’s customer street address information was of poor quality.</p>
<p>The tool can be run again after performing a number of analyses and algorithms &#8211; including street address matching against a commercial street address database (usually accessed from government postal databases or commercial delivery service sources).</p>
<p>Use these tools as communications vehicles to every part of your organization… to indicate you have coupled a quality improvement strategy with a few metrics that define success.  For example, you could target reducing the poor quality street addresses to below 3%.</p>
<p>&nbsp;</p>
<p>Finally, no Data Quality Improvement should be begun without a Governance program.  Where does your company place on the industry-accepted Data Governance Maturity Model described below?</p>
<p style="text-align: center;"><a href="http://infocus.emc.com/wp-content/uploads/2012/05/CDQFigure3.gif"><img class="aligncenter  wp-image-5713" src="http://infocus.emc.com/wp-content/uploads/2012/05/CDQFigure3.gif" alt="" width="399" height="170" /></a></p>
<p>At the lowest level of maturity, your IT team (with or without business involvement) has developed a “tactical” approach whereby the strategy and framework are defined.</p>
<p>In the next level of “operational” maturity, an organization (IT plus Business representatives) successfully implements scenarios &amp; validation of the framework.</p>
<p>Finally, the “strategic” and highest level of maturity indicates your company focuses on maintaining original goals achieved and fosters a corporate culture to retain implemented best-practices.</p>
<p>&nbsp;</p>
<p><strong>Next Up:  </strong>Unlocking the Power of Big Data to Become a Next-generation Service Provider</p>
<p>I hope you will join me and will pass on the link to your friends and networks.  Please … subscribe, send me feedback, and check back next week for the next installment.   If nothing else, I promise the International Travel tips will be extremely useful!</p>
<p>&nbsp;</p>
<p><strong>Today’s International Travel Tip:   Must-Have Technology Packing Guide (Also Known as ‘Packing for a Potential Natural Disaster’)</strong></p>
<p>I will not insult your intelligence about the basics, but rather provide some helpful hints that provide for your personal safety as well as business continuity.  They are the result of my experience in surviving the Chilean mega-earthquake in February, 2010 (see one of my prior blogs for more info).</p>
<ol>
<li>Spare batteries for your cell phone and computer  (and keep them charged)</li>
<li>Spare power cord to charge your wireless phone</li>
<li>Small Flashlight, spare batteries</li>
<li>Did I mention spare batteries?</li>
<li>Laptop with Microsoft Communicator with voice calling</li>
<li>Laptop with Skype for voice and/or video calling (due to some company’s corporate policies that forbid Skype on corporate laptops, some travelers carry two devices – a corporate laptop and a device with Skype)</li>
<li>Copies (hard copies as well as scanned) of your passport</li>
<li>Emergency travel contacts.  Most companies have contracts with ‘travel assist’ vendors – not just travel agents, but companies that will arrange for quick extractions from countries if circumstances warrant.</li>
<li>Emergency medical assistance contacts.  Beyond the company-provided medical contacts, always obtain names and phones of at least 2 local doctors for fast emergency assistance.</li>
</ol>
<p>&nbsp;</p>
<p>One final note:  while I won’t get into the merits of an iPhone vs. a Blackberry, I will point out one key difference (that is not widely advertised) that can make your life miserable in some countries.   Blackberry smartphones have always had the option to let you manually select networks in foreign countries.  When you select ‘manual networks’ – a list of available networks is displayed and you can select the appropriate network you prefer.  This is extremely useful as you (like me) typically have a good understanding of the best local carriers in the countries where we travel.</p>
<p>On the other hand, an iPhone does NOT have this capability – it will automatically select a carrier in a country.  This limitation has ramifications for countries such as Brazil and Chile – where the iPhone selects a carrier for good voice capabilities – yet that carrier’s data capabilities may be (based on experience) poor or ineffective.  One of my team member spent hours on the phone with Verizon – and was told there was no way around this iPhone limitation.  Buyer beware!</p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/laddie_suk/customer-data-quality-how-to-measure-it-improve-it-and-make-it-stick/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Financial Planning: Static vs. Probabilistic</title>
		<link>http://infocus.emc.com/dave_bagatelle/financial-planning-static-vs-probabilistic/</link>
		<comments>http://infocus.emc.com/dave_bagatelle/financial-planning-static-vs-probabilistic/#comments</comments>
		<pubDate>Thu, 10 May 2012 20:20:07 +0000</pubDate>
		<dc:creator>Dave Bagatelle</dc:creator>
				<category><![CDATA[Financial Services]]></category>
		<category><![CDATA[Service Excellence]]></category>
		<category><![CDATA[Asset Management]]></category>
		<category><![CDATA[Financial Planning]]></category>
		<category><![CDATA[financial services]]></category>
		<category><![CDATA[planning tools]]></category>
		<category><![CDATA[Wealth Management]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5701</guid>
		<description><![CDATA[Financial Planning is a growing need, especially for today’s retiring or retired baby boomers that require planning to sustain their lifestyle and for the Gen X/Y generations who are facing high education costs and less certain retirement prospects.  Planning tools are playing an increasing role in attracting and retaining these clients.  The right tools convey [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;"><img class="aligncenter  wp-image-5748" src="http://infocus.emc.com/wp-content/uploads/2012/05/Piggybank.jpg" alt="" width="360" height="229" /></p>
<p style="text-align: left;">Financial Planning is a growing need, especially for today’s retiring or retired baby boomers that require planning to sustain their lifestyle and for the Gen X/Y generations who are facing high education costs and less certain retirement prospects.  Planning tools are playing an increasing role in attracting and retaining these clients.  The right tools convey to clients that their adviser understands clients’ ever-changing situation, allow them to provide sound, valuable advice, and bring a level of trust and continuity that help advisers maintain relationships.</p>
<p>More than ever, clients want to know (and be able to track) if what their adviser said in one year holds true in the next several.  This requires the industry to move away from straight-line assumptions and forecasting to more probabilistic advice and tools that support forward-looking advice.</p>
<p>Probabilistic forecasting (aka the <a href="http://en.wikipedia.org/wiki/Retirement_planning#The_Monte_Carlo_method" target="_blank">Monte Carlo method</a>) provides two main benefits:</p>
<ul>
<li>Risk/Liability reduction to the firm for forecasts that do not come true</li>
<li>Improved advice to clients (ex. A gifting strategy that makes sense if a portfolio grows might need to be reviewed and revised during times of market volatility; therefore, creating a strategy that factors in that volatility from the start would obviate the need to revise the strategy)</li>
</ul>
<p>Static advice and reporting is no longer acceptable to most clients.  As account values change, clients demand to know the impacts to their goals in future years and what realistic changes are needed to stay on track.  Clients are also more skeptical than ever regarding straight line, non-volatile assumptions that do not account for real world events.  Increasingly, advisers are being required to provide advice that takes into account volatile investment types and markets.</p>
<p>Using straight line assumptions (e.g., US Large Cap stocks will go up 10% each year with no volatility) does not provide a realistic example of the investment world and it will almost undoubtedly provide an overly optimistic outcome as the only way an account value goes down is if spending is higher than the combined income and growth of the portfolio.</p>
<p>Advisers without the ability to respond to client requests for probabilistic financial planning will have a harder time retaining them since static forecasts become less reliable in a volatile environment.</p>
<p>Financial Planning tools that incorporate integrated data and a probability of success for individual goals allow advisers to provide strategic planning advice.  Advisers can track the probability of success when the plan is established as well as automatically on an on-going basis as actual income is realized, actual expenses are paid, and actual investments increase or decrease in value.</p>
<p>While assumptions still need to be made regarding the expected growth of investments, the volatility of those returns and their correlation to each other, the models will output potential long periods of negative returns in the short/mid and long term and create a framework for discussion and consideration.</p>
<p>Monte Carlo models rely on reasonable assumptions regarding the market return, asset class volatility and correlation and not all models are considered equal.  The most accurate models will consider mean reversion, the non-normal distribution of returns (e.g., fat-tails) and the ability to stress test a strategy using actual ‘black-swan’ (extraordinary) events.  A detailed description of these concepts and their application to financial planning models is beyond the scope of this blog.  The point is that some Monte Carlo is better than none, but actual tools and assumptions are important factors.</p>
<p>Advanced tools allow for the prioritization of goals and alert clients when/if material changes impact their plans.  Alerts can be set, among other things, to track the value of accounts, spending vs. budgets and allocations to specific securities or asset classes.</p>
<p>Best-in-breed financial planning tools allow advisers and clients to see how these occurrences impact the initial retirement goals and whether revisions are necessary.  The client can ‘play’ with several options using ‘what-if’ scenarios (ex. save more, assume to spend less in retirement, work longer).</p>
<p>Strategic planning is not something that can be fully programmed into a computer.  An adviser must understand the client’s situation and goals and come up with creative strategies that meet their individual needs.  Software helps this process, but one must never forget or discount the importance of human factors in the financial planning process.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/dave_bagatelle/financial-planning-static-vs-probabilistic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>EMC Wins the TSIA STAR Award Best Use of Metrics and Business Intelligence</title>
		<link>http://infocus.emc.com/frank_coleman/emc-wins-the-tsia-star-award-best-use-of-metrics-and-business-intelligence/</link>
		<comments>http://infocus.emc.com/frank_coleman/emc-wins-the-tsia-star-award-best-use-of-metrics-and-business-intelligence/#comments</comments>
		<pubDate>Thu, 10 May 2012 13:34:20 +0000</pubDate>
		<dc:creator>Frank Coleman</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Intelligence / Data Warehousing]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[metrics]]></category>
		<category><![CDATA[STAR award]]></category>
		<category><![CDATA[thought process]]></category>
		<category><![CDATA[TSIA]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5670</guid>
		<description><![CDATA[Yesterday I had the privilege of accepting the TSIA STAR Award for Best Use of Metrics and Business Intelligence on behalf of EMC. This is the third year in a row that EMC has won this award. EMC’s commitment to improving our customers’ experience is what this is all about. Our core metrics are designed [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday I had the privilege of accepting the <a href="http://www.technologyservicesworld.com/about/press-releases/2012-press-releases/2012-05-09-star-awards.html" target="_blank">TSIA STAR Award</a> for <strong>Best Use of Metrics and Business Intelligence</strong> on behalf of EMC.</p>
<p style="text-align: center;"><img class="size-full wp-image-5672 aligncenter" title="TSIA_StarAward_2012_sm" src="http://infocus.emc.com/wp-content/uploads/2012/05/TSIA_StarAward_2012_sm.jpg" alt="" width="300" height="86" /></p>
<p>This is the third year in a row that EMC has won this award. EMC’s commitment to improving our customers’ experience is what this is all about. Our core metrics are designed to look at the service experience from the customer point of view, not ours.  This has been invaluable in helping us focus our efforts on areas of the business that will have the greatest customer impact.</p>
<p>Our approach is a <a href="http://infocus.emc.com/frank_coleman/data-analytics-%e2%80%93-systems-thinking/" target="_blank">thought process</a>, not an end state. Because our business is dynamic, we will continue to evolve with our customers and not lose sight of their experience. These are exciting times with the increased focus on Big Data and Business Intelligence; EMC is the place to be!</p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/frank_coleman/emc-wins-the-tsia-star-award-best-use-of-metrics-and-business-intelligence/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Transforming Your IT Infrastructure</title>
		<link>http://infocus.emc.com/choong_kengleong/transforming-your-it-infrastructure/</link>
		<comments>http://infocus.emc.com/choong_kengleong/transforming-your-it-infrastructure/#comments</comments>
		<pubDate>Wed, 09 May 2012 15:54:41 +0000</pubDate>
		<dc:creator>Choong Keng Leong</dc:creator>
				<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[IT transformation]]></category>
		<category><![CDATA[virtual data center]]></category>
		<category><![CDATA[virtualization]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5636</guid>
		<description><![CDATA[Referring to the IT Transformation Puzzle which we discussed in my previous blog post IT Transformation – Where to Start?, most IT organizations will start their transformation journey with the Infrastructure Layer as shown in Figure 1. Figure 1: Infrastructure Transformation This is because Infrastructure is the foundation that everything else &#8216;rides&#8217; on top of. [...]]]></description>
			<content:encoded><![CDATA[<p>Referring to the IT Transformation Puzzle which we discussed in my previous blog post <a href="http://infocus.emc.com/choong_kengleong/it-transformation-where-to-start/">IT Transformation – Where to Start?</a>, most IT organizations will start their transformation journey with the Infrastructure Layer as shown in Figure 1.</p>
<p style="text-align: center;"><a href="http://infocus.emc.com/wp-content/uploads/2012/05/OppPic.gif" target="_blank"><img class="aligncenter  wp-image-5637" title="OppPic" src="http://infocus.emc.com/wp-content/uploads/2012/05/OppPic.gif" alt="" width="450" height="300" /></a></p>
<p style="text-align: center;" align="center"><em>Figure 1: Infrastructure Transformation</em></p>
<p>This is because Infrastructure is the foundation that everything else &#8216;rides&#8217; on top of. This is analogous to cruising a Mercedes Benz along a smooth flowing tar-surfaced highway versus the same car on a congested and pot-holed muddy road. You can have the best car (application or IT service) but the driving experience (IT service quality) is completely dependent on the road (Infrastructure) itself. However, this is not necessarily the easiest layer to start with. As you can imagine, making changes to the infrastructure requires significant planning, time, resources and investments.</p>
<p>The approach to transformation of the Infrastructure can take place over three stages. This is discussed at a high level in the following paragraphs.</p>
<p>&nbsp;</p>
<p><strong><span style="text-decoration: underline;">Stage 1: From Silos to Consolidated Infrastructure</span></strong></p>
<p>The logical first step to transforming the infrastructure is to consolidate and standardize the infrastructure to reduce complexity and IT costs; and improve efficiency and controls, as shown in Figure 2.</p>
<p style="text-align: center;"><a href="http://infocus.emc.com/wp-content/uploads/2012/05/Presentation1.gif" target="_blank"><img class="aligncenter  wp-image-5641" title="Presentation1" src="http://infocus.emc.com/wp-content/uploads/2012/05/Presentation1.gif" alt="" width="450" height="300" /></a></p>
<p style="text-align: center;" align="center"><em>Figure 2: Stage 1 &#8211; From Silos to Consolidation</em></p>
<p>In this stage, there are three key strategies:</p>
<ul>
<li><strong>Consolidate data centers</strong>: Data center consolidation has been elevated to the top of most CIOs’ agendas as they look to modify their data center facilities due to limits in power, cooling, efficiency and space. At the same time, there is a strong desire to lower facility costs and reduce operational expenditures by eliminating redundant facilities.</li>
</ul>
<ul>
<li><strong>Rationalize infrastructure</strong>: Most organizations spend up to around 70% of their budgets on maintaining existing assets. Rationalization strategies help reduce overlap, cut costs and add business value by aligning IT and investments that will contribute to business results.</li>
</ul>
<ul>
<li><strong>Virtualization</strong>: Virtualizing your data center is the first step towards <a href="http://www.emc.com/collateral/emc-perspective/h8539-cloud-optimiz-ep.pdf" target="_blank">achieving the flexibility and cost benefits of cloud computing</a>. To realize the full benefits of virtualization, IT organizations must have an integrated virtual data center strategy that has integrated management of key operational processes and procedures, a flexible virtualized architecture, and support for mission critical applications.</li>
</ul>
<p>&nbsp;</p>
<p><strong><span style="text-decoration: underline;">Stage 2: From Consolidation to Shared Infrastructure</span></strong></p>
<p>After consolidation, the next stage is to optimize and maximize the ROI and utilization of IT assets and resources. This can be accomplished via three methods as shown in Figure 3.</p>
<p style="text-align: center;"><a href="http://infocus.emc.com/wp-content/uploads/2012/05/Presentation11.gif" target="_blank"><img class="aligncenter  wp-image-5644" title="Presentation1" src="http://infocus.emc.com/wp-content/uploads/2012/05/Presentation11.gif" alt="" width="450" height="300" /></a></p>
<p style="text-align: center;" align="center"><em>Figure 3: Stage 2 &#8211; From Consolidated to shared Infrastructure</em></p>
<ul>
<li><strong>Tier Infrastructure</strong>: Defining service levels and tiering the infrastructure according to the service levels provide businesses freedom of choice – to choose and pay for the quality, service level, performance and capacity they require. Virtualization allows IT to create Compute, Storage and Network resource pools which can be aligned to service levels by abstracting resources from the appropriate infrastructure tiers.</li>
</ul>
<ul>
<li><strong>Optimize Resources</strong>: Increase resource utilization and efficiencies through modern technologies such as automated storage tiering, capacity management, thin provisioning, deduplication and IO virtualization which help to reduce per unit costs by doing more with less.</li>
</ul>
<ul>
<li><strong>Integrate and Automate Management</strong>: Management of a dynamic and fluid infrastructure requires proactive, real-time and “intelligent” management across the multi-lT layers (e.g application, compute, storage, network) and across physical, virtual and legacy platforms (e.g. mainframe, midrange, physical server, virtual server). Management tools with build-in intelligence and analytics helps to cut through the complexity of the next generation infrastructure to reduce time to resolve and improve service availability and performance.</li>
</ul>
<p>&nbsp;</p>
<p><strong><span style="text-decoration: underline;">Stage 3: From Shared Infrastructure to IT-as-a-Service</span></strong></p>
<p>The last stage of the Infrastructure Transformation is to take advantage of cloud computing and evolve towards IT-as-a-Service ((ITaaS). In particular, <a href="http://www.networkworld.com/supp/2012/enterprise2/040912-ecs-iaas-257610.html" target="_blank">IT offering Infrastructure-as-a-Service</a> (IaaS). The IaaS service catalog may include services for compute (or server), storage, backup, disaster recovery, data center and network. As shown in figure 4.</p>
<p style="text-align: center;"><a href="http://infocus.emc.com/wp-content/uploads/2012/05/Presentation12.gif" target="_blank"><img class="aligncenter  wp-image-5646" title="Presentation1" src="http://infocus.emc.com/wp-content/uploads/2012/05/Presentation12.gif" alt="" width="450" height="300" /></a></p>
<p style="text-align: center;" align="center"><em>Figure 4: Stage 3 – From Shared to Cloud and IT-as-a-Service</em></p>
<ul>
<li><em></em><strong>Embrace Cloud Computing</strong>: Enterprises need to review existing IT portfolio and sourcing strategies and take advantage of cloud computing where it makes business sense. Cloud computing allows IT Organizations to build and own their own applications and infrastructure; or to procure these in the form of services from Cloud service providers. By understanding your core competencies and businesses, develop and implement a cloud strategy that optimizes the enterprise’s IT portfolio with a strategic-mix of internal and external vendors and service providers.</li>
</ul>
<ul>
<li><strong>Operate An IT-as-a-Service Model</strong>: IT organizations cannot effectively operate a cloud environment and deliver IT services if it continues to operate as the traditional infrastructure or technology provider. Using the Fast-Food Restaurant as an analogy:</li>
</ul>
<ul>
<li>A customer orders a meal from a menu with clearly indicated pricing (service catalog)</li>
<li>Customer may also go to an internet portal for ordering home delivery (self service portal)</li>
<li>The order is taken and delivered within expected time duration and with consistent quality and service (orchestration, processes and roles and responsibilities)</li>
<li>The customer is focused on the end-result – the meal of his choice is delivered within expected time duration and quality, he pays only for what he ordered and is served with a smile.</li>
</ul>
<p>&nbsp;</p>
<p><strong><span style="text-decoration: underline;">Summary</span></strong></p>
<p>Hopefully this blog has provided you some high level ideas for transforming your infrastructure. More details can be obtained from the EMC Cloud website <a href="http://www.emc.com/microsites/cloud/cloud.htm ">here. </a></p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/choong_kengleong/transforming-your-it-infrastructure/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Key to Private Cloud Is Removing IT Stratification</title>
		<link>http://infocus.emc.com/jp_morgenthal/the-key-to-private-cloud-is-removing-it-stratification/</link>
		<comments>http://infocus.emc.com/jp_morgenthal/the-key-to-private-cloud-is-removing-it-stratification/#comments</comments>
		<pubDate>Tue, 08 May 2012 17:24:33 +0000</pubDate>
		<dc:creator>JP Morgenthal</dc:creator>
				<category><![CDATA[Cloud]]></category>
		<category><![CDATA[IT Transformation]]></category>
		<category><![CDATA[IT transformation]]></category>
		<category><![CDATA[IT-as-a-Service]]></category>
		<category><![CDATA[ITaaS]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5627</guid>
		<description><![CDATA[One of the leading problems plaguing IT organizations is the high costs of operations and maintenance.  The industry average is roughly 70% with some organizations going as high as 90%. Picking apart these costs one often finds a stratified organization focused on narrow bands of computing with little crossover between the bands. Moreover, the weighting [...]]]></description>
			<content:encoded><![CDATA[<p>One of the leading problems plaguing IT organizations is the high costs of operations and maintenance.  The industry average is roughly 70% with some organizations going as high as 90%. Picking apart these costs one often finds a stratified organization focused on narrow bands of computing with little crossover between the bands. Moreover, the weighting of political density between layers often makes it too risky for basic collaboration between the stratified layers. Hence, when problems arise, each layer attempts to solve the problems only with the tools at their disposal. The result is the Operation Petticoat wired together with chewing gum and bras that we call IT.</p>
<blockquote><p><strong>JP’s IT Axiom #124</strong>: Design flaws at the top of the stack will highlight limitations at the bottom of the stack. Likewise, the design at the bottom of the stack impacts performance at the top of the stack.</p></blockquote>
<p>There’s no escaping the fact that a poorly-designed application will put undue burden on the operating infrastructure. A “chatty” application impacts bandwidth. Improperly designed database queries will consume memory and disk capacity. Poorly-designed storage architecture will limit the amount of I/O per second (IOPS) and, thusly, limit the speed of retrieval of data to the application. IT transformation is about moving from a stratified organization to an agile organization through the use of DevOps culture and other collaborative techniques.</p>
<p><em>Read more and comment</em>:  JP&#8217;s <a href="http://www.jpmorgenthal.com/morgenthal/?p=345">The Tech Evangelist Blog</a></p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/jp_morgenthal/the-key-to-private-cloud-is-removing-it-stratification/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ITaaS Assessment</title>
		<link>http://infocus.emc.com/wayne_pauley/itaas-assessment/</link>
		<comments>http://infocus.emc.com/wayne_pauley/itaas-assessment/#comments</comments>
		<pubDate>Tue, 08 May 2012 16:39:03 +0000</pubDate>
		<dc:creator>Wayne Pauley</dc:creator>
				<category><![CDATA[Cloud]]></category>
		<category><![CDATA[IT Transformation]]></category>
		<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[Cloud Architect]]></category>
		<category><![CDATA[IT]]></category>
		<category><![CDATA[IT transformation]]></category>
		<category><![CDATA[IT-as-a-Service]]></category>
		<category><![CDATA[ITaaS]]></category>
		<category><![CDATA[Service assessments]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5605</guid>
		<description><![CDATA[In my last post, I discussed why cloud architects need to be business savvy when transforming IT to ITaaS.  Part of that discussion was how to get the information needed to assess the organization and the technology.  In this post I will dig a little deeper into that assessment process, starting with the working definition [...]]]></description>
			<content:encoded><![CDATA[<p>In my last post, I discussed <a href="http://infocus.emc.com/wayne_pauley/why-do-cloud-architects-need-to-be-business-savvy-for-itaas/">why cloud architects need to be business savvy</a> when <a href="http://www.emc.com/campaign/global/cloud/emc-cloud-services.htm">transforming IT to ITaaS</a>.  Part of that discussion was how to get the information needed to assess the organization and the technology.  In this post I will dig a little deeper into that assessment process, starting with the working definition we use in the <a href="http://education.emc.com/guest/certification/framework/ca/itasaservice.aspx">ITaaS class</a>.  While the definition of the assessment may be basic, the outcomes are almost always going to be huge.  How many of you have executed an organizational assessment?  Do you even know what the company’s governance model is, and how it is maintained and enforced?  What about the economic shift?  While everyone talks about the shift from capex to opex, is it really that simple?</p>
<p><a href="http://infocus.emc.com/wayne_pauley/itaas-assessment/wpauleywithorange/" rel="attachment wp-att-5617"><img class="alignnone  wp-image-5617" title="wpauleywithorange" src="http://infocus.emc.com/wp-content/uploads/2012/05/wpauleywithorange.gif" alt="" width="450" height="309" /></a></p>
<p>&nbsp;</p>
<p>The last bullet on this diagram is key: IT must become a service provider.  This is one of the most important parts of the transformation process.  Internal business units have a choice of providers, and all they have to do is pull out their credit card to use some of them.  So if their IT organization becomes a Service Provider and can not only compete with offers from external providers, but can also provide ITaaS as a trusted ‘broker’, it would be a win-win for the business.</p>
<p>Below is a list of some of the most critical areas we believe must be included in the assessment:</p>
<p> <a href="http://infocus.emc.com/wayne_pauley/itaas-assessment/wpauley2/" rel="attachment wp-att-5608"><img class="alignnone  wp-image-5608" title="wpauley2" src="http://infocus.emc.com/wp-content/uploads/2012/05/wpauley2.gif" alt="" width="444" height="361" /></a></p>
<p>At the top are the Enterprise-level assessments that must be completed first, because to be successful, the transformation process must be completed top down.  By assessing the overall organization, the business can evaluate how it is designed from an ownership and accountability (governance) perspective, as well as how it measures success.  The traditional approach of meeting or exceeding deadlines is no longer enough.  Businesses today must be agile and responsive to changes in the market and customer demand, which requires measurement to be shifted to ‘time-to-value’ (TTV).  Purchasing a large software system and spending 3-5 years on implementation is too long a TTV.  By transforming IT to be a broker/provider, IT must make a competitive offer.  The lines of business need a new way of interacting with IT, including accepting a shift to a chargeback model (I’ll cover chargeback more in a future post).</p>
<p>The next section of the slide includes more organizational and governance elements that apply specifically to IT.  EMC’s Global Marketing CTO, Chuck Hollis, posted a couple of great posts along this line (<a href="http://chucksblog.emc.com/chucks_blog/2012/04/in-praise-of-the-business-process-owner.html">In praise of the business process owner</a> and <a href="http://chucksblog.emc.com/chucks_blog/2012/03/good-help-is-here-for-your-it-transformation.html">Good help is here for your IT transformation</a>)  The point being that we have to change the way IT is designed at the technical level (cross-domain expertise) <em>and</em> at the structural level (new functions such as services, sales and marketing, billing help desk, and service provider liaison).</p>
<p>The next section of the graphic covers the GRC and Security assessments.  Does IT have updated information flow diagrams?  Has it developed a taxonomy for what information assets can be placed with a given service provider and under what conditions?  For example, it is fairly common practice to prohibit intellectual property to flow outside the firewall.  But what if IT wants to use SP resources to run a new version of the company’s code, instead of acquiring systems that would be left idle much of the time?  This would require a new governance model, with new security controls, to ensure the IP is adequately protected.</p>
<p>The next section covers the services and Services levels (SLAs).  If IT is going to provide services and be competitive, it needs a self-service portal.  Does the company have a service lifecycle from creation to end-of-life?  Does the company have a service catalog?  A service orchestration engine?  These are just a few of the issues we consider in the ITaaS class that must be part of the assessment process.  SLAs are getting fiercely competitive—can IT provide SLAs that are the right cost with the right metric?  If IT is typically required to deliver Five 9s, but the business only needs and wants to pay for Three 9s, does IT know how to do that?</p>
<p>Next is the applications assessment, which is the most important and most difficult assessment to make.  You’ve heard of Server Huggers but they are nothing compared to App Huggers!  When it ran the applications assessment, EMC’s own IT organization found that it had more than 600 applications, which were eventually reduced to under 400.  The process for using COTS (commercial off the shelf) software also had to be adapted.  Most companies find the notion of changing business processes to be repulsive, so when a COTS package is acquired, the first thing that happens is the stakeholders with the most say start driving for customizations, which is, of course, antithetical to ITaaS.  In order for IT to be competitive with SaaS offerings, there must be an even playing field—the business must choose the provider who can best meet the core objectives of Cost, Functionality, SLA, and Trust as a package deal.  It is important to note, however, that customers often have different weighted values for these dimensions of service based on their business needs.</p>
<p>Last up is the technology assessment.  Some of this is quite simple and it is immediately clear if the necessary pieces are in place to stand up and operate a service-based infrastructure (IaaS), development (PaaS), and application (SaaS) environment.  Key investments start with a “virtualize everything” and “standardize the platforms” basis.  It may also include service catalogs, orchestration engines, federated access and identity, federated directory services, etc.  Some of this may be very hard to implement – for example is your network essentially flat today?  If so, how do you federate with a public provider without exposing unwanted elements to the SP network?  What happens when the SP wants to perform an update to its technology that IT has federated to key applications internally?  Have the systems become agnostic to upgrades or at least do they appear to be agnostic to the lines of business?</p>
<p>So as you can see, the assessment process requires an enormous amount of data gathering.  The architect must wear a lot of hats or at least be fluent enough in several of the new domains to drive the process in the right direction and eventually to the decision-making point.  Once the assessment is complete it is still necessary to evaluate the prioritization and economics of what to do first, and determine the justification and next phases of planning and design.  I will cover those topics soon in a future post.</p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/wayne_pauley/itaas-assessment/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Understanding the Role of Hadoop In Your BI Environment</title>
		<link>http://infocus.emc.com/william_schmarzo/understanding-the-role-of-hadoop-in-your-bi-environment/</link>
		<comments>http://infocus.emc.com/william_schmarzo/understanding-the-role-of-hadoop-in-your-bi-environment/#comments</comments>
		<pubDate>Sun, 06 May 2012 16:45:18 +0000</pubDate>
		<dc:creator>Bill Schmarzo</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Intelligence / Data Warehousing]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data Staging]]></category>
		<category><![CDATA[data warehousing]]></category>

		<guid isPermaLink="false">http://infocus.emc.com/?p=5558</guid>
		<description><![CDATA[In my discussions about the assault of big data technologies on the traditional Business Intelligence (BI) and data warehouse (DW) markets, sometimes it is hard to identify specific use cases where a big data technology – like Hadoop – can help your BI environment.  Many of these big data use cases are really still being [...]]]></description>
			<content:encoded><![CDATA[<p>In my discussions about the assault of big data technologies on the traditional Business Intelligence (BI) and data warehouse (DW) markets, sometimes it is hard to identify specific use cases where a big data technology – like Hadoop – can help your BI environment.  Many of these big data use cases are really still being developed, but here is one that I’m starting to see more and more – using Hadoop to replace or augment your existing ETL, data staging, and operational data store (ODS) architecture.  And as is typical when I get into some technically complex topics, I want to thank Dr. Pedro DeSouza for his help and insights in pulling together this blog.</p>
<h1><strong>Stage 1:  Hadoop for Data Staging and ODS</strong></h1>
<p>I am seeing several client engagements where the BI/DW team is looking to use Hadoop to simplify, accelerate, and enhance their existing ETL and data staging processes.  Hadoop brings at least two significant advantages to your ETL and data staging processes.  The first is the ability to ingest massive amounts of data as-is. That means that you do not need to pre-define the data schema before loading data into Hadoop. This includes both traditional transactional data (e.g., point-of-sale transactions, call detail records, general ledger transactions, call center transactions), but also unstructured internal data (like consumer comments, doctor’s notes, insurance claims descriptions, and web logs) and external social media data (from social media sites such LinkedIn, Pinterest, Facebook and Twitter).  So regardless of the structure of your incoming data, you can rapidly load it all into Hadoop, as-is, where it then becomes available for your downstream ETL, DW, and analytic processes (see Stage 1 in Figure 1 below).</p>
<p><img class="aligncenter size-full wp-image-5582" title="Untitled" src="http://infocus.emc.com/wp-content/uploads/2012/05/Untitled1.jpg" alt="" width="435" height="183" /></p>
<p style="text-align: center;" align="center"><em>Figure 1:  Hadoop as ODS/Data Stage</em></p>
<p>The second advantage that Hadoop brings to your BI/DW architecture occurs once the data is in the Hadoop environment.  Once it’s in your Hadoop ODS, you can leverage the inherently parallel nature of Hadoop to perform your traditional ETL work of cleansing, normalizing, aligning, and creating aggregates for your EDW at massive scale.</p>
<p>Many ETL vendors, like Pentaho, Talend, and Datameer are modifying their products to seamlessly create MapReduce parallel ETL jobs. These vendors provide drag-and-drop UIs for generating ETL MapReduce jobs, removing a great deal of complexity from the data integrator/developer.</p>
<p>Once the raw data is in the Hadoop environment, developers can do data transformations and enrichments that were not easy to do before including:</p>
<ul>
<li>Parse complex, unstructured data feeds (like consumer comments, web logs, and twitter feeds) to capture key data and metrics of importance (e.g., visitor id, session id, site id, display ad id, display ad location) that can then be integrated with the existing structured data in your EDW.  For example, imagine the business possibilities if you were able to mine the wealth of social media data on your customers to identify their interests, passions, associations, and affiliations, and then integrate those new customer insights with the existing customer data that is meticulously maintained in your CRM system.</li>
<li>Create advanced composite metrics that require you to process many days, weeks, or even months of history.  I covered some of these composite metrics in a previous blog, but creating metrics around frequency, recency, and sequencing is possible in a Hadoop environment that is not easily done within your traditional ETL environment.  These composite metrics facilitate more detailed and complex analysis yielding key performance indicators that might be better indicators and/or predictors of performance.</li>
</ul>
<h1><strong>Stage 2:  Hadoop for Feeding Your DW</strong></h1>
<p>After doing all of this parsing of unstructured data for new customer and business metrics and data enrichment to create new composite metrics, the output of the Hadoop ODS can then feed your standard EDW.  The Hadoop ODS has the advantage of being able to create structure out of unstructured data, which can then be integrated with your existing structured, transactional data in the EDW (see Stage 2 in Figure 2).</p>
<p style="text-align: center;" align="center"><img class="aligncenter size-full wp-image-5598" title="stage2" src="http://infocus.emc.com/wp-content/uploads/2012/05/stage21.png" alt="" width="437" height="160" /></p>
<p style="text-align: center;" align="center"><em>Figure 2: Hadoop For Feeding Your DW</em></p>
<p>And because of the massive scalability of Hadoop and the ability to ingest massive amounts of data quickly, you can significantly accelerate your traditional ETL processes and more easily fulfill your EDW SLA’s.  Not only can you shrink the latency between when the data transaction or event occurs and when it’s available in your EDW, but you can also provide more granular, detailed data in your EDW, especially if your EDW is based on an MPP architecture (which reduces the need for indices, aggregate tables and materialized views, thereby saving more even more EDW loading time and management effort).</p>
<h1><strong>Stage 3:  Hadoop for Feeding Your Analytic Sandbox</strong></h1>
<p>The third stage of Hadoop as your ODS affects the self-provisioning and rapid and frequent iterations typically required from your analytics sandbox environment. In this environment, the data scientists can grab what data they need out of the Hadoop ODS without worrying about impacting the EDW environment.  They can select whatever level of granularity they need from whichever data sources they need in order to build, test, refine, and publish their analytic results (see Stage 3 in Figure 3).</p>
<p><img class="aligncenter size-full wp-image-5597" title="stage3" src="http://infocus.emc.com/wp-content/uploads/2012/05/stage31.png" alt="" width="437" height="167" /></p>
<p style="text-align: center;" align="center"><em>Figure 3: Hadoop For Feeding Your Analytic Sandbox</em></p>
<p>With the Hadoop ODS, the data scientist have the ability to store and access data they “might” need from an analytics perspective; data that may never find its way into the EDW.  For example, the Data Scientists might want to store large amounts of social media or web log data, or large varieties of widely available third-party data (from places like data.gov) to enhance analytic modeling.  The Data Scientists would grab this data out of the Hadoop ODS when and as they needed it, depending upon their current analytic needs.</p>
<h1><strong>Business Example of the Power of Hadoop as an ODS</strong></h1>
<p>Let’s walk through an example as to how this Hadoop ODS environment can support the types of on-demand analytics that the EDW is so poorly suited to support.</p>
<p>Recently there were floods in Thailand that had major impact on the supply chains of many manufacturers.  In one case, the CFO asked their BI team to come up with an estimate of the impact on the end of quarter Earnings Per Share (EPS).  To make matters worse, the BI team only had 3 weeks to complete this assessment.  This is the perfect type of analytics request for an analytic sandbox because:</p>
<ul>
<li>The analytics team is going to need to get access to lots of different data sources rapidly – both detailed structured data as well as some unstructured web log and media feed data.  The data requests could change rapidly (e.g., adding third-party market data, local economic, and weather data) as the analysts proceed through the analysis process.</li>
<li>Much of the data being used for this analysis will never find its way into the DW, but needs to be housed in an environment that stores the data as-is and can be rapidly accessed and integrated into the analysts processes just in case they need it.</li>
<li>They need to come up with an estimate – a range of probabilities – and not a precise answer.</li>
<li>This is likely a one-off request, so it’s not critical to build a repeatable process for this analytic request.</li>
</ul>
<p>In this example, the analytics team asked for not only their own supply chain data, but they also asked for the supply chain data for their top 25 suppliers (which represented about 90% of their components, and 100% of their Tier 1 components).  They also pulled in customer, inventory, orders, sales pipeline and manufacturing data from their internal systems, as well as the manufacturing bill of materials from the internal manufacturing system.  This data was then augmented with third-party data on weather forecasts, traffic and road conditions, and local support and recovery information.  All of this data could be stored in the Hadoop ODS waiting for the time when the Data Scientist team would or might need that data.</p>
<p>The analytics team was able to use this data to create models that answered the following questions:</p>
<ul>
<li>Are their forecasted weather problems or local recovery and traffic issues that will exacerbate the supply chain problem?</li>
<li>Which of our products are most impacted by the shortfalls (note: this required looking into their suppliers’ supply chains to identify common subcomponent suppliers across their entire supplier network)?</li>
<li>Which of our top customers are most impacted by the component shortages?</li>
<li>What are the top sales opportunities in the sales pipeline based upon margin impact?</li>
<li>How best do we allocate our existing products to our most important customers and deals?</li>
<li>What is the likely impact on Earnings Per Share with a 95% confidence level?</li>
</ul>
<p>The result was that the analytics team was able to not only give the CFO the estimated EPS impact (at a 95% confidence level range) but was also able to provide insights into how to optimize existing inventory allocation to minimize the EPS impact.</p>
<p>By the way, once this analysis was done, the analytics team packaged up the analysis and archived it.  While they were unlikely to be doing this type of analysis again anytime soon, they needed to archive the analysis for compliance and possibly legal reasons.</p>
<h1><strong>Summary</strong></h1>
<p>As you can see, using Hadoop as your ODS and data staging area can deliver new, compelling business benefits (and not to mention that it’s probably significantly cheaper and more agile then your existing ETL environment).  And I expect that we’re going to start seeing more use cases for Hadoop within our BI and DW environments.  The cost benefits, the processing power, the access to lower-latency data, and the overall simpler architecture are just too compelling to ignore.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://infocus.emc.com/william_schmarzo/understanding-the-role-of-hadoop-in-your-bi-environment/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

