Ffi Rs.indd 01:56:13:PM 03/28/2014 Page Iv

Transcription

ffirs.indd 01:56:13:PM 03/28/2014Page iv

Applied PredictiveAnalyticsPrinciples and Techniques for theProfessional Data AnalystDean Abbottffirs.indd01:56:13:PM 03/28/2014Page i

Applied Predictive Analytics: Principles and Techniques for the Professional Data AnalystPublished byJohn Wiley & Sons, Inc.10475 Crosspoint BoulevardIndianapolis, IN 46256www.wiley.comCopyright 2014 by John Wiley & Sons, Inc., Indianapolis, IndianaPublished simultaneously in CanadaISBN: 978-1-118-72796-6ISBN: 978-1-118-72793-5 (ebk)ISBN: 978-1-118-72769-0 (ebk)Manufactured in the United States of America10 9 8 7 6 5 4 3 2 1No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means,electronic, mechanical, photocopying, recording, scanning or otherwise, except as permitted under Sections 107 or108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to the Copyright Clearance Center, 222 Rosewood Drive,Danvers, MA 01923, (978) 750-8400, fax (978) 646-8600. Requests to the Publisher for permission should be addressedto the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748-6011, fax (201)748-6008, or online at http://www.wiley.com/go/permissions.Limit of Liability/Disclaimer of Warranty: The publisher and the author make no representations or warranties withrespect to the accuracy or completeness of the contents of this work and specifically disclaim all warranties, includingwithout limitation warranties of fitness for a particular purpose. No warranty may be created or extended by sales orpromotional materials. The advice and strategies contained herein may not be suitable for every situation. This workis sold with the understanding that the publisher is not engaged in rendering legal, accounting, or other professionalservices. If professional assistance is required, the services of a competent professional person should be sought.Neither the publisher nor the author shall be liable for damages arising herefrom. The fact that an organization orWeb site is referred to in this work as a citation and/or a potential source of further information does not mean thatthe author or the publisher endorses the information the organization or website may provide or recommendationsit may make. Further, readers should be aware that Internet websites listed in this work may have changed or disappeared between when this work was written and when it is read.For general information on our other products and services please contact our Customer Care Department within theUnited States at (877) 762-2974, outside the United States at (317) 572-3993 or fax (317) 572-4002.Wiley publishes in a variety of print and electronic formats and by print-on-demand. Some material included withstandard print versions of this book may not be included in e-books or in print-on-demand. If this book refers tomedia such as a CD or DVD that is not included in the version you purchased, you may download this material athttp://booksupport.wiley.com. For more information about Wiley products, visit www.wiley.com.Library of Congress Control Number: 2013958302Trademarks: Wiley and the Wiley logo are trademarks or registered trademarks of John Wiley & Sons, Inc. and/orits affiliates, in the United States and other countries, and may not be used without written permission. [Insert thirdparty trademark information] All other trademarks are the property of their respective owners. John Wiley & Sons,Inc. is not associated with any product or vendor mentioned in this book.ffirs.indd01:56:13:PM 03/28/2014Page ii

To Barbaraffirs.indd01:56:13:PM 03/28/2014Page iii

ffirs.indd 01:56:13:PM 03/28/2014Page iv

About the AuthorDean Abbott is President of Abbott Analytics, Inc. in San Diego, California.Dean is an internationally recognized data-mining and predictive analyticsexpert with over two decades of experience applying advanced modeling anddata preparation techniques to a wide variety of real-world problems. He isalso Chief Data Scientist of SmarterRemarketer, a startup company focusingon data-driven behavior segmentation and attribution.Dean has been recognized as a top-ten data scientist and one of the top tenmost influential people in data analytics. His blog has been recognized as oneof the top-ten predictive analytics blogs to follow.He is a regular speaker at Predictive Analytics World and other analyticsconferences. He is on the advisory board for the University of California IrvineCertificate Program for predictive analytics and the University of CaliforniaSan Diego Certificate Program for data mining, and is a regular instructor forcourses on predictive modeling algorithms, model deployment, and text mining. He has also served several times on the program committee for the KDDConference Industrial Track.vffirs.indd01:56:13:PM 03/28/2014Page v

ffirs.indd01:56:13:PM 03/28/2014Page vi

About the Technical EditorWilliam J. Komp has a Ph.D. from the University of Wisconsin–Milwaukee,with a specialization in the fields of general relativity and cosmology. He hasbeen a professor of physics at the University of Louisville and Western KentuckyUniversity. Currently, he is a research scientist at Humana, Inc., working in theareas of predictive analytics and data mining.viiffirs.indd01:56:13:PM 03/28/2014Page vii

ffirs.indd01:56:13:PM 03/28/2014Page viii

CreditsExecutive EditorRobert ElliottBusiness ManagerAmy KniesProject EditorAdaobi Obi TultonVice President and ExecutiveGroup PublisherRichard SwadleyTechnical EditorWilliam J. KompSenior Production EditorKathleen WisorCopy EditorNancy RapoportManager of Content Developmentand AssemblyMary Beth WakefieldDirector of Community MarketingDavid MayhewAssociate PublisherJim MinatelProject Coordinator, CoverTodd KlemmeProofreaderNancy CarrascoIndexerJohnna VanHoose DinseCover DesignerRyan SneedMarketing ManagerAshley Zurcherixffirs.indd01:56:13:PM 03/28/2014Page ix

ffirs.indd01:56:13:PM 03/28/2014Page x

AcknowledgmentsThe idea for this book began with a phone call from editor Bob Elliott, who presented the idea of writing a different kind of predictive analytics book gearedtoward business professionals. My passion for more than a decade has beento teach principles of data mining and predictive analytics to business professionals, translating the lingo of mathematics and statistics into a language thepractitioner can understand. The questions of hundreds of course and workshop attendees forced me to think about why we do what we do in predictiveanalytics. I also thank Bob for not only persuading me that I could write thebook while continuing to be a consultant, but also for his advice on the scopeand depth of topics to cover.I thank my father for encouraging me in analytics. I remember him teachingme how to compute batting average and earned run average when I was eightyears old so I could compute my Little League statistics. He brought home reamsof accounting pads, which I used for playing thousands of Strat-O-Matic baseballgames, just so that I could compute everyone’s batting average and earned runaverage, and see if there were significant differences between what I observedand what the players’ actual statistics were. My parents put up with a lot ofpaper strewn across the floor for many years.I would never have been in this field were it not for Roger Barron, my firstboss at Barron Associates, Inc., a pioneer in statistical learning methods anda man who taught me to be curious, thorough, and persistent about data analysis.His ability to envision solutions without knowing exactly how they would besolved is something I reflect on often.I’ve learned much over the past 27 years from John Elder, a friend and colleague, since our time at Barron Associates, Inc. and continuing to this day.I am very grateful to Eric Siegel for inviting me to speak regularly at Predictivexiffirs.indd01:56:13:PM 03/28/2014Page xi

xiiAcknowledgmentsAnalytics World in sessions and workshops, and for his advice and encouragement in the writing of this book.A very special thanks goes to editors Adaobi Obi Tulton and Nancy Rapoportfor making sense of what of I was trying to communicate and making this bookmore concise and clearer than I was able to do alone. Obviously, I was a mathematics major and not an English major. I am especially grateful for technicaleditor William Komp, whose insightful comments throughout the book helpedme to sharpen points I was making.Several software packages were used in the analyses contained in the book,including KNIME, IBM SPSS Modeler, JMP, Statistica, Predixion, and Orange.I thank all of these vendors for creating software that is easy to use. I also wantto thank all of the other vendors I’ve worked with over the past two decades,who have supplied software for me to use in teaching and research.On a personal note, this book project could not have taken place without thesupport of my wife, Barbara, who encouraged me throughout the project, evenas it wore on and seemed to never end. She put up with me as I spent countlessnights and Saturdays writing, and cheered with each chapter that was submitted. My two children still at home were very patient with a father who wasn’tas available as usual for nearly a year.As a Christian, my worldview is shaped by God and how He reveals Himselfin the Bible, through his Son, and by his Holy Spirit. When I look back at thecircumstances and people He put into my life, I only marvel at His handof providence, and am thankful that I’ve been able to enjoy this field for myentire career.ffirs.indd01:56:13:PM 03/28/2014Page xii

ContentsIntroductionChapter 1xxiOverview of Predictive AnalyticsWhat Is Analytics?What Is Predictive Analytics?133Supervised vs. Unsupervised LearningParametric vs. Non-Parametric ModelsBusiness IntelligencePredictive Analytics vs. Business Intelligence68Do Predictive Models Just State the Obvious?Similarities between Business Intelligenceand Predictive Analytics9Predictive Analytics vs. Statistics910Statistics and AnalyticsPredictive Analytics and Statistics ContrastedPredictive Analytics vs. Data MiningWho Uses Predictive Analytics?Challenges in Using Predictive AnalyticsObstacles in ManagementObstacles with DataObstacles with ModelingObstacles in DeploymentChapter 256111213131414141516What Educational Background Is Needed to Become aPredictive Modeler?16Setting Up the ProblemPredictive Analytics Processing Steps: CRISP-DMBusiness Understanding191921The Three-Legged StoolBusiness Objectives2223xiiiftoc.indd 01:52:15:PM 03/28/2014Page xiii

xivContentsDefining Data for Predictive Modeling25Defining the Columns as MeasuresDefining the Unit of AnalysisWhich Unit of Analysis?262728Defining the Target VariableTemporal Considerations for Target VariableDefining Measures of Success for Predictive ModelsSuccess Criteria for ClassificationSuccess Criteria for EstimationOther Customized Success CriteriaDoing Predictive Modeling Out of OrderBuilding Models FirstEarly Model DeploymentCase Study: Recovering Lapsed DonorsOverviewBusiness ObjectivesData for the CompetitionThe Target VariablesModeling ObjectivesModel Selection and Evaluation CriteriaModel DeploymentCase Study: Fraud DetectionOverviewBusiness ObjectivesData for the ProjectThe Target VariablesModeling ObjectivesModel Selection and Evaluation CriteriaModel DeploymentChapter 1Summary42Data UnderstandingWhat the Data Looks LikeSingle Variable Summaries434444MeanStandard DeviationThe Normal DistributionUniform DistributionApplying Simple Statistics in Data UnderstandingSkewnessKurtosisRank-Ordered StatisticsCategorical Variable AssessmentData Visualization in One DimensionHistogramsMultiple Variable Summariesftoc.indd 01:52:15:PM 03/28/2014Page xiv454545464749515255585964

ContentsHidden Value in Variable Interactions: Simpson’s ParadoxThe Combinatorial Explosion of InteractionsCorrelationsSpurious CorrelationsBack to CorrelationsCrosstabsChapter 4Data Visualization, Two or Higher Dimensions69ScatterplotsAnscombe’s QuartetScatterplot MatricesOverlaying the Target Variable in SummaryScatterplots in More Than Two Dimensions6971757678The Value of Statistical SignificancePulling It All Together into a Data AuditSummary808182Data PreparationVariable Cleaning8384Incorrect ValuesConsistency in Data FormatsOutliersMultidimensional OutliersMissing ValuesFixing Missing Data848585899091Feature Creation98Simple Variable TransformationsFixing SkewBinning Continuous VariablesNumeric Variable ScalingNominal Variable TransformationOrdinal Variable TransformationsDate and Time Variable FeaturesZIP Code FeaturesWhich Version of a Variable Is Best?Multidimensional FeaturesVariable Selection Prior to ModelingSamplingExample: Why Normalization Matters for K-Means ClusteringChapter Summary143Itemsets and Association RulesTerminology145146ConditionLeft-Hand-Side, Antecedent(s)Right-Hand-Side, Consequent, Output, ConclusionRule (Item Set)ftoc.indd 01:52:15:PM 03/28/2014147148148148Page xvxv

xviContentsSupportAntecedent SupportConfidence, AccuracyLiftParameter SettingsHow the Data Is OrganizedStandard Predictive Modeling Data FormatTransactional FormatVariable SelectionInteraction Variable Creation157157Problems with Association Rules158158158159Building Classification Rules from Association RulesSummary159161Descriptive ModelingData Preparation Issues with Descriptive ModelingPrincipal Component Analysis163164165Clustering AlgorithmsThe K-Means AlgorithmData Preparation for K-MeansSelecting the Number of ClustersThe Kohonen SOM AlgorithmVisualizing Kohonen MapsSimilarities with 197Interpreting Descriptive ModelsStandard Cluster Model Interpretation199199Problems with Interpretation MethodsIdentifying Key Variables in Forming Cluster ModelsCluster PrototypesCluster OutliersChapter 8151152154156The PCA AlgorithmApplying PCA to New DataPCA for Data InterpretationAdditional Considerations before Using PCAThe Effect of Variable Magnitude on PCA ModelsChapter 7151151Measures of Interesting RulesDeploying Association RulesRedundant RulesToo Many RulesToo Few RulesChapter 6149149150150202203209210Summary212Predictive ModelingDecision Trees213214The Decision Tree LandscapeBuilding Decision Treesftoc.indd 01:52:15:PM 03/28/2014Page xvi215218

ContentsDecision Tree Splitting MetricsDecision Tree Knobs and OptionsReweighting Records: PriorsReweighting Records: Misclassification CostsOther Practical Considerations for Decision TreesLogistic Regression230Interpreting Logistic Regression ModelsOther Practical Considerations for Logistic RegressionNeural NetworksK-Nearest Neighbor242244247249251252253253254The k-NN Learning AlgorithmDistance Metrics for k-NNOther Practical Considerations for k-NNNaïve Bayes254258259264Bayes’ TheoremThe Naïve Bayes ClassifierInterpreting Naïve Bayes ClassifiersOther Practical Considerations for Naïve BayesRegression ModelsLinear Regression264268268269270271Linear Regression AssumptionsVariable Selection in Linear RegressionInterpreting Linear Regression ModelsUsing Linear Regression for Classification274276278279Other Regression AlgorithmsSummary280281Assessing Predictive ModelsBatch Approach to Model Assessment283284Percent Correct ClassificationRank-Ordered Approach to Model AssessmentChapter 10233235240Building Blocks: The NeuronNeural Network TrainingThe Flexibility of Neural NetworksNeural Network SettingsNeural Network PruningInterpreting Neural NetworksNeural Network Decision BoundariesOther Practical Considerations for Neural NetworksChapter 9221222224224229284293Assessing Regression ModelsSummary301304Model EnsemblesMotivation for Ensembles307307The Wisdom of CrowdsBias Variance Tradeoff308309Bagging311ftoc.indd 01:52:15:PM 03/28/2014Page xviixvii

xviiiContentsBoostingImprovements to Bagging and BoostingRandom ForestsStochastic Gradient BoostingHeterogeneous EnsemblesChapter 11323323326Text MiningMotivation for Text MiningA Predictive Modeling Approach to Text MiningStructured vs. Unstructured DataWhy Text Mining Is Hard327328329329330Data Preparation StepsPOS TaggingTokensStop Word and Punctuation FiltersCharacter Length and Number FiltersStemmingDictionariesThe Sentiment Polarity Movie Data SetText Mining FeaturesTerm FrequencyInverse Document FrequencyTF-IDFCosine SimilarityMulti-Word Features: N-GramsReducing Keyword FeaturesGrouping TermsModeling with Text Mining FeaturesRegular ExpressionsUses of Regular Expressions in Text 6346347347347349351Summary352Model DeploymentGeneral Deployment Considerations353354Deployment StepsChapter 13320321321Model Ensembles and Occam’s RazorInterpreting Model EnsemblesSummaryText Mining ApplicationsData Sources for Text MiningChapter 12316320355Summary375Case StudiesSurvey Analysis Case Study: Overview377377Business Understanding: Defining the ProblemData UnderstandingData PreparationModelingftoc.indd 01:52:15:PM 03/28/2014Page xviii378380381385

ContentsDeployment: “What-If” AnalysisRevisit ModelsDeploymentSummary and Conclusions391392401401Help Desk Case Study402Data Understanding: Defining the DataData PreparationModelingRevisit Business UnderstandingDeploymentSummary and ConclusionsIndex403403405407409411413ftoc.indd 01:52:15:PM 03/28/2014Page xixxix

IntroductionThe methods behind predictive analytics have a rich history, combining disciplines of mathematics, statistics, social science, and computer science. The label“predictive analytics” is relatively new, however, coming to the forefront only inthe past decade. It stands on the shoulders of other analytics-centric fields suchas data mining, machine learning, statistics, and pattern recognition.This book describes the predictive modeling process from the perspectiveof a practitioner rather than a theoretician. Predictive analytics is both scienceand art. The science is relatively easy to describe but to do the subject justicerequires considerable knowledge of mathematics. I don’t believe a good practitioner needs to understand the mathematics of the algorithms to be able to applythem successfully, any more than a graphic designer needs to understand themathematics of image correction algorithms to apply sharpening filters well.However, the better practitioners understand the effects of changing modelingparameters, the implications of the assumptions of algorithms in predictions,and the limitations of the algorithms, the more successful they will be, especiallyin the most challenging modeling projects.Science is covered in this book, but not in the same depth you will find inacademic treatments of the subject. Even though you won’t find sections describing how to decompose matrices while building linear regression models, thisbook does not treat algorithms as black boxes.The book describes what I would be telling you if you were looking over myshoulder while I was solving a problem, especially when surprises occur inthe data. How can algorithms fool us into thinking their performance is excellent when they are actually brittle? Why did I bin a variable rather than justtransform it numerically? Why did I use logistic regression instead of a neuralnetwork, or vice versa? Why did I build a linear regression model to predict axxiflast.indd01:56:26:PM 03/28/2014Page xxi

xxiiIntroductionbinary outcome? These are the kinds of questions, the art of predictive modeling, that this book addresses directly and indirectly.I don’t claim that the approaches described in this book represent the only wayto solve problems. There are many contingencies you may encounter where theapproaches described in this book are not appropriate. I can hear the commentsalready from other experienced statisticians or data miners asking, “but whatabout . . . ?” or “have you tried doing this . . . ?” I’ve found over the years thatthere are often many ways to solve problems successfully, and the approach youtake depends on many factors, including your personal style in solving problems and the desires of the client for how to solve the problems. However, evenmore often than this, I’ve found that the biggest gains in successful modelingcome more from understanding data than from applying a more sophisticatedalgorithm.How This Book Is OrganizedThe book is organized around the Cross-Industry Standard Process Model forData Mining (CRISP-DM). While the name uses the term “data mining,” thesteps are the same in predictive analytics or any project that includes the building of predictive or statistical models.This isn’t the only framework you can use, but it a well-documented framework that has been around for more than 15 years. CRISP-DM should not beused as a recipe with checkboxes, but rather as a guide during your predictivemodeling projects.The six phases or steps in CRISP-DM:1. Business Understanding2. Data Understanding3. Data Preparation4. Modeling5. Evaluation6. DeploymentAfter an introductory chapter, Business Understanding, Data Understanding,and Data Preparation receive a chapter each in Chapters 2, 3, and 4. The DataUnderstanding and Data Preparation chapters represent more than one-quarterof the pages of technical content of the book, with Data Preparation the largestsingle chapter. This is appropriate because preparing data for predictive modeling is nearly always considered the most time-consuming stage in a predictivemodeling project.flast.indd01:56:26:PM 03/28/2014Page xxii

IntroductionChapter 5 describes association rules, usually considered a modeling method,but one that differs enough from other descriptive and predictive modelingtechniques that it warrants a separate chapter.The modeling methods—descriptive, predictive, and model ensembles—aredescribed in Chapters 6, 8, and 10. Sandwiched in between these are the modeling evaluation or assessment methods for descriptive and predictive modeling,in Chapters 7 and 9, respectively. This sequence is intended to connect moredirectly the modeling methods with how they are evaluated.Chapter 11 takes a side-turn into text mining, a method of analysis of unstructured data that is becoming more mainstream in predictive modeling. Softwarepackages are increasingly including text mining algorithms built into the software or as an add-on package. Chapter 12 describes the sixth phase of predictivemodeling: Model Deployment.Finally, Chapter 13 contains two case studies: one based on work I did as acontractor to Seer Analytics for the YMCA, and the second for a Fortune 500company. The YMCA case study is actually two case studies built into a singlenarrative because we took vastly different approaches to solve the problem.The case studies are written from the perspective of an analyst as he considersdifferent ways to solve the problem, including solutions that were good ideasbut didn’t work well.Throughout the book, I visit many of the problems in predictive modelingthat I have encountered in projects over the past 27 years, but the list certainly isnot exhaustive. Even with the inherent limitations of book format, the thoughtprocess and principles are more important than developing a complete list ofpossible approaches to solving problems.Who Should Read This BookThis book is intended to be read by anyone currently in the field of predictiveanalytics or its related fields, including data mining, statistics, machine learning, data science, and business analytics.For those who are just starting in the field, the book will provide a descriptionof the core principles every modeler needs to know.The book will also help those who wish to enter these fields but aren’t yet there.It does not require a mathematics background beyond pre-calculus to understand predictive analytics methods, though knowledge of calculus and linearalgebra will certainly help. The same applies to statistics. One can understandthe material in this book without a background in statistics; I, for one, have nevertaken a course in statistics. However, understanding statistics certainly providesadditional intuition and insight into the approaches described in this book.flast.indd01:56:26:PM 03/28/2014Page xxiiixxiii

xxivIntroductionTools You Will NeedNo predictive modeling software is needed to understand the concepts in thebook, and no examples are shown that require a particular predictive analyticssoftware package. This was deliberate because the principles and techniquesdescribed in this book are general, and apply to any software package. Whenapplicable, I describe which principles are usually available in software andwhich are rarely available in software.What’s on the WebsiteAll data sets used in examples throughout the textbook are included on theWiley website at www.wiley.com/go/appliedpredictiveanalytics and in theResources section of www.abbottanalytics.com. These data sets include: KDD Cup 1998 data, simplified from the full version Nasadata data set Iris data setIn addition, baseline scripts, workflows, streams, or other documents specificto predictive analytics software will be included as they become available.These will provide baseline processing steps to replicate analyses from thebook chapters.SummaryMy hope is that this book will encourage those who want to be more effectivepredictive modelers to continue to work at their craft. I’ve worked side-by-sidewith dozens of predictive modelers over the years.I always love watching how effective predictive modelers go about solvingproblems, perhaps in ways I had never considered before. For those with moreexperience, my hope is that this book will describe data preparation, modeling,evaluation, and deployment in a way that even experienced modelers haven’tthought of before.flast.indd01:56:26:PM 03/28/2014Page xxiv

CHAPTER1Overview of Predictive AnalyticsA small direct response company had developed dozens of programs in cooperation with major brands to sell books and DVDs. These affinity programswere very successful, but required considerable up-front work to develop thecreative content and determine which customers, already engaged with the brand,were worth the significant marketing spend to purchase the books or DVDson subscription. Typically, they first developed test mailings on a moderatelysized sample to determine if the expected response rates were high enough tojustify a larger program.One analyst with the company identified a way to help the company becomemore profitable. What if one could identify the key characteristics of those whoresponded to the test mailing? Furthermore, what if one could generate a scorefor these customers and determine what minimum score would result in a highenough response rate to make the campaign profitable? The analyst discoveredpredictive analytics techniques that could be used for both purposes, findingkey customer characteristics and using those characteristics to generate a scorethat could be used to determine which customers to mail.Two decades before, the owner of a small company in Virginia had a compelling idea: Improve the accuracy and flexibility of guided munitions usingoptimal control. The owner and president, Roger Barron, began the process ofderiving the complex mathematics behind optimal control using a techniqueknown as variational calculus and hired a graduate student to assist him in thetask. Programmers then implemented the mathematics in computer code so1c01.indd01:52:36:PM 03/28/2014Page 1

2Chapter 1 Overview of Predictive Analyticsthey could simulate thousands of scenarios. For each trajectory, the variationalcalculus minimized the miss distance while maximizing speed at impact aswell as the angle of impact.The variational calculus algorithm succeeded in identifying the optimalsequence of commands: how much the fins (control surfaces) needed to changethe path of the munition to follow the optimal path to the target. The conceptworked in simulation in the thousands of optimal trajectories that were run.Moreover, the mathematics worked on several munitions, one of which wasthe MK82 glide bomb, fitted (in simulation) with an inertial guidance unit tocontrol the fins: an early smart-bomb.There was a problem, however. The variational calculus was so computationally complex that the small computers on-board could not solve the problemin real time. But what if one could estimate the optimal guidance commands atany time during the flight from observable characteristics of the flight? Afterall, the guidance unit can compute where the bomb is in space, how fast it isgoing, and the distance of the target that was programmed into the unit when itwas launched. If the estimates of the optimum guidance commands were closeenough to the actual optimal path, it would be near optimal and still succeed.Predictive models were built to do exactly this. The system was called OptimalPath-to-Go guidance.These two programs designed by two different companies seemingly couldnot be more different. One program knows characteristics of people, such asdemographics and their level of engagement with a brand, and tries to predicta human decision. The second program knows locations of a bomb in space andtries to predict the best physical action for it to hit a target.But they share something in common: They both need to estimate values thatare unknown but tremendously useful. For the affinity programs, the modelsestimate whether or not an individual will respond to a campaign, and for theguidance program, the models estimate the best guidance command. In thissense, these two programs are very similar because they both involve predicting a value or values that are known historically, but are unknown at the timea decision is needed. Not only are these programs rela

of the top-ten predictive analytics blogs to follow. He is a regular speaker at Predictive Analytics World and other analytics . I am very grateful to Eric Siegel for inviting me to speak regularly at Predictive Acknowledgments. xii Acknowledgments ffi rs.indd 01:5