DIY Data Analytics Dill ID Ind Comm 10-24-19

Transcription

10/21/2019D.I.Y. DATA ANALYTICSCHRISTOPHER DILL – SPECIAL INVESTIGATIONS UNIT MANAGERICW GROUP INSURANCE COMPANYIDAHO INDUSTRIAL COMMISSION – 10/24/19Life can be much broader once you discover one simple fact:Everything around you, that you call life, was made up by peoplethat were no smarter than you and you can change it, you cang that other peoplep p caninfluence it,, yyou can build yyour own thingsuse. Once you learn that, you'll never be the same again.- Steve JobsTHE KNOWLEDGE WORKERKnowledge workers will bethe most valuable assets ofa 21st-century organizationbecause of their high levelof productivity andcreativity- High-level workers- Apply theoretical and analytical knowledge- Acquired through formal training- Develop products and servicesPeter Drucker, The Landmarks of Tomorrow (1959)1

10/21/2019THE INFORMATION AGEInformation has become a commodity that is quickly and widely disseminatedand easily available especially through the use of computer technologyTHE STANDARD:IMMEDIATE ACCESS TO INFORMATIONHey Google Alexa, .Hey Siri 2

10/21/2019WHAT INFORMATION DO WE SEEK?Top GoogleSearches2019WHAT DEEP QUESTIONS DO WE HAVE?“storm area 51”51”“faceapp old faceface””“Las Vegas grasshopper”grasshopper”“meth gators”gators”“INFORMATION INDEPENDENCE” Direct access to information is the norm Questions no longer outpace most available answers AnswersAcommonlyl lleadd to new questionsi Insights are often truly gained many questions, then answers, down the line3

10/21/2019SOME SHORTHAND DEFINITIONS Data information Analytics your questions Database information store“DATA ANALYTICS?”SHORTHAND DEFINITIONS EXPANDED Data information Analytics your questions Database information store Digital stored electronically Big Data more information“Data Analytics” ?than we know what to do with4

10/21/2019BIG DATA A BIG DEAL5

10/21/2019DATA SCIENTISTS Analytical data expert Technical skills to solve complex problems Curiosity to explore what problems needto be solved Advanced degreed Highly sought after and well paidDATA SCIENTISTS Analytical data expert Technical skills to solve complex problems Curiosity to explore what problems needto be solved Advanced degreed Highly sought after and well paid6

10/21/2019 A workforce with questions in need of answers Access to the information to answer those questions A basic digital tool box and skillset Basic IT/Technical support A willingness to learn, try and fail.and try againMICROSOFT EXCEL – SPREADSHEETS Basic spreadsheet program Typically available at most organizations Very large online learning community Essential Elements: Sort, filter, line charts, bar charts, and pivot table Good to Know: VLOOKUP, CONCATENATE, IF, and Conditional Formatting No waiting for IT to deal with your question Incredibly flexible if you know Excel Most organizations use it/marketable skill Excel problems usually because of misuse Excel allows you to do something with dataexported from big systems There is a huge amount of help Improvements in collaboration optionsSource: s-still-used-by-half-a-billion-people-worldwide/7

10/21/2019EXCEL USERS SKILLSETS 51% Basic Level Open and close Excel, save workbooks, and basic formatting 29-34% Intermediate – Advanced Level Charts, tables, IF, VLOOKUP, and Pivot TablesSource: rs/EXCEL BASICS - ROWSEXCEL BASICS – COLUMNS8

10/21/2019EXCEL BASICS – SHEETS/TABSEXCEL BASICS – SORTINGEXCEL BASICS - FILTERING9

10/21/2019ADVANCED EXCEL – CONDITIONAL FORMATTINGADVANCED EXCEL – PIVOT TABLEADVANCED EXCEL – PIVOT TABLE EXPANDED10

10/21/2019ADVANCED EXCEL – ‘VLOOKUP’ FUNCTIONA SNEAK PEEK INTO DATABASE STRUCTUREMatching Field Across 2 SheetsRELATIONAL DATABASES Database think a very large spreadsheet broken up into many tabs Collection of digital informationOrganized into smaller tables Example Customer NamesCustomer AddressesProductsPurchase DetailsTables “relate” to each other through “keys” think VLOOKUP matching fieldAllows for the efficient retrieval of only the relevant information using SQL11

10/21/2019TABLEAU – DATA ANALYSIS & VISUALIZATION Direct and live connection to many data typesAble to handle much larger data sets than ExcelSQL – Structured Query Language not neededCan build tables (Rows and Columns) like ExcelCan export to ExcelSignificantly superior Data Visualization capabilitiesINTERACTIVESELF SERVICETABLEAU – BASIC DRAG AND DROPLEARNING STYLES Visual Learners - Seeing Auditory Learners – Hearing Kinesthetic Learners - Doing65% of Population35% of Population5% of Population12

10/21/2019LEARNING STYLES Visual Learners - Seeing Auditory Learners – Hearing Kinesthetic Learners - Doing65% of Population35% of Population5% of PopulationLEARNING STYLES Visual Learners - Seeing Auditory Learners – Hearing Kinesthetic Learners - Doing65% of Population35% of Population5% of PopulationDATA VISUALIZATION AND STORYTELLING The initial goal of analysis is to discover the answer Pattern, Rank, Trend, Outlier, Impact, etc. Theh next goall off analysislis to makek theh case ffor action We should .do this, stop this, change that, improve this, monitor that, etc. A well crafted data visualization Simply answers complex questionsSparks the curiosity of the viewer13

10/21/201914

10/21/2019DATA ANALYSIS – SAMPLE USE CASE US Dept. of Housing & Urban Development Physical property inspection dataset 2001-2015 Making these inspection details available will enable researchers, advocacygroups and the general public to 1) better understand the physical condition ofthe HUD-assisted housing stock, as well as changes in the stock over time; 2)hold providers accountable for housing quality; and 3) plan for futureaffordable housing needs.SAMPLE DATA SETUNDERSTANDING WHAT IS AND IS NOT IN “DATA”COLUMNS TO CHOOSE FROM INSPECTION ID STATE CODEDEVLOPMENT ID ZIPDEVELPMENT NAME LATITUDE LONGITUDE LOCATION QUALITY PHA CODECOUNTY NAME PHA NAMECOUNTY CODE INSPECTION SCORESTATE NAME INSPECTION DATEADDRESSCITYCBSA NAMECBSA CODE15

10/21/201916

10/21/2019Robust MappingCapabilities17

10/21/2019INJURYPREVENTION &RISK MANAGEMENTINVESTIGATION – TREATMENT PATTERN ANALYSISINVESTIGATION – LOCATION TIMELINE ANALYSIS18

10/21/2019COST CONTAINMENT – CONFUSION TO CLARITYQuestion 1Question 2Question 3Question 4IN CLOSING Data is everywhere, but few can access itQuestions currently outpace answersAnswers should, and often do, lead to new questions an iterative processInformation independence accelerates the path to a data driven cultureBig Data initiatives will be propelled by data savvy/comfortable consumers and advocatesWho currently asks your questions?The Analyst is YouADDITIONAL RESOURCES www.tableau.com www.alteryx.com www.powerbi.microsoft.com www.coursera.org www.youtube.com19

10/21/2019QUESTIONS?THANK YOU20

advanced excel – pivot table advanced excel – pivot table expanded. 10/21/2019 11 advanced excel – ‘vlookup’ function a sneak peek into database structure matching field across 2 sheets re