Advanced Analytics

#SquadGoals






Julia Romero
AXA US
Legacy Analytics

Reference Info

All data presented here is synthetic and has been generated using:

  • the dataset generation tools in scikit learn
  • functions and random number generators

All the code required to perform the examples presented here as well as the presentation itself can be found my a repo on github. The link is availible in the presentation documents shared

Toolbox

Throughout the presentation I will include "toolbox" references highlighting the particular packages I'm using to do something.

All of the examples today are written using python 3 I am presenting using a jupyter notebook and the RISE Reveal.js extension

A note on data validation

CRITICALLY IMPORTANT

Just not a part of this talk

In [3]:
HTML( "<h1>That data science venn diagram ...</h1>")
Image("https://static1.squarespace.com/static/5150aec6e4b0e340ec52710a/t/51525c33e4b0b3e0d10f77ab/1364352052403/Data_Science_VD.png")
Out[3]:

That data science venn diagram ...

Out[3]:
In [4]:
HTML("<h1>I like this one better</h1>")
Image("https://3.bp.blogspot.com/-bvQxcwfqATQ/V-E_uTBc4VI/AAAAAAAAMGQ/Qa1Ntef-rs0E-mWx5pkVu-CPlREdvD0TwCLcB/s1600/VennDiagram2.png")
Out[4]:

I like this one better

Out[4]:
In [5]:
HTML("<h1> ... so why are we here today?</h1><br>")
Out[5]:

... so why are we here today?


Move from hype to can

My goal today is for you to leave this room knowing that you can head back to your offices on Monday and with a few emails have an analytics project up and running

In [6]:
HTML("<h1>One more chart you've seen before</h1><br>")
Image("https://media.licdn.com/mpr/mpr/shrinknp_800_800/AAEAAQAAAAAAAAjIAAAAJGI0NzY3MGM0LTIyMTEtNDYwYy04OWQ2LTgyYmZiNDgzNTlhNw.png")
Out[6]:

One more chart you've seen before


Out[6]:

Descriptive Analytics

In [3]:
Image("https://media.licdn.com/mpr/mpr/shrinknp_800_800/AAEAAQAAAAAAAAjIAAAAJGI0NzY3MGM0LTIyMTEtNDYwYy04OWQ2LTgyYmZiNDgzNTlhNw.png")
Out[3]:

Toolbox

  • numpy
  • pandas
  • matplotlib
In [9]:
chargePlt(synthExp_gf, ['2','4','6','8']) 

Actuaries have been using advance analytics for a long time... we just called it experience

In [10]:
chargePlt(synthExp_ngf, ['2','4','6','8'])

OK fine, but how about something a bit more...

In [6]:
vid = YouTubeVideo("93lrosBEW-Q", start=27, end =40, width = 1067, height = 600, autoplay=0)
display(vid)
In [10]:
HTML("<h2>Remember our trusty old venn diagram</h2><br>")
Image("https://static1.squarespace.com/static/5150aec6e4b0e340ec52710a/t/51525c33e4b0b3e0d10f77ab/1364352052403/Data_Science_VD.png")
Out[10]:

Remember our trusty old venn diagram


Out[10]: