Clients and stakeholders' needs are changing. Questions need to be answered quicker and more accurately for cheaper so we have to be more responsive Many may already be using data warehouses behind the scenes, and so we need to be comfortable with the language and technology in order to meet them where they are, or perhaps even help them get to where they want to be.
You may get a huge number of files to process and you won't be able to do it no matter how many actuarial students you throw at it.
The genie is out of the bottle. Aside from reserved roles, there's nothing technical that we do that couldn't be done by a similarly skilled data scientist:
Visicalc 1979 on the Apple II
Lotus 1-2-3 in 1983 on MS-DOS
Microsoft Excel 1995
LibreOffice
Also I've seen people make art in spreadsheets and perform standup comedy about spreadsheets.
News report from The Verge
We have the most successful, most popular spreadsheet program breaking peoples' data.
Spreadsheets make you squash problems into a format that can be solved by a spreadsheet or make you think your problem isn't solvable because it can't be done in a spreadsheet easily.
diamonds
## # A tibble: 53,940 x 10## carat cut color clarity depth table price x y z## <dbl> <ord> <ord> <ord> <dbl> <dbl> <int> <dbl> <dbl> <dbl>## 1 0.23 Ideal E SI2 61.5 55 326 3.95 3.98 2.43## 2 0.21 Premium E SI1 59.8 61 326 3.89 3.84 2.31## 3 0.23 Good E VS1 56.9 65 327 4.05 4.07 2.31## 4 0.29 Premium I VS2 62.4 58 334 4.2 4.23 2.63## 5 0.31 Good J SI2 63.3 58 335 4.34 4.35 2.75## 6 0.24 Very Good J VVS2 62.8 57 336 3.94 3.96 2.48## 7 0.24 Very Good I VVS1 62.3 57 336 3.95 3.98 2.47## 8 0.26 Very Good H SI1 61.9 55 337 4.07 4.11 2.53## 9 0.22 Fair E VS2 65.1 61 337 3.87 3.78 2.49## 10 0.23 Very Good H VS1 59.4 61 338 4 4.05 2.39## # … with 53,930 more rows
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity))
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point()
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point() + facet_wrap(~cut)
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point() + facet_wrap(~cut) + theme_bw()
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point() + facet_wrap(~cut) + theme_bw() + theme(legend.position = "bottom")
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point() + facet_wrap(~cut) + theme_bw() + theme(legend.position = "bottom") + labs(x = "Carat", y = "Price (USD)", title = "Some facts about Diamonds")
ggplot(midwest, aes(x = poptotal))
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30)
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30) + facet_wrap(~state)
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30) + facet_wrap(~state) + scale_x_log10(labels = scales::comma)
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30) + facet_wrap(~state) + scale_x_log10(labels = scales::comma) + theme_bw()
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30) + facet_wrap(~state) + scale_x_log10(labels = scales::comma) + theme_bw() + labs(x = "Total Population", y = "Count", title = "Distribution of Total county population in Midwestern States")
Implementations and realisations of models shall be reproducible.
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
Clients and stakeholders' needs are changing. Questions need to be answered quicker and more accurately for cheaper so we have to be more responsive Many may already be using data warehouses behind the scenes, and so we need to be comfortable with the language and technology in order to meet them where they are, or perhaps even help them get to where they want to be.
You may get a huge number of files to process and you won't be able to do it no matter how many actuarial students you throw at it.
The genie is out of the bottle. Aside from reserved roles, there's nothing technical that we do that couldn't be done by a similarly skilled data scientist:
Visicalc 1979 on the Apple II
Lotus 1-2-3 in 1983 on MS-DOS
Microsoft Excel 1995
LibreOffice
Also I've seen people make art in spreadsheets and perform standup comedy about spreadsheets.
News report from The Verge
We have the most successful, most popular spreadsheet program breaking peoples' data.
Spreadsheets make you squash problems into a format that can be solved by a spreadsheet or make you think your problem isn't solvable because it can't be done in a spreadsheet easily.
diamonds
## # A tibble: 53,940 x 10## carat cut color clarity depth table price x y z## <dbl> <ord> <ord> <ord> <dbl> <dbl> <int> <dbl> <dbl> <dbl>## 1 0.23 Ideal E SI2 61.5 55 326 3.95 3.98 2.43## 2 0.21 Premium E SI1 59.8 61 326 3.89 3.84 2.31## 3 0.23 Good E VS1 56.9 65 327 4.05 4.07 2.31## 4 0.29 Premium I VS2 62.4 58 334 4.2 4.23 2.63## 5 0.31 Good J SI2 63.3 58 335 4.34 4.35 2.75## 6 0.24 Very Good J VVS2 62.8 57 336 3.94 3.96 2.48## 7 0.24 Very Good I VVS1 62.3 57 336 3.95 3.98 2.47## 8 0.26 Very Good H SI1 61.9 55 337 4.07 4.11 2.53## 9 0.22 Fair E VS2 65.1 61 337 3.87 3.78 2.49## 10 0.23 Very Good H VS1 59.4 61 338 4 4.05 2.39## # … with 53,930 more rows
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity))
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point()
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point() + facet_wrap(~cut)
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point() + facet_wrap(~cut) + theme_bw()
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point() + facet_wrap(~cut) + theme_bw() + theme(legend.position = "bottom")
diamonds %>% ggplot(aes(x = carat, y = price, colour = clarity)) + geom_point() + facet_wrap(~cut) + theme_bw() + theme(legend.position = "bottom") + labs(x = "Carat", y = "Price (USD)", title = "Some facts about Diamonds")
ggplot(midwest, aes(x = poptotal))
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30)
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30) + facet_wrap(~state)
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30) + facet_wrap(~state) + scale_x_log10(labels = scales::comma)
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30) + facet_wrap(~state) + scale_x_log10(labels = scales::comma) + theme_bw()
ggplot(midwest, aes(x = poptotal)) + geom_histogram(bins = 30) + facet_wrap(~state) + scale_x_log10(labels = scales::comma) + theme_bw() + labs(x = "Total Population", y = "Count", title = "Distribution of Total county population in Midwestern States")
Implementations and realisations of models shall be reproducible.