New ideas from academia and industry practitioners
Connect ideas to problems
Avoid the downside
Validate your own actions
boundaries (does it even make sense to apply this…)
Wrong / right
Do not reinvent square wheels
What we can get out of such a thing?
Research -> practice
Practice -> publish / capture ideas
Should we always read the latest papers?
“But, The amount of papers generated is humongous … how do I keep up?”
The purpose of research is …
(Matt Might’s illustrated guide to a PhD)
Sources
Conferences
Journals
By Topic
By people / group / organisations (companies.. eg: goog, fb, for specialised interests)
Survey papers
Outcomes of a papers we love session
… not exhaustive or prescriptive …
Name a concept
Explain the concept
Explain how it works
Explain how it is an improvement
Where to use it. (Upside)
Where not to use it. (Downside)
The Paper for today’s session
“Statistical modeling: The two cultures” by Leo Breiman, 2001.
Why I liked this paper
Talks about experience and experience is king
Erudition and story telling is appealing
I do this for fun. not to write an exam.
Not dry and mathy.
Two Cultures
“Stochastic Data Model” vs “Algorithmic Model”
Data modeling (Statistics)
response variables = f(predictor variables, random noise, params)
“… Assume that the data are generated by the following model: ….”
Algorithm modeling (Machine Learning)
Find a function \(f(x)\) – an algorithm that operates on \(x\) to predict the responses to \(y\).
The issue with Data modeling
The statistician, by imagination and by looking at the data, can invent a reasonably good model to explain the complex mechanism of nature.
The conclusion about the model fit are about the model’s mechanism, and not about nature’s mechanism
If the model is a poor emulation of nature, the conclusions may be wrong
Data modeling produces a “simple” and “understandable” picture of the relationship between input and output.
But more complicated data models that have appeared suggests that as data becomes complex, the data models lose the advantage of presenting a clear picture of nature’s mechanism