Data Intelligence for Business Leaders

3. December 2019

SDC Corporate Innovation Days 2019 - House of the Danish Industry Foundation, Huairou, Beijing

Millions are Wasted on failing AI Initiatives Every Year

“...companies exaggerate their use or results of AI by as much as 300%, and often only publicly reveal the AI initiatives that they think will reflect well with their customers. Most AI initiatives are failures, begun with no clear end in mind, or done entirely to appear “innovative”, and with little or no strategic value.”

Emerj.

Roman Jurowetzki, PhD

Assistant Professor in Innovation Studies and Data Science

Research
technological change and many fun/side-projects
Language geek
🇩🇪 🇩🇰 🇬🇧 🇷🇺 🇪🇸
Expertise
Natural Language Processing

Daniel Hain, PhD

Assistant Professor in Quantitative Methods and Industry Dynamics

Research
economics of technological change
What I do
Mapping, understanding, forecasting
Obseesed with
network & system analysis

Some of the things we do

Projects
with policy makers & industry.
Teaching
students, researchers, and professionals in applied ML & AI
Research projects
ML&AI - mainly in IP intelligence (patents), winner of OECD/EUIPO Big Data analytics challange
Building communities
around ML best-practice
EU - China research
research projects on AI ecosystem mapping
Our students
“AI consultants” in companies for final projects on the Social Data Science programme

MSc - Social Data Science Programme

6 months Master Specialization
in applied Machine Learning & AI for business and social science students

Some numbers

1st year 30 students
2nd year 60 students
one of the largest master programmes at AAU SAMF Engineering approach to problem solving

Structure

Applied ML
Working with unstructured data (Networks & Natural-Language-Processing)
Deep Learning and AI
Company project

Outline - Getting started with ML&AI in your company

The process we’ve experienced in several companies and what it needs to get started

Understanding the technology (Supervised and Unsupervised ML)
Identify ML&AI usecases
Establishing a business case and ROIs
Identifying key people and building a team

Creating and sustaining data literacy across the organization

How to start with AI!

Get an AI degree!
Hire an AI wizard!
Pick an awesome algorithm!
Dive into the data!
NOPE - Please don’t!

Defining some terms

Algorithm
In plain: Input ... MAGIC-BOX-OF-MAGIC ... Output
AI
Algorithms, where the magic produces outputs we consider intelligent.
Data Science
Academic/Professional discipline dealing with establishing ML processes and communicating insights

AI hype & FOMO

40% of European “AI startups” don’t use AI (EC report finds same in EU

Emerging technology
Everybody is excited, magical use-cases in the air.
Overhype
in news science, business & technology reporting
Misinformation
e.g. MMC report : 40% of EU AI startups don't use AI (EC report finds same in EU).
Mystification
simple/trivial use-cases portrayed using obscure terms.

Supervised vs. Unsupervised ML

Supervised ML (in 95% of cases)

Thought Experiment: The AI island

We have some 5k “friends” on a remote island that do tasks for us. They are specialized on labeling stuff.
They speak a native language, so we can’t talk. We can only teach them by examples. When they look at enough, they eventually figure out how to label what.
They make some mistakes, and need really a lot of examples to figure it out, but when they have, we can send them new stuff, and they label it for us.
What would you use our friends for?

How to think “predictive”?

Some simple steps:

Can we reframe the problem as a prediction task?
Is this task trivial and can be well designed by simple rules?
Do we have a reasonable amount of historical input-output data?
Is the outcome of interest well defined and measurable?

Is it reasonable to believe that the outcome of interest can be inferred from the input (signal/noise ratio)?
Is this data available in an accessible manner? Can input be mapped to the corresponding output? Is the input available at the point we need it?
Can it be used to improve a well-defined metric of interest?

Baby food ingredient: safe or spoiled?
Patient: ideal medication dosage?
Email: spam or ham?
Recorded phone call to call center: issue topic?
Bottle of wine: will I like it or not?
Stock: tomorrow’s price?
Transaction: legitimate or fraudulent?
Machine: when will it need maintenance?
Inventory: when to restock?

Your turn: Spot prediction problems

Process

Logistics?
QM?
Customer Care?

Product

Toothbrush?
Windmill?
Running shoes?
Kitchen stuff?

Service/Finance

Dating?
Insurance?
Fund Management

Predictive Tasks: Industries & applications

Unsupervised learning

Nothing to do with machines running around without adult supervision

Unsupervised learning

A hands-on example

Data Driven vs. AI driven

Are you a data driven company?

“Many people only use data to feel better about decisions they’ve already made.”

What is the business aim and what do you expect the data to tell?
Set your decision criteria before the analysis?
Review: 4.2: More than 4 but less than 5?

Ready for ML&AI?

If you check all with yes, your are probably data-driven. If not, don’t introduce ML&AI until you are.

Is relevant data gathered on a routine basis, and made accessible throughout the organization?
Is it used to create KPIs, which are continuously monitored.
Are important decisions made based on relevant data (vs. intuition)?

Getting started with AI

Your turn

Think about a cool ML&AI project in your industry to get started!

Pick your project

Drones? Self driving cars?
If you are not a data-company - consider starting with something that is familiar?
Less sexy but you know the business application and can estimate ROIs
Airbus: Looks at predictive maintenance instead of autonomous flying…
Industrial companies have a huge advantage where it comes to data ownership

Setting up a team

Whom to hire?

Let’s imagine you can hire 10 people for your new data science team. What kind of people would you look for?

Who should be operating the microwave?

The ML-ops team

Full-stack data scientist:
with business/domain expertise (= DS unicorn): Extremely rare, hard to hire and retain.
Data Analyst/BI:
Knows how to answer business questions with tabular workflows, probably a but of (inferential) statistics, and dataviz (diagrams & dashboards). Good starting point when not existing so far.
Statisticians:
Help decision-makers come to conclusions safely beyond the data. Carry out inferential (why) analysis, inspect data for bias. Applied ML engineer: Trained in applying ML methods mainly to solve prediction tasks.
ML systems enginer:
Trained in developing the tech infrastructure to bring models to production.
(Academic) ML researcher:
Often trained in developing ML. Makes most sense when already having specific and demanding use-case.

Cases Product: StitchFix

stitchfix

Stitch Fix, established in 2011 in San Francisco
Collaboration between artificial intelligence (AI) and human stylists
Unsupervised learning paired with similarity search.
- What are good combinations?
- What products can be used to create the combination or an alternative?
Takes in customer profile and order outputs combination recommendations for the stylist
Inventory management!

Cases Process: How ML Keeps Shelves Stocked at Home Depot

Shelfout prediction:
not Out-of-stock
ROI
having full shelves? - They experimented.
Building the model:
What could cause stuff not to be in the right shelve (collaborative effort)
Result:
A system warning stockers

Cases Service: Chatbot use at Ryanair

Most people want to know trivial things
How much cabin luggage can I carry? What items are allowed in checked in luggage? What items are allowed in checked in luggage?
A less well known function of a chatbot is to classify requests into those that the chatbot can handle and those that require human support.
Preselection allows to free up support to help with “real issues”

The AI getting-started Checklist

Cassie Kozyrkov

Head of Decision Intelligence, Google.

Correct delegation
Does the person in charge understand your business?
Output-focused ideation
Can you explain the system’s outputs and their value?
Appropriate task for ML/AI
Ar you automating many decisions/labels?
Reasonable expectations:
Can you live with imperfection?
Enough examples:
Is the amount of data you have is enough to learn from?
Team:
Are you sure, you can assemble a team with the necessary skills?
Ground truth:
Do you have access to outputs?

Data Intelligence for Business Leaders

Millions are Wasted on failing AI Initiatives Every Year

Roman Jurowetzki, PhD

Research

Language geek

Expertise

Daniel Hain, PhD

Research

What I do

Obseesed with

Some of the things we do

Projects

Teaching

Research projects

Building communities

EU - China research

Our students

MSc - Social Data Science Programme

6 months Master Specialization

Some numbers

Structure

Outline - Getting started with ML&AI in your company

The process we’ve experienced in several companies and what it needs to get started

Creating and sustaining data literacy across the organization

How to start with AI!

Defining some terms

Algorithm

AI

Data Science

AI hype & FOMO

Emerging technology

Overhype

Misinformation

Mystification

Supervised vs. Unsupervised ML

Supervised ML (in 95% of cases)

What would you use our friends for?

How to think “predictive”?

Your turn: Spot prediction problems

Process

Product

Service/Finance

Predictive Tasks: Industries & applications

Unsupervised learning

Unsupervised learning

Unsupervised learning

A hands-on example

Data Driven vs. AI driven

Ready for ML&AI?

Getting started with AI

Your turn

Think about a cool ML&AI project in your industry to get started!

Pick your project

Setting up a team

Whom to hire?

Who should be operating the microwave?

The ML-ops team

Full-stack data scientist:

Data Analyst/BI:

Statisticians:

ML systems enginer:

(Academic) ML researcher:

Cases Product: StitchFix

Cases Process: How ML Keeps Shelves Stocked at Home Depot

Shelfout prediction:

ROI

Building the model:

Result:

Cases Service: Chatbot use at Ryanair

Most people want to know trivial things

The AI getting-started Checklist

Cassie Kozyrkov

Correct delegation

Output-focused ideation

Appropriate task for ML/AI

Reasonable expectations:

Enough examples:

Team:

Ground truth: