๐ฎ Trust in the model
Story: a model for forecast of New Product sales
- model is better than baseline
- users do not trust the numbers
- they are used to a different process
- does interpretability help?
๐๏ธ Data Generating Process
Story: Data Exploration of Warehouse data
- order data: many more rows for outbound than inbound orders, why?
- pick task data: missing a day in a specific week, why?
- ...
๐๏ธ Data Generating Process
Story: Data Exploration of Warehouse data
- order data: many more rows for outbound than inbound orders, why?
- pick task data: missing a day in a specific week, why?
- ...
Rules of ML
- First, design and implement metrics
- Choose machine learning over a complex heuristic.
"
Google's rules of ML
Talk to experts
Interview
Try to uncover opportunties and risks
for a data-driven tool you might want to build.
Interface
It helps to iterate with an interactive
tool that shows data and visualization.
Document the domain
e.g. Business, Domain and Data essentials in README
Document the domain
e.g. Business, Domain and Data essentials in README
- Company X is investing strategically in 3rd party logistics
- Low costs are key to a successful operation
Document the domain
e.g. Business, Domain and Data essentials in README
- Company X is investing strategically in 3rd party logistics
- Low costs are key to a successful operation
- Main cost component in a Warehouse is picking (time)
- Using a ABC class based positioning of items could help
Document the domain
e.g. Business, Domain and Data essentials in README
- Company X is investing strategically in 3rd party logistics
- Low costs are key to a successful operation
- Main cost component in a Warehouse is picking (time)
- Using a ABC class based positioning of items could help
- Data comes from a WMS
- We have 2 years of data, 1 year of clean data
Random forest vs Xgboost
Context: AutoML for Forecast Initialization
Random forest vs Xgboost
Context: AutoML for Forecast Initialization
- boosted trees may predict outside training range
Random forest vs Xgboost
Context: AutoML for Forecast Initialization
- boosted trees may predict outside training range
- boosted trees more difficult to calibrate than RF
Random forest vs Xgboost
Context: AutoML for Forecast Initialization
- boosted trees may predict outside training range
- boosted trees more difficult to calibrate than RF
- learned: 1) resist the hype; 2) watch your results.
Random forest vs Xgboost
Context: AutoML for Forecast Initialization
- boosted trees may predict outside training range
- boosted trees more difficult to calibrate than RF
- learned: 1) resist the hype; 2) watch your results.
Learn the Domain
Socials
Learn the Domain
Socials
Experts
Get inspired by Domain
5S methodology
- ๆด็ (seiri) Sort
- ๆด้ (seiton) Straighten
- ๆธ
ๆ (seiso) Shine
- ๆธ
ๆฝ (seiketsu) Standardize
- ใใคใ (shitsuke) Sustain