Model Development Strategies for Time Series Forecasting


Jun Kim
Director – Finance Decision Science
American Express

Overview

In this presentation, Jun Kim talks about leading a team of data scientists at American Express to create time series models to predict future financial trends. He focuses more on the practical application of model development rather than actual model theory.

Time Series Background

Time series are a sequence of data points that are spaced equidistant from one another and are represented chronologically over time. Just as in other machine learning models, the original dataset is broken into a training dataset and a testing dataset to be able to measure the margin of error within a models’ predictions. One distinct difference is that this is not randomized and this test-train split will occur at a specific date in time.

“Time series allow you communicate complex scenarios to senior leadership who do not know anything about machine learning or the statistics behind a model.”

Watch Jun Kim’s full presentation here

Jun speaks specifically about how to implement time series in the financial realm relating to client services. As he describes, this can combine metrics that are reported at the daily level by different products and different clients. Normally, this would quickly generate a large, overwhelming amount of data which may be difficult to isolate trends for analysis and predictions. Time series aid in categorizing data in a way that forecasting models can accurately predict future trends and lend the ability to create visualizations that are easily understood.

Time series can be thought of operating on your typical X and Y axis, with the X axis representing your independent variable (“the cause”) and the Y axis representing the dependent variable (“the effect”). The complexity behind time series stems from the fact that you are trying to predict “the effects” when you do not necessarily know how “the causes” are defined. This concept of working with future data can quickly lead to large margin of errors if constraints are no correctly applied to the model features and variables. 

One example of how time series predictions can differ can be compared between a Multiple Forecasting model (figure A below) and a Lagged Variable model (figure B below). Proper time series use a lagged variable approach where a regression equation is used to predict dependent variables based on historical and current values. This allows for variables to capture more recent information, yearly seasonality, and averages over different time periods. Multiple Forecasting on the other hand is an aggregation of predictions based off of other predictions. This can spiral quickly in to a large margin of error as each deviation from absolute truth grows exponentially with each iterative prediction. This is a good, simple example of how aggregating all variables to generate a single prediction is more logical and accurate than a series of smaller predictions.

Figure A
Figure B

Jun speaks to two different time series models that he and team tested. The first, the SARIMAX model, was utilized due to its’ ability to capture trends, seasonality, and exogenous variables. This model benefits in its flexibility to operate using data at different levels of granularity (daily, weekly, monthly) as well as being able to more meticulously feature engineer data prior to model runs. The second model used is the Prophet Model which is a time series model developed by Facebook and has been open source since 2017. This model offers daily forecasting that can easily take holiday effects and seasonality into account while remaining very fast and accurate. However, if there are too many segments to process or there is not clear trend or seasonality this model may not compensate well.

(Difference between trend, seasonality, and noise within a time series)

Perhaps the biggest benefit that time series possess, and which Jun heavily emphasizes, is the ability to visualize the results easily. Both Power BI and Tableau are powerful tools that can take the results from time series predictions and visualize in a manner that the layman can understand. This allows for complicated scenarios to be shown to upper management without the need for an in depth, behind the scenes explanation. Instead, it allows for the focus to remain on the results and what actions are to be taken from them. Additionally, the tools mentioned are interactive. As questions can, and will , arise when presenting results, these visualizations become dynamic and allow the user to target specific sections or features to draw out hidden truths within the results. 

While the tools mentioned are incredibly powerful, it is the structure and build of a time series that allows for such manipulation to be possible. Succinct visualizations and the ability to condense complex topics allow for Jun and his team at American Express to continue to utilize time series to their full potential.


Tags   •   Finance

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Posts

Recent Posts

Does Healthcare AI Meet Basic Ethics Principles? - Ingrid Vasiliu-Feltes Chief Quality and Innovation Officer MEDNAX, Health Solutions Partner Over the past decade we have noticed an exponential…
“Ask Me Anything” with Zappos’s Head of AI/ML Research & Platforms, Ameen Kazerouni - Ameen Kazerouni Head of AI/ML Research & Platforms Zappos Family of Companies Ai4 recently hosted an "Ask Me Anything" session…
Top AI Conferences - Interested in learning the latest in AI this year? We’ve compiled a list of the top artificial intelligence conferences in…
Securing Personal Data for AI/ML Computing in the Cloud - Serge Vilvovsky Cybersecurity & Big Data Engineer MIT Startup Exchange Overview In this presentation, Serge Vilvovsky, a Cyber Security and…
Secure Your Code Via AI - Eliezer Kanal Technical Manager, Cyber Security Foundations, CERT Division Carnegie Mellon University Software Engineering Institute Introduction In this presentation Eliezer…
Leveraging AI in Cybersecurity Risk Modeling & Mitigation - Christopher Novak Director, Threat Advisory Research Verizon Wireless Introduction Originally, there was a poor understanding of why cyber breaches were…
Using AI to Build More Secure Software - Mark Sherman Technical Director, Cyber Security Foundations, CERT Division Carnegie Mellon University Software Engineering Institute Introduction MITRE's Common Vulnerabilities and…
Artificial Intelligence & Cybersecurity: Math Not Magic - Wayne Chung CTO FBI Introduction The field of cybersecurity has slowly progressed from an art to a science. It has…
Life Hacks for Thrilling the Customer with Your Data Science Technical Summary Products - Anne Lifton Manager of Data Science Nordstrom Overview Anne Lifton is a Manager of Data Science at Nordstrom and in…
Machine Learning for Pricing and Inventory Optimization @ Macy’s - Jolene Mork Senior Data Scientist Macy's Iain Stitt Data Scientist Macy's Bhagyesh Phanse VP, Data Science Macy's Overview In this…

Popular Posts

Leveraging AI in Cybersecurity Risk Modeling & Mitigation - Christopher Novak Director, Threat Advisory Research Verizon Wireless Introduction Originally, there was a poor understanding of why cyber breaches were…
Machine Learning for Pricing and Inventory Optimization @ Macy’s - Jolene Mork Senior Data Scientist Macy's Iain Stitt Data Scientist Macy's Bhagyesh Phanse VP, Data Science Macy's Overview In this…
Machine Learning and Artificial Intelligence in Banking - Artit "Art" Wangperawong Distinguished Engineer US Bank Introduction Every company’s AI journey is different. We’re all trying to figure out…
Convergent AI in Reducing Overdiagnosis, Overtreatment, and Misdiagnosis - Stephen Wong Chief Research Information Officer & Chair Professor Houston Methodist The current healthcare situation:  High-cost/low-quality healthcare is now a…
Figuring Out Applied ML: Building Frameworks and Teams to Operationalize ML at Scale - Dr. Nels Lindahl Director of Clinical Systems CVS Health How do you operationalize machine learning at scale? How do you…
The Autonomous Pharmacy: Applying AI and ML to Medication Management Across the Care Continuum - Ken Perez VP of Healthcare Policy Omnicell, Inc. Ken applies artificial intelligence (AI) and machine learning (ML) solutions to medication…
Unleashing the Power of AI on R&D: The Potential, The Pitfalls, The Progress - Dr. Jim Weatherall VP, Data Science & AI Astrazeneca R&D Intro to AstraZeneca R&D Here we see an end-to-end case…
Advancements at Siemens Healthineers in AI for Medical Imaging - Bimba Rao Head of Global Artificial Intelligence Engineering Siemens Healthineers Ultrasound Siemens Healthineers background  Siemens Healthineers builds healthcare products and…
What AI Will Bring to Medicine and Why Human Experts Are Here to Stay - Hakima Ibaroudene Group Leader - Research & Development Southwest Research Institute Overview Hakima Ibaroudene discusses how she and her team…
Does Healthcare AI Meet Basic Ethics Principles? - Ingrid Vasiliu-Feltes Chief Quality and Innovation Officer MEDNAX, Health Solutions Partner Over the past decade we have noticed an exponential…