9th SIGKDD International Workshop on Mining and Learning from Time Series

Introduction

Time series data are ubiquitous. In domains as diverse as finance, retail, entertainment, transportation and health care, we observe a fundamental shift away from parsimonious, infrequent measurement to nearly continuous monitoring and recording. Recent advances in diverse sensing technologies, ranging from remote sensors to wearables and social sensing, are generating a rapid growth in the size and complexity of time series archives. Thus, although time series analysis has been studied extensively, its importance only continues to grow. What is more, modern time series data pose significant challenges to existing techniques (e.g., irregular sampling in hospital records and spatiotemporal structure in climate data). Finally, time series mining research is challenging and rewarding because it bridges a variety of disciplines and demands interdisciplinary solutions. Now is the time to discuss the next generation of temporal mining algorithms. The focus of MiLeTS workshop is to synergize the research in this area and discuss both new and open problems in time series analysis and mining. The solutions to these problems may be algorithmic, theoretical, statistical, or systems-based in nature. Further, MiLeTS emphasizes applications to high impact or relatively new domains, including but not limited to biology, health and medicine, climate and weather, road traffic, astronomy, and energy.
The MiLeTS workshop will discuss a broad variety of topics related to time series, including:

Time series pattern mining and detection, representation, searching and indexing, classification, clustering, prediction, forecasting, and rule mining.
Time series with special structure: spatiotemporal (e.g., traffic speeds at different locations), relational (e.g., patients with similar diseases), hierarchical, etc.
Time series with sparse or irregular sampling, missing values at and not at random, and special types of measurement noise or bias.
Time series that are multivariate, high-dimensional, heterogeneous, etc., or that possess other atypical properties.
Time series analysis using less traditional approaches, such as deep learning and subspace clustering.
Privacy preserving time series mining and learning.
Online, high-speed learning and mining from streaming time series.
Uncertain time series mining.
Applications to high impact or relatively new time series domains, such as health and medicine, road traffic, and air quality.
New, open, or unsolved problems in time series analysis and mining.

Schedule

1:00 PM - 5:00 PM (PT)

Venue: 201A

01:00-01:15 Opening remarks

01:15-2:00 Keynote Talk

Robust and Intelligent Time Series Analysis and Applications , Prof. Wotao Yin

2:50-2:45 Keynote Talk

Modeling Temporal Data via Knowledge-Guided Machine Learning: A New Framework for Accelerating Scientific Discovery and Addressing Global Environmental Challenges , Prof. Vipin Kumar

2:45-3:30 Poster Session + Coffee Break

3:30-4:10 Contributed Talks

Deep Sequence Modeling for Event Log-based Predictive Maintenance. Yun Zhou, Yawei Wang, Huan Song, Tesfagabir Meharizghi, Mohamad Al Jazaery, Denisse Colin-Magana, Aruna Abeyakoon and Panpan Xu.
Dynamic Ensemble for Probabilistic Time-series forecasting via Deep Reinforcement Learning. Yuhao Ding, Youngsuk Park, Karthick Gopalswamy, Hilaf Hasson, Yuyang Wang and Jun Huan.
VQ-TR: Vector Quantized Attention for Time Series Forecasting. Kashif Rasul, Umang Gupta, Hena Ghonia, Anderson Schneider and Yuriy Nevmyvaka.
Long-Range Transformers for Dynamic Spatiotemporal Forecasting. Jake Grigsby, Zhe Wang, Nam H Nguyen and Yanjun Qi.
STNN-SDE: Continuous Stochastic Spatial-Temporal Representation Learning via Latent Stochastic Differential Equations. Yanbo Xu, Piyush Patil, Lingkai Kong and Chao Zhang.

4:10-4:55 pm Keynote Talk

Convolutional kernels for effective and scalable time series analytics, Prof. Geoff Webb

4:55-5:00 pm Concluding Remarks

Speakers

Vipin Kumar

Regents Professor and William Norris Endowed Chair,
Department of Computer Science and Engineering,
Director, CSE Data Science Initiative
University of Minnesota

Modeling Temporal Data via Knowledge-Guided Machine Learning: A New Framework for Accelerating Scientific Discovery and Addressing Global Environmental Challenges

Ability to effectively model complex temporal data is central to addressing global environmental chal-lenges, as it can improve our ability to understand the behavior of environmental systems and how they respond to changing climate and human actions. Process-based models of dynamical systems are often used to study a variety of environmental systems. Despite their extensive use, these models have several well-known limitations due to incomplete or inaccurate representations of the physical processes being modeled. There is a tremendous opportunity to systematically advance modeling in these domains by using state of the art machine learning (ML) methods that have already revolution-ized computer vision and language translation. However, capturing this opportunity is contingent on a paradigm shift in data-intensive scientific discovery since the “black box” use of ML often leads to seri-ous false discoveries in scientific applications. Because the hypothesis space of scientific applications is often complex and exponentially large, an uninformed data-driven search can easily select a highly complex model that is neither generalizable nor physically interpretable, resulting in the discovery of spurious relationships, predictors, and patterns. This problem becomes worse when there is a scarcity of labeled samples, which is quite common in science and engineering domains. This talk makes the case that in real-world systems that are governed by physical processes, there is an opportunity to take advantage of fundamental physical principles to inform the search of a physical-ly meaningful and accurate ML model. While this talk will illustrate the potential of the knowledge-guided machine learning (KGML) paradigm in the context of environmental problems (e.g., Fresh water science, Hydrology, Agronomy), the paradigm has the potential to greatly advance the pace of discov-ery in a diverse set of discipline where mechanistic models are used, e.g., climate science, weather forecasting, and pandemic management.

Biography
Vipin Kumar is a Regents Professor and holds William Norris Chair in the department of Computer Science and Engineering at the University of Minnesota. His research spans data mining, high-performance computing, and their applications in Climate/Ecosystems and health care. He also served as the Director of Army High Performance Computing Research Center (AHPCRC) from 1998 to 2005. He has authored over 400 research articles, and co-edited or coauthored 11 books including two widely used text books ``Introduction to Parallel Computing", "Introduction to Data Mining", and a recent edited collection, “Knowledge Guided Machine Learning”. Kumar's current major research focus is on knowledge-guided machine learning and its applications to understanding the impact of human induced changes on the Earth and its environment. Kumar’s research on this topic has been funded by NSF’s AI Institutes, BIGDATA, INFEWS, HDR, STC, and GCR programs, as well as ARPA-E, DARPA, and USGS. He has recently finished serving as the Lead PI of a 5-year, $10 Million pro-ject, "Understanding Climate Change - A Data Driven Approach", funded by the NSF's Expeditions in Computing program. Kumar is a Fellow of the AAAI, ACM, IEEE, AAAS, and SIAM. Kumar's founda-tional research in data mining and high performance computing has been honored by the ACM SIGKDD 2012 Innovation Award, which is the highest award for technical excellence in the field of Knowledge Discovery and Data Mining (KDD), the 2016 IEEE Computer Society Sidney Fernbach Award, one of IEEE Computer Society's highest awards in high performance computing, and Test-of-time award from 2021 Supercomputing conference (SC21).

Wotao Yin

Director of Decision Intelligence Lab
Damo Academy, Alibaba Group

Robust and Intelligent Time Series Analysis and Applications

In this talk, we present the robust and intelligent approaches toward time series analysis developed by Alibaba DAMO Academy - Decision Intelligence Lab. We will overview our techniques of periodicity detection, trend filtering, and seasonal-trend decomposition, which allow us to handle complex patterns and anomalies with precision. Moving beyond theory, we will discuss how we apply techniques in forecasting and anomaly detection, with an emphasis on explainable electricity forecasting, as well as disucssing our products such as eForecaster and MindOpt, and AIOps solutions. This talk will also highlight how anomaly detection leads to root cause analysis and how forecasting informs decision-making processes, such as autoscaling.

Biography:
Dr. Yin received my Ph.D. degree in operations research from Columbia University in 2006 under Prof. Donald Goldfarb. Before joining Alibaba US in 2019, he was a Professor in the Department of Mathematics, University of California, Los Angeles. During 2006–2013, he was with the Department of Computational and Applied Mathematics at Rice University. Dr. Yin's research interests include computational optimization and its applications in signal processing, machine learning, and other data science problems. He has won the NSF CAREER award in 2008, an Alfred P. Sloan Research Fellowship in 2009, a Morningside Gold Medal in 2016, a Damo Award in 2021, and INFORMS Egon Balas Prize in 2021. He has eight papers co-authored with his students and collaborators that have received the best-paper-kind awards. Since 2018, he have been among the top 1% cited researchers by Clarivate Analytics.

Geoff Webb

Department of Data Science & Artificial Intelligence, Monash Data Futures Institute,
Professor, Monash University, Australia

Convolutional kernels for effective and scalable time series analytics

Time series classification is a fundamental data science task, providing understanding of dynamic processes as they evolve over time. Convolutional kernels provide an effective methods for extracting a wide range of different forms of information from time series data. I present the Rocket family of time series classification technologies that utilize convolutional kernels to achieve state-of-the-art accuracy with many orders of magnitude greater efficiency and scalability than any alternative. These make time series classification feasible at hitherto unattainable scale. The methods also have potential application across many other forms of time series analysis, including extrinsic regression, clustering, anomaly detection, segmentation and forecasting.

Biography:
Professor Geoff Webb is an eminent and highly-cited data scientist. He was editor in chief of the Data Mining and Knowledge Discovery journal, from 2005 to 2014. He has been Program Committee Chair of both ACM SIGKDD and IEEE ICDM, as well as General Chair of ICDM and member of the ACM SIGKDD Executive. He is a Technical Advisor to machine learning as a service startup BigML Inc and to recommender systems startup FROOMLE. He developed many of the key mechanisms of support-confidence association discovery in the 1980s. His OPUS search algorithm remains the state-of-the-art in rule search. He pioneered multiple research areas as diverse as black-box user modelling, interactive data analytics and statistically-sound pattern discovery. He has developed many useful machine learning algorithms that are widely deployed. His many awards include IEEE Fellow, the inaugural Eureka Prize for Excellence in Data Science (2017) and the Pacific-Asia Conference on Knowledge Discovery and Data Mining Distinguished Research Contributions Award (2022).

Call for Papers

Submissions should follow the SIGKDD formatting requirements (unless otherwise stated) and will be evaluated using the SIGKDD Research Track evaluation criteria. Preference will be given to papers that are reproducible, and authors are encouraged to share their data and code publicly whenever possible. Submissions are limited to be no more than 9 pages (suggested 4-8 pages), including references (all in a single pdf). All submissions must be in pdf format using the KDD main conference paper template (see: https://kdd.org/kdd2023/call-for-research-track-papers/ ). Submissions will be managed via the MiLeTS 2023 EasyChair website: https://easychair.org/conferences/?conf=milets2023.

Note on open problem submissions: In order to promote new and innovative research on time series, we plan to accept a small number of high quality manuscripts describing open problems in time series analysis and mining. Such papers should provide a clear, detailed description and analysis of a new or open problem that poses a significant challenge to existing techniques, as well as a thorough empirical investigation demonstrating that current methods are insufficient.

The review process is single-round and double-blind (submission files have to be anonymized). Concurrent submissions to other journals and conferences are acceptable. Accepted papers will be presented as posters during the workshop and list on the website (non-archival/without proceedings). Besides, a small number of accepted papers will be selected to be presented as contributed talks.

Any questions may be directed to the workshop e-mail address: kdd.milets@gmail.com.

Download Call for Papers here.

Introduction

Schedule

1:00 PM - 5:00 PM (PT)

Venue: 201A

Speakers

Modeling Temporal Data via Knowledge-Guided Machine Learning: A New Framework for Accelerating Scientific Discovery and Addressing Global Environmental Challenges

Robust and Intelligent Time Series Analysis and Applications

Convolutional kernels for effective and scalable time series analytics

Accepted Papers

Call for Papers

Key Dates

Workshop Organizers

Steering Committee

Program Committee

Zhengping Che

Feiyang Cai

Jing Dai

Jiechao Gao

Shima Imani

Yushan Jiang

Ming Jin

Xiang Li

Yuxuan Liang

Dongsheng Luo

Yingtao Luo

Minghua Ma

Lili Meng

Zijie Pan

John Paparrizos

Xingjian Shi

An Wang

Dingsu Wang

Pengyang Wang

Haomin Wen

Renjie Wu

Chen Xu

Dongkuan Xu

Yiyuan Yang

Yi-Fan Zhang

Md Rahman

Workshop Sponsors

We are grateful to Morgan Stanley for sponsoring our workshop and supporting the best paper and poster awards.