Time series data are ubiquitous. In domains as diverse as finance, retail, entertainment, transportation and health care, we observe a fundamental shift away from parsimonious, infrequent measurement to nearly continuous monitoring and recording. Recent
advances in diverse sensing technologies, ranging from remote sensors to wearables and social sensing, are generating a rapid growth in the size and complexity of time series archives. Thus, although time series analysis has been
studied extensively, its importance only continues to grow. What is more, modern time series data pose significant challenges to existing techniques (e.g., irregular sampling in hospital records and spatiotemporal structure in
climate data). Finally, time series mining research is challenging and rewarding because it bridges a variety of disciplines and demands interdisciplinary solutions. Now is the time to discuss the next generation of temporal mining
algorithms. The focus of MiLeTS workshop is to synergize the research in this area and discuss both new and open problems in time series analysis and mining. The solutions to these problems may be algorithmic, theoretical, statistical,
or systems-based in nature. Further, MiLeTS emphasizes applications to high impact or relatively new domains, including but not limited to biology, health and medicine, climate and weather, road traffic, astronomy, and energy.
The MiLeTS workshop will discuss a broad variety of topics related to time series, including:
01:00-01:15 Opening remarks
01:15-2:00 Keynote Talk
2:50-2:45 Keynote Talk
2:45-3:30 Poster Session + Coffee Break
3:30-4:10 Contributed Talks
4:10-4:55 pm Keynote Talk
4:55-5:00 pm Concluding Remarks
Regents Professor and William Norris Endowed Chair,
Department of Computer Science and Engineering,
Director, CSE Data Science Initiative
University of Minnesota
Ability to effectively model complex temporal data is central to addressing global environmental chal-lenges, as it can improve our ability to understand the behavior of environmental systems and how they respond to changing climate and human actions. Process-based models of dynamical systems are often used to study a variety of environmental systems. Despite their extensive use, these models have several well-known limitations due to incomplete or inaccurate representations of the physical processes being modeled. There is a tremendous opportunity to systematically advance modeling in these domains by using state of the art machine learning (ML) methods that have already revolution-ized computer vision and language translation. However, capturing this opportunity is contingent on a paradigm shift in data-intensive scientific discovery since the “black box” use of ML often leads to seri-ous false discoveries in scientific applications. Because the hypothesis space of scientific applications is often complex and exponentially large, an uninformed data-driven search can easily select a highly complex model that is neither generalizable nor physically interpretable, resulting in the discovery of spurious relationships, predictors, and patterns. This problem becomes worse when there is a scarcity of labeled samples, which is quite common in science and engineering domains. This talk makes the case that in real-world systems that are governed by physical processes, there is an opportunity to take advantage of fundamental physical principles to inform the search of a physical-ly meaningful and accurate ML model. While this talk will illustrate the potential of the knowledge-guided machine learning (KGML) paradigm in the context of environmental problems (e.g., Fresh water science, Hydrology, Agronomy), the paradigm has the potential to greatly advance the pace of discov-ery in a diverse set of discipline where mechanistic models are used, e.g., climate science, weather forecasting, and pandemic management.
Biography
Vipin Kumar is a Regents Professor and holds William Norris Chair in the department of Computer Science and Engineering at the University of Minnesota. His research spans data mining, high-performance computing, and their applications in Climate/Ecosystems and health care. He also served as the Director of Army High Performance Computing Research Center (AHPCRC) from 1998 to 2005. He has authored over 400 research articles, and co-edited or coauthored 11 books including two widely used text books ``Introduction to Parallel Computing", "Introduction to Data Mining", and a recent edited collection, “Knowledge Guided Machine Learning”. Kumar's current major research focus is on knowledge-guided machine learning and its applications to understanding the impact of human induced changes on the Earth and its environment. Kumar’s research on this topic has been funded by NSF’s AI Institutes, BIGDATA, INFEWS, HDR, STC, and GCR programs, as well as ARPA-E, DARPA, and USGS. He has recently finished serving as the Lead PI of a 5-year, $10 Million pro-ject, "Understanding Climate Change - A Data Driven Approach", funded by the NSF's Expeditions in Computing program. Kumar is a Fellow of the AAAI, ACM, IEEE, AAAS, and SIAM. Kumar's founda-tional research in data mining and high performance computing has been honored by the ACM SIGKDD 2012 Innovation Award, which is the highest award for technical excellence in the field of Knowledge Discovery and Data Mining (KDD), the 2016 IEEE Computer Society Sidney Fernbach Award, one of IEEE Computer Society's highest awards in high performance computing, and Test-of-time award from 2021 Supercomputing conference (SC21).
In this talk, we present the robust and intelligent approaches toward time series analysis developed by Alibaba DAMO Academy - Decision Intelligence Lab. We will overview our techniques of periodicity detection, trend filtering, and seasonal-trend decomposition, which allow us to handle complex patterns and anomalies with precision. Moving beyond theory, we will discuss how we apply techniques in forecasting and anomaly detection, with an emphasis on explainable electricity forecasting, as well as disucssing our products such as eForecaster and MindOpt, and AIOps solutions. This talk will also highlight how anomaly detection leads to root cause analysis and how forecasting informs decision-making processes, such as autoscaling.
Biography:
Dr. Yin received my Ph.D. degree in operations research from Columbia University in 2006 under Prof. Donald Goldfarb. Before joining Alibaba US in 2019, he was a Professor in the Department of Mathematics, University of California, Los Angeles. During 2006–2013, he was with the Department of Computational and Applied Mathematics at Rice University. Dr. Yin's research interests include computational optimization and its applications in signal processing, machine learning, and other data science problems. He has won the NSF CAREER award in 2008, an Alfred P. Sloan Research Fellowship in 2009, a Morningside Gold Medal in 2016, a Damo Award in 2021, and INFORMS Egon Balas Prize in 2021. He has eight papers co-authored with his students and collaborators that have received the best-paper-kind awards. Since 2018, he have been among the top 1% cited researchers by Clarivate Analytics.
Department of Data Science & Artificial Intelligence, Monash Data Futures Institute,
Professor, Monash University, Australia
Time series classification is a fundamental data science task, providing understanding of dynamic processes as they evolve over time. Convolutional kernels provide an effective methods for extracting a wide range of different forms of information from time series data. I present the Rocket family of time series classification technologies that utilize convolutional kernels to achieve state-of-the-art accuracy with many orders of magnitude greater efficiency and scalability than any alternative. These make time series classification feasible at hitherto unattainable scale. The methods also have potential application across many other forms of time series analysis, including extrinsic regression, clustering, anomaly detection, segmentation and forecasting.
Biography:
Professor Geoff Webb is an eminent and highly-cited data scientist. He was editor in chief of the Data Mining and Knowledge Discovery journal, from 2005 to 2014. He has been Program Committee Chair of both ACM SIGKDD and IEEE ICDM, as well as General Chair of ICDM and member of the ACM SIGKDD Executive. He is a Technical Advisor to machine learning as a service startup BigML Inc and to recommender systems startup FROOMLE. He developed many of the key mechanisms of support-confidence association discovery in the 1980s. His OPUS search algorithm remains the state-of-the-art in rule search. He pioneered multiple research areas as diverse as black-box user modelling, interactive data analytics and statistically-sound pattern discovery. He has developed many useful machine learning algorithms that are widely deployed. His many awards include IEEE Fellow, the inaugural Eureka Prize for Excellence in Data Science (2017) and the Pacific-Asia Conference on Knowledge Discovery and Data Mining Distinguished Research Contributions Award (2022).
Deep Sequence Modeling for Event Log-based Predictive Maintenance. Yun Zhou, Yawei Wang, Huan Song, Tesfagabir Meharizghi, Mohamad Al Jazaery, Denisse Colin-Magana, Aruna Abeyakoon and Panpan Xu.
Dynamic Ensemble for Probabilistic Time-series forecasting via Deep Reinforcement Learning. Yuhao Ding, Youngsuk Park, Karthick Gopalswamy, Hilaf Hasson, Yuyang Wang and Jun Huan.
VQ-TR: Vector Quantized Attention for Time Series Forecasting. Kashif Rasul, Umang Gupta, Hena Ghonia, Anderson Schneider and Yuriy Nevmyvaka.
Long-Range Transformers for Dynamic Spatiotemporal Forecasting. Jake Grigsby, Zhe Wang, Nam H Nguyen and Yanjun Qi.
STNN-SDE: Continuous Stochastic Spatial-Temporal Representation Learning via Latent Stochastic Differential Equations. Yanbo Xu, Piyush Patil, Lingkai Kong and Chao Zhang.
Time Series Forecasting with Distortion-Aware Convolutional Neural Networks. Krisztian Buza.
Classification of Sparse and Irregularly Sampled Time Series with Convolutional Neural Networks. Krisztian Buza.
Distribution-Free Multi-Horizon Forecasting and Vending System. Vincent Quenneville-Belair, Malcolm Wolff, Brady Willhelme, Dhruv Madeka and Dean Foster.
Client: Cross-variable Linear Integrated Enhanced Transformer for Multivariate Long-Term Time Series Forecasting. Jiaxin Gao, Wenbo Hu and Yuntian Chen.
On the Consistency and Robustness of Saliency Explanations for Time Series Classification. Bin Li, Chiara Balestra and Emmanuel Müller.
FuseTPT: Fused Transformer Model for Accurate Multivariate Time Series Target Prediction. Imry Kissos, Moshe Unger, Even Glemmestad and Guy Arie.
VQ-AR: Vector Quantized Autoregressive Probabilistic Time Series Forecasting. Kashif Rasul, Young-Jin Park, Max Nihlén Ramström and Kyung-Min Kim.
SageFormer: Series-Aware Graph-Enhanced Transformers for Multivariate Time Series Forecasting. Zhenwei Zhang, Xin Wang and Yuantao Gu.
Bridge the Performance Gap in Peak-hour Series Forecasting: The Seq2Peak Framework . Zhenwei Zhang, Xin Wang, Jingyuan Xie, Heling Zhang and Yuantao Gu.
Novel Physics-Based Machine-Learning Models for Indoor Air Quality Approximations . Ahmad Mohammadshirazi, Aida Nadafian, Amin Karimi Monsefi, Mohammad Rafiei and Rajiv Ramnath
Score Matching-based Pseudolikelihood Estimation of Neural Marked Temporal Point Process. Zichong Li, Qunzhi Xu, Zhenghao Xu, Yajun Mei, Tuo Zhao and Hongyuan Zha.
Learning Behavioral Representations of Routines From Large-scale Unlabeled Wearable Time-series Data Streams using Hawkes Point Process. Tiantian Feng, Brandon M. Booth and Shrikanth Narayanan.
PyPOTS: A Python Toolbox for Data Mining on Partially-Observed Time Series. Wenjie Du.
Submissions should follow the SIGKDD formatting requirements (unless otherwise stated) and will be evaluated using the SIGKDD Research Track evaluation criteria. Preference will be given to papers that are reproducible, and authors are encouraged to share their data and code publicly whenever possible. Submissions are limited to be no more than 9 pages (suggested 4-8 pages), including references (all in a single pdf). All submissions must be in pdf format using the KDD main conference paper template (see: https://kdd.org/kdd2023/call-for-research-track-papers/ ). Submissions will be managed via the MiLeTS 2023 EasyChair website: https://easychair.org/conferences/?conf=milets2023.
Note on open problem submissions: In order to promote new and innovative research on time series, we plan to accept a small number of high quality manuscripts describing open problems in time series analysis and mining. Such papers should provide a clear, detailed description and analysis of a new or open problem that poses a significant challenge to existing techniques, as well as a thorough empirical investigation demonstrating that current methods are insufficient.
The review process is single-round and double-blind (submission files have to be anonymized). Concurrent submissions to other journals and conferences are acceptable. Accepted papers will be presented as posters during the workshop and list on the website (non-archival/without proceedings). Besides, a small number of accepted papers will be selected to be presented as contributed talks.
Any questions may be directed to the workshop e-mail address: kdd.milets@gmail.com.
Download Call for Papers here.
Paper Submission Deadline: May 26th, 2023, 11:59PM Alofi Time
Author Notification: June 13th June 20th, 2023
Camera Ready Version: July 10th, 2023
Workshop: August 7th, 2023
University of Maryland, Baltimore County
AWS AI Labs
University of Virginia
University of Connecticut
Morgan Stanley
DAMO Academy, Alibaba Group (U.S.)
University of California Riverside
University of Southern California
University of New Mexico