The 11th Mining and Learning from Time Series Workshop: From Classical Methods to LLMs

(KDD MILETS Workshop 2025)

Held in conjunction with KDD'25
Aug 3, 2025

Introduction

Time series data are ubiquitous. In domains as diverse as finance, retail, entertainment, transportation and health care, we observe a fundamental shift away from parsimonious, infrequent measurement to nearly continuous monitoring and recording. Recent advances in diverse sensing technologies, ranging from remote sensors to wearables and social sensing, are generating a rapid growth in the size and complexity of time series archives. Thus, although time series analysis has been studied extensively, its importance only continues to grow. What is more, modern time series data pose significant challenges to existing techniques (e.g., irregular sampling in hospital records and spatiotemporal structure in climate data). Finally, time series mining research is challenging and rewarding because it bridges a variety of disciplines and demands interdisciplinary solutions. Now is the time to discuss the next generation of temporal mining algorithms. The focus of MiLeTS workshop is to synergize the research in this area and discuss both new and open problems in time series analysis and mining. The solutions to these problems may be algorithmic, theoretical, statistical, or systems-based in nature. Further, MiLeTS emphasizes applications to high impact or relatively new domains, including but not limited to biology, health and medicine, climate and weather, road traffic, astronomy, and energy.
The MiLeTS workshop will discuss a broad variety of topics related to time series, including:

Time series forecasting and prediction using classical approaches
Time series forecasting and prediction using LLMs
Time series pattern mining and detection, representation, searching and indexing, classification, clustering, prediction, forecasting, and rule mining.
Time series with special structure: spatiotemporal (e.g., traffic speeds at different locations), relational (e.g., patients with similar diseases), hierarchical, etc.
Time series with sparse or irregular sampling, missing values at and not at random, and special types of measurement noise or bias.
Time series that are multivariate, high-dimensional, heterogeneous, etc., or that possess other atypical properties.
Time series analysis using less traditional approaches, such as deep learning and subspace clustering.
Privacy preserving time series mining and learning.
Online, high-speed learning and mining from streaming time series.
Uncertain time series mining.
Applications to high impact or relatively new time series domains, such as health and medicine, road traffic, and air quality.
New, open, or unsolved problems in time series analysis and mining.

Schedule

08:00-08:10 AM Opening remarks

08:10-08:50 AM Keynote Talk

Conformal Prediction for Time Series: Beyond Exchangeability and Towards Multivariate Dependence , Prof. Yao Xie

8:50-9:30 AM Breakout Session

AI Agents in Time Series Forecasting: Replacement, Conjunction, or Revolution? , Madhura and Veer

9:30-10:15 AM Coffee Break + Poster Session

10:15-10:40 AM Keynote Talk

Towards Deep Time Series Modeling via Data-Centric AI , Prof. Yanjie Fu

10:55-11:20 Poster Session

11:20 AM-12:00 PM Keynote Talk

Frontiers of Foundation Models for Time Series , Prof. Yan Liu

12:00-12:05 PM Concluding Remarks

Speakers

Yao Xie

Coca-Cola Foundation Chair and Professor
H. Milton Stewart School of Industrial and Systems Engineering (ISyE)
Associate Director, Machine Learning Center
Adjunct Professor, School of Electrical and Computer Engineering (ECE)
Georgia Institute of Technology

Conformal Prediction for Time Series: Beyond Exchangeability and Towards Multivariate Dependence

Conformal prediction (CP) provides a powerful, distribution-free framework for constructing prediction intervals with finite-sample coverage guarantees. However, most classical CP methods rely on the assumption of data exchangeability, which is typically violated in time series due to temporal dependence. This has motivated a growing body of work aimed at extending CP to the time-series setting. Another computational challenge comes from handling multi-dimensional time-series. In this talk, I will review recent advances in conformal prediction for time series, including our own contributions to a general framework for constructing distribution-free prediction intervals for time series that wrap around a given black-box algorithm. The new approach relaxes the exchangeability assumption and allows for general forms of temporal and multi-variate dependence across dimensions. Theoretically, we establish asymptotic marginal and conditional coverage guarantees, which comes with certain width optimality in some cases. Methodologically, we propose computationally efficient procedures based on ensemble predictors that are closely related to standard CP, yet tailored for time series.

Biography
Yao Xie is the Coca-Cola Foundation Chair and Professor in the H. Milton Stewart School of Industrial and Systems Engineering (ISyE) at the Georgia Institute of Technology. She also serves as Associate Director of the Machine Learning Center and holds an adjunct faculty appointment in the School of Electrical and Computer Engineering (ECE). Her research focuses on the intersection of statistics, optimization, and machine learning, with an emphasis on developing computationally efficient and statistically rigorous algorithms for solving engineering challenges drawn from real-world applications. Her specific areas of interest include sequential data analysis, change-point detection, high-dimensional data analysis, spatio-temporal modeling, and optimization for robust learning and inference. Her work has contributed to societal and policy advancements, including data-driven policing initiatives. Her research has been featured in outlets such as Inside Higher Ed and Georgia Tech's "In Pursuit of Better Data Forecasting" and "Driving Data Science Innovation as Coca-Cola Foundation Chair." Yao Xie holds an Erdős number of 3. In recognition of her research and impact, she received the INFORMS Donald P. Gaver, Jr. Early Career Award for Excellence in Operations Research in 2022. This prestigious award, given to a single recipient annually across INFORMS, cited her "outstanding research contributions at the interface of operations research, statistics, machine learning, and optimization; for successfully applying her research talent to applications of societal importance; and for contributions to the education and mentoring of students at all levels." In 2024, she was further honored with the CWS Woodroofe Award, presented at the Joint Statistical Meeting (JSM) 2024.

Yanjie Fu

Associate Professor
School of Computing and AI,
Ira A. Fulton Schools of Engineering,
Arizona State University

Towards Deep Time Series Modeling via Data-Centric AI

Time series data, as a pervasive kind of data format, have played an important role in numerous real-world scenarios such as energy, transportation, healthcare, etc. Effective time series modeling can help with accurate forecasting, resource optimization, and risk management. Given the great importance, how can we effectively model the nature of the pervasive time series data? This talk will focus on deep learning for time series modeling from a novel data-centric AI perspective. This perspective originally categorizes data-centric time series modeling into “learning” and “manipulating” time series data distribution. In “learning” time series data, I will investigate how to extract the distribution regularity to expand dependencies for forecasting. In “manipulating” time series data, I will discuss how to overcome the distribution shift for advancing generalization. By incorporating deep learning-based time series analysis with data-centric AI techniques, this talk aims to achieve more effective and efficient time series modeling and subsequent decision-making.

Biography:
Dr. Yanjie Fu is an associate professor in the School of Computing and AI at the Arizona State University. He received his Ph.D. degree from the Rutgers University in 2016, the B.E. degree from the University of Science and Technology of China (USTC), and the M.E. degree from the Chinese Academy of Sciences (CAS). His teaching and research have been recognized by various awards, including: US NAE Frontiers of Engineering early career engineer (2023), US NSF CAREER (2021), 2024 Stanford Elsevier World’s Top 2% Scientists, Fulton Engineering Top 5% Teaching Recognition Award, University Reach the Star award. He is committed to data science education. His graduated Ph.D. students have joined academia as tenure-track faculty members. His recent focuses are machine learning with geospatial, time series data, data-centric AI (AI4data), AI for simulation and decisions, multimodal AI and foundation models, LLM and agentic AI (routing, teaming, reasoning). He currently serves as an Associate Editor of ACM Transactions on Knowledge Discovery from Data.

Yan Liu

Professor,
Thomas Lord Department of Computer Science
Director, USC Machine Learning Center,
Viterbi School of Engineering
Ming Hsieh Department of Electrical Engineering (by courtesy)
Quantitative and Computational Biology Department (by courtesy)
University of Southern California

Frontiers of Foundation Models for Time Series

Recent development in deep learning has spurred research advances in time series modeling and analysis. Practical applications of time series raise a series of new challenges in science applications, such as multi-resolution, multimodal, missing value, distributeness, and interpretability. In this talk, I will discuss possible paths to foundation models for time series data and future directions for time series research.

Biography:
Yan Liu is a full professor in the Computer Science Department of the Viterbi School of Engineering at the University of Southern California (USC), where she has been a faculty member since August 2010. Prior to joining USC, she worked as a research staff member in the Data Analytics Group at the IBM T.J. Watson Research Center from November 2006. She earned both her M.S. and Ph.D. degrees from Carnegie Mellon University. Her research focuses on machine learning and artificial intelligence, with applications spanning healthcare, sustainability—including climate science and traffic—and social media analysis. Her specific areas of interest include the development of novel machine learning models for time series analysis and modeling, explainable machine learning techniques, and efficient algorithms for sampling and optimization. Over the years, she has received numerous honors, including being named a 2021 ACM Distinguished Member and a 2020 National Academy of Inventors Senior Member. She was also recognized as a member of the 2018 New Voices of the National Academies of Science, Engineering, and Medicine and received the 2017 Biocom Catalyst Award. Additional accolades include the 2013 Okawa Foundation Research Award, the 2013 NSF CAREER Award, the 2011 James H. Zumberge Research and Innovation Fund Individual Award, the 2011 Yahoo Faculty Research & Engagement Award, an ACM Doctoral Dissertation Award Honorable Mention in 2007, and a Best Paper Award at the 2007 SIAM Conference on Data Mining.

Breakout Session Moderators

Madhura Raut

Principal Data Scientist,
Workday

Madhura Raut is a Principal Data Scientist at Workday, where she leads the design of large-scale machine learning systems for labor demand forecasting. Her work integrates time series modeling, AI, and reinforcement learning to enable real-time workforce optimization. She is the lead inventor on two U.S. patents related to advanced time series techniques, and her ML product has been recognized as a Top HR Product of the Year by Human Resource Executive. With over nine years of experience spanning enterprise machine learning and applied research, Madhura is passionate about building intelligent, production-grade systems that address complex operational challenges. Besides her professional work, she also serves as an IEEE Senior Member and mentor, actively contributing to the broader AI and data science community.

Veer Lade

Lead Applied Scientist,
Uber

Veer Lade is a Senior Applied Scientist at UberEats leading the Item Intelligence team focusing on developing ranking and recommender solutions for the item ranking space. He has been the seed scientist in the team helping UberEats expand into the Grocery & New Verticals domain. His work focuses on Recommender model development, experimentation, developing ranking simulations and deriving key data insights to guide Business decisions. Veer is passionate about data foundations, objective functions and deploying production grade ML systems. .

Accepted Papers

Multi-Modal Interpretable Graph for Competing Risk Prediction with Electronic Health Records . Munib Mesinovic, Peter Watkinson and Tingting Zhu.

ICeTEA: Mixture of Detectors for Metric-Log Anomaly Detection . Junxiang Wang, Xu Zheng, Zhengzhang Chen, Masanao Natsumeda, Nishioka Jun, Dongsheng Luo and Haifeng Chen.

L-GTA: Latent Generative Modeling for Time Series Augmentation. Luis Roque, Vitor Cerqueira, Carlos Soares and Luis Torgo.

A Comparative Benchmark of MSET and Contemporary Anomaly Detection Methods in Time Series Prognostics .Matthew Gerdes and Guang Chao Wang

Effectively Designing 2-Dimensional Sequence Models for Multivariate Time Series .Daniel Yiming Cao, Ali Behrouz, Ali Parviz, Mahdi Karami, Michele Santacatterina and Ramin Zabih

MedTPE: Compressing Long EHR Sequence for LLM-based Clinical Prediction with Token-Pair Encoding .Mingcheng Zhu, Zhiyao Luo, Yu Liu and Tingting Zhu

Learnable Continuous Wavelet Transform Network .Masaharu Yamamoto and Shigeru Maya

M$^2$Traj: A Data-Driven Framework for Multimorbidity Trajectory Identification from Multi-Disease Diagnosis Time Series Yu Liu and Tingting Zhu

Simulated Annealing-Based Imputation for Large Gaps in Big Science Network Time Series .Sanjay Chari, Kevin A. Brown, Andrew Norman and Christopher Carothers

ROCKET-LRP: Explainable Time Series Classification with Application to Anomaly Prediction in Manufacturing Zhijian Ling, Eldan Cohen, Takuya Aoyama and Keijiro Yano

Leto: Modeling Multivariate Time Series with Memorizing at Test Time .Ali Behrouz, Ali Parviz, Daniel Cao, Michele Santacatterina and Ramin Zabih

Using Supervised Anomaly Detection Algorithms to Localize Anomalies in Unlabeled Time Series Training Data .Matthew Gerdes and Guang Chao Wang

Scaled FP32 and Quantization-aware Training of PatchTST for Efficient Time Series Forecasting .Lorson Blair, Jeremy Buhler, Kaoutar El Maghraoui, Christopher Carothers, Naigang Wang and Jordan Murray

Call for Papers

Submissions should follow the SIGKDD formatting requirements (unless otherwise stated) and will be evaluated using the SIGKDD Research Track evaluation criteria. Preference will be given to papers that are reproducible, and authors are encouraged to share their data and code publicly whenever possible. Submissions are limited to be no more than 9 pages (suggested 4-8 pages), including references (all in a single pdf). All submissions must be in pdf format using the KDD main conference paper template (see: https://kdd2025.kdd.org/research-track-call-for-papers/ ). Submissions will be managed via the MiLeTS 2025 EasyChair website: https://easychair.org/conferences/?conf=milets2025.

Note on open problem submissions: In order to promote new and innovative research on time series, we plan to accept a small number of high quality manuscripts describing open problems in time series analysis and mining. Such papers should provide a clear, detailed description and analysis of a new or open problem that poses a significant challenge to existing techniques, as well as a thorough empirical investigation demonstrating that current methods are insufficient.

The review process is single-round and double-blind (submission files have to be anonymized). Concurrent submissions to other journals and conferences are acceptable. Accepted papers will be presented as posters during the workshop and list on the website (non-archival/without proceedings). Besides, a small number of accepted papers will be selected to be presented as contributed talks.

Any questions may be directed to the workshop e-mail address: kdd.milets@gmail.com.

Download Call for Papers here.

Key Dates

Paper Submission Deadline: May 8 May 31, 2025, 11:59PM Alofi Time

Author Notification: June 8 July 7, 2025, 11:59PM Alofi Time

Camera Ready Version: June 22, 2025 July 15, 2025, 11:59PM Alofi Time

Workshop: August 3, 2025

The 11th Mining and Learning from Time Series Workshop: From Classical Methods to LLMs

(KDD MILETS Workshop 2025)

Held in conjunction with KDD'25 Aug 3, 2025

Introduction

Schedule

Speakers

Conformal Prediction for Time Series: Beyond Exchangeability and Towards Multivariate Dependence

Towards Deep Time Series Modeling via Data-Centric AI

Frontiers of Foundation Models for Time Series

Breakout Session Moderators

Accepted Papers

Call for Papers

Key Dates

Workshop Organizers

Steering Committee

Program Committee

Zhengping Che

Jing Dai

Wei Chen

Zahid Hasan Tushar

Sultan Ahmed

Weiding Fan

Md Mahmudur Rahman

Yutong Feng

Ruiquan Huang

Yushan Jiang

John Paparrizos

Yingtao Luo

Xingjian Shi

Yiyuan Yang

Renjie Wu

Yangyu Wu

Haomin Wen

Held in conjunction with KDD'25
Aug 3, 2025