Publications
A full list of publications can be found on Google Scholar.
(* Equal Contribution, † Corresponding author)
- Streaming LLMs/MLLMs (Demo)
- From Static Inference to Dynamic Interaction: A Survey of Streaming Large Language Models. [PDF] [Repository]
Junlong Tong, Zilong Wang, YuJie Ren, Peiran Yin, Hao Wu, Wei Zhang, Xiaoyu Shen†.
Findings of ACL 2026. - StreamingThinker: Large Language Models Can Think While Reading. [PDF] [Code] [Project]
Junlong Tong, Yingqi Fan, Anhao Zhao, Yunpu Ma, Xiaoyu Shen†.
ICLR 2026. - Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models. [PDF] [Code] [Project]
Jialiang Zhang*, Junlong Tong*, Junyan Lin, Hao Wu, Yunpu Ma, Xiaoyu Shen†. (* Equal Contribution)
CVPR 2026. - LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding. [PDF] [Code]
Junlong Tong, Jinlan Fu, Zixuan Lin, Yingqi Fan, Anhao Zhao, Hui Su, Xiaoyu Shen†.
Findings of ACL 2025. - ProactiveLLM: Learning Active Interaction for Streaming Large Language Models.
Junlong Tong, Yao Zhang, Anhao Zhao, Yingqi Fan, Yunpu Ma, Xiaoyu Shen†.
Under review - Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models. [PDF][Code]
Junyan Lin*, Junlong Tong*, Hao Wu, Jialiang Zhang, Jinming Liu, Xin Jin, Xiaoyu Shen†.
Under review
- Efficient LLMs/MLLMs
- Context Guided Transformer Entropy Modeling for Video Compression. [PDF]
Junlong Tong, Wei Zhang, Yaohui Jin, Xiaoyu Shen†.
ICCV 2025. - What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models. [PDF][Code]
Yingqi Fan, Junlong Tong, Anhao Zhao, Xiaoyu Shen†.
CVPR 2026. (Highlight) - HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit [PDF][Code]
Hao Wu, Yingqi Fan, Jinyang Dai, Junlong Tong, Yunpu Ma, Xiaoyu Shen†
ICLR 2026. - VisiPruner: Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMs. [PDF]
Yingqi Fan, Anhao Zhao, Jinlan Fu, Junlong Tong, Hui Su, Yijie Pan, Wei Zhang, Xiaoyu Shen†.
EMNLP 2025. - SkipGPT: Each Token is One of a Kind. [PDF] [Code]
Anhao Zhao, Fanghua Ye, Yingqi Fan, Junlong Tong, Jing Xiong, Zhiwei Fei, Hui Su, Xiaoyu Shen†.
ICML 2025. - From Data to Model: A Survey of the Compression Lifecycle in MLLMs. [PDF] [Repository]
Hao Wu*, Junlong Tong*, Xudong Wang, Yang Tan, Changyu Zeng, Anastasia Antsiferova, Xiaoyu Shen†.
Under review
- LLM for Sequence Modeling
- Rethinking the Role of LLMs in Time Series Forecasting. [PDF] [Code]
Xin Qiu*, Junlong Tong*, Yirong Sun, Yunpu Ma, Wei Zhang, Xiaoyu Shen†.
Under review - The Few Govern the Many: Unveiling Few-Layer Dominance for Time Series Models. [PDF] [Code]
Xin Qiu*, Junlong Tong*, Yirong Sun, Yunpu Ma, Xiaoyu Shen†.
Under review - Probabilistic Decomposition Transformer for Time Series Forecasting. [PDF] [Code]
Junlong Tong, Liping Xie, Kanjian Zhang.
SIAM International Conference on Data Mining (SDM 2023).
- Enhancing Time Series Forecasting: A Hierarchical Transformer with Probabilistic Decomposition Representation. [PDF]
Junlong Tong, Liping Xie, Wankou Yang, Kanjian Zhang, Junsheng Zhao.
Information Sciences 2023. - Hourly Solar Irradiance Forecasting Based on Encoder–decoder Model Using Series Decomposition and Dynamic Error Compensation. [PDF]
Junlong Tong, Liping Xie, Shixiong Fang, Wankou Yang, Kanjian Zhang.
Energy Conversion and Management 2022.
