Publications

A full list of publications can be found on Google Scholar.
(* Equal Contribution, Corresponding author)

  • Streaming LLMs/MLLMs (Demo)
  1. From Static Inference to Dynamic Interaction: A Survey of Streaming Large Language Models. [PDF] [Repository]
    Junlong Tong, Zilong Wang, YuJie Ren, Peiran Yin, Hao Wu, Wei Zhang, Xiaoyu Shen.
    Findings of ACL 2026.
  2. StreamingThinker: Large Language Models Can Think While Reading. [PDF] [Code] [Project]
    Junlong Tong, Yingqi Fan, Anhao Zhao, Yunpu Ma, Xiaoyu Shen.
    ICLR 2026.
  3. Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models. [PDF] [Code] [Project]
    Jialiang Zhang*, Junlong Tong*, Junyan Lin, Hao Wu, Yunpu Ma, Xiaoyu Shen. (* Equal Contribution)
    CVPR 2026.
  4. LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding. [PDF] [Code]
    Junlong Tong, Jinlan Fu, Zixuan Lin, Yingqi Fan, Anhao Zhao, Hui Su, Xiaoyu Shen.
    Findings of ACL 2025.
  5. ProactiveLLM: Learning Active Interaction for Streaming Large Language Models.
    Junlong Tong, Yao Zhang, Anhao Zhao, Yingqi Fan, Yunpu Ma, Xiaoyu Shen.
    Under review
  6. Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models. [PDF][Code]
    Junyan Lin*, Junlong Tong*, Hao Wu, Jialiang Zhang, Jinming Liu, Xin Jin, Xiaoyu Shen.
    Under review
  • Efficient LLMs/MLLMs
  1. Context Guided Transformer Entropy Modeling for Video Compression. [PDF]
    Junlong Tong, Wei Zhang, Yaohui Jin, Xiaoyu Shen.
    ICCV 2025.
  2. What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models. [PDF][Code]
    Yingqi Fan, Junlong Tong, Anhao Zhao, Xiaoyu Shen.
    CVPR 2026. (Highlight)
  3. HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit [PDF][Code]
    Hao Wu, Yingqi Fan, Jinyang Dai, Junlong Tong, Yunpu Ma, Xiaoyu Shen
    ICLR 2026.
  4. VisiPruner: Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMs. [PDF]
    Yingqi Fan, Anhao Zhao, Jinlan Fu, Junlong Tong, Hui Su, Yijie Pan, Wei Zhang, Xiaoyu Shen.
    EMNLP 2025.
  5. SkipGPT: Each Token is One of a Kind. [PDF] [Code]
    Anhao Zhao, Fanghua Ye, Yingqi Fan, Junlong Tong, Jing Xiong, Zhiwei Fei, Hui Su, Xiaoyu Shen.
    ICML 2025.
  6. From Data to Model: A Survey of the Compression Lifecycle in MLLMs. [PDF] [Repository]
    Hao Wu*, Junlong Tong*, Xudong Wang, Yang Tan, Changyu Zeng, Anastasia Antsiferova, Xiaoyu Shen.
    Under review
  • LLM for Sequence Modeling
  1. Rethinking the Role of LLMs in Time Series Forecasting. [PDF] [Code]
    Xin Qiu*, Junlong Tong*, Yirong Sun, Yunpu Ma, Wei Zhang, Xiaoyu Shen.
    Under review
  2. The Few Govern the Many: Unveiling Few-Layer Dominance for Time Series Models. [PDF] [Code]
    Xin Qiu*, Junlong Tong*, Yirong Sun, Yunpu Ma, Xiaoyu Shen.
    Under review
  3. Probabilistic Decomposition Transformer for Time Series Forecasting. [PDF] [Code]
    Junlong Tong, Liping Xie, Kanjian Zhang.
    SIAM International Conference on Data Mining (SDM 2023).
  4. Enhancing Time Series Forecasting: A Hierarchical Transformer with Probabilistic Decomposition Representation. [PDF]
    Junlong Tong, Liping Xie, Wankou Yang, Kanjian Zhang, Junsheng Zhao.
    Information Sciences 2023.
  5. Hourly Solar Irradiance Forecasting Based on Encoder–decoder Model Using Series Decomposition and Dynamic Error Compensation. [PDF]
    Junlong Tong, Liping Xie, Shixiong Fang, Wankou Yang, Kanjian Zhang.
    Energy Conversion and Management 2022.