Publications

* denotes equal contribution. Full list on Google Scholar.

2026

A. Kembay, S. Gunasekaran, R.-J. Zhu, Y. Zhang, J.K. Eshraghian
Efficient Knowledge Distillation via Salient Feature Masking
APL Machine Learning, 2026

W.-J. Shu, X. Qiu, R.-J. Zhu, H.-H. Chen, Y. Liu, H. Yang
LoopViT: Scaling Visual ARC with Looped Transformers
Preprint, 2026 [pdf]

2025

H. Xu, X. Qiu, Y. Xu, M.E. Elbtity, P. Zhou, Y. Tian, R.-J. Zhu, J. Zhang, S. Gu, et al.
Neuromorphic Spike-Based Large Language Model
National Science Review, 2025

S. Gunasekaran, A. Kembay, H. Ladret, R.-J. Zhu, L. Perrinet, O. Kavehei, et al.
A Predictive Approach to Enhance Time-Series Forecasting
Nature Communications 16(1), 8645, 2025

S. Yuan, J. Huang, Y. Shi, Y. Xu, R. Zhu, B. Lin, X. Cheng, L. Yuan, J. Luo
MagicTime: Time-Lapse Video Generation Models as Metamorphic Simulators
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025

Y. Zhang, S. Yang, R. Zhu, Y. Zhang, L. Cui, Y. Wang, B. Wang, F. Shi, B. Wang, et al.
Gated Slot Attention for Efficient Linear-Time Sequence Modeling
ICLR, 2025

X. Qiu, J. Zhang, W. Wei, H. Cao, J. Guo, R.-J. Zhu, Y. Shan, Y. Yang, M. Zhang, et al.
Quantized Spike-Driven Transformer
ICLR, 2025

Y. Shan, M. Zhang, R. Zhu, X. Qiu, J.K. Eshraghian, H. Qu
Advancing Spiking Neural Networks towards Multiscale Spatiotemporal Interaction Learning
AAAI, 2025

R.-J. Zhu*, Z. Wang*, K. Hua, T. Zhang, Z. Li, H. Que, B. Wei, Z. Wen, F. Yin, et al.
Scaling Latent Reasoning via Looped Language Models
Preprint, 2025 [pdf]

D. Wang*, R.-J. Zhu*, S. Abreu, Y. Shan, T. Kergan, Y. Pan, Y. Chou, Z. Li, et al.
A Systematic Analysis of Hybrid Linear Attention
Preprint, 2025 [pdf]

Y. Pan, Y. An, Z. Li, Y. Chou, R. Zhu, X. Wang, M. Wang, J. Wang, G. Li
Scaling Linear Attention with Sparse State Expansion
Preprint, 2025 [pdf]

Y. Chou, Z. Liu, R. Zhu, X. Wan, T. Li, C. Chu, Q. Liu, J. Wu, Z. Ma
ZeCo: Zero Communication Overhead Sequence Parallelism for Linear Attention
Preprint, 2025 [pdf]

M. Hui*, R.-J. Zhu*, S. Yang*, Y. Zhang, Z. Wang, Y. Zhou, J. Eshraghian, C. Xie
ARFlow: Autoregressive Flow with Hybrid Linear Attention
Preprint, 2025 [pdf]

J. Liu, D. Zhu, Z. Bai, Y. He, H. Liao, H. Que, Z. Wang, C. Zhang, G. Zhang, R.-J. Zhu, et al.
A Comprehensive Survey on Long Context Language Modeling
Preprint, 2025 [pdf]

Q. Ma, R.-J. Zhu, P. Liu, R. Yan, F. Zhang, L. Liang, M. Li, Z. Yu, Z. Wang, Y. Cai, et al.
Inner-Probe: Discovering Copyright-Related Data Generation in LLM Architecture
IEEE Transactions on Artificial Intelligence, 2025

X. Hao, R. Zhu, G. Zhang, K. Shen, C. Li
Reformulation for Pretraining Data Augmentation
Preprint, 2025 [pdf]

X. Qu, S. Wang, Z. Huang, K. Hua, F. Yin, R.-J. Zhu, J. Zhou, Q. Min, Z. Wang, et al.
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Preprint, 2025 [pdf]

S. Abreu, S.B. Shrestha, R.-J. Zhu, J. Eshraghian
Neuromorphic Principles for Efficient Large Language Models on Intel Loihi 2
Workshop on Scalable Optimization for Efficient and Adaptive Foundation Models (NeurIPS), 2025

Y. Shan, Z. Ren, H. Wu, W. Wei, R.-J. Zhu, S. Wang, D. Zhang, Y. Xiao, J. Zhang, et al.
SDTrack: A Baseline for Event-Based Tracking via Spiking Neural Networks
Preprint, 2025 [pdf]

J. Nardone, R. Zhu, J. Callenes, M.E. Elbtity, R. Zand, J. Eshraghian
Learnable Sparsification of Die-to-Die Communication via Spike-Based Encoding
Preprint, 2025 [pdf]

2024

R.-J. Zhu, S. Gunasekaran, J. Eshraghian
Bridging the Gap between Artificial Intelligence and Natural Intelligence
Nature Computational Science, 2024 [pdf]

R.-J. Zhu, Y. Zhang, S. Abreu, E. Sifferman, T. Sheaves, Y. Wang, D. Richmond, et al.
Scalable MatMul-Free Language Modeling
Preprint, 2024 [pdf] [code]

Y. Chou, M. Yao, K. Wang, Y. Pan, R.-J. Zhu, J. Wu, Y. Zhong, Y. Qiao, B. Xu, et al.
MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map
NeurIPS, 2024 (Oral)

S. Yuan, J. Huang, Y. Xu, Y. Liu, S. Zhang, Y. Shi, R. Zhu, X. Cheng, J. Luo, et al.
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-Lapse Video Generation
NeurIPS, 2024

R.-J. Zhu, T. Peng, T. Cheng, X. Qu, J. Huang, D. Zhu, H. Wang, K. Xue, et al.
Autonomous Driving with Spiking Neural Networks
NeurIPS, 2024

B. Peng, D. Goldstein, Q. Anthony, A. Albalak, E. Alcaide, S. Biderman, et al.
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
COLM, 2024 [pdf] [code]

X. Qiu, R.-J. Zhu, Y. Chou, Z. Wang, L. Deng, G. Li
Gated Attention Coding for Training High-Performance and Efficient Spiking Neural Networks
AAAI, 2024 [pdf]

R.-J. Zhu, M. Zhang, Q. Zhao, H. Deng, Y. Duan, L.J. Deng
TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural Networks
IEEE Transactions on Neural Networks and Learning Systems, 2024

H. Deng, R. Zhu, X. Qiu, Y. Duan, M. Zhang, L.J. Deng
Tensor Decomposition Based Attention Module for Spiking Neural Networks
Knowledge-Based Systems, 2024 [pdf]

S.K. Nath, S.K. Das, S.K. Nandi, C. Xi, C.V. Marquez, A. Rúa, M. Uenuma, et al.
Optically Tunable Electrical Oscillations in Oxide-Based Memristors for Neuromorphic Computing
Advanced Materials, 2024 [pdf]

S. Kundu, R.-J. Zhu, A. Jaiswal, P.A. Beerel
Recent Advances in Scalable Energy-Efficient and Trustworthy Spiking Neural Networks
ICASSP, 2024 [pdf]

R.-J. Zhu, Q. Zhao, G. Li, J.K. Eshraghian
SpikeGPT: Generative Pre-Trained Language Model with Spiking Neural Networks
Transactions on Machine Learning Research, 2024 [pdf] [code]

X. Qiu, Z. Wang, Z. Luan, R.-J. Zhu, X. Wu, M. Zhang, L.J. Deng
Both Efficiency and Effectiveness! A Large Scale Pre-Ranking Framework in Search System
Preprint, 2024

2023

B. Peng, E. Alcaide, Q. Anthony, A. Albalak, S. Arcadinho, H. Cao, X. Cheng, et al.
RWKV: Reinventing RNNs for the Transformer Era
EMNLP Findings, 2023 [pdf] [code]

C. Jin, R.-J. Zhu, X. Wu, L.J. Deng
VTSNN: A Virtual Temporal Spiking Neural Network
Frontiers in Neuroscience, 2023 [pdf]

X. Feng, Q. Zhao, R.-J. Zhu
Towards Popularity Prediction of Information Cascades via Degree Distribution and Deep Neural Networks
Journal of Informetrics, 2023 [pdf]

X. Qiu, Z. Luan, Z. Wang, R.-J. Zhu
When Spiking Neural Networks Meet Temporal Attention Image Decoding and Adaptive Spiking Neuron
ICLR Tiny Paper Track, 2023 [pdf]

Z. Zhu, R.-J. Zhu, Y. Ge, Q. Zhao
Uni-Match: A Semantic Unified Model for Query-Product Retrieval
ICLR Tiny Paper Track, 2023 [pdf]