* denotes equal contribution. Full list on Google Scholar.
Efficient Knowledge Distillation via Salient Feature Masking
APL Machine Learning, 2026
LoopViT: Scaling Visual ARC with Looped Transformers
Preprint, 2026
[pdf]
Neuromorphic Spike-Based Large Language Model
National Science Review, 2025
A Predictive Approach to Enhance Time-Series Forecasting
Nature Communications 16(1), 8645, 2025
MagicTime: Time-Lapse Video Generation Models as Metamorphic Simulators
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025
Gated Slot Attention for Efficient Linear-Time Sequence Modeling
ICLR, 2025
Quantized Spike-Driven Transformer
ICLR, 2025
Advancing Spiking Neural Networks towards Multiscale Spatiotemporal Interaction Learning
AAAI, 2025
Scaling Latent Reasoning via Looped Language Models
Preprint, 2025
[pdf]
A Systematic Analysis of Hybrid Linear Attention
Preprint, 2025
[pdf]
Scaling Linear Attention with Sparse State Expansion
Preprint, 2025
[pdf]
ZeCo: Zero Communication Overhead Sequence Parallelism for Linear Attention
Preprint, 2025
[pdf]
ARFlow: Autoregressive Flow with Hybrid Linear Attention
Preprint, 2025
[pdf]
A Comprehensive Survey on Long Context Language Modeling
Preprint, 2025
[pdf]
Inner-Probe: Discovering Copyright-Related Data Generation in LLM Architecture
IEEE Transactions on Artificial Intelligence, 2025
Reformulation for Pretraining Data Augmentation
Preprint, 2025
[pdf]
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Preprint, 2025
[pdf]
Neuromorphic Principles for Efficient Large Language Models on Intel Loihi 2
Workshop on Scalable Optimization for Efficient and Adaptive Foundation Models (NeurIPS), 2025
SDTrack: A Baseline for Event-Based Tracking via Spiking Neural Networks
Preprint, 2025
[pdf]
Learnable Sparsification of Die-to-Die Communication via Spike-Based Encoding
Preprint, 2025
[pdf]
Bridging the Gap between Artificial Intelligence and Natural Intelligence
Nature Computational Science, 2024
[pdf]
Scalable MatMul-Free Language Modeling
Preprint, 2024
[pdf]
[code]
MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map
NeurIPS, 2024 (Oral)
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-Lapse Video Generation
NeurIPS, 2024
Autonomous Driving with Spiking Neural Networks
NeurIPS, 2024
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
COLM, 2024
[pdf]
[code]
Gated Attention Coding for Training High-Performance and Efficient Spiking Neural Networks
AAAI, 2024
[pdf]
TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural Networks
IEEE Transactions on Neural Networks and Learning Systems, 2024
Tensor Decomposition Based Attention Module for Spiking Neural Networks
Knowledge-Based Systems, 2024
[pdf]
Optically Tunable Electrical Oscillations in Oxide-Based Memristors for Neuromorphic Computing
Advanced Materials, 2024
[pdf]
Recent Advances in Scalable Energy-Efficient and Trustworthy Spiking Neural Networks
ICASSP, 2024
[pdf]
SpikeGPT: Generative Pre-Trained Language Model with Spiking Neural Networks
Transactions on Machine Learning Research, 2024
[pdf]
[code]
Both Efficiency and Effectiveness! A Large Scale Pre-Ranking Framework in Search System
Preprint, 2024
RWKV: Reinventing RNNs for the Transformer Era
EMNLP Findings, 2023
[pdf]
[code]
VTSNN: A Virtual Temporal Spiking Neural Network
Frontiers in Neuroscience, 2023
[pdf]
Towards Popularity Prediction of Information Cascades via Degree Distribution and Deep Neural Networks
Journal of Informetrics, 2023
[pdf]
When Spiking Neural Networks Meet Temporal Attention Image Decoding and Adaptive Spiking Neuron
ICLR Tiny Paper Track, 2023
[pdf]
Uni-Match: A Semantic Unified Model for Query-Product Retrieval
ICLR Tiny Paper Track, 2023
[pdf]