About Me 👨‍🔬

I am currently an Assistant Professor at the Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), affiliated with Shenzhen University.

My current research interests🔬 focus on:

  • RWKV: a novel neural network architecture for language modeling
  • Visual-Language Models combining visual and textual understanding

If you’re interested in academic collaboration, feel free to email me — I’d love to connect! We’re also recruiting interns — reach out if you’re passionate about AI! ✉️🚀

I have published 20+ papers Static Badge at the top international AI conferences such as EMNLP, COLING, COLM.

Work Experience

  • 2023.08 - Now, Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Assistant Professor
  • 2017.08 - 2022.11, Tencent ML Platform (Shenzhen, China), Applied Research Scientist

News 🔥

  • 2025.03: 🎉 RWKV-UI is accepted by ICME 2025
  • 2024.12: 🎉 VisualRWKV is accepted by COLING 2025, check out the chinese introduction video on Bilibili 🎬
  • 2023.08: I join Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ) as a Assistant Professor

Selected Publications 📝

COLING 2025
sym

VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models
Haowen Hou and Peigen Zeng and Fei Ma and Fei Richard Yu

Code | Paper | Video | BibTeX

COLM 2025
sym

RWKV-7 “Goose” with Expressive Dynamic State Evolution
Bo Peng, Ruichong Zhang, Daniel Goldstein, Eric Alcaide, Haowen Hou,et al.

Paper | HuggingFace | Code

IEEE SPL
sym

MLLM-TA: Leveraging Multimodal Large Language Models for Precise Temporal Video Grounding
Yi Liu, Haowen Hou(co-first author), Fei Ma, Shiguang Ni, Fei Richard Yu

Paper

EAAI
sym
COLM 2024
sym

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
Bo Peng, Eric Alcaide, Quentin Anthony, …, Haowen Hou, et al.

Paper | HuggingFace | Code

EMNLP 2023
sym

RWKV: Reinventing RNNs for the Transformer Era
Bo Peng, Eric Alcaide, Quentin Anthony, …, Haowen Hou, et al.

Paper | HuggingFace | Code

Other Publications 📚

Projects 🏭

2022 E-Commerce Visual Language Pre-training

  • Developed a large-scale (100M pairs) e-commerce visual-language pre-training model for enhanced product understanding.
  • Proposed BagFormer, a dual encoder for cross-modal retrieval using bag-wise interaction and entity-aware text granularity. Achieved top recall with low latency.
  • Ranked 1st in MUGE Challenge with BagFormer.

2021 WeChat Video Product Identification

  • Led design & development of scalable video product ID pipeline: data processing, model training, inference acceleration, and deployment (TensorRT, Kubernetes, Pulsar).
  • Led the development of major modules: multimodal video classification(ResNet-50, BERT, and NextVlad), product detection(YOLO-v5), product retrieval(Siamese Network), and product tagging(ALBEF+XGBoost).
  • Boosted overall GSB by 12.37%, Top-3 GSB by 26%, contributing to WeChat Video reaching 500M DAU.
  • Won 2nd place in the AAAI 2021 Watch and Buy Challenge.

2020 WeChat Pay Merchant Understanding

  • Tagged 7.7M merchants with 370M high-quality tags using distant supervision and active learning.
  • Modeled merchant-tag relations as a reading comprehension task using BERT + fine-grained entity types.
  • Achieved 96.6% top-1 accuracy on head merchants and 89.0% overall.
  • Improved pickup rate by 6.58% and usage rate by 0.28% (via A/B testing).

2019 WeChat Search Query Improvement

  • Built relevant query recommendation system to improve search satisfaction.
  • Constructed co-click query graphs (Hadoop), filtered with FastText, trained DSSM with hard negatives, and deployed FAISS PQ Index for real-time similarity search (latency < 100ms).
  • Led to 4.5% daily query view increase, raising search ad revenue by 3%.

Honors and Awards 🎖

  • 2018, Shenzhen High-Level Overseas Talent
  • 2013, NUS SMA3 Scholarship
  • 2012, NUS NGS Scholarship
  • 2007, First Prize, National Olympiad in Chemistry in Provinces (Top 1%)

Educations 📖

  • 2013.01 - 2017.05, Ph.D., National University of Singapore, Singapore.
  • 2008.08 - 2012.07, B.Eng., Harbin Institute of Technology, Harbin.

Invited Talks 💬

  • 2025.03, RWKV: Next-Gen Model Architecture, NVIDIA GTC 2025
  • 2024.12, RWKV: Next-Gen Model Architecture, Future Medicine Conference
  • 2024.10, RWKV: Next-Gen Model Architecture, China National Conference on Social Media Processing 2024

Teaching 🧑‍🏫

  • 2024 Fall Advanced Algorithms

Academic Service 🎓

  • 2024.06 - Now, Journal Reviewer, IEEE Signal Processing Letters
  • 2025.01 - Now, Conference Reviewer, IEEE International Conference on Multimedia & Expo (ICME)