Biography

Hao Zhang (张豪) is a Principal Engineer at Noah’s Ark Lab, Singapore Research Center, Huawei International, working in large language models and its application to RAG. He obtained his Ph.D. from the College of Computing and Data Science at Nanyang Technological University (NTU), Singapore in July 2022, advised by Prof. Aixin SUN. Before, he received Master’s degree from the School of Electrical and Electronic Engineering at NTU in June 2016. He obtained his Bachelor’s degree from Dalian University of Technology (DLUT) in July 2015.

His broad research interests include natural language processing, vision-language learning, and machine learning. To be specific, his research topics include Large Language Models, Retrieval-Augmented Generation, Multimodal LLM, and etc.

Research Community Services:
Reviewer: ACL Rolling Review (2023-), ICLR 2024-2025, WWW 2024, KDD 2024, ACL 2019-2023, IJCAI 2023-2024, EMNLP 2023, MM 2023, COLING 2022-2023, SIGIR 2021-2022, AAAI 2021
Co-organizer: Interactive Recommendation System Workshop, WSDM 2023

Interests
  • Large Language Models
  • Multimodal LLMs
  • Retrieval-Augmented Generation
  • Vision-Language Learning
Education
  • PhD in Computer Science, 2019-2022

    Nanyang Technological University, Singapore

  • MSc in Communications Engineering, 2015-2016

    Nanyang Technological University, Singapore

  • BEng in Communications Engineering, 2011-2015

    Dalian University of Technology, China

Experience

 
 
 
 
 
Principal Engineer
July 2022 – August 2024 Singapore
Working on Pangu LLM SFT, Retrieval-augmented Generation, Agent, LLM for Rec, etc.
 
 
 
 
 
Senior Research Engineer
June 2018 – May 2022 Singapore

Senior Research Engineer/Principal Investigator, Centre for Frontier AI Research, Jan 2022 - May 2022

Research Engineer, Institute of High Performance Computing, Jun 2018 - Dec 2021

 
 
 
 
 
Research Associate
July 2016 – May 2018 Singapore

Publications

Quickly discover relevant content by filtering publications.
(2024). DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering. In EMNLP.

PDF Cite Code 📌EMNLP'24

(2024). LONG2RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall. In EMNLP.

Preprint PDF Cite Code 📌EMNLP'24

(2024). Multi-view Content-aware Indexing for Long Document Retrieval. In EMNLP.

Preprint PDF Cite 📌EMNLP'24

(2024). Collaborative Cross-modal Fusion with Large Language Model for Recommendation. In CIKM.

Preprint PDF Cite DOI 📌CIKM'24

(2024). Retrieval-Oriented Knowledge for Click-Through Rate Prediction. In CIKM.

Preprint PDF Cite Code DOI 📌CIKM'24