Biography

Hi! My name is Hao Zhang (张豪). I am currently a senior research scientist at the Language & Science AI Lab (LASA), Alibaba DAMO Academy, working on multimodal large language models (LLMs) and their applications in the biomedical field. My research focuses on developing novel techniques, including RAG, agents, complex reasoning and data synthesis, to enhance the capabilities of multimodal LLMs and apply them to address real-world challenges.

Prior joining DAMO Academy, I served as a principal engineer at Huawei Noah’s Ark Lab for two years. I received Ph.D degree at the College of Computing and Data Science at Nanyang Technological University (NTU), Singapore in July 2022, under the supervision of Professor Aixin SUN. Before that, I obtained my M.Sc degree from the School of Electrical and Electronic Engineering at NTU in June 2016 and B.Eng degree from Dalian University of Technology (DLUT) in July 2015.

Research Community Services:
Reviewer: ACL Rolling Review, ACL, EMNLP, COLING, ICLR, AAAI, IJCAI, WWW, KDD, SIGIR, MM
Co-organizer: Interactive Recommendation System Workshop, WSDM 2023

Interests
  • Large Language Models
  • Multimodal LLMs
  • AI for Science
  • Retrieval-Augmented Generation
Education
  • PhD in Computer Science, 2019-2022

    Nanyang Technological University, Singapore

  • MSc in Communications Engineering, 2015-2016

    Nanyang Technological University, Singapore

  • BEng in Communications Engineering, 2011-2015

    Dalian University of Technology, China

Experience

 
 
 
 
 
Staff Algorithm Engineer
September 2024 – Present Singapore
Working on LLMs, multimodal LLMs and their applications in the medical field
 
 
 
 
 
Principal Engineer
July 2022 – August 2024 Singapore
Working on Pangu LLM SFT, Retrieval-augmented Generation, Agent, LLM for Rec, etc.
 
 
 
 
 
Senior Research Engineer
June 2018 – May 2022 Singapore

Senior Research Engineer/Principal Investigator, Centre for Frontier AI Research, Jan 2022 - May 2022

Research Engineer, Institute of High Performance Computing, Jun 2018 - Dec 2021

 
 
 
 
 
Research Associate
July 2016 – May 2018 Singapore

Publications

Quickly discover relevant content by filtering publications.
(2024). DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering. In EMNLP.

PDF Cite Code 📌EMNLP'24

(2024). LONG2RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall. In EMNLP.

Preprint PDF Cite Code 📌EMNLP'24

(2024). MC-indexing: Effective Long Document Retrieval via Multi-view Content-aware Indexing. In EMNLP.

Preprint PDF Cite 📌EMNLP'24

(2024). Collaborative Cross-modal Fusion with Large Language Model for Recommendation. In CIKM.

Preprint PDF Cite DOI 📌CIKM'24

(2024). Retrieval-Oriented Knowledge for Click-Through Rate Prediction. In CIKM.

Preprint PDF Cite Code DOI 📌CIKM'24