Yuhao Zhang

张宇浩

Hey there, welcome!

I am currently an NLP scientist at Amazon AWS AI, on a mission to deliver innovative and state-of-the-art AI/NLP services to enterprises and end users.

Before Amazon, I obtained my PhD degree from Stanford University, where I was jointly advised by Prof. Chris Manning in the Stanford NLP Group and Prof. Curtis Langlotz in the Stanford AIMI Center. My PhD work has focused on natural language processing and its applications in medicine.

Before that, I obtained a M.S. degree in the Computer Science Department at Stanford University, and a bachelor’s degree from the Department of Electronic Engineering at Tsinghua University, China.

research interest

I care about NLP systems and their impact in real-world applications. My research work has focused on four key areas:

  • information extraction, with a focus on understanding entities and relations in both general newswire and biomedical text;
  • text summarization, with a focus on improving the factual correctness of summarization systems;
  • multimodal learning, with a focus on joint modeling of medical images and text;
  • syntactic analysis and open-source NLP toolkit, and I am a co-author of the widely used Stanza NLP library.

contact

You can reach me now at {first-name} ~at~ cs.stanford.edu. You can also find my various social accounts at the bottom of this page.

selected publications

For a complete list, see the publications page, or my google scholar page.

(*=equal contribution)

  1. ACL Findings
    RobustQA: Benchmarking the Robustness of Domain Adaptation for Open-domain Question Answering
    Rujun Han, Peng Qi, Yuhao Zhang, Lan Liu, Juliette Burger, William Wang, Zhiheng Huang, Bing Xiang, and Dan Roth
    In Findings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2023.
  2. ACL Findings
    Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
    Jifan Chen, Yuhao Zhang, Lan Liu, Rui Dong, Xinchi Chen, Patrick Ng, William Yang Wang, and Zhiheng Huang
    In Findings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2023.
  3. ACL Findings
    Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
    Xingyu Fu, Sheng Zhang, Gukyeong Kwon, Pramuditha Perera, Henghui Zhu, Yuhao Zhang, Alexander Hanbo Li, William Yang Wang, Zhiguo Wang, Vittorio Castelli, Patrick Ng, Dan Roth, and Bing Xiang
    In Findings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2023.
  4. arXiv
    Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks
    Kaiser Sun, Peng Qi, Yuhao Zhang, Lan Liu, William Yang Wang, and Zhiheng Huang
    arXiv preprint arXiv:2212.09912, 2022.
  5. MLHC
    Contrastive Learning of Medical Visual Representations from Paired Images and Text
    Yuhao Zhang, Hang Jiang, Yasuhide Miura, Christopher D Manning, and Curtis P Langlotz
    In Proceedings of the 7th Machine Learning for Healthcare Conference, 2022.
  6. Thesis
    Deep Understanding and Generation of Medical Text and Beyond
    Yuhao Zhang
    Stanford University PhD Thesis, 2021.
  7. JAMIA
    Biomedical and Clinical English Model Packages for the Stanza Python NLP Library
    Yuhao Zhang, Yuhui Zhang, Peng Qi, Christopher D Manning, and Curtis P. Langlotz
    Journal of the American Medical Informatics Association, 2021.
  8. NAACL
    Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation
    Yasuhide Miura, Yuhao Zhang, Emily Tsai, Curtis Langlotz, and Dan Jurafsky
    In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2021.
  9. EACL
    Do Syntax Trees Help Pre-trained Transformers Extract Information?
    Devendra Sachan, Yuhao Zhang, Peng Qi, and William L Hamilton
    In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021.
  10. ACL
    Stanza: A Python Natural Language Processing Toolkit for Many Human Languages
    Peng Qi*, Yuhao Zhang*, Yuhui Zhang, Jason Bolton, and Christopher D Manning
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL): System Demonstrations, 2020.
  11. ACL
    Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports
    Yuhao Zhang, Derek Merck, Emily Tsai, Christopher D Manning, and Curtis Langlotz
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 2020.
  12. EMNLP
    Graph Convolution over Pruned Dependency Trees Improves Relation Extraction
    Yuhao Zhang*, Peng Qi*, and Christopher D Manning
    In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018.
  13. EMNLP-CoNLL
    Universal Dependency Parsing from Scratch
    Peng Qi*, Timothy Dozat*, Yuhao Zhang*, and Christopher D Manning
    In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, 2018.
  14. EMNLP
    Position-aware Attention and Supervised Data Improve Slot Filling
    Yuhao Zhang, Victor Zhong, Danqi Chen, Gabor Angeli, and Christopher D Manning
    In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2017.