cv

Basics

Name Natalia (Shuixiunan) Zhang
Email zsxn21@mails.tsinghua.edu.cn
Url https://nataliazhang.github.io/
Summary An undergraduate student of computer science major in IIIS, Tsinghua University

Education

  • 2021.09 - now
    Undergraduate
    Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University
    Computer Science
  • 2018.09 - 2021.06
    High School
    Hangzhou Xuejun High School
    Physics Olympiad
  • 2012.09 - 2015.06
    Middle School
    Hangzhou Wenlan Middle School

Work

  • 2024.10 - now
    LLM-Assisted Dataset Generation with Consistency Evaluation for RLHF
    Generate a consistency-weighted dataset using LLMs to augment limited human-labeled preferences for improved reward model training in offline RLHF tasks.
    • Instructor: Prof. Zhixuan Fang in IIIS
    • Cooperator: Jiaxuan Jiang
  • 2024.08 - now
    Stabilizing Offline Reinforcement Learning with Imitation-Based Assistant Rewards
    Introduce an imitation-based assistant reward to improve early-stage training stability in offline reinforcement learning on the D4RL benchmark
    • Instructor: Prof. Simon Du in UW
    • Cooperator: Xinqi Wang
  • 2024.06 - 2024.08
    Reward Optimization of Imitation Learning in Offline Multi-Agent Games from Human Feedback
    Integrate human feedback into imitation learning for offline multi-agent tasks, using a reward model to improve performance by filtering and optimizing offline datasets
    • Instructor: Prof. Zhixuan Fang in IIIS
    • Paper: PDF
  • 2024.03 - 2024.08
    Multi-Agent Reinforcement Learning from Human Feedback
    Initiated the work of Multi-Agent Reinforcement Learning from Human Feedback (MARLHF), combining theoretical analysis and empirical experiments on preference-only offline datasets and proposing new algorithmic techniques
    • Instructor: Prof. Simon Du, Prof. Sham Kakade
    • Cooperator: Xinqi Wang, Qiwen Cui, Runlong Zhou
    • Repo: Jax-MARLHF
    • Paper: Arxiv
    • ICLR 2025 Under Review
  • 2023.09 - 2024.01
    Research on Communication in Multi-Agent Reinforcement Learning
    Introduced truthfulness for agents and large language model for communication in decentralized, partially observable multi-agent reinforcement learning, aimed at improving performance and transferability
    • Instructor: Prof. Zhixuan Fang in IIIS
    • Cooperator: Jiaxuan Jiang and Kaidi Fu
  • 2023.04 - 2023.07
    Research on a Self-Developed Information Eliciting Model
    Researched and developed a novel Information Eliciting Model incorporating a thinking hierarchy framework to uncover the truth within diverse perspectives, introducing metrics for measuring response hierarchy
    • Instructor: Prof. Zhixuan Fang in IIIS
    • Cooperator: Weiliang Wang
    • Paper: PDF
  • 2022.07 - 2022.10
    New Star Academic Program on Real-Time Translucent Neural Rendering
    Developed real-time neural rendering techniques for translucent objects by utilizing deep learning to model subsurface scattering, with superior results compared to existing approximation methods
    • Instructor: Prof. Kun Xu in Department of Computer Science and Technology
    • Cooperator: Di An and Shuzhen Li

Awards

  • 2024
    Comprehensive Scholarship in Tsinghua University
  • 2023
    Comprehensive Scholarship in Tsinghua University
  • 2022
    Scholarship of Institute for Interdisciplinary Information Core Technology, First Prize
  • 2021
    Freshman Scholarship, Second Prize
  • 2021
    Address at IIIS Opening Ceremony
  • 2019
    National Physics Olympiad, Second Prize
  • 2019
    Tsinghua Physics Camp, First Prize
  • 2018
    National English Proficiency Competition, Second Prize
  • 2018
    National Oral English Innovative Proficiency Contest, Second Prize

Projects

  • 2023.07 - 2023.09
    Implementation and Improvement on a 3D Hierarchical Torus Network Topology and its Corresponding Deadlock-Free Routing Algorithm
    • Cooperator: Ran Tao
    • Instructor: Prof. Kaisheng Ma
  • 2023.03 - 2023.07
    Distributed Computation on Matrix Multiplication with Concurrency
    • Cooperator: Xingchen Miu
    • Instructor: Prof. Wei Xu and Prof. Mingyu Gao
  • 2022.10 - 2022.12
    Implementation of RISC-V Processor Model with High-Performance Cache
    • Instructor: Prof. Mingyu Gao
  • 2022.03 - 2022.06
    Implementation of a Group of Machine Learning Models Including Random Forest and XGBoost
    • Instructor: Prof. Yang Yuan
  • 2022.03 - 2022.05
    Implementation of Stochastic Algorithms for 2D Ising Model
    • Instructor: Prof. Yukai Wu
  • 2022.04 - 2022.05
    Implementation of a Distributed Database Model
    • Instructor: Prof. Huanchen Zhang
  • 2022.05 - 2022.07
    Multimedia Processing Based on FFmpeg
    • Instructor: Prof. Huanchen Zhang

Languages

English
Toefl Best Score 109, Speaking 26
Programming
C/C++, Python, Pytorch, Matlab, Latex, SQL, etc.

Interests

Baseball and Softball
Poems