cv | Natalia Zhang

Basics

Name	Natalia (Shuixiunan) Zhang
Email	zsxn21@mails.tsinghua.edu.cn
Url	https://nataliazhang.github.io/
Summary	An undergraduate student of computer science major in IIIS, Tsinghua University

Education

2021.09 - now
Undergraduate

Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University

Computer Science
2018.09 - 2021.06
High School

Hangzhou Xuejun High School

Physics Olympiad
2012.09 - 2015.06
Middle School

Hangzhou Wenlan Middle School

Work

2024.10 - now
LLM-Assisted Dataset Generation with Consistency Evaluation for RLHF

Generate a consistency-weighted dataset using LLMs to augment limited human-labeled preferences for improved reward model training in offline RLHF tasks.
- Instructor: Prof. Zhixuan Fang in IIIS
- Cooperator: Jiaxuan Jiang
2024.08 - now
Stabilizing Offline Reinforcement Learning with Imitation-Based Assistant Rewards

Introduce an imitation-based assistant reward to improve early-stage training stability in offline reinforcement learning on the D4RL benchmark
- Instructor: Prof. Simon Du in UW
- Cooperator: Xinqi Wang
2024.06 - 2024.08
Reward Optimization of Imitation Learning in Offline Multi-Agent Games from Human Feedback

Integrate human feedback into imitation learning for offline multi-agent tasks, using a reward model to improve performance by filtering and optimizing offline datasets
- Instructor: Prof. Zhixuan Fang in IIIS
- Paper: PDF
2024.03 - 2024.08
Multi-Agent Reinforcement Learning from Human Feedback

Initiated the work of Multi-Agent Reinforcement Learning from Human Feedback (MARLHF), combining theoretical analysis and empirical experiments on preference-only offline datasets and proposing new algorithmic techniques
- Instructor: Prof. Simon Du, Prof. Sham Kakade
- Cooperator: Xinqi Wang, Qiwen Cui, Runlong Zhou
- Repo: Jax-MARLHF
- Paper: Arxiv
- ICLR 2025 Under Review
2023.09 - 2024.01
Research on Communication in Multi-Agent Reinforcement Learning

Introduced truthfulness for agents and large language model for communication in decentralized, partially observable multi-agent reinforcement learning, aimed at improving performance and transferability
- Instructor: Prof. Zhixuan Fang in IIIS
- Cooperator: Jiaxuan Jiang and Kaidi Fu
2023.04 - 2023.07
Research on a Self-Developed Information Eliciting Model

Researched and developed a novel Information Eliciting Model incorporating a thinking hierarchy framework to uncover the truth within diverse perspectives, introducing metrics for measuring response hierarchy
- Instructor: Prof. Zhixuan Fang in IIIS
- Cooperator: Weiliang Wang
- Paper: PDF
2022.07 - 2022.10
New Star Academic Program on Real-Time Translucent Neural Rendering

Developed real-time neural rendering techniques for translucent objects by utilizing deep learning to model subsurface scattering, with superior results compared to existing approximation methods
- Instructor: Prof. Kun Xu in Department of Computer Science and Technology
- Cooperator: Di An and Shuzhen Li

Awards

2024

Comprehensive Scholarship in Tsinghua University
2023

Comprehensive Scholarship in Tsinghua University
2022

Scholarship of Institute for Interdisciplinary Information Core Technology, First Prize
2021

Freshman Scholarship, Second Prize
2021

Address at IIIS Opening Ceremony
2019

National Physics Olympiad, Second Prize
2019

Tsinghua Physics Camp, First Prize
2018

National English Proficiency Competition, Second Prize
2018

National Oral English Innovative Proficiency Contest, Second Prize

Projects

2023.07 - 2023.09
Implementation and Improvement on a 3D Hierarchical Torus Network Topology and its Corresponding Deadlock-Free Routing Algorithm
- Cooperator: Ran Tao
- Instructor: Prof. Kaisheng Ma
2023.03 - 2023.07
Distributed Computation on Matrix Multiplication with Concurrency
- Cooperator: Xingchen Miu
- Instructor: Prof. Wei Xu and Prof. Mingyu Gao
2022.10 - 2022.12
Implementation of RISC-V Processor Model with High-Performance Cache
- Instructor: Prof. Mingyu Gao
2022.03 - 2022.06
Implementation of a Group of Machine Learning Models Including Random Forest and XGBoost
- Instructor: Prof. Yang Yuan
2022.03 - 2022.05
Implementation of Stochastic Algorithms for 2D Ising Model
- Instructor: Prof. Yukai Wu
2022.04 - 2022.05
Implementation of a Distributed Database Model
- Instructor: Prof. Huanchen Zhang
2022.05 - 2022.07
Multimedia Processing Based on FFmpeg
- Instructor: Prof. Huanchen Zhang

Languages

	English
	Toefl Best Score 109, Speaking 26

	Programming
	C/C++, Python, Pytorch, Matlab, Latex, SQL, etc.

Interests

Baseball and Softball

Poems

Basics

Education

Undergraduate

Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University

Computer Science

High School

Hangzhou Xuejun High School

Physics Olympiad

Middle School

Hangzhou Wenlan Middle School

Work

LLM-Assisted Dataset Generation with Consistency Evaluation for RLHF

Generate a consistency-weighted dataset using LLMs to augment limited human-labeled preferences for improved reward model training in offline RLHF tasks.

Stabilizing Offline Reinforcement Learning with Imitation-Based Assistant Rewards

Introduce an imitation-based assistant reward to improve early-stage training stability in offline reinforcement learning on the D4RL benchmark

Reward Optimization of Imitation Learning in Offline Multi-Agent Games from Human Feedback

Integrate human feedback into imitation learning for offline multi-agent tasks, using a reward model to improve performance by filtering and optimizing offline datasets

Multi-Agent Reinforcement Learning from Human Feedback

Initiated the work of Multi-Agent Reinforcement Learning from Human Feedback (MARLHF), combining theoretical analysis and empirical experiments on preference-only offline datasets and proposing new algorithmic techniques

Research on Communication in Multi-Agent Reinforcement Learning

Introduced truthfulness for agents and large language model for communication in decentralized, partially observable multi-agent reinforcement learning, aimed at improving performance and transferability

Research on a Self-Developed Information Eliciting Model

Researched and developed a novel Information Eliciting Model incorporating a thinking hierarchy framework to uncover the truth within diverse perspectives, introducing metrics for measuring response hierarchy

New Star Academic Program on Real-Time Translucent Neural Rendering

Developed real-time neural rendering techniques for translucent objects by utilizing deep learning to model subsurface scattering, with superior results compared to existing approximation methods

Awards

Comprehensive Scholarship in Tsinghua University

Comprehensive Scholarship in Tsinghua University

Scholarship of Institute for Interdisciplinary Information Core Technology, First Prize

Freshman Scholarship, Second Prize

Address at IIIS Opening Ceremony

National Physics Olympiad, Second Prize

Tsinghua Physics Camp, First Prize

National English Proficiency Competition, Second Prize

National Oral English Innovative Proficiency Contest, Second Prize

Projects

Implementation and Improvement on a 3D Hierarchical Torus Network Topology and its Corresponding Deadlock-Free Routing Algorithm

Distributed Computation on Matrix Multiplication with Concurrency

Implementation of RISC-V Processor Model with High-Performance Cache

Implementation of a Group of Machine Learning Models Including Random Forest and XGBoost

Implementation of Stochastic Algorithms for 2D Ising Model

Implementation of a Distributed Database Model

Multimedia Processing Based on FFmpeg

Languages

Interests