cv
Basics
| Name | Natalia (Shuixiunan) Zhang | 
| zsxn21@mails.tsinghua.edu.cn | |
| Url | https://nataliazhang.github.io/ | 
| Summary | An undergraduate student of computer science major in IIIS, Tsinghua University | 
Education
-  
2021.09 - now Undergraduate
Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University
Computer Science
 -  
2018.09 - 2021.06  -  
2012.09 - 2015.06  
Work
-  2024.10 - now
LLM-Assisted Dataset Generation with Consistency Evaluation for RLHF
Generate a consistency-weighted dataset using LLMs to augment limited human-labeled preferences for improved reward model training in offline RLHF tasks.
- Instructor: Prof. Zhixuan Fang in IIIS
 - Cooperator: Jiaxuan Jiang
 
 -  2024.08 - now
Stabilizing Offline Reinforcement Learning with Imitation-Based Assistant Rewards
Introduce an imitation-based assistant reward to improve early-stage training stability in offline reinforcement learning on the D4RL benchmark
- Instructor: Prof. Simon Du in UW
 - Cooperator: Xinqi Wang
 
 -  2024.06 - 2024.08
Reward Optimization of Imitation Learning in Offline Multi-Agent Games from Human Feedback
Integrate human feedback into imitation learning for offline multi-agent tasks, using a reward model to improve performance by filtering and optimizing offline datasets
- Instructor: Prof. Zhixuan Fang in IIIS
 - Paper: PDF
 
 -  2024.03 - 2024.08
Multi-Agent Reinforcement Learning from Human Feedback
Initiated the work of Multi-Agent Reinforcement Learning from Human Feedback (MARLHF), combining theoretical analysis and empirical experiments on preference-only offline datasets and proposing new algorithmic techniques
- Instructor: Prof. Simon Du, Prof. Sham Kakade
 - Cooperator: Xinqi Wang, Qiwen Cui, Runlong Zhou
 - Repo: Jax-MARLHF
 - Paper: Arxiv
 - ICLR 2025 Under Review
 
 -  2023.09 - 2024.01
Research on Communication in Multi-Agent Reinforcement Learning
Introduced truthfulness for agents and large language model for communication in decentralized, partially observable multi-agent reinforcement learning, aimed at improving performance and transferability
- Instructor: Prof. Zhixuan Fang in IIIS
 - Cooperator: Jiaxuan Jiang and Kaidi Fu
 
 -  2023.04 - 2023.07
Research on a Self-Developed Information Eliciting Model
Researched and developed a novel Information Eliciting Model incorporating a thinking hierarchy framework to uncover the truth within diverse perspectives, introducing metrics for measuring response hierarchy
- Instructor: Prof. Zhixuan Fang in IIIS
 - Cooperator: Weiliang Wang
 - Paper: PDF
 
 -  2022.07 - 2022.10
New Star Academic Program on Real-Time Translucent Neural Rendering
Developed real-time neural rendering techniques for translucent objects by utilizing deep learning to model subsurface scattering, with superior results compared to existing approximation methods
- Instructor: Prof. Kun Xu in Department of Computer Science and Technology
 - Cooperator: Di An and Shuzhen Li
 
 
Awards
-  2024
Comprehensive Scholarship in Tsinghua University
 -  2023
Comprehensive Scholarship in Tsinghua University
 -  2022
Scholarship of Institute for Interdisciplinary Information Core Technology, First Prize
 -  2021
Freshman Scholarship, Second Prize
 -  2021
Address at IIIS Opening Ceremony
 -  2019
National Physics Olympiad, Second Prize
 -  2019
Tsinghua Physics Camp, First Prize
 -  2018
National English Proficiency Competition, Second Prize
 -  2018
National Oral English Innovative Proficiency Contest, Second Prize
 
Projects
-  2023.07 - 2023.09
Implementation and Improvement on a 3D Hierarchical Torus Network Topology and its Corresponding Deadlock-Free Routing Algorithm
- Cooperator: Ran Tao
 - Instructor: Prof. Kaisheng Ma
 
 -  2023.03 - 2023.07
Distributed Computation on Matrix Multiplication with Concurrency
- Cooperator: Xingchen Miu
 - Instructor: Prof. Wei Xu and Prof. Mingyu Gao
 
 -  2022.10 - 2022.12
Implementation of RISC-V Processor Model with High-Performance Cache
- Instructor: Prof. Mingyu Gao
 
 -  2022.03 - 2022.06
Implementation of a Group of Machine Learning Models Including Random Forest and XGBoost
- Instructor: Prof. Yang Yuan
 
 -  2022.03 - 2022.05
Implementation of Stochastic Algorithms for 2D Ising Model
- Instructor: Prof. Yukai Wu
 
 -  2022.04 - 2022.05
Implementation of a Distributed Database Model
- Instructor: Prof. Huanchen Zhang
 
 -  2022.05 - 2022.07
Multimedia Processing Based on FFmpeg
- Instructor: Prof. Huanchen Zhang
 
 
Languages
| English | |
| Toefl Best Score 109, Speaking 26 | 
| Programming | |
| C/C++, Python, Pytorch, Matlab, Latex, SQL, etc. | 
Interests
| Baseball and Softball | 
| Poems |