cv
Basics
Name | Natalia (Shuixiunan) Zhang |
zsxn21@mails.tsinghua.edu.cn | |
Url | https://nataliazhang.github.io/ |
Summary | An undergraduate student of computer science major in IIIS, Tsinghua University |
Education
-
2021.09 - now Undergraduate
Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University
Computer Science
-
2018.09 - 2021.06 -
2012.09 - 2015.06
Work
- 2024.10 - now
LLM-Assisted Dataset Generation with Consistency Evaluation for RLHF
Generate a consistency-weighted dataset using LLMs to augment limited human-labeled preferences for improved reward model training in offline RLHF tasks.
- Instructor: Prof. Zhixuan Fang in IIIS
- Cooperator: Jiaxuan Jiang
- 2024.08 - now
Stabilizing Offline Reinforcement Learning with Imitation-Based Assistant Rewards
Introduce an imitation-based assistant reward to improve early-stage training stability in offline reinforcement learning on the D4RL benchmark
- Instructor: Prof. Simon Du in UW
- Cooperator: Xinqi Wang
- 2024.06 - 2024.08
Reward Optimization of Imitation Learning in Offline Multi-Agent Games from Human Feedback
Integrate human feedback into imitation learning for offline multi-agent tasks, using a reward model to improve performance by filtering and optimizing offline datasets
- Instructor: Prof. Zhixuan Fang in IIIS
- Paper: PDF
- 2024.03 - 2024.08
Multi-Agent Reinforcement Learning from Human Feedback
Initiated the work of Multi-Agent Reinforcement Learning from Human Feedback (MARLHF), combining theoretical analysis and empirical experiments on preference-only offline datasets and proposing new algorithmic techniques
- Instructor: Prof. Simon Du, Prof. Sham Kakade
- Cooperator: Xinqi Wang, Qiwen Cui, Runlong Zhou
- Repo: Jax-MARLHF
- Paper: Arxiv
- ICLR 2025 Under Review
- 2023.09 - 2024.01
Research on Communication in Multi-Agent Reinforcement Learning
Introduced truthfulness for agents and large language model for communication in decentralized, partially observable multi-agent reinforcement learning, aimed at improving performance and transferability
- Instructor: Prof. Zhixuan Fang in IIIS
- Cooperator: Jiaxuan Jiang and Kaidi Fu
- 2023.04 - 2023.07
Research on a Self-Developed Information Eliciting Model
Researched and developed a novel Information Eliciting Model incorporating a thinking hierarchy framework to uncover the truth within diverse perspectives, introducing metrics for measuring response hierarchy
- Instructor: Prof. Zhixuan Fang in IIIS
- Cooperator: Weiliang Wang
- Paper: PDF
- 2022.07 - 2022.10
New Star Academic Program on Real-Time Translucent Neural Rendering
Developed real-time neural rendering techniques for translucent objects by utilizing deep learning to model subsurface scattering, with superior results compared to existing approximation methods
- Instructor: Prof. Kun Xu in Department of Computer Science and Technology
- Cooperator: Di An and Shuzhen Li
Awards
- 2024
Comprehensive Scholarship in Tsinghua University
- 2023
Comprehensive Scholarship in Tsinghua University
- 2022
Scholarship of Institute for Interdisciplinary Information Core Technology, First Prize
- 2021
Freshman Scholarship, Second Prize
- 2021
Address at IIIS Opening Ceremony
- 2019
National Physics Olympiad, Second Prize
- 2019
Tsinghua Physics Camp, First Prize
- 2018
National English Proficiency Competition, Second Prize
- 2018
National Oral English Innovative Proficiency Contest, Second Prize
Projects
- 2023.07 - 2023.09
Implementation and Improvement on a 3D Hierarchical Torus Network Topology and its Corresponding Deadlock-Free Routing Algorithm
- Cooperator: Ran Tao
- Instructor: Prof. Kaisheng Ma
- 2023.03 - 2023.07
Distributed Computation on Matrix Multiplication with Concurrency
- Cooperator: Xingchen Miu
- Instructor: Prof. Wei Xu and Prof. Mingyu Gao
- 2022.10 - 2022.12
Implementation of RISC-V Processor Model with High-Performance Cache
- Instructor: Prof. Mingyu Gao
- 2022.03 - 2022.06
Implementation of a Group of Machine Learning Models Including Random Forest and XGBoost
- Instructor: Prof. Yang Yuan
- 2022.03 - 2022.05
Implementation of Stochastic Algorithms for 2D Ising Model
- Instructor: Prof. Yukai Wu
- 2022.04 - 2022.05
Implementation of a Distributed Database Model
- Instructor: Prof. Huanchen Zhang
- 2022.05 - 2022.07
Multimedia Processing Based on FFmpeg
- Instructor: Prof. Huanchen Zhang
Languages
English | |
Toefl Best Score 109, Speaking 26 |
Programming | |
C/C++, Python, Pytorch, Matlab, Latex, SQL, etc. |
Interests
Baseball and Softball |
Poems |