Yueqin Yin (殷月琴)

Yueqin Yin (殷月琴)

Third-year Ph.D. student

McCombs School of Business, The University of Texas at Austin

I am a third-year Ph.D. student at the McCombs School of Business, The University of Texas at Austin, advised by Prof. Mingyuan Zhou.

Previously, I received my M.S. from the Institute of Automation, Chinese Academy of Sciences (CASIA) in 2023, and my B.S. in Software Engineering from Dalian University of Technology in 2020.

My research interests include large language model alignment, reasoning, and post-training optimization.

If you are interested in my work and would like to collaborate, please feel free to reach out!

Recent News

Sep 2025 Segment-PPO was accepted to TMLR 2025. Paper.
May 2025 KODCODE was accepted to ACL 2025 Findings. Paper.
May 2025 Started summer research internship at Zoom GenAI (hallucination detection and verification).
May 2024 Started summer research internship at Microsoft GenAI (LLM alignment and reasoning).
Aug 2023 Started Ph.D. program at The University of Texas at Austin.

Selected Publications

Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model

TMLR25 Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model

Yueqin Yin*, Shentao Yang*, Yujia Xie, Ziyi Yang, Yuting Sun, Hany Awadalla, Weizhu Chen, Mingyuan Zhou

KODCODE: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

ACL Findings25 🏆 Best Paper Award at DataWorld @ ICML 2025 KODCODE: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Zhangchen Xu, Yang Liu, Yueqin Yin, Mingyuan Zhou, Radha Poovendran

ContextCheck: Sentence-Level Faithfulness Verification with Context-Aware Disambiguation

Under Review ContextCheck: Sentence-Level Faithfulness Verification with Context-Aware Disambiguation

Yueqin Yin, Yaxi Li, Xin Liu, Xun Wang, Kaiqiang Song, Simin Ma, Shujian Liu, Sathish Reddy Indurthi, Haoyun Deng, Pengcheng He, Mingyuan Zhou, Song Wang

Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts

arXiv'2402 Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts

Yueqin Yin*, Zhendong Wang*, Yi Gu, Hai Huang, Weizhu Chen, Mingyuan Zhou

Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment

arXiv'2405 Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment

Yueqin Yin*, Zhendong Wang*, Yujia Xie, Weizhu Chen, Mingyuan Zhou

Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization

arXiv'2406 Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization

Yi Gu, Zhendong Wang, Yueqin Yin, Yujia Xie, Mingyuan Zhou

Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers

ICCV23 Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers

Shiyue Cao, Yueqin Yin, Lianghu Huang, Yu Liu, Xin Zhao, Deli Zhao, Kaiqi Huang

TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

CVPR22 TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

Yanbo Xu*, Yueqin Yin*, Liming Jiang, Qianyi Wu, Chengyao Zheng, Chen Change Loy, Bo Dai, Wayne Wu

Experience

Summer Research Intern — Hallucination Detection and Verification

Zoom Video Communications, GenAI Research Group

Jun 2025 - Aug 2025

ContextCheck: sentence-level faithfulness verification with context-aware disambiguation.

Summer Research Intern — LLM Alignment and Reasoning

Microsoft Research, GenAI

Jun 2024 - Aug 2024

Segment-PPO for RLHF; KODCODE dataset for coding.

Research Intern — Diffusion Models

Alibaba DAMO Academy

Mar 2022 - Mar 2023

DiffGAR; Efficient-VQGAN.

Research Intern — GANs

Shanghai AI Laboratory

Mar 2021 - Dec 2021

TransEditor: controllable facial editing via dual-space GAN and Transformer.

Academic Service

Reviewer

Conferences: CVPR 2024, ICCV 2025, ICLR 2025, ICLR 2026

Education

Sept. 2023 - June. 2028 (expected)

Ph.D. in LLM

The University of Texas at Austin

Sept. 2020 - June. 2023

M.S. in Computer Vision and Deep Learning

Institute of Automation, Chinese Academy of Sciences (CASIA)

Sept. 2016 - June. 2020

B.S. in Software Engineering

Dalian University of Technology