Wangyang Ying

Ph.D. Candidate | Arizona State University

Data Mining, Machine Learning, Data-Centric AI

Currently on the job market for positions starting Spring 2026!

Wangyang Ying

About Me

I am currently a Ph.D. candidate at Arizona State University in Tempe. I began my Ph.D. studies in 2023. Prior to that, I received both my Bachelor's (2016) and Master's (2019) degrees from Sichuan University. Following my Master's, I worked at Alibaba and Tencent, focusing on video recommendation and news search algorithms, respectively.

My research interests include data mining, machine learning, and interdisciplinary applications. Currently, I focus on Data-Centric AI, learning from unlabeled data, and AI for scientific discovery.

Education

  • Arizona State University - Ph.D. Student (2023 - Present)
  • Sichuan University - M.S. (2016 - 2019)
  • Sichuan University - B.S. (2012 - 2016)

Research Interests

  • Data Mining
  • Machine Learning
  • Data-Centric AI
  • Learning from Unlabeled Data
  • AI for Scientific Discovery

Research Areas

Data-Centric AI

Focusing on learning from unlabeled data, feature transformation, and robust data representations for scientific discovery and real-world applications.

Machine Learning

Developing algorithms for large-scale data mining, representation learning, and interdisciplinary AI research.

AI for Science

Applying AI and machine learning to scientific problems, including biomarker discovery and material science.

Selected Publications

TIST 2025

Neuro-Symbolic Embedding for Short and Effective Feature Selection via Autoregressive Generation

Wangyang Ying, Nanxu Gong, Dongjie Wang, Yanjie Fu

TKDD 2024

Feature Selection as Deep Sequential Generative Learning

Wangyang Ying, Dongjie Wang, Haifeng Chen, Yanjie Fu

CIKM 2024

Revolutionizing Biomarker Discovery: Leveraging Generative AI for Bio-Knowledge-Embedded Continuous Space Exploration

Wangyang Ying, Dongjie Wang, Xuanming Hu, Ji Qiu, Jin Park, Yanjie Fu

KDD 2024

Unsupervised Generative Feature Transformation via Graph Contrastive Pre-training and Multi-objective Fine-tuning

Wangyang Ying, Dongjie Wang, Xuanming Hu, Yuanchun Zhou, Charu C. Aggarwal, Yanjie Fu

ICDM 2023

Self-optimizing Feature Generation via Categorical Hashing Representation and Hierarchical Reinforcement Crossing

Wangyang Ying, Dongjie Wang, Kunpeng Liu, Leilei Sun, Yanjie Fu

FCS 2020

Sichuan Dialect Speech Recognition with Deep LSTM Network

Wangyang Ying, Lei Zhang, Hongli Deng

NeurIPS 2025

Sculpting Features from Noise: Reward-Guided Hierarchical Diffusion for Task-Optimal Feature Transformation

Nanxu Gong, Zijun Li, Sixun Dong, Haoyue Bai, Wangyang Ying, Xinyuan Wang, Yanjie Fu

WSC 2025

Supply Chain Optimization via Generative Simulation and Iterative Decision Policies

Haoyue Bai, Haoyu Wang, Nanxu Gong, Xinyuan Wang, Wangyang Ying, Haifeng Chen, Yanjie Fu

AAAI 2025

Evolutionary Large Language Model for Automated Feature Transformation

Nanxu Gong, Chandan K Reddy, Wangyang Ying, Haifeng Chen, Yanjie Fu

npj AI 2025

Privacy-preserving Data Reprogramming

Haoyue Bai, Wangyang Ying, Nanxu Gong, Xinyuan Wang, Yanjie Fu

IJCAI 2025

Unsupervised Feature Transformation via In-context Generation, Generator-critic LLM Agents, and Duet-play Teaming

Nanxu Gong, Xinyuan Wang, Wangyang Ying, Haoyue Bai, Sixun Dong, Haifeng Chen, Yanjie Fu

CIKM 2024

Reinforcement Feature Transformation for Polymer Property Performance Prediction

Xuanming Hu, Dongjie Wang, Wangyang Ying, Yanjie Fu

EMNLP 2022

Title2event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset

Haolin Deng, Yanan Zhang, Yangfan Zhang, Wangyang Ying, Changlong Yu, Jun Gao, Wei Wang, Xiaoling Bai, Nan Yang, Jin Ma, et al.

Preprints

arXiv 2025

A Survey on Data-Centric AI: Tabular Learning from Reinforcement Learning and Generative AI Perspective

Wangyang Ying, Cong Wei, Nanxu Gong, Xinyuan Wang, Haoyue Bai, Arun Vignesh Malarkkan, Sixun Dong, Dongjie Wang, Denghui Zhang, Yanjie Fu

arXiv 2025

Data-Efficient Symbolic Regression via Foundation Model Distillation

Wangyang Ying, Jinghan Zhang, Haoyue Bai, Nanxu Gong, Xinyuan Wang, Kunpeng Liu, Chandan K Reddy, Yanjie Fu

arXiv 2025

Distribution Shift Aware Neural Tabular Learning

Wangyang Ying, Nanxu Gong, Dongjie Wang, Xinyuan Wang, Arun Vignesh Malarkkan, Vivek Gupta, Chandan K Reddy, Yanjie Fu

arXiv 2025

Bridging the Domain Gap in Equation Distillation with Reinforcement Feedback

Wangyang Ying, Haoyue Bai, Nanxu Gong, Xinyuan Wang, Sixun Dong, Haifeng Chen, Yanjie Fu

arXiv 2024

Topology-aware Reinforcement Feature Space Reconstruction for Graph Data

Wangyang Ying, Haoyue Bai, Kunpeng Liu, Yanjie Fu

arXiv 2025

LLM-ML Teaming: Integrated Symbolic Decoding and Gradient Search for Valid and Stable Generative Feature Transformation

Xinyuan Wang, Haoyue Bai, Nanxu Gong, Wangyang Ying, Sixun Dong, Xiquan Cui, Yanjie Fu

arXiv 2025

Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories

Nanxu Gong, Sixun Dong, Haoyue Bai, Xinyuan Wang, Wangyang Ying, Yanjie Fu

arXiv 2025

Brownian Bridge Augmented Surrogate Simulation and Injection Planning for Geological CO₂ Storage

Haoyue Bai, Guodong Chen, Wangyang Ying, Xinyuan Wang, Nanxu Gong, Sixun Dong, Giulia Pedrielli, Haoyu Wang, Haifeng Chen, Yanjie Fu

arXiv 2025

Towards Data-Centric AI: A Comprehensive Survey of Traditional, Reinforcement, and Generative Approaches for Tabular Data Transformation

Dongjie Wang, Yanyong Huang, Wangyang Ying, Haoyue Bai, Nanxu Gong, Xinyuan Wang, Sixun Dong, Tao Zhe, Kunpeng Liu, Meng Xiao, et al.

arXiv 2025

Efficient Post-Training Refinement of Latent Reasoning in Large Language Models

Xinyuan Wang, Dongjie Wang, Wangyang Ying, Haoyue Bai, Nanxu Gong, Sixun Dong, Kunpeng Liu, Yanjie Fu

arXiv 2024

Knockoff-Guided Feature Selection via A Single Pre-trained Reinforced Agent

Xinyuan Wang, Dongjie Wang, Wangyang Ying, Rui Xie, Haifeng Chen, Yanjie Fu

Experience

Research Intern

NEC Laboratories America, Princeton

05/2025 - 08/2025

Data Science & System Security

Research Intern

Institute of High Performance Computing, A*STAR, Singapore

05/2024 - 08/2024

Full Time

Platform and Content Group, Tencent, Beijing

11/2020 - 08/2022

Full Time

Digital Media & Entertainment Group, Alibaba, Beijing

06/2019 - 10/2020

News

Service

Teaching Experience

Contact

yingwangyang@gmail.com
Tempe, Arizona, USA
Arizona State University