Home

Wangyang Ying

Wangyang Ying

Ph.D. | SCAI, Arizona State University

Data-Centric AI, LLM Reasoning, Search Ranking, Recommendation System

yingwangyang@gmail.com

I have completed my Ph.D. defense in Computer Science at Arizona State University, with degree conferral expected in May 2026. I have industry experience building large-scale search and recommendation systems at Tencent and Alibaba. My research focuses on LLM reasoning, agent systems, and representation learning, with an emphasis on improving accuracy, stability, and efficiency under practical constraints. I have also worked on LLM reliability and structured knowledge extraction during internships at NEC Laboratories America and A*STAR Singapore, and published in venues such as TKDD, KDD, NeurIPS, AAAI, IJCAI, CIKM, and EMNLP.

Education

  • Arizona State University - Ph.D. (2023 - 2026)
  • Sichuan University - M.S. (2016 - 2019)
  • Sichuan University - B.S. (2012 - 2016)

Work Experience

  • NEC Laboratories America - Research Intern (05/2025 - 08/2025): Multi-agent LLM
  • A*STAR, Singapore - Research Intern (05/2024 - 08/2024): LLM Safety
  • Tencent - Machine Learning Engineer (10/2020 - 11/2022): Search Ranking
  • Alibaba - Machine Learning Engineer (06/2019 - 10/2020): Video Recommendation

News

  • [2026-02] Good News! One paper has been accepted by ACM TKDD.
  • [2026-02] Good News! One paper has been accepted by EACL.
  • [2025-12] Good News! I have passed my PhD defense.
  • [2025-11] Good News! Two papers have been accepted by AAAI 2026.
  • [2025-11] Good News! One paper has been accepted by IEEE TBD.
  • [2025-09] Good News! One paper has been accepted by ACM TKDD.
  • [2025-09] Good News! One paper has been accepted by NeuraIPS 2025.
  • [2025-04] Good News! One paper has been accepted by IJCAI 2025.

Research Areas

Data-Centric AI

Developing data-centric methods to enhance the robustness and effectiveness of machine learning through feature transformation, robust data representations, and learning from unlabeled data.

Multi-Agent Reasoning

Creating multi-agent frameworks for structured knowledge extraction and reasoning, enabling collaborative AI systems to solve complex problems through distributed intelligence.

Scientific Equation Discovery

Developing interpretable methods for equation discovery to uncover scientific patterns from data, bridging AI and scientific discovery for automated hypothesis generation.

Selected Publications

I have published papers in top-tier venues, including KDD, NeurIPS, EMNLP, AAAI, CIKM, IJCAI, and TKDD. A complete list of publications is available on Google Scholar.

  1. [TKDD] Feature Selection as Deep Sequential Generative Learning.
    • Formulates feature selection as a sequential generative process, enabling structured and controllable selection of informative features for downstream prediction tasks.
  2. [KDD] Unsupervised Generative Feature Transformation via Graph Contrastive Pre-training and Multi-objective Fine-tuning.
    • Transforms features into simpler and more discriminative representation spaces, enabling strong downstream performance with lightweight models.
  3. [NeurIPS] Sculpting Features from Noise: Reward-Guided Hierarchical Diffusion for Task-Optimal Feature Transformation.
    • Proposed a reward-guided latent diffusion framework that reformulates discrete feature transformation as a generative process, enabling stable global exploration and task-optimal feature construction beyond local continuous search.
  4. [AAAI] Efficient Post-Training Refinement of Latent Reasoning in Large Language Models.
    • Proposed a training-free, post-training approach to improve the stability and accuracy of multi-step reasoning in large language models by refining latent reasoning trajectories.
  5. [IJCAI] Unsupervised Feature Transformation via In-context Generation, Generator-critic LLM Agents, and Duet-play Teaming.
    • Introduced an agent-based, generative formulation for discrete search and feature transformation, enabling effective learning from unlabeled data through iterative feedback.
  6. [CIKM] Revolutionizing Biomarker Discovery: Leveraging Generative AI for Bio-Knowledge-Embedded Continuous Space Exploration.
    • Applied generative feature selection to high-dimensional biological data to identify a compact set of informative features, improving disease prediction performance.
  7. [EMNLP] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset.
    • Event extraction from short news titles, enabling event-level representation, aggregation, and event-driven news search.

Work Experience

Research Intern

Data Science & System Security,NEC Laboratories America, Princeton

05/2025 - 08/2025

Developed multi-agent LLM frameworks for structured knowledge extraction (procedural graph representation), supporting downstream retrieval-augmented generation (RAG). Explored how structured knowledge enables personalized LLM training by grounding user-specific workflows into structured representations

Research Intern

Institute of High Performance Computing, A*STAR, Singapore

05/2024 - 08/2024

Investigated trustworthiness of LLMs in medical applications, with emphasis on understanding how jailbreak attacks compromise system reliability. Conducted systematic analysis of jailbreak strategies as a foundation for designing future LLM safety and protection mechanisms.

Full Time

Platform and Content Group, Tencent, Beijing

10/2020 - 11/2022

Led algorithm design for time-sensitive search scenarios (e.g., weather, stock, news), serving hundreds of millions of users. Designed methods for query time-sensitivity detection, retrieval pipeline optimization, and time-aware ranking and presentation to enhance freshness and relevance in search results.

Full Time

Digital Media & Entertainment Group, Alibaba, Beijing

06/2019 - 10/2020

Built recommendation systems for long- and short-form video platforms (movies, TV shows, variety shows, and micro-videos). Worked on video content understanding (e.g., tagging, user profiling) and video retrieval, improving large-scale recommendation quality and user engagement.

Service

Teaching Experience

Contact

yingwangyang@gmail.com
Tempe, Arizona, USA
Arizona State University