Hey there πŸ‘‹, I'm

CoorDi

πŸŽ™οΈ Speech Algorithm Engineer

A Speech Algorithm Engineer at NetEase Cloud Music 🎡, passionate about TTS, Voice Conversion, and AI Music Generation. Fudan University M.S. graduate with hands-on experience in speech synthesis, voice cloning, and cutting-edge audio AI research πŸš€.

πŸ™‹ About Me

CoorDi's Homepage profile picture

🎡 Currently a Speech Algorithm Engineer at NetEase Cloud Music, where I work on TTS, Voice Conversion, AI Music Generation & Evaluation, and MV video automation. I hold a Master’s degree in Electronic Information from Fudan University, with deep expertise in speech synthesis, voice cloning, and audio generation.

πŸ’‘ Previously interned at ByteDance and Bilibili, diving into training-free diffusion acceleration, CV/NLP applications, and ComfyUI plugin development. I love pushing the boundaries of AI in speech, music, and audio β€” turning research ideas into real-world products.

πŸƒ Outside of work, I’m a fan of ultimate frisbee and running. Music-wise, I’m into YOASOBI and Jay Chou. Also a big mystery & detective film/TV enthusiast πŸ”.

πŸ› οΈ Here are a few technologies I've been working with recently:
  • TTS
  • AI Music Generation
  • AI-MV
  • Diffusion Model
  • PyTorch
  • LCM & ADD

πŸ’Ό Work Experience

Speech Algorithm Engineer - NetEase Cloud Music
Jun. 2025 - Present

🎀 Leading TTS & Voice Conversion R&D β€” improving naturalness and expressiveness of speech synthesis. 🎢 Building AI music generation algorithms, exploring generative models for creative music production. πŸ“Š Designing AI music evaluation systems that align generated music quality with human aesthetics. 🎬 Developing Agents for automated song MV generation from end to end.

  • TTS & Voice Conversion
  • AI Music Generation & Evaluation
  • MV Video Generation
AIGC Algorithm Optimization Intern - ByteDance
Dec. 2023 - Aug. 2024

⚑ Optimized DDPM inference speed through improved distillation strategies. πŸ”¬ Deep-dived into Classifier-Free Guidance (CFG), tracking RCFG, Limited Interval and other cutting-edge improvements. πŸš€ Independently developed a training-free CFG acceleration algorithm β€” achieving 20% speedup with quality parity. 🧩 Shipped a ComfyUI plugin integrating self-developed CFG acceleration with other train-free methods.

  • Acceleration Algorithm Research
  • CFG Algorithm Research
CV/NLP Algorithm Intern - Bilibili
Aug. 2023 - Nov. 2023

🎯 Trained models for low-quality & traffic-directing video recognition across the platform. πŸ€– Designed prompt-based title analysis and built a BERT-powered video classification system. πŸ“Ή Explored AIGC-driven video production β€” automated news extraction, image/audio pairing, and video generation.

  • Video Understanding
  • AIGC Intelligent Video Production

πŸŽ“ Education

Sept. 2022 - Jun. 2025
Master of Science in Electronic Information
Fudan University
GPA: 3.88/4.0 | Rank: 4/235

πŸ“‘ Researched diffusion model applications on SAR images at the School of Information Science and Engineering. Published 3 first-author papers and received multiple honors during my studies.

  • πŸ… National Scholarship
  • πŸ… Samsung Scholarship
  • πŸŽ–οΈ Shanghai Outstanding Graduate
  • πŸ“œ Outstanding Student Scholarship
Sept. 2018 - Jun. 2022
Bachelor of Science in Cyberspace Security
University of Science and Technology of China
GPA: 3.56/4.3

πŸ” Studied Cyberspace Security with a focus on Image Captioning algorithm optimization & acceleration. Active in student leadership and competitions.

  • πŸ“œ Outstanding Student Scholarship
  • πŸ“œ Wang Xiaomo Cyberspace Security Scholarship
  • πŸŽ–οΈ USTC Outstanding Graduate
  • πŸŽ–οΈ Anhui Province Outstanding Graduate
  • πŸ‘₯ President of School Student Union

πŸ† Achievements

Synergizing Large-Scale Music Representations and Metric-Based Meta-Learning for Few-Shot Song Aesthetics Evaluation
ICASSP 2026 Second Author Music Aesthetics Meta-Learning
πŸ“„ ICASSP 2026 (Poster) | Second Author | 2026
Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation
Arxiv First Author Diffusion Model
πŸ“„ Arxiv | First Author | 2024
Conditional Diffusion for SAR to Optical Image Translation
IEEE GRSL Q2 First Author SAR
πŸ“„ IEEE GRSL (Q2 Journal) | First Author | 2024
SAR to Optical Image Translation with Color Supervised Diffusion Model
IGARSS First Author SAR
πŸ“„ IGARSS Conference | First Author | 2023
πŸ₯ˆ Kaggle Silver Medals Γ—2
Kaggle Silver Computer Vision NLP
β€’ Google Research | CVPR Image Matching Challenge 2023 β€” Top 7%
β€’ The Learning Agency Lab | LLM AI Text Detection β€” Top 4%
πŸ₯‰ Kaggle Bronze Medals Γ—2
Kaggle Bronze Speech Recognition Computer Vision
β€’ Bengali.AI Speech Recognition β€” Top 9%
β€’ Google Research Image Matching Challenge 2024 β€” Top 6%

πŸ“¬ Contact

My inbox is always open βœ‰οΈ β€” whether you have a question, a collaboration idea, or just want to say hi, I’d love to hear from you!