Hey there 👋, I'm

CoorDi

🎙️ Speech Algorithm Engineer

A Speech Algorithm Engineer at NetEase Cloud Music 🎵, passionate about TTS, Voice Conversion, and AI Music Generation. Fudan University M.S. graduate with hands-on experience in speech synthesis, voice cloning, and cutting-edge audio AI research 🚀.

🙋 About Me

🎵 Currently a Speech Algorithm Engineer at NetEase Cloud Music, where I work on TTS, Voice Conversion, AI Music Generation & Evaluation, and MV video automation. I hold a Master’s degree in Electronic Information from Fudan University, with deep expertise in speech synthesis, voice cloning, and audio generation.

💡 Previously interned at ByteDance and Bilibili, diving into training-free diffusion acceleration, CV/NLP applications, and ComfyUI plugin development. I love pushing the boundaries of AI in speech, music, and audio — turning research ideas into real-world products.

🏃 Outside of work, I’m a fan of ultimate frisbee and running. Music-wise, I’m into YOASOBI and Jay Chou. Also a big mystery & detective film/TV enthusiast 🔍.

🛠️ Here are a few technologies I've been working with recently:

TTS
AI Music Generation
AI-MV
Diffusion Model
PyTorch
LCM & ADD

💼 Work Experience

NetEase Cloud Music
ByteDance
Bilibili

Speech Algorithm Engineer - NetEase Cloud Music

Jun. 2025 - Present

🎤 Leading TTS & Voice Conversion R&D — improving naturalness and expressiveness of speech synthesis. 🎶 Building AI music generation algorithms, exploring generative models for creative music production. 📊 Designing AI music evaluation systems that align generated music quality with human aesthetics. 🎬 Developing Agents for automated song MV generation from end to end.

TTS & Voice Conversion
AI Music Generation & Evaluation
MV Video Generation

AIGC Algorithm Optimization Intern - ByteDance

Dec. 2023 - Aug. 2024

⚡ Optimized DDPM inference speed through improved distillation strategies. 🔬 Deep-dived into Classifier-Free Guidance (CFG), tracking RCFG, Limited Interval and other cutting-edge improvements. 🚀 Independently developed a training-free CFG acceleration algorithm — achieving 20% speedup with quality parity. 🧩 Shipped a ComfyUI plugin integrating self-developed CFG acceleration with other train-free methods.

Acceleration Algorithm Research
CFG Algorithm Research

CV/NLP Algorithm Intern - Bilibili

Aug. 2023 - Nov. 2023

🎯 Trained models for low-quality & traffic-directing video recognition across the platform. 🤖 Designed prompt-based title analysis and built a BERT-powered video classification system. 📹 Explored AIGC-driven video production — automated news extraction, image/audio pairing, and video generation.

Video Understanding
AIGC Intelligent Video Production

🎓 Education

Sept. 2022 - Jun. 2025

Master of Science in Electronic Information

Fudan University

GPA: 3.88/4.0 | Rank: 4/235

📡 Researched diffusion model applications on SAR images at the School of Information Science and Engineering. Published 3 first-author papers and received multiple honors during my studies.

🏅 National Scholarship
🏅 Samsung Scholarship
🎖️ Shanghai Outstanding Graduate
📜 Outstanding Student Scholarship

Sept. 2018 - Jun. 2022

Bachelor of Science in Cyberspace Security

University of Science and Technology of China

GPA: 3.56/4.3

🔐 Studied Cyberspace Security with a focus on Image Captioning algorithm optimization & acceleration. Active in student leadership and competitions.

📜 Outstanding Student Scholarship
📜 Wang Xiaomo Cyberspace Security Scholarship
🎖️ USTC Outstanding Graduate
🎖️ Anhui Province Outstanding Graduate
👥 President of School Student Union

🏆 Achievements

Synergizing Large-Scale Music Representations and Metric-Based Meta-Learning for Few-Shot Song Aesthetics Evaluation

ICASSP 2026 Second Author Music Aesthetics Meta-Learning

📄 ICASSP 2026 (Poster) | Second Author | 2026

Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation

Arxiv First Author Diffusion Model

📄 Arxiv | First Author | 2024

Conditional Diffusion for SAR to Optical Image Translation

IEEE GRSL Q2 First Author SAR

📄 IEEE GRSL (Q2 Journal) | First Author | 2024

SAR to Optical Image Translation with Color Supervised Diffusion Model

IGARSS First Author SAR

📄 IGARSS Conference | First Author | 2023

🥈 Kaggle Silver Medals ×2

Kaggle Silver Computer Vision NLP

• Google Research | CVPR Image Matching Challenge 2023 — Top 7%
• The Learning Agency Lab | LLM AI Text Detection — Top 4%

🥉 Kaggle Bronze Medals ×2

Kaggle Bronze Speech Recognition Computer Vision

• Bengali.AI Speech Recognition — Top 9%
• Google Research Image Matching Challenge 2024 — Top 6%

📬 Contact

My inbox is always open ✉️ — whether you have a question, a collaboration idea, or just want to say hi, I’d love to hear from you!

📧 Mail me