Skip to content
View sh-lee-prml's full-sized avatar
  • Ajou University
  • Suwon

Block or report sh-lee-prml

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
sh-lee-prml/readme.md

Sang-Hoon Lee

I am an Assistant Professor in the Department of Software and Computer Engineering at Ajou University, starting in March 2024, where I lead SAIL, Speech AI Lab.. Prior to this, I worked as a postdoctoral researcher in AI Research Center, Korea University, Seoul, South Korea. I received the Ph.D. degree in the Department of Brain and Cognitive Engineering, Korea University in 2023. In March 2016, I started my integrated M.S.&Ph.D. in Pattern Recognition & Machine Learning (PRML) Lab at the Korea University in Seoul, Korea, under the supervision of Seong-Whan Lee.

๐Ÿ‘€ Research Interests

  • Speech Synthesis (2019-, HierSpeech++, DDDM-VC, Diff-HierVC)
  • Neural Vocoder (2021-, PeriodWave, PeriodWave-Turbo, Fre-GAN, Fre-GAN2)
  • Neural Audio Codec (2024-)
  • Singing Voice Synthesis (2022-, MIDI-Voice, HiddenSinger)
  • Speech-to-Speech Translation (2023-, TranSentence)
  • Brain-Computer Interface (2019-2020, Brain-to-Speech System)
  • Reinforcement Learning (2017-2018, AI Curling Robot Curly)

๐ŸŽ‰ Publications

Arxiv

2025

[-2024] ### 2024 - [DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training](https://arxiv.org/abs/2307.16549), H.-S. Oh, **S.-H. Lee**, and S.-W. Lee, **IEEE Trans. on Audio, Speech and Language Processing**, 2024 [[Demo]](https://prml-lab-speech-team.github.io/demo/DiffProsody/) [[Code]](https://github.com/hsoh0306/DiffProsody) - [Cross-lingual Text-to-Speech via Hierarchical Style Transfer](https://sites.google.com/view/limmits24/home?authuser=0), **S.-H. Lee**, H.-Y. Choi, and S.-W. Lee, **ICASSPW**, 2024. - [Audio Super-resolution with Robust Speech Representation Learning of Masked Autoencoder](https://ieeexplore.ieee.org/document/10381805), S.-B. Kim, **S.-H. Lee**, H.-Y. Choi, S.-W. Lee, **IEEE Trans. on Audio, Speech and Language Processing**, 2024. - [TranSentence: Speech-to-Speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data](https://ieeexplore.ieee.org/abstract/document/10447331), S.-B. Kim, **S.-H. Lee**, and S.-W. Lee, **ICASSP**, 2024. - [MIDI-Voice: Expressive Zero-shot Singing Voice Synthesis via MIDI-driven Priors](https://ieeexplore.ieee.org/abstract/document/10447981/), D.-M. Byun, **S.-H. Lee**, J.-S. Hwang, and S.-W. Lee, **ICASSP**, 2024. - [DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion](https://arxiv.org/abs/2305.15816), H.-Y. Choi*, **S.-H. Lee***, and S.-W. Lee, **AAAI**, 2024. [[Demo]](https://hayeong0.github.io/DDDM-VC-demo/) [[Code]](https://github.com/hayeong0/DDDM-VC) [[Poster]](https://github.com/sh-lee-prml/sh-lee-prml/blob/main/DDDM-VC_poster.pdf)

2023

2022

2021

-2020

โœจ Educations

2016.03-2023.02: Integrated M.S.&Ph.D, Dept. of Brain and Cognitive Engineering, Korea University

2012.03-2016.02: B.S, Dept. of Life Science, Dongguk University

๐ŸŽ Awards and Services

AC: NeurIPS

Reviewer: NeurIPS, ICLR, ICML, AAAI, ICASSP, Interspeech, ACL ARR, IEEE/ACM Transactions on Audio, Speech, and, Language Processing

2022.02.25: Paper Award (Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis), Korea University

๐ŸŽ™Invited Talks

[Read More]

2024.06.25: Fake Audio Detection, Ajou University.

2024.06.07: Speech Synthesis, ์ œ2ํšŒAI์œตํ•ฉ์›Œํฌ์ˆ, Ajou University.

2024.05.24: Speech Language Model for Generative AI, KSCS2024

2023.08.18: Towards Unified Speech Synthesis for Text-to-Speech and Voice Conversion, Deepbrain AI

2023.08.11: Towards Unified Speech Synthesis for Text-to-Speech and Voice Conversion, Workshop on Brain and Artificial Intelligence 2023

2023.06.20: HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis, Top Conference Session in KCC2023

2022.08.19: VoiceMixer: Adversarial Voice Style Mixup, AIGS Symposium 2022

2022.07.01: VoiceMixer: Adversarial Voice Style Mixup, Top Conference Session in KCC2022

2021.12.02: Voice Conversion, Netmarble

2021.07.29: Speech Synthesis and Voice Conversion, Neosapience

Pinned Loading

  1. HierSpeechpp HierSpeechpp Public

    The official implementation of HierSpeech++

    Python 1.2k 151

  2. PeriodWave PeriodWave Public

    The official Implementation of PeriodWave and PeriodWave-Turbo

    Python 220 17