SooHwan Eom

PhD Student, U-AIM Lab, Electrical Engineering, KAIST

prof_pic.jpg

Daejeon, Korea

sean1105@kaist.ac.kr (academic)

eomsoohwan1105@gmail.com (permanent)

I am a Ph.D. student in Electrical Engineering at KAIST, advised by Prof. Chang D. Yoo. I received my B.S. (2022) and M.S. (2024), both in Electrical Engineering from KAIST.

My research focuses on large language model (LLM) alignment and reasoning, multimodal learning, and self-supervised speech representation learning. Recent projects include parameter-efficient adaptation of foundation models and multimodal language models. More broadly, I study principled methods for adapting and improving foundation models under limited, noisy, or structured supervision — with the long-term goal of building self-improving, agentic AI systems that reason reliably in complex domains.

news

Jun 2026 Token-level Response-visual Attention Guidance for Multimodal LLMs Knowledge Distillation got accepted to ECCV 2026.
Jun 2026 Transcript-Free Flow-Matching Text-to-Speech via Speech Feature Conditioning got accepted to Interspeech 2026.
May 2026 Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding got accepted to ICML 2026 (Spotlight, top 2.2%).
Feb 2026 PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning got accepted to CVPR 2026.
Sep 2025 Our review on wearable blood pressure sensors and machine learning algorithms for blood pressure estimation was published in Nature Reviews Cardiology.

selected publications

  1. Transcript-Free Flow-Matching Text-to-Speech via Speech Feature Conditioning
    SooHwan Eom, Hee Suk Yoon, Eunseop Yoon, Mark Hasegawa-Johnson, and Chang D. Yoo
    In Proceedings of INTERSPEECH, 2026
  2. Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding
    Hee Suk Yoon, Eunseop Yoon, Jaehyun Jang, SooHwan Eom, Ji Woo Hong, Mark Hasegawa-Johnson, Qi Dai, Chong Luo, and Chang D. Yoo
    In Proceedings of the International Conference on Machine Learning (ICML), 2026
  3. SiamCTC: Learning Speech Representations through Monotonic Temporal Alignment
    In Proceedings of INTERSPEECH, 2025
  4. Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM
    SooHwan Eom, Jay Shim, Gwanhyeong Koo, Haebin Na, Mark A. Hasegawa-Johnson, Sungwoong Kim, and Chang D. Yoo
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  5. ACL
    TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback
    Eunseop Yoon*, Hee Suk Yoon*, SooHwan Eom*, Gunsoo Han, Daniel Nam, Daejin Jo, Kyoung-Woon On, Mark Hasegawa-Johnson, Sungwoong Kim, and Chang D. Yoo
    In Findings of the Association for Computational Linguistics: ACL 2024, 2024
  6. AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
    SooHwan Eom, Eunseop Yoon, Hee Suk Yoon, Chanwoo Kim, Mark Hasegawa-Johnson, and Chang D. Yoo
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024