| Jun 2026 | Token-level Response-visual Attention Guidance for Multimodal LLMs Knowledge Distillation got accepted to ECCV 2026. |
| Jun 2026 | Transcript-Free Flow-Matching Text-to-Speech via Speech Feature Conditioning got accepted to Interspeech 2026. |
| May 2026 | Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding got accepted to ICML 2026 (Spotlight, top 2.2%). |
| Feb 2026 | PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning got accepted to CVPR 2026. |
| Sep 2025 | Our review on wearable blood pressure sensors and machine learning algorithms for blood pressure estimation was published in Nature Reviews Cardiology. |
| May 2025 | SiamCTC: Learning Speech Representations through Monotonic Temporal Alignment got accepted to Interspeech 2025. |
| Sep 2024 | Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM got accepted to Findings of EMNLP 2024. |
| May 2024 | TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback got accepted to Findings of ACL 2024. |
| Dec 2023 | AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition got accepted to ICASSP 2024. |
| May 2023 | Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction got accepted to Interspeech 2023. |
| Feb 2023 | Counterfactual Two-Stage Debiasing for Video Corpus Moment Retrieval got accepted to ICASSP 2023. |