news

Jun 2026 Token-level Response-visual Attention Guidance for Multimodal LLMs Knowledge Distillation got accepted to ECCV 2026.
Jun 2026 Transcript-Free Flow-Matching Text-to-Speech via Speech Feature Conditioning got accepted to Interspeech 2026.
May 2026 Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding got accepted to ICML 2026 (Spotlight, top 2.2%).
Feb 2026 PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning got accepted to CVPR 2026.
Sep 2025 Our review on wearable blood pressure sensors and machine learning algorithms for blood pressure estimation was published in Nature Reviews Cardiology.
May 2025 SiamCTC: Learning Speech Representations through Monotonic Temporal Alignment got accepted to Interspeech 2025.
Sep 2024 Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM got accepted to Findings of EMNLP 2024.
May 2024 TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback got accepted to Findings of ACL 2024.
Dec 2023 AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition got accepted to ICASSP 2024.
May 2023 Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction got accepted to Interspeech 2023.
Feb 2023 Counterfactual Two-Stage Debiasing for Video Corpus Moment Retrieval got accepted to ICASSP 2023.