热点
"自动语音识别" 相关文章
The Interspeech 2025 Speech Accessibility Project Challenge
cs.AI updates on arXiv.org 2025-07-30T04:12:06.000000Z
Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Guidance
cs.AI updates on arXiv.org 2025-07-24T05:31:34.000000Z
Improving Contextual ASR via Multi-grained Fusion with Large Language Models
cs.AI updates on arXiv.org 2025-07-17T04:14:51.000000Z
Pronunciation Deviation Analysis Through Voice Cloning and Acoustic Comparison
cs.AI updates on arXiv.org 2025-07-16T04:29:03.000000Z
Supporting SEN\'{C}OTEN Language Documentation Efforts with Automatic Speech Recognition
cs.AI updates on arXiv.org 2025-07-16T04:28:58.000000Z
Omni-Router: Sharing Routing Decisions in Sparse Mixture-of-Experts for Speech Recognition
cs.AI updates on arXiv.org 2025-07-09T04:01:48.000000Z
Audio-3DVG: Unified Audio - Point Cloud Fusion for 3D Visual Grounding
cs.AI updates on arXiv.org 2025-07-02T04:03:46.000000Z
Transformers Pipeline 加载whisper模型实现语音识别ASR
掘金 人工智能 2025-06-08T07:28:24.000000Z
通过huggingface学习【自动语音识别(Automatic Speech Recognition, ASR)】
掘金 人工智能 2025-06-08T04:03:13.000000Z
3Play Media Releases Annual Study, Finds ASR Technology Showing Signs of Plateau
CDSA 2025-05-28T11:03:53.000000Z
NVIDIA Open Sources Parakeet TDT 0.6B: Achieving a New Standard for Automatic Speech Recognition ASR and Transcribes an Hour of Audio in One Second
MarkTechPost@AI 2025-05-06T05:55:49.000000Z
What’s Next for Automatic Speech Recognition? Challenges and Cutting-Edge Approaches
Unite.AI 2025-02-21T17:27:44.000000Z
Apple Researchers Propose a Novel AI Algorithm to Optimize a Byte-Level Representation for Automatic Speech Recognition ASR and Compare it with UTF-8 Representation
MarkTechPost@AI 2024-09-11T16:50:32.000000Z