热点
关于我们
xx
xx
"
语音处理
" 相关文章
StutterCut: Uncertainty-Guided Normalised Cut for Dysfluency Segmentation
cs.AI updates on arXiv.org
2025-08-05T11:10:22.000000Z
Synthetic Data Generation for Phrase Break Prediction with Large Language Model
cs.AI updates on arXiv.org
2025-07-25T04:28:45.000000Z
字节推出中英同传新模型:模拟音色 延迟近专业同传译员水平
Cnbeta
2025-07-24T08:07:46.000000Z
On the Relationship between Accent Strength and Articulatory Features
cs.AI updates on arXiv.org
2025-07-08T05:54:07.000000Z
K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function
cs.AI updates on arXiv.org
2025-07-08T05:54:00.000000Z
PhonemeFake: Redefining Deepfake Realism with Language-Driven Segmental Manipulation and Adaptive Bilevel Detection
cs.AI updates on arXiv.org
2025-07-01T06:49:15.000000Z
This company is using AI to give people American-sounding accents
The Verge - Artificial Intelligences
2025-03-26T15:39:36.000000Z
Alibaba Speech Lab Releases ClearerVoice-Studio: An Open-Sourced Voice Processing Framework Supporting Speech Enhancement, Separation, and Target Speaker Extraction
MarkTechPost@AI
2024-12-07T20:19:55.000000Z
快速创建 3D 数字人头;开源多功能修图神器;Runway 新增高级运镜功能;通义提示词生成连贯图像;音频版 LoRa 音乐创作
三花AI
2024-11-04T03:00:09.000000Z
大规模、动态「语音增强/分离」新基准!清华发布移动音源仿真平台SonicSim,含950+小时训练数据
新智元
2024-10-31T09:32:02.000000Z
大规模、动态「语音增强/分离」新基准,清华发布移动音源仿真平台SonicSim,含950+小时训练数据
36氪 - 科技频道
2024-10-31T07:29:09.000000Z
SpeechBrain: A PyTorch-based Speech Toolkit
MarkTechPost@AI
2024-10-08T07:21:16.000000Z
This AI Paper by NVIDIA Introduces NEST: A Fast and Efficient Self-Supervised Model for Speech Processing
MarkTechPost@AI
2024-09-13T05:05:44.000000Z
SpeechVerse: A Multimodal AI Framework that Enables LLMs to Follow Natural Language Instructions for Performing Diverse Speech-Processing Tasks
MarkTechPost@AI
2024-05-18T01:00:59.000000Z