Hypothesis

2 Matching Annotations

Mar 2023
github.com github.com

openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision

1
1. polarislee 14 Mar 2023
  
  in Public
  
  Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
  
  Whisper는 범용 음성 인식 모델입니다. 다양한 오디오의 대규모 데이터 세트를 학습하고 다국어 음성 인식, 음성 번역, 언어 식별을 수행할 수 있는 멀티태스킹 모델이기도 합니다.
  
  ASR Open Source Whisper 음성인식 openAI Whisper API
Visit annotations in context

Tags

음성인식

Whisper API

openAI

Whisper

ASR

Open Source

Annotators

polarislee

URL

github.com/openai/whisper
Sep 2021
arxiv.org arxiv.org

Speech Recognition: Key Word Spotting through Image Recognition

1
1. mshook 17 Sep 2021
  
  in Public
  
  Humans perform a version of this task when interpretinghard-to-understand speech, such as an accent which is particularlyfast or slurred, or a sentence in a language we do not know verywell—we do not necessarily hear every single word that is said,but we pick up on salient key words and contextualize the rest tounderstand the sentence.
  
  Boy, don't they
  
  asr speech nn ml human language recognition understanding meaning
Visit annotations in context

Tags

asr

nn

understanding

recognition

human

speech

language

ml

meaning

Annotators

mshook

URL

arxiv.org/pdf/1803.03759.pdf

Tags

Annotators

URL

Tags

Annotators

URL