Hypothesis

3 Matching Annotations

Last 7 days
arxiv.org arxiv.org

Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment

2
1. bharat3012 30 Jan 2026
  
  in Public
  
  n encoder-decodertransformers, the TTS alignment is learned in certain cross-attention heads of the decoder; while in decoder-only models,the alignment is learned in the self-attention layers.
  
  Good point of difference between En-De vs De only models.
2. bharat3012 30 Jan 2026
  
  in Public
  
  his issue becomes more promi-nent when the input text is challenging and contains repeatingwords.
  
  Important on Data Usage perspective
Visit annotations in context

Annotators

bharat3012

URL

arxiv.org/pdf/2406.17957
Jun 2024
nvidia.github.io nvidia.github.io

Using the NVIDIA API Catalog — NVIDIA Generative AI Examples 24.6.0 documentation

1
1. bharat3012 24 Jun 2024
  
  in Public
  
  git clone git@github.com:NVIDIA/GenerativeAIExamples.git
  
  Unable to clone
Visit annotations in context

Annotators

bharat3012

URL

nvidia.github.io/GenerativeAIExamples/latest/api-catalog.html