Skip to main content
Back to top
Ctrl
+
K
Getting Started
Introduction
NeMo Fundamentals
Tutorials
Key Optimizations
Mixed Precision Training
Parallelisms
Mixture of Experts
Optimizations
Attention Optimizations
Activation Recomputation
Communication Overlap
CPU Offloading
Model Checkpoints
Checkpoints
NeMo Distributed Checkpoint User Guide
Converting from Megatron-LM
Evaluation
Evaluate NeMo 2.0 Checkpoints
APIs
NeMo APIs
NeMo Models
Neural Modules
Experiment Manager
Neural Types
Exporting NeMo Models
Adapters
Adapter Components
Adapters API
NeMo Core APIs
NeMo Common Collection API
Callbacks
Losses
Metrics
Tokenizers
Data
S3 Checkpointing
NeMo ASR API
NeMo TTS API
Collections
NeMo Collections
Large Language Models
GPT Model Training
Batching
Positional embeddings
Megatron Core Customization
Reset Learning Rate
Ramp Up Batch Size
Machine Translation Models
Automatic Speech Recognition (ASR)
Models
Datasets
ASR Language Modeling and Customization
Checkpoints
Scores
NeMo ASR Configuration Files
NeMo ASR API
All Checkpoints
Example With MCV
Speech Classification
Models
Datasets
Checkpoints
NeMo Speech Classification Configuration Files
Resource and Documentation Guide
Speaker Recognition (SR)
Models
NeMo Speaker Recognition Configuration Files
Datasets
Checkpoints
NeMo Speaker Recognition API
Resource and Documentation Guide
Speaker Diarization
Models
Datasets
Checkpoints
End-to-End Speaker Diarization Configuration Files
NeMo Speaker Diarization API
Resource and Documentation Guide
Speech Self-Supervised Learning
Models
Datasets
Checkpoints
NeMo SSL Configuration Files
NeMo SSL collection API
Resources and Documentation
Speech Intent Classification and Slot Filling
Models
Datasets
Checkpoints
NeMo Speech Intent Classification and Slot Filling Configuration Files
NeMo Speech Intent Classification and Slot Filling collection API
Resources and Documentation
SpeechLM2
Models
Datasets
Configuration Files
Training and Scaling
Text-to-Speech (TTS)
Models
Data Preprocessing
Checkpoints
NeMo TTS Configuration Files
Grapheme-to-Phoneme Models
Speech and Audio Processing
Models
Datasets
Checkpoints
NeMo Audio Configuration Files
NeMo Audio API
Speech AI Tools
Speech AI Tools
NeMo Forced Aligner (NFA)
Dataset Creation Tool Based on CTC-Segmentation
Speech Data Explorer
Comparison tool for ASR Models
ASR Evaluator
Speech Data Processor
(Inverse) Text Normalization
WFST-based (Inverse) Text Normalization
Text (Inverse) Normalization
Grammar customization
Deploy to Production with C++ backend
Resources and Documentation
Neural Models for (Inverse) Text Normalization
Neural Text Normalization Models
Thutmose Tagger: Single-pass Tagger-based ITN Model
.rst
.pdf
Optimizations
Optimizations
#
Attention Optimizations
Activation Recomputation
Communication Overlap
CPU Offloading