NeMo
NVIDIA's NeMo framework for speech, TTS, multimodal and LLM training & fine-tuning.
Overview
NeMo is a multi-domain AI toolkit from NVIDIA that covers ASR, TTS, vision, and NLP, with modular tools for training and deploying large language models.
Key features
- Collections for speech, vision and NLP tasks.
- Tooling for training, evaluation and model export.
Use cases
- Multimodal AI research and production deployments.
Technical highlights
- Modular collections and container-friendly tooling with comprehensive docs.