Tuesday, June 23, 2026
Merged timeline of 266 items — blog publish times and listing timestamps, cut at midnight . Page 2 of 6.
Merged timeline of 266 items — blog publish times and listing timestamps, cut at midnight . Page 2 of 6.
Masked Auto-Encoder (MAE) for self-supervised pretraining and fine-tuning. Masks random patches and reconstructs
PyTorch-based TAO image classification. Supports a wide range of backbones (FAN, EfficientNet, ResNet, etc.)
Grounding DINO for open-set object detection. Combines DINO-style detection with a BERT text encoder for
Stereo depth estimation using FoundationStereo. Predicts disparity maps from stereo image pairs for 3D
DINO (DETR with Improved DeNoising Anchor Boxes) for 2D object detection. Transformer-based detector with
Performs gap analysis on NVIDIA TAO VCN Classify (Visual Component Net) experiments by invoking the data-services container (`tao_toolkit.data_services` from `versions.yaml`) directly via `docker run … gap_analysis vcn_…
Deformable DETR for 2D object detection. Uses deformable attention for efficient multi-scale feature processing,
Monocular depth estimation using Metric Depth Anything v2 or Relative Depth Anything architectures. Predicts
CenterPose for keypoint / pose estimation. Detects object centers and regresses keypoint locations for 6-DoF
BEVFusion for multi-sensor 3D object detection. Fuses LiDAR point clouds and camera images in bird's-eye-view
Brev managed GPU instances with Docker support. Use when running TAO training, evaluation, or inference on
TAO Execution SDK for submitting and monitoring GPU training jobs on supported platforms (Brev, SLURM,
Performs deep Root Cause Analysis (RCA) on NVIDIA TAO Visual ChangeNet classification experiments with
Action recognition from video sequences. Supports RGB, optical flow, and joint (multi-stream) input types for
NVIDIA RAG Blueprint — deploy, configure, troubleshoot, and manage. Handles any RAG action: deploy, install, start, enable, disable, toggle, change, configure, troubleshoot, debug, fix, shutdown, stop, or tear down any…
Routes the weakest VCN samples (output of `tao-analyze-gaps-visual-changenet`) into per-augmentation-module
Official NVIDIA-authored guidance for navigating PhysicsNeMo — pick the model, datapipe, or example for a SciML/AI4Science task (surrogates, forecasting, downscaling, physics-informed, inverse, generative). Points at ex…
Run AutoML / hyperparameter optimization (HPO) for NVIDIA TAO networks using AutoMLRunner. Handles algorithm
Kubernetes execution platform — submits TAO container jobs as single-pod k8s Jobs with NVIDIA GPU scheduling.
CLIP vision-language model for image-text retrieval, zero-shot classification, embedding extraction, ONNX
DGX Cloud Lepton managed GPU compute platform with run/status/cancel interface. Use when submitting TAO jobs
Use only to generate or update a governance skill card for a specified existing agent skill directory. Do not use for explaining, listing, comparing, or discussing skill capabilities.
Remote SLURM GPU cluster execution over SSH with sbatch/srun, Pyxis/Enroot containers, and Lustre-backed
Local or remote Docker execution for TAO SDK job containers using a Docker daemon with NVIDIA GPU runtime. Use
Cosmos3-Nano video QA supervised fine-tuning with FSDP parallelism. Use when training or evaluating video
Choose the right MoE token dispatcher (`alltoall`, DeepEP, or HybridEP) for the hardware, EP degree, and optimization stage. Summarizes patterns from DSV3, Qwen3, Qwen3-Next, and VLM bring-up work.
MoE expert-parallel communication overlap in Megatron Bridge. Covers dispatch/combine overlap, flex dispatcher backends, and expert wgrad scheduling.
Long-context MoE training guidance for Megatron Bridge. Covers CP sizing, selective recompute, dispatcher choices, and practical patterns from DSV3, Qwen3, and Qwen3-Next long-context experiments.
Representative MoE training playbooks by hardware platform and model family. Summarizes rounded throughput bands, parallelism patterns, and common tuning stacks.
Router for NVIDIA NuRec/NRE: USDZ rendering, NCore conversion, 3DGS, gRPC sensor sim, PhysicalAI HF datasets. Do NOT use for SimReady or infra setup.
>-
Recommend and customize Megatron Bridge recipes for a user's model, GPU count, and training goal. Indexes library recipes (pretrain/SFT/PEFT) and performance recipes.
Top-level workflow skill for USD performance diagnosis and optimization. Use for slow loading, high memory, low FPS, or 'optimize my scene' requests; delegates auth/runtime setup to Phase 0 owners.
Use as the top-level router for Omniverse Realtime Viewer USD app requests and focused viewer reference documents.
Operational guide for enabling TP, DP, and PP communication overlap in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification.
Coordinate the end-to-end CAD/source-asset to SimReady workflow. Use for broad requests such as CAD to SimReady, source asset to simulation-ready USD, or prop packaging that require conversion, material/physics assignme…