tao✦ Official

tao-generate-referring-expressions

Four-step image referring-expression pipeline: turns images plus KITTI bounding-box labels into region

nvidia/skills|Updated Jun 23, 2026

Works with

Claude CodeCursorClineWindsurfCodex

Installation Guide

Select your AI agent

How to use tao-generate-referring-expressions on Cursor

AI-first code editor with Composer

Prerequisites

Before installing skills in Cursor, ensure your development environment meets these requirements:

›Cursor installed and configured on your machine
›Node.js 16+ with npm — verify with node --version
›Active project directory where you want to add tao-generate-referring-expressions

Run the install command

Execute the skills CLI command in your project's root directory to begin installation:

$npx skills install nvidia/skills/tao-generate-referring-expressions

Fetches tao-generate-referring-expressions from nvidia/skills and configures it for Cursor.

Select Cursor when prompted

The CLI shows a list of agents. Use arrow keys and space to select Cursor:

◆ Which agents do you want to install to?

│

│ ── Universal (.agents/skills) ────────────────

│ · Cline · Codex · Goose · Windsurf

│ ●Cursor(selected)

│ · Cursor · Aider · Continue

Verify installation

Confirm successful installation by checking the skill directory location:

.cursor/skills/tao-generate-referring-expressions

Restart Cursor to activate tao-generate-referring-expressions. Access via /tao-generate-referring-expressions in your agent's command palette.

⚠

Security Notice

We perform automated surface-level scans (Gen AI Scanner, Socket, Snyk) during installation. These checks detect common vulnerabilities but do not guarantee complete security. Always review skill source code and verify the publisher's reputation before production use.

Skills execute code in your environment. Always review source, verify the publisher, and test in isolation before production.

›View source on GitHub ›Skills CLI docs ›About Cursor ›What are agent skills?

Documentation

List & Monetize Your Skill

Submit your Claude Code skill and start earning

Get started →

Use Cases

Task Automation & Efficiency

Automate repetitive workflows and reduce manual effort

Example

Generate reports, summarize documents, draft communications

✓

Save 3-5 hours per week on routine tasks

Knowledge Enhancement

Learn new skills, understand complex topics, get expert guidance

Example

Explain concepts, provide examples, suggest learning resources

✓

Accelerate learning and skill development by 2x

Quality Improvement

Enhance output quality through reviews, suggestions, and refinements

Example

Review drafts, suggest improvements, catch errors

✓

Improve work quality by 30-40% with less effort

name	tao-generate-referring-expressions
description	"Four-step image referring-expression pipeline: turns images plus KITTI bounding-box labels into region descriptions, scene captions, grounded referring expressions, and (optionally) verified expressions via VLM distillation. Use when the user wants to generate referring-expression annotations from images with KITTI labels, build region descriptions, produce grouped grounding phrases tied to bboxes, run a double-check verification pass on grounding expressions, auto-label traffic / scene images for referring datasets, or run the image_referring_expression pipeline. Triggers include 'referring expression', 'region description', 'KITTI labels', 'spatial relationship annotation', 'auto-label image referring expression', 'image_referring_expression'."
license	Apache-2.0
compatibility	Requires docker + nvidia-container-toolkit + at least one VLM endpoint (Gemini API key or OpenAI-compatible).
metadata	author: NVIDIA Corporation version: "0.1.0"
tags	- image - referring-expression - kitti - bounding-boxes - auto-label - vlm
allowed-tools	Read Bash Write

Field	Default	Description
`workflow.steps`	`["0","1","2","3"]`	Which steps to execute (`0`=region_expr, `1`=image_caption, `2`=grounding_expr, `3`=double_check)
`workflow.max_workers`	`4`	Parallel threads per step (watch API rate limits)
`workflow.force_reprocess`	`false`	Ignore cached per-step outputs and reprocess from scratch
`workflow.output_format`	`"jsonl"` (set to `"both"` in the default spec)	`"jsonl"`, `"legacy"`, or `"both"`
`vlm.backend`	`"gemini"`	`"gemini"` or `"openai"` (OpenAI-compatible endpoint)
`data.image_dir`	required	Directory of input images (`.jpg` / `.jpeg` / `.png`)
`data.kitti_label_dir`	required (unless resuming)	Directory of KITTI-format `.txt` label files
`data.input_annotations_jsonl`	`""`	Optional pre-seeded `annotations.jsonl` (skips KITTI seeding)

tao-generate-referring-expressions

Installation Guide

How to use tao-generate-referring-expressions on Cursor

Prerequisites

Run the install command

Select Cursor when prompted

Verify installation

Security Notice

Documentation

List & Monetize Your Skill

Use Cases

Task Automation & Efficiency

Knowledge Enhancement

Quality Improvement

Install Skill

Image Referring Expression Pipeline

Purpose

Pipeline Architecture

Instructions

Initial setup

Running the pipeline

Recommended pilot workflow

Configuration

Inputs

Outputs

Prerequisites

Implementation Guide

Best Practices

When to Use This

Learning Path

Related Skills

vss-generate-video-calibration

cuopt-install

cuopt-routing-formulation

jetson-print-bsp-info

jetson-memory-audit

jetson-speculative-decoding

Reviews

Discussion