+
+
+
+

The AI Performance Engineer

Your AI for GPU optimization. Profiling, PTX, and detailed hardware docs, all where you already code

>Coming Soon_
Backed by
Fifty Years
Fifty Years
Y Combinator
Y Combinator
Liquid 2
Liquid 2
NVIDIA Inception
NVIDIA Inception
Jeff Dean
Jeff DeanChief Scientist at Google
Woj Zaremba
Woj ZarembaCo-Founder at OpenAI
Charlie Songhurst
Charlie SonghurstMeta Board of Directors
Arash Ferdowsi
Arash FerdowsiCo-Founder at Dropbox
Dan Fu
Dan FuHead of Kernels at Together
Kawal Gandhi
Kawal GandhiOffice of the CTO at Google
Fifty Years
Fifty Years
Y Combinator
Y Combinator
Liquid 2
Liquid 2
NVIDIA Inception
NVIDIA Inception
Jeff Dean
Jeff DeanChief Scientist at Google
Woj Zaremba
Woj ZarembaCo-Founder at OpenAI
Charlie Songhurst
Charlie SonghurstMeta Board of Directors
Arash Ferdowsi
Arash FerdowsiCo-Founder at Dropbox
Dan Fu
Dan FuHead of Kernels at Together
Kawal Gandhi
Kawal GandhiOffice of the CTO at Google
Fifty Years
Fifty Years
Y Combinator
Y Combinator
Liquid 2
Liquid 2
NVIDIA Inception
NVIDIA Inception
Jeff Dean
Jeff DeanChief Scientist at Google
Woj Zaremba
Woj ZarembaCo-Founder at OpenAI
Charlie Songhurst
Charlie SonghurstMeta Board of Directors
Arash Ferdowsi
Arash FerdowsiCo-Founder at Dropbox
Dan Fu
Dan FuHead of Kernels at Together
Kawal Gandhi
Kawal GandhiOffice of the CTO at Google
NCU Profiling

Profile your kernels directly in your editor. Automatically use as agent context

NCU Profile Summary
GPU Documentation

Grounded in NVIDIA's full docs - from CUDA kernels to CUTLASS templates

GPU Documentation Tool Screenshot
Generated Code Analysis

Easily work directly at the PTX & SASS level. Step through assembly, and auto-generate more efficient variants.

PTX and SASS Code View
AI-Powered Optimization

Let our agent find the bottlenecks. It reads profiling data, understands your hardware, and implements optimizations.

10x your GPU engineering productivity

Available as a Cursor, VSCode extension and a powerful CLI. All your GPU development tools in one place.

>Coming Soon_

Everything you need for GPU development. Built for kernel engineers who want to ship faster.

NCU Integration

Run NVIDIA Compute Utility profiles directly from your editor. Get insights without context switching.

Documentation Search

Search CUDA programming guides, API references, and optimization best practices instantly.

Remote Execution

Send your code to run on remote GPU machines. No more expensive local hardware for prototyping.

PTX Visualization

See the generated PTX and SASS from your CUDA code. Like Godbolt, but for GPU kernels.

AI Agent

An agent that reads your profiling data and suggests the next optimization to implement.

Tool Calling

The agent can call NCU, search docs, and run code—same actions you can do, but automated.

Code Diff

Review agent-suggested changes before applying. Accept, reject, or modify the proposed optimizations.

Hyperparameter Tuning

Automatically sweep common kernel hyperparameters like tile sizes, thread counts, and unroll factors.