Many CUDA kernels are bandwidth bound, and the increasing ratio of flops to bandwidth in new hardware results in more bandwidth bound kernels. This makes it…
Category: Misc
If you’ve used the NVIDIA CUDA Compiler (NVCC) for your NVIDIA GPU application recently, you may have encountered a warning message like the following: nvcc…
If you’ve used the NVIDIA CUDA Compiler (NVCC) for your NVIDIA GPU application recently, you may have encountered a warning message like the following: What does this mean exactly, and what actions should you take? In this post, we’ll explain how the NVIDIA CUDA Toolkit and NVIDIA Driver work together to support GPUs The software stack for programming GPUs is divided into two…
NVIDIA CUDA-Q 0.12 introduces new simulation tools for accelerating how researchers develop quantum applications and design performant quantum hardware. With…
NVIDIA CUDA-Q 0.12 introduces new simulation tools for accelerating how researchers develop quantum applications and design performant quantum hardware. With the new API, users can obtain more detailed statistics on individual runs (or shots) of a simulation, rather than being restricted to aggregated statistical outputs from simulations. Access to raw shot data is important to researchers…
A key challenge for retrieval-augmented generation (RAG) systems is handling user queries that lack explicit clarity or carry implicit intent. Users often…
A key challenge for retrieval-augmented generation (RAG) systems is handling user queries that lack explicit clarity or carry implicit intent. Users often phrase questions imprecisely. For instance, consider the user query, “Tell me about the latest update in NVIDIA NeMo model training.” It’s possible that the user is implicitly interested in advancements in NeMo large language model (LLM)…
You’ve been there. You wrote the perfect Python script, tested it on a sample CSV, and everything worked flawlessly. But when you unleashed it on the full 10…
You’ve been there. You wrote the perfect Python script, tested it on a sample CSV, and everything worked flawlessly. But when you unleashed it on the full 10 million row dataset, your laptop fan started screaming, your console froze, and you had enough time to brew three pots of coffee before seeing a result. What if you could get massive speedups on those exact same workflows with a simple…
Quantization is a core tool for developers aiming to improve inference performance with minimal overhead. It delivers significant gains in latency, throughput,…
Quantization is a core tool for developers aiming to improve inference performance with minimal overhead. It delivers significant gains in latency, throughput, and memory efficiency by reducing model precision in a controlled way—without requiring retraining. Today, most models are trained in FP16 or BF16, with some, like DeepSeek-R, natively using FP8. Further quantizing to formats like FP4…
Just Released: NVIDIA HPC SDK v25.7
The HPC SDK v25.7 includes support for CUDA 12.9U1, updated library components, bugfixes, and performance improvements.
The HPC SDK v25.7 includes support for CUDA 12.9U1, updated library components, bugfixes, and performance improvements.
Just Released: NVIDIA cuPQC v0.4
This update introduces Poseidon2 to cuHash and a Merkle Tree API compatible with all cuHash hash functions.
This update introduces Poseidon2 to cuHash and a Merkle Tree API compatible with all cuHash hash functions.
Prompt injection, where adversaries manipulate inputs to make large language models behave in unintended ways, has long posed a threat to AI systems since the…
Prompt injection, where adversaries manipulate inputs to make large language models behave in unintended ways, has long posed a threat to AI systems since the earliest days of LLM deployment. While defenders have made progress securing models against text-based attacks, the shift to multimodal and agentic AI is rapidly expanding the attack surface. This is where red teaming plays a vital…
August brings new levels of gaming excitement on GeForce NOW, with 2,300 titles now available to stream in the cloud. Grab a controller and get ready for epic adventures — a dozen new games are coming to the cloud this month. Each week brings fresh titles for members to discover, stream and conquer. Get ready
Read Article