Skip to main content
NVIDIA A100 PCIe 40GB
NVIDIA · GPU · Ampere

NVIDIA A100 PCIe 40GB — Benchmarks & Specs

40 GB VRAM250W TDP$12,000 MSRP2020

The NVIDIA A100 PCIe 40GB is a graphics card from the Ampere family released in 2020 from NVIDIA. Key on-paper specs include 40 GB of GDDR VRAM, 250W TDP. It launched with a $12,000 MSRP, though street prices typically diverge meaningfully from launch pricing — see the linked product cards below for current Amazon listings. Data on this page draws on 5 synthetic benchmark results, 10 community AI inference reports, aggregated from public benchmark databases (TechPowerUp, PassMark, Geekbench, Cinebench) and the LocalLLaMA community. Read this page when shopping the NVIDIA A100 PCIe 40GB, comparing it against other graphics cards in your build, or sizing it for a specific workload (gaming at 1080p/1440p/4K, productivity benchmarks, or local LLM inference).

Synthetic Benchmarks

Higher is better. Bars are scaled within each benchmark family (multi-thread, single-thread, etc.) so you can compare like-with-like at a glance.

PassMark G3D Mark via PassMark VideoCardBenchmark
13,326 points
HPL (High Performance LINPACK) FP64 via Puget Systems Labs
10,940 Gflops
Blender (Bizon composite) via Bizon Tech GPU Benchmarks
3,788 points
V-Ray GPU via Bizon Tech GPU Benchmarks
1,555 points
OctaneBench via Bizon Tech GPU Benchmarks
498 points

AI Inference Performance

Tokens per second under each model + quantization. Higher = faster generation. Bars compare runs across the same model.

gemma-3-4b-it FP16 via DatabaseMart
3385.7 tok/s
llama2:7b FP16 via vLLM GitHub Discussions
2246.0 tok/s
llama-7b via vllm GitHub Discussion #275
2246.0 tok/s
DeepSeek-R1-Distill-Llama-8B FP16 via DatabaseMart
2225.3 tok/s
qwen2.5:7b FP16 via DatabaseMart
2091.2 tok/s
qwen3:32b FP16 via DatabaseMart
913.7 tok/s
open_llama:13b FP16 via vLLM GitHub Discussions
745.2 tok/s
DeepSeek-R1-Distill-Qwen-14B FP16 via DatabaseMart
615.6 tok/s
qwen2.5:14b FP16 via DatabaseMart
574.9 tok/s
deepseek-moe-16b-base FP16 via DatabaseMart
420.3 tok/s

Full Specifications

tdp w250
vram gb40
cuda cores6912
memory typeHBM2

NVIDIA A100 PCIe 40GB — Frequently Asked Questions

What is the NVIDIA A100 PCIe 40GB best used for?
NVIDIA A100 PCIe 40GB is positioned as a 40 GB VRAM Ampere-family graphics card. Use it for high-end 4K gaming and local LLM inference. See the synthetic + AI benchmark tables below for measured performance.
When was the NVIDIA A100 PCIe 40GB released, and what was its launch MSRP?
NVIDIA A100 PCIe 40GB launched in 2020 at a $12,000 MSRP. Street prices diverge from launch pricing over a product's lifetime — check the linked Amazon listings on this page for current availability.
Where do the benchmark numbers on this page come from?
Synthetic benchmarks are scraped from public databases (TechPowerUp, PassMark, Geekbench Browser, Cinebench leaderboards). AI inference numbers come from the LocalLLaMA community (Reddit threads, llama.cpp / Ollama discussion logs, and Phoronix when available). Every benchmark row carries an inline source citation — click through to verify the original number.
Can the NVIDIA A100 PCIe 40GB run local LLMs?
Yes — NVIDIA A100 PCIe 40GB has 10 AI inference benchmarks on file (see the AI Inference Performance section above for model + tokens-per-second numbers). With 40 GB VRAM, it fits the popular 32B-parameter open-weight models at Q4 quantization comfortably.
Where can I buy the NVIDIA A100 PCIe 40GB?
Active Amazon listings aren't on file for this exact SKU yet. See the linked benchmark sources and the Compare tool for adjacent parts that may be in stock — and check the /benchmarks index for the latest curated picks in this category.

Editorial guides covering the NVIDIA A100 PCIe 40GB

In-depth SpecPicks reviews, build guides, and head-to-heads referencing this graphics card.

More guides & deep dives from the SpecPicks archive

Browse all articles & guides →

More reviews from the SpecPicks archive

Browse all reviews →

More buying guides from SpecPicks

Browse all buying guides →