ๅ ณ้ฎ่ฆ็น
The used NVIDIA RTX 3060 12 GB is the best GPU under $300 for local LLMs because 12 GB of VRAM plus zero-setup CUDA support gives you a working LLM box in minutes. At $150-250 in the May 2026 used market, it runs Mistral 7B, Llama 3 8B, and Qwen3 8B at 15-20 tokens per second, and most 13B models at Q4.
The RTX 3060 wins on software. Ollama and llama.cpp detect NVIDIA GPUs via CUDA automatically on Windows and Linux โ no driver hunting, no ROCm. The AMD RX 6700 XT ($130-200 used) saves $30-80 and matches the 12 GB capacity, but ROCm setup on Linux typically costs 3-5 hours and is unsupported on Windows for fast inference.
Choose the RX 6700 XT only if budget is the single deciding factor and you are comfortable on Linux. For everyone else, the RTX 3060 12 GB is the safer first GPU. Avoid the 6 GB RTX 3060 variant โ it looks identical in listings but only fits 3B models.
Both cards carry 12 GB of VRAM, so model capacity is identical โ the decision is CUDA versus ROCm. Prices below are a May 2026 US used-market snapshot; the 2026 memory shortage keeps GPU prices volatile, so re-check before buying.
| GPU | VRAM | Price (May 2026) | Setup | Best for |
|---|---|---|---|---|
| RTX 3060 12 GB | 12 GB | $150-250 used | CUDA, instant | Best pick โ no setup friction |
| RX 6700 XT | 12 GB | $130-200 used | ROCm, 3-5 hours | Cheapest, accepts AMD setup |