éèŠãªãã€ã³ã
- CPU-only æšè«ã¯ 8â32 GB RAM æèŒã®ã¢ãã³ CPU äžã® 3â13B ã¢ãã«ã§å¹æçã§ãã
- æé«ã® CPU ã¢ãã«: Phi-4 Mini (3.8Bã2.3 GBã12 ããŒã¯ã³/ç§)ãGemma 3 2B (1.5 GBã15 ããŒã¯ã³/ç§)ãLlama 3.2 3B (2 GBã10 ããŒã¯ã³/ç§)ã
- CPU æšè«ã¯ GPU ãã 10â30à é ãã§ãããå°çš VRAM ããŒã䜿çšããŸãã
- Ollama ãŸã㯠llama.cpp ã§ CPU-only ã¢ãŒããæå¹ã«ããŸã â ã·ã³ãã«ãªã³ãã³ãã©ã€ã³ãã©ã°ã
- CPU æšè«ã¯æ¬çª API (GPU ãªãŒããŒãããäžèŠ)ããšããžããã€ã¹ãã³ã¹ãå¶çŽç°å¢ã«æé©ã§ãã
CPU 㯠LLM ãå®è¡ã§ããŸããïŒ
ã¯ããã¢ãã³ CPU (Intel i7-10äžä»£+ãAMD Ryzen 5000+ãApple M ã·ãªãŒãº) 㯠3â13B ã¢ãã«ã 8â15 ããŒã¯ã³/ç§ã§å®è¡ã§ããŸãã ãã㯠GPU ãã 10â30à é ãã§ãããå°çš VRAM ãå¿ èŠãšããŸãããååãªã·ã¹ãã RAM (8â32 GB) ãæèŒãã CPU ã¯ã$300+ ã® GPU ãå¿ èŠãšããã¢ãã«ãå®è¡ã§ããŸãã
CPU æšè«ã¯é床ãã¢ã¯ã»ã·ããªãã£ãšäº€æããŸã: GPU ãªãŒããŒããããŒããå®ç§ãªå®å®æ§ããã©ã€ããŒåé¡ãªããã«ãžã¥ã¢ã«ãªãŠãŒã¹ã±ãŒã¹ (æ¯ç§æ°ãªã¯ãšã¹ãã«å¿çãããã£ãããããããªãã©ã€ã³ããã¥ã¡ã³ãåŠç) ã§ã¯ãCPU-only ã¯å®çšçã§ãã
ã¢ãã³ CPU ã«ã¯ AVX-512 ãŸã㯠NEON/SVE ãã¯ã¿ãŒåœä»€ãããããããªãã¯ã¹æŒç®ãå éããŸããllama.cpp ãš Ollama ãªã©ã®ããŒã«ã¯ããããèªåçã«äœ¿çšããCPU æšè«ããã€ãŒããªå®è£ ããå€§å¹ ã«é«éåããŸãã
æé«ã® CPU-only ã¢ãã« 2026
以äžã®è¡šã¯ãCPU-only ã¢ãŒãæèŒã® Intel i7-12700 (12ã³ã¢ãAVX-512) äžã®ããã©ãŒãã³ã¹ã§ã¢ãã«ãã©ã³ã¯ä»ãããŸã:
| ã¢ãã« | ãã©ã¡ãŒã¿ | GGUF ãµã€ãº | RAM èŠä»¶ | CPU é床 | æé©ãªçšé |
|---|---|---|---|---|---|
| Phi-4 Mini | 3.8B | ~2.3 GB | 4 GB | 12 ããŒã¯ã³/ç§ | äžè¬çãªãã£ãããã³ãŒãæ¯æŽ |
| Gemma 3 2B | 2B | ~1.5 GB | 3 GB | 15 ããŒã¯ã³/ç§ | é«éå¿çãäœ VRAM |
| Llama 3.2 3B | 3B | ~2 GB | 3.5 GB | 10 ããŒã¯ã³/ç§ | ãã©ã³ã¹ã®åããå質/é床 |
| Mistral 7B Q4 | 7B | ~4.5 GB | 6 GB | 5 ããŒã¯ã³/ç§ | ããé«ãå質ã16+ GB RAM |
| Llama 3.1 8B Q4 | 8B | ~5 GB | 7 GB | 4 ããŒã¯ã³/ç§ | ã³ãŒãã£ã³ã°ãããžãã¯ã¿ã¹ã¯ |
é床: CPU vs GPU
é床ã¯ããŒããŠã§ã¢ã«ãã£ãŠç°ãªããŸãããããã®ãã³ãããŒã¯ã¯ Ollama ãŸã㯠llama.cpp ãå®è¡ããæšæº 2026 ããŒããŠã§ã¢äžã®ãã®ã§ã:
| ããŒããŠã§ã¢ | ã¢ãã« | é床 | 泚é |
|---|---|---|---|
| Intel i7-12700 (CPU) | Phi-4 Mini 3.8B | 12 ããŒã¯ã³/ç§ | AVX-512 æå¹ |
| AMD Ryzen 7 5700X (CPU) | Phi-4 Mini 3.8B | 9 ããŒã¯ã³/ç§ | å€ã AVX2 ã®ã¿ |
| Apple M3 (CPU) | Phi-4 Mini 3.8B | 14 ããŒã¯ã³/ç§ | ãŠããã¡ã€ãã¡ã¢ãªã®å©ç¹ |
| RTX 3060 (GPUã12 GB) | Phi-4 Mini 3.8B | 80 ããŒã¯ã³/ç§ | GPU 㯠6.7à é«é |
| RTX 4090 (GPUã24 GB) | Llama 3.1 8B Q4 | 120 ããŒã¯ã³/ç§ | GPU 㯠CPU ãã 30à é«é |
ã¢ãã«å¥ RAM èŠä»¶
çµéšå: GGUF ãµã€ãº + 500 MB ãªãŒããŒããã = å¿ èŠæå°é RAMã 2 GB GGUF ã¢ãã«ã¯ 2.5â3 GB ã®ç¡æã·ã¹ãã RAM ãå¿ èŠã§ã:
| ã¢ãã« | GGUF ãµã€ãº | æå° RAM | å¿«é© | ã³ã³ããã¹ãé· |
|---|---|---|---|---|
| Gemma 3 2B | ~1.5 GB | 2â2.5 GB | 4 GB | 8K |
| Phi-4 Mini 3.8B | ~2.3 GB | 3 GB | 6 GB | 4K |
| Llama 3.2 3B | ~2 GB | 2.5â3 GB | 6 GB | 8K |
| Mistral 7B Q4 | ~4.5 GB | 5 GB | 8 GB | 32K |
| Llama 3.1 8B Q4 | ~5 GB | 6 GB | 12 GB | 128K |
CPU-only ã¢ãŒãã®å®è¡æ¹æ³
Ollama (æãç°¡å): åã« `ollama run phi:mini` ãå®è¡ããŸããOllama 㯠NVIDIA/AMD GPU ã®ãªãã·ã¹ãã ã§ CPU-only ãèªåæ€åºããã·ã¹ãã RAM ã䜿çšããŸããLM Studio: èšå®ãéã â GPU ã®ããªãããéžæã㊠CPU ã¢ãŒãã匷å¶ããŸããLlama.cpp: ãã©ã° `--n-gpu-layers 0` ã䜿çšã㊠GPU ãªãããŒããç¡å¹ã«ããŸãã
ollama run phi:mini
# Ollama 㯠CPU-only ã·ã¹ãã ãèªåæ€åºããŸãCPU æšè«ã®æé©åã®ãã³ã
CPU æšè«ããæå€§ããã©ãŒãã³ã¹ãåŒãåºããŸã:
- Q4_K_M éååãäœ¿çš â GGUF ãµã€ãºã ~70% åæžãæå°å質æå€±ããã£ãã·ã¥åäœã®åäžã«ãã 10â20% é床åäžã
- ã³ã³ããã¹ããŠã£ã³ããŠãåæž â ããé·ãã³ã³ããã¹ã = ããé ãæšè«ã`--context 2048` ã䜿çšããŠã³ã³ããã¹ãã 2K ããŒã¯ã³ã«å¶éããŸãã
- ãã«ãã¹ã¬ãããæå¹å â Ollama ãš llama.cpp 㯠CPU ã³ã¢æ°ãèªåæ€åºããŸãã`nproc` ã§äžèŽã確èªããŸãã
- AVX-512 ãŸã㯠ARM NEON ãäœ¿çš â ã¢ãã³ Intel/AMD/ARM CPU ã«ã¯ãã¯ã¿ãŒåœä»€ããããŸããCPU ãã©ã°ã確èª: `cat /proc/cpuinfo | grep avx512` (Linux) ãŸã㯠Apple æ å ± â ã·ã¹ãã ã¬ããŒã (Mac)ã
- ããããµã€ãº = 1 â CPU ã¯ã·ã³ã°ã«ã·ãŒã±ã³ã¹æšè«ãæé©ã«åŠçããŸããCPU ã§ãã«ããããã詊ã¿ãªãã§ãã ããã
- ã¹ã¬ãããã³ã¢ã«åºå® â Linux ã§ã¯ `numactl --cpunodebind=0 ollama run phi:mini` ã䜿çšããŠã³ã¢åãæ¿ããªãŒããŒããããåé¿ããŸãã
CPU vs GPU ã䜿çšããå Žå
| ãŠãŒã¹ã±ãŒã¹ | CPU | GPU |
|---|---|---|
| ãªã¢ã«ã¿ã€ã ãã£ãã (1ç§æªæºã¬ã€ãã³ã·) | â é ããã (12 ããŒã¯ã³/ç§ = 60 ããŒã¯ã³ã§ 5 ç§) | â 80+ ããŒã¯ã³/ç§ |
| ãããåŠç (ããã¥ã¡ã³ãããã°) | â è¯å¥œ (é床ã¯åé¡ãªã) | â ïž ãªãŒããŒãã« |
| æ¬çª API (ã³ã¹ãå¶çŽ) | â $0 ããŒããŠã§ã¢ã³ã¹ã | â ïž $200+ GPU + é»å |
| ãšããžããã€ã¹ (Raspberry Pi) | â ä»£æ¿æ¡ãªã | â GPU ãªãã·ã§ã³éå® |
| éçº / ããŒã«ã«ãã¹ã | â äœæ¶è²»é»åãéã㪠| â ïž ãªãŒããŒãã« |
| LLM ãã¡ã€ã³ãã¥ãŒãã³ã° | â é ããã (æé â æ¥æ°) | â 10â30à é«éå |
FAQ
CPU-only æšè«ã¯ GPU ãšæ¯ã¹ãŠäœåé ãã§ããïŒ
CPU: ã¢ãã³ããã»ããµäžã§ 8â15 ããŒã¯ã³/ç§ãGPU (RTX 3060): 80 ããŒã¯ã³/ç§ãGPU (RTX 4090): 120+ ããŒã¯ã³/ç§ãCPU 㯠10â30à é ãã§ãã $0 GPU æè³ãå¿ èŠã§ãã
CPU äžã§äžè²«æ§ã®ããåºåãçæããæå°ã¢ãã«ã¯äœã§ããïŒ
Gemma 3 2B (1.5 GB) ã¯åççãªå¿çãçæããŸãããã以äžã§ã¯å質ãäœäžããŸãã8 GB RAM ã§ã®æé«å質ã«ã¯ Phi-4 Mini (3.8B) ãŸã㯠Llama 3.2 3B (2 GB) ã䜿çšããŠãã ããã
13B ã¢ãã«ã CPU äžã§å®è¡ã§ããŸããïŒ
ã¯ããQ4_K_M éååã§ 13B ã¢ãã«ã¯ ~6.5 GB ã§ãã8â12 GB ã·ã¹ãã RAM ãå¿ èŠã§ããé床: ~2â3 ããŒã¯ã³/ç§ãã€ã³ã¿ã©ã¯ãã£ã䜿çšã«ã¯äžå¿«ã§ãããããåŠçã§æ©èœããŸãã
CPU æšè«ã¯ GPU ããŸã£ãã䜿çšããŸããïŒ
ããããOllama/llama.cpp ã® CPU-only ã¢ãŒã㯠GPU 䜿çšãæç€ºçã«ç¡å¹ã«ããã·ã¹ãã RAM ã®ã¿ã䜿çšããŸãã
CPU-only æšè«ã¯å®å®ããŠããŸããïŒ
ã¯ããGPU ããå®å®ããŠããŸãããã©ã€ããŒã¯ã©ãã·ã¥ãªããGPU ã¡ã¢ãªãšã©ãŒãªããå¯äžã®ãªã¹ã¯ã¯ã·ã¹ãã RAM 飜åã§ãã¢ãã«éžæã«ããå¶åŸ¡ããŸãã
Apple Silicon CPU ã®èšå®ã調æŽããå¿ èŠããããŸããïŒ
ããããOllama 㯠M1/M2/M3/M4 ãèªåæ€åºãããŠããã¡ã€ãã¡ã¢ãªãå¹ççã«äœ¿çšããŸããApple Silicon 㯠ã¡ã¢ãªã¢ãŒããã¯ãã£ã«ããåç Intel CPU ãã ~10â20% é«éã§ãã
CPU-only LLM äœ¿çšæã« METI ã¬ã€ãã³ã¹ã«æºæ ããå¿ èŠããããŸããïŒ
ãšã³ã¿ãŒãã©ã€ãºãããã€ã®å ŽåãMETI 2024 AI ã¬ããã³ã¹ãåç §ããŠãã ãããããŒã«ã« CPU æšè«ã¯ããŒã¿ç®¡çã«å¯Ÿããããé«åºŠãªå¶åŸ¡ãæäŸããäŒæ¥ããªã·ãŒã«é©åãããããªããŸãã
10 GB ã®å€ãããŒãããœã³ã³ã§ã CPU-only æšè«ã¯å®çšçã§ããïŒ
ã¯ããGemma 3 2B (1.5 GB) ãŸã㯠Phi-4 Mini (2.3 GB) 㯠10 GB RAM ã§å¹ççã«å®è¡ã§ããŸãã3â5 ããŒã¯ã³/ç§ã®ãããåŠçã軜éãã£ãããããã«æé©ã§ãã
è€æ°ã®ã¢ãã«ãåæã« CPU ã§å®è¡ã§ããŸããïŒ
RAM ãèš±å¯ãããŠããã°æè¡çã«ã¯å¯èœã§ãããéçŸå®çã§ããè€æ°ã¢ãã«ã¯ã¡ã¢ãªç«¶åãåŒãèµ·ãããã©ã¡ããäœéã«ãªããŸããäžåºŠã« 1 ã¢ãã«ã䜿çšããããšããå§ãããŸãã
CPU æšè«ã®å®è£ ã§ã®ã»ãã¥ãªãã£ãªã¹ã¯ã¯äœã§ããïŒ
CPU-only 㯠GPU ããå®å šã§ããã¯ã©ãŠã転éãªã = ããŒã¿ã¯ããŒã«ã«ã«çãŸããŸãããã ãç©ççãã·ã³ã»ãã¥ãªãã£ãš OS ã¢ããããŒãããã£ããããã»ã³ã·ãã£ãããŒã¿ãæªæå·åã§æ®ãå¯èœæ§ããããŸãã
llama.cpp vs Ollama ã§ CPU æšè«é床ã«éãã¯ãããŸããïŒ
ãããã§ããäž¡è ãšãåãã³ã¢ CPU æé©å (AVX-512) ã䜿çšããŸããããããªå·®ç°ã¯ã¹ã¬ãã管çã®å®è£ ã«ãã (~2â5%)ãããã©ã«ãã® Ollama ãã詊ããã ããã