Skip to main content
PromptQuorumPromptQuorum
Home/Local LLMs/Ollama ์„ค์น˜: macOS, Windows & Linux 2๋ถ„ ์„ค์น˜ ๊ฐ€์ด๋“œ
์‹œ์ž‘ํ•˜๊ธฐ

Ollama ์„ค์น˜: macOS, Windows & Linux 2๋ถ„ ์„ค์น˜ ๊ฐ€์ด๋“œ

ยท8๋ถ„ ์ฝ๊ธฐยทBy Hans Kuepper ยท Founder of PromptQuorum, multi-model AI dispatch tool ยท PromptQuorum

Ollama๋Š” macOS, Windows, Linux์—์„œ 2๋ถ„ ์ด๋‚ด์— ์„ค์น˜ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์„ค์น˜ ํ›„ ๋ช…๋ น์–ด ํ•˜๋‚˜๋กœ Ollama ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์˜ ๋ชจ๋“  ๋ชจ๋ธ์„ ๋‹ค์šด๋กœ๋“œํ•˜๊ณ  ์‹คํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค -- Python ํ™˜๊ฒฝ, ์„ค์ • ํŒŒ์ผ, ์‹œ์ž‘์„ ์œ„ํ•œ GPU๊ฐ€ ํ•„์š”ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.

Ollama๋Š” macOS, Windows, Linux์—์„œ 2๋ถ„ ์ด๋‚ด์— ์„ค์น˜ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์„ค์น˜ ํ›„ ๋ช…๋ น์–ด ํ•˜๋‚˜๋กœ Ollama ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์˜ ๋ชจ๋“  ๋ชจ๋ธ์„ ๋‹ค์šด๋กœ๋“œํ•˜๊ณ  ์‹คํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค -- Python ํ™˜๊ฒฝ, ์„ค์ • ํŒŒ์ผ, ์‹œ์ž‘์„ ์œ„ํ•œ GPU๊ฐ€ ํ•„์š”ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. 2026๋…„ 4์›” ๊ธฐ์ค€์œผ๋กœ Ollama๋Š” Meta Llama 3.3, Qwen3, Mistral์„ ํฌํ•จํ•œ 200๊ฐœ ์ด์ƒ์˜ ๋ชจ๋ธ์„ ์ง€์›ํ•ฉ๋‹ˆ๋‹ค.

Key Takeaways

  • macOS: ollama.com์—์„œ .dmg๋ฅผ ๋‹ค์šด๋กœ๋“œํ•˜๊ฑฐ๋‚˜ `brew install ollama`๋ฅผ ์‹คํ–‰ํ•œ ํ›„ -- `ollama run llama3.2`๋กœ ๋Œ€ํ™”๋ฅผ ์‹œ์ž‘ํ•˜์‹ญ์‹œ์˜ค.
  • Windows: ollama.com/download์—์„œ ์„ค์น˜ ํ”„๋กœ๊ทธ๋žจ์„ ๋‹ค์šด๋กœ๋“œํ•˜์‹ญ์‹œ์˜ค. Ollama๋Š” ์‹œ์Šคํ…œ ํŠธ๋ ˆ์ด์—์„œ ๋ฐฑ๊ทธ๋ผ์šด๋“œ ์„œ๋น„์Šค๋กœ ์‹คํ–‰๋ฉ๋‹ˆ๋‹ค.
  • Linux: ๋ช…๋ น์–ด ํ•˜๋‚˜๋กœ ๋ชจ๋“  ๊ฒƒ์„ ์„ค์น˜ํ•ฉ๋‹ˆ๋‹ค -- `curl -fsSL https://ollama.com/install.sh | sh`.
  • ์ตœ์†Œ ์š”๊ตฌ ์‚ฌํ•ญ: 3B ๋ชจ๋ธ์—๋Š” 4 GB RAM, 7B ๋ชจ๋ธ์—๋Š” 8 GB RAM. ์‹œ์ž‘ํ•˜๋Š” ๋ฐ GPU๋Š” ํ•„์š”ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.
  • Ollama๋Š” `http://localhost:11434`์—์„œ OpenAI ํ˜ธํ™˜ REST API๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค -- OpenAI SDK ์•ฑ์ด๋ผ๋ฉด ์ฝ”๋“œ ๋ณ€๊ฒฝ ์—†์ด ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  • ๐Ÿ‘‰ ์„ค์น˜ ์ „์— ๋กœ์ปฌ ์‹คํ–‰์ด ๊ท€ํ•˜์˜ ์‚ฌ์šฉ ์‚ฌ๋ก€์— ์ ํ•ฉํ•œ์ง€ ํ™•์ธํ•˜์‹ญ์‹œ์˜ค โ€” ํด๋ผ์šฐ๋“œ๊ฐ€ ๋กœ์ปฌ ์ถ”๋ก ๋ณด๋‹ค ๋‚˜์€ ๊ฒฝ์šฐ๋Š” ๋กœ์ปฌ LLM vs ํด๋ผ์šฐ๋“œ API๋ฅผ ์ฐธ์กฐํ•˜์‹ญ์‹œ์˜ค.

์„ค์น˜ ์ „: ๋กœ์ปฌ LLM์ด ๊ท€ํ•˜์˜ ์‚ฌ์šฉ ์‚ฌ๋ก€์— ์ ํ•ฉํ•ฉ๋‹ˆ๊นŒ?

Ollama ์„ค์น˜๋Š” 5๋ถ„์ด ๊ฑธ๋ฆฌ์ง€๋งŒ, GPU ๊ฐ์ง€ ๋ฌธ์ œ, ๋“œ๋ผ์ด๋ฒ„ ๋ถˆ์ผ์น˜ ๋˜๋Š” RAM ์ œ์•ฝ์ด ๋ฐœ์ƒํ•˜๋ฉด ์ฒซ ๋ฒˆ์งธ ๋ชจ๋ธ์„ ์ œ๋Œ€๋กœ ์‹คํ–‰ํ•˜๋Š” ๋ฐ 20~40๋ถ„์ด ๊ฑธ๋ฆด ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

๋กœ์ปฌ ์ถ”๋ก ์ด ์˜ฌ๋ฐ”๋ฅธ ์„ ํƒ์ธ์ง€ ํ™•์‹ ํ•˜์ง€ ๋ชปํ•˜๋Š” ๊ฒฝ์šฐ, **๋กœ์ปฌ vs ํด๋ผ์šฐ๋“œ์˜ ์ „์ฒด ํŠธ๋ ˆ์ด๋“œ์˜คํ”„๋ฅผ ๋จผ์ € ๋น„๊ตํ•˜์‹ญ์‹œ์˜ค** โ€” ํด๋ผ์šฐ๋“œ API(5๋ถ„์ด๋ฉด ์ค€๋น„ ์™„๋ฃŒ, ๋ฌธ์ œ ํ•ด๊ฒฐ ๋ถˆํ•„์š”)๋กœ ์‹œ์ž‘ํ•˜๋Š” ๊ฒƒ์ด ๋” ํ˜„๋ช…ํ•œ ๋ฐฉ๋ฒ•์ž„์„ ์•Œ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋งŽ์€ ์‚ฌ์šฉ์ž๊ฐ€ ์„ค์น˜ ํ›„์— ์ด๋ฅผ ๋ฐœ๊ฒฌํ•ฉ๋‹ˆ๋‹ค. ์ง€๊ธˆ ๊ฒฐ์ •ํ•˜๋Š” ๊ฒƒ์ด ์ข‹์Šต๋‹ˆ๋‹ค.

๋กœ์ปฌ์„ ์„ ํƒํ•œ ์‚ฌ์šฉ์ž๋Š” ์•„๋ž˜๋ฅผ ๊ณ„์† ์ฝ์œผ์‹ญ์‹œ์˜ค. ๋จผ์ € ํด๋ผ์šฐ๋“œ๋ฅผ ํ‰๊ฐ€ํ•˜๋ ค๋Š” ์‚ฌ์šฉ์ž๋Š” ์ „์ฒด ๋น„๊ต๋ฅผ ์ฐธ์กฐํ•˜์‹ญ์‹œ์˜ค.

Ollama๋ž€ ๋ฌด์—‡์ด๋ฉฐ ์™œ ์‚ฌ์šฉํ•ฉ๋‹ˆ๊นŒ?

Ollama๋Š” ๋Œ€ํ˜• ์–ธ์–ด ๋ชจ๋ธ์„ ๋กœ์ปฌ์—์„œ ์‹คํ–‰ํ•˜๋Š” ์˜คํ”ˆ ์†Œ์Šค ์ถ”๋ก  ์—”์ง„์ž…๋‹ˆ๋‹ค. ๋ชจ๋ธ ๊ด€๋ฆฌ, llama.cpp ์ถ”๋ก  ๋ฐฑ์—”๋“œ, OpenAI ํ˜ธํ™˜ REST API๋ฅผ ๋‹จ์ผ ๊ฒฝ๋Ÿ‰ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์œผ๋กœ ํŒจํ‚ค์ง•ํ•ฉ๋‹ˆ๋‹ค. Python, conda ํ™˜๊ฒฝ, CUDA ์„ค์ •์ด ํ•„์š”ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.

Ollama๋Š” Meta Llama 3.3, Microsoft Phi-3, Google Gemma 2, Mistral, Qwen3 ๋ฐ 100๊ฐœ ์ด์ƒ์˜ ๋‹ค๋ฅธ ๋ชจ๋ธ์„ ์›ํด๋ฆญ์œผ๋กœ ๋‹ค์šด๋กœ๋“œํ•  ์ˆ˜ ์žˆ๋Š” ํ๋ ˆ์ด์…˜๋œ ๋ชจ๋ธ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ(ollama.com/library)๋ฅผ ์œ ์ง€ ๊ด€๋ฆฌํ•ฉ๋‹ˆ๋‹ค. ๋ชจ๋ธ์€ ํ•œ ๋ฒˆ ๋‹ค์šด๋กœ๋“œ๋˜์–ด ๋””์Šคํฌ์— ์บ์‹œ๋ฉ๋‹ˆ๋‹ค -- ์ดํ›„ ์‹คํ–‰์€ 5์ดˆ ์ด๋‚ด์— ์‹œ์ž‘๋ฉ๋‹ˆ๋‹ค.

Ollama์˜ ๋Œ€์•ˆ์€ ๋กœ์ปฌ LLM ์›ํด๋ฆญ ์„ค์น˜ ํ”„๋กœ๊ทธ๋žจ์„ ์ฐธ์กฐํ•˜์‹ญ์‹œ์˜ค. Ollama์™€ LM Studio์˜ ๋น„๊ต๋Š” LM Studio ์„ค์น˜ ๋ฐฉ๋ฒ•์„ ์ฐธ์กฐํ•˜์‹ญ์‹œ์˜ค.

macOS์—์„œ Ollama๋ฅผ ์–ด๋–ป๊ฒŒ ์„ค์น˜ํ•ฉ๋‹ˆ๊นŒ?

๋‘ ๊ฐ€์ง€ ๋ฐฉ๋ฒ•์ด ์žˆ์Šต๋‹ˆ๋‹ค. ์„ค์น˜ ํ”„๋กœ๊ทธ๋žจ ๋‹ค์šด๋กœ๋“œ๊ฐ€ ๋” ๋น ๋ฅด๋ฉฐ, Homebrew๋Š” brew๋กœ ์†Œํ”„ํŠธ์›จ์–ด๋ฅผ ๊ด€๋ฆฌํ•˜๋Š” ๊ฒฝ์šฐ์— ์ ํ•ฉํ•ฉ๋‹ˆ๋‹ค.

  1. 1
    ollama.com/download๋กœ ์ด๋™ํ•˜์—ฌ "Download for macOS"๋ฅผ ํด๋ฆญํ•˜์‹ญ์‹œ์˜ค.
  2. 2
    ๋‹ค์šด๋กœ๋“œํ•œ Ollama.dmg ํŒŒ์ผ์„ ์—ด๊ณ  Ollama๋ฅผ ์‘์šฉ ํ”„๋กœ๊ทธ๋žจ ํด๋”๋กœ ๋“œ๋ž˜๊ทธํ•˜์‹ญ์‹œ์˜ค.
  3. 3
    ์‘์šฉ ํ”„๋กœ๊ทธ๋žจ์—์„œ Ollama๋ฅผ ์‹คํ–‰ํ•˜์‹ญ์‹œ์˜ค. ๋ฉ”๋‰ด ๋ฐ”์— ๋ผ๋งˆ ์•„์ด์ฝ˜์ด ํ‘œ์‹œ๋ฉ๋‹ˆ๋‹ค -- Ollama๊ฐ€ ์ด์ œ ๋ฐฑ๊ทธ๋ผ์šด๋“œ ์„œ๋น„์Šค๋กœ ์‹คํ–‰ ์ค‘์ž…๋‹ˆ๋‹ค.
  4. 4
    ํ„ฐ๋ฏธ๋„์„ ์—ด๊ณ  ์ฒซ ๋ฒˆ์งธ ๋ชจ๋ธ์„ ์‹คํ–‰ํ•˜์‹ญ์‹œ์˜ค: `ollama run llama3.2`
  5. 5
    ๋ชจ๋ธ์ด ๋‹ค์šด๋กœ๋“œ๋ฉ๋‹ˆ๋‹ค(llama3.2:3b์˜ ๊ฒฝ์šฐ ์•ฝ 2 GB). ์ฑ„ํŒ… ํ”„๋กฌํ”„ํŠธ๊ฐ€ ํ‘œ์‹œ๋ฉ๋‹ˆ๋‹ค. ๋ฉ”์‹œ์ง€๋ฅผ ์ž…๋ ฅํ•˜๊ณ  Enter๋ฅผ ๋ˆ„๋ฅด์‹ญ์‹œ์˜ค.

Homebrew๋กœ macOS์— Ollama ์„ค์น˜

bash
brew install ollama

# Start the Ollama service
ollama serve &

# Pull and run a model
ollama run llama3.2

Windows์—์„œ Ollama๋ฅผ ์–ด๋–ป๊ฒŒ ์„ค์น˜ํ•ฉ๋‹ˆ๊นŒ?

  1. 1
    ollama.com/download๋กœ ์ด๋™ํ•˜์—ฌ "Download for Windows"๋ฅผ ํด๋ฆญํ•˜์‹ญ์‹œ์˜ค.
  2. 2
    ๋‹ค์šด๋กœ๋“œํ•œ OllamaSetup.exe ์„ค์น˜ ํ”„๋กœ๊ทธ๋žจ์„ ์‹คํ–‰ํ•˜์‹ญ์‹œ์˜ค. Ollama๋Š” %LOCALAPPDATA%\Programs\Ollama์— ์„ค์น˜๋ฉ๋‹ˆ๋‹ค.
  3. 3
    Ollama๊ฐ€ ์ž๋™์œผ๋กœ ์‹œ์ž‘๋˜์–ด ์‹œ์Šคํ…œ ํŠธ๋ ˆ์ด ์•„์ด์ฝ˜์œผ๋กœ ํ‘œ์‹œ๋ฉ๋‹ˆ๋‹ค.
  4. 4
    PowerShell ๋˜๋Š” ๋ช…๋ น ํ”„๋กฌํ”„ํŠธ๋ฅผ ์—ด๊ณ  ์‹คํ–‰ํ•˜์‹ญ์‹œ์˜ค: `ollama run llama3.2`
  5. 5
    ์ฒซ ๋ฒˆ์งธ ์‹คํ–‰ ์‹œ ๋ชจ๋ธ์ด ๋‹ค์šด๋กœ๋“œ๋ฉ๋‹ˆ๋‹ค. ์ดํ›„ ์‹คํ–‰์€ ์บ์‹œ๋œ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.

Windows์—์„œ์˜ GPU ์ง€์›

Windows์˜ Ollama๋Š” NVIDIA GPU(CUDA 11.3+)์™€ AMD GPU(ROCm 6+)๋ฅผ ์ž๋™์œผ๋กœ ๊ฐ์ง€ํ•˜์—ฌ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. NVIDIA RTX ์นด๋“œ๊ฐ€ ์žˆ๋Š” ๊ฒฝ์šฐ Ollama๊ฐ€ ์ž๋™์œผ๋กœ ๋ชจ๋ธ ๋ ˆ์ด์–ด๋ฅผ VRAM์— ์˜คํ”„๋กœ๋“œํ•ฉ๋‹ˆ๋‹ค -- ์ˆ˜๋™ ์„ค์ •์ด ํ•„์š”ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. GPU๊ฐ€ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ๋Š”์ง€ ํ™•์ธํ•˜๋ ค๋ฉด `ollama run llama3.2`๋ฅผ ์‹คํ–‰ํ•œ ํ›„ ์ž‘์—… ๊ด€๋ฆฌ์ž โ†’ GPU์—์„œ ํ™œ๋™์„ ํ™•์ธํ•˜์‹ญ์‹œ์˜ค.

Linux์—์„œ Ollama๋ฅผ ์–ด๋–ป๊ฒŒ ์„ค์น˜ํ•ฉ๋‹ˆ๊นŒ?

๋‹จ์ผ ๋ช…๋ น์–ด๋กœ ๋ชจ๋“  Linux ๋ฐฐํฌํŒ์— Ollama๋ฅผ ์„ค์น˜ํ•ฉ๋‹ˆ๋‹ค:

bash
curl -fsSL https://ollama.com/install.sh | sh

Linux์—์„œ systemd ์„œ๋น„์Šค๋กœ Ollama ์‹คํ–‰

์„ค์น˜ ์Šคํฌ๋ฆฝํŠธ๊ฐ€ ์ž๋™์œผ๋กœ Ollama๋ฅผ systemd ์„œ๋น„์Šค๋กœ ๋“ฑ๋กํ•ฉ๋‹ˆ๋‹ค. ๊ด€๋ฆฌ ๋ฐฉ๋ฒ•:

bash
# Check service status
systemctl status ollama

# Start / stop / restart
systemctl start ollama
systemctl stop ollama
systemctl restart ollama

# View logs
journalctl -u ollama -f

Ollama์—์„œ ์ฒซ ๋ฒˆ์งธ ๋ชจ๋ธ์„ ์–ด๋–ป๊ฒŒ ๋‹ค์šด๋กœ๋“œํ•˜๊ณ  ์‹คํ–‰ํ•ฉ๋‹ˆ๊นŒ?

Ollama๋ฅผ ์„ค์น˜ํ•œ ํ›„ ์ด ๋ช…๋ น์–ด๋ฅผ ์‹คํ–‰ํ•˜์—ฌ ๋ชจ๋ธ์„ ๋‹ค์šด๋กœ๋“œํ•˜๊ณ  ์‹œ์ž‘ํ•˜์‹ญ์‹œ์˜ค:

bash
# Pull a model (downloads to ~/.ollama/models)
ollama pull llama3.2

# Run it interactively
ollama run llama3.2

# Or pull and run in one step
ollama run llama3.2

์ฒ˜์Œ์— ์–ด๋–ค ๋ชจ๋ธ๋กœ ์‹œ์ž‘ํ•ด์•ผ ํ•ฉ๋‹ˆ๊นŒ?

์ฒซ ๋ฒˆ์งธ ์‹คํ–‰์„ ์œ„ํ•ด ๋‹ค์–‘ํ•œ ํ•˜๋“œ์›จ์–ด ํ”„๋กœํ•„์„ ๋‹ค๋ฃจ๋Š” ์„ธ ๊ฐ€์ง€ ๋ชจ๋ธ์„ ๊ถŒ์žฅํ•ฉ๋‹ˆ๋‹ค:

Model๋‹ค์šด๋กœ๋“œ ํฌ๊ธฐํ•„์š” RAM์ ํ•ฉ ์šฉ๋„
Llama 3.2 3B์•ฝ 2 GB4 GB์ฒซ ํ…Œ์ŠคํŠธ -- ๋ชจ๋“  ๊ธฐ๊ธฐ
Llama 3.3 8B์•ฝ 4.7 GB8 GB๋Œ€๋ถ€๋ถ„์˜ ๋…ธํŠธ๋ถ์—์„œ ์ผ๋ฐ˜ ์‚ฌ์šฉ
phi4-mini์•ฝ 2.3 GB4 GB๋น ๋ฅธ ์‘๋‹ต, ๋‚ฎ์€ RAM

Ollama๊ฐ€ ์ž‘๋™ํ•˜๋Š”์ง€ ์–ด๋–ป๊ฒŒ ํ™•์ธํ•ฉ๋‹ˆ๊นŒ?

REST API๋ฅผ ์ง์ ‘ ํ…Œ์ŠคํŠธํ•˜์—ฌ Ollama๊ฐ€ ์‹คํ–‰ ์ค‘์ด๊ณ  ์ ‘๊ทผ ๊ฐ€๋Šฅํ•œ์ง€ ํ™•์ธํ•˜์‹ญ์‹œ์˜ค:

bash
# Check Ollama is running
curl http://localhost:11434
# Expected: "Ollama is running"

# List downloaded models
ollama list

# Send a prompt via API (OpenAI-compatible)
curl http://localhost:11434/api/generate -d '{
  "model": "llama3.2",
  "prompt": "What is 2+2?",
  "stream": false
}'

์œ ์šฉํ•œ Ollama ๋ช…๋ น์–ด

๋ช…๋ น์–ด๊ธฐ๋Šฅ
ollama list๋‹ค์šด๋กœ๋“œ๋œ ๋ชจ๋“  ๋ชจ๋ธ๊ณผ ํฌ๊ธฐ ํ‘œ์‹œ
ollama pull <model>์‹คํ–‰ํ•˜์ง€ ์•Š๊ณ  ๋ชจ๋ธ ๋‹ค์šด๋กœ๋“œ
ollama rm <model>๋””์Šคํฌ์—์„œ ๋ชจ๋ธ ์‚ญ์ œ
ollama psํ˜„์žฌ ๋ฉ”๋ชจ๋ฆฌ์— ๋กœ๋“œ๋œ ๋ชจ๋ธ ํ‘œ์‹œ
ollama show <model>๋ชจ๋ธ ์„ธ๋ถ€ ์ •๋ณด ํ‘œ์‹œ(ํŒŒ๋ผ๋ฏธํ„ฐ, ํ…œํ”Œ๋ฆฟ, ๋ผ์ด์„ ์Šค)
ollama serveOllama ์„œ๋ฒ„ ์ˆ˜๋™ ์‹œ์ž‘(์„œ๋น„์Šค๋กœ ์‹คํ–‰๋˜์ง€ ์•Š๋Š” ๊ฒฝ์šฐ)

์ผ๋ฐ˜์ ์ธ Ollama ์„ค์น˜ ๋ฌธ์ œ ํ•ด๊ฒฐ

Ollama์—์„œ "could not connect to ollama app, is it running?"์ด๋ผ๊ณ  ํ‘œ์‹œ๋ฉ๋‹ˆ๋‹ค

Ollama๊ฐ€ ๋ฐฑ๊ทธ๋ผ์šด๋“œ ์„œ๋น„์Šค๋กœ ์‹คํ–‰๋˜๊ณ  ์žˆ์ง€ ์•Š์Šต๋‹ˆ๋‹ค. macOS์—์„œ๋Š” ์‘์šฉ ํ”„๋กœ๊ทธ๋žจ์—์„œ Ollama ์•ฑ์„ ์—ฌ์‹ญ์‹œ์˜ค. Linux์—์„œ๋Š” `systemctl start ollama` ๋˜๋Š” ํ„ฐ๋ฏธ๋„์—์„œ `ollama serve`๋ฅผ ์‹คํ–‰ํ•˜์‹ญ์‹œ์˜ค. Windows์—์„œ๋Š” ์‹œ์ž‘ ๋ฉ”๋‰ด์—์„œ Ollama๋ฅผ ์‹คํ–‰ํ•˜์‹ญ์‹œ์˜ค.

๋ชจ๋ธ ๋‹ค์šด๋กœ๋“œ๊ฐ€ ๋งค์šฐ ๋А๋ฆฌ๊ฑฐ๋‚˜ ๋ฉˆ์ถฅ๋‹ˆ๋‹ค

๋ชจ๋ธ ๋‹ค์šด๋กœ๋“œ ํฌ๊ธฐ๊ฐ€ ํฝ๋‹ˆ๋‹ค(2~47 GB). ๋‹ค์šด๋กœ๋“œ๊ฐ€ ๋ฉˆ์ถ”๋ฉด Ctrl+C๋ฅผ ๋ˆ„๋ฅด๊ณ  `ollama pull <model>`์„ ๋‹ค์‹œ ์‹คํ–‰ํ•˜์‹ญ์‹œ์˜ค -- Ollama๊ฐ€ ๋ถ€๋ถ„ ๋‹ค์šด๋กœ๋“œ๋ฅผ ์žฌ๊ฐœํ•ฉ๋‹ˆ๋‹ค. ๋” ๋น ๋ฅธ ๋‹ค์šด๋กœ๋“œ๋ฅผ ์œ„ํ•ด Wi-Fi ๋Œ€์‹  ์œ ์„  ์—ฐ๊ฒฐ์„ ์‚ฌ์šฉํ•˜์‹ญ์‹œ์˜ค.

๋ชจ๋ธ ์‹คํ–‰ ์‹œ "error: model requires more system memory"๊ฐ€ ํ‘œ์‹œ๋ฉ๋‹ˆ๋‹ค

๋ชจ๋ธ์ด ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ RAM๋ณด๋‹ค ํฝ๋‹ˆ๋‹ค. ๋” ์ž‘์€ ์–‘์žํ™”๋ฅผ ์‹œ๋„ํ•˜์‹ญ์‹œ์˜ค: ๊ธฐ๋ณธ Q4_K_M ๋Œ€์‹  `ollama run llama3.2-instruct-q4_0`์„ ์‚ฌ์šฉํ•˜์‹ญ์‹œ์˜ค. ๋˜๋Š” `llama3.2:3b`์™€ ๊ฐ™์€ ๋” ์ž‘์€ ๋ชจ๋ธ๋กœ ์ „ํ™˜ํ•˜์‹ญ์‹œ์˜ค. RAM์— ๋งž๋Š” ๊ถŒ์žฅ ์‚ฌํ•ญ์€ ์ดˆ๋ณด์ž๋ฅผ ์œ„ํ•œ ์ตœ๊ณ ์˜ ๋กœ์ปฌ LLM ๋ชจ๋ธ์„ ์ฐธ์กฐํ•˜์‹ญ์‹œ์˜ค.

Ollama๊ฐ€ ์‹คํ–‰ ์ค‘์ธ๋ฐ GPU๊ฐ€ ์‚ฌ์šฉ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค

Windows์—์„œ๋Š” NVIDIA ๋“œ๋ผ์ด๋ฒ„ ๋ฒ„์ „์ด 452.39 ์ด์ƒ์ธ์ง€ ํ™•์ธํ•˜์‹ญ์‹œ์˜ค. Linux์—์„œ๋Š” NVIDIA ์ปจํ…Œ์ด๋„ˆ ํˆดํ‚ท์ด ์„ค์น˜๋˜์–ด ์žˆ๋Š”์ง€ ํ™•์ธํ•˜์‹ญ์‹œ์˜ค(`nvidia-smi`๊ฐ€ GPU ์ •๋ณด๋ฅผ ๋ฐ˜ํ™˜ํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค). Ollama๋Š” VRAM์ด ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•  ๋•Œ ์ž๋™์œผ๋กœ ๋ ˆ์ด์–ด๋ฅผ GPU์— ์˜คํ”„๋กœ๋“œํ•ฉ๋‹ˆ๋‹ค -- ๋ชจ๋ธ์„ ์‹œ์ž‘ํ•œ ํ›„ `ollama ps`๋ฅผ ์‹คํ–‰ํ•˜์—ฌ GPU ์‚ฌ์šฉ๋ฅ ์„ ํ™•์ธํ•˜์‹ญ์‹œ์˜ค.

Ollama ๋ชจ๋ธ ํŒŒ์ผ์€ ์–ด๋””์— ์ €์žฅ๋ฉ๋‹ˆ๊นŒ?

๋ชจ๋ธ์€ macOS์™€ Linux์—์„œ ~/.ollama/models์— ์ €์žฅ๋ฉ๋‹ˆ๋‹ค. Windows์—์„œ ๊ธฐ๋ณธ ๊ฒฝ๋กœ๋Š” C:\Users\<username>\.ollama\models์ž…๋‹ˆ๋‹ค. ์„œ๋น„์Šค ์‹œ์ž‘ ์ „์— OLLAMA_MODELS ํ™˜๊ฒฝ ๋ณ€์ˆ˜๋ฅผ ์„ค์ •ํ•˜์—ฌ ์ €์žฅ ์œ„์น˜๋ฅผ ๋ณ€๊ฒฝํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Ollama ์„ค์น˜ ํ›„ ๋ฌด์—‡์„ ํ•ด์•ผ ํ•ฉ๋‹ˆ๊นŒ?

Ollama๊ฐ€ ์‹คํ–‰๋˜๋ฉด ๋‹ค์Œ ๋‹จ๊ณ„๋Š” ์ฒซ ๋ฒˆ์งธ ๋กœ์ปฌ LLM ์‹คํ–‰์œผ๋กœ ํ”„๋กฌํ”„ํŒ…, ์ปจํ…์ŠคํŠธ ๊ธธ์ด, ๋กœ์ปฌ ์ถ”๋ก  ์†๋„์—์„œ ๋ฌด์—‡์„ ๊ธฐ๋Œ€ํ•  ์ˆ˜ ์žˆ๋Š”์ง€ ์ดํ•ดํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค. ํ•˜๋“œ์›จ์–ด์— ์ ํ•ฉํ•œ ์ตœ๊ณ ์˜ ๋ชจ๋ธ์„ ์„ ํƒํ•˜๋ ค๋ฉด ์ดˆ๋ณด์ž๋ฅผ ์œ„ํ•œ ์ตœ๊ณ ์˜ ๋กœ์ปฌ LLM ๋ชจ๋ธ์„ ์ฐธ์กฐํ•˜์‹ญ์‹œ์˜ค. ํ„ฐ๋ฏธ๋„ ๋Œ€์‹  ๊ทธ๋ž˜ํ”ฝ ์ฑ„ํŒ… ์ธํ„ฐํŽ˜์ด์Šค๋ฅผ ์„ ํ˜ธํ•˜๋Š” ๊ฒฝ์šฐ LM Studio ์„ค์น˜ ๋ฐฉ๋ฒ•์—์„œ ๋ฐ์Šคํฌํ†ฑ ์•ฑ ๋Œ€์•ˆ์„ ๋‹ค๋ฃจ๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

์ถœ์ฒ˜

  • Ollama ๊ณต์‹ ์›น์‚ฌ์ดํŠธ -- ์„ค์น˜ ๋‹ค์šด๋กœ๋“œ ๋ฐ ๊ณต์‹ ๋ฌธ์„œ
  • Ollama GitHub ์ €์žฅ์†Œ -- ์†Œ์Šค ์ฝ”๋“œ, ์ด์Šˆ ๋ฐ ์ปค๋ฎค๋‹ˆํ‹ฐ ํ† ๋ก 
  • Ollama ๋ชจ๋ธ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ -- ๋‹ค์šด๋กœ๋“œ ๋งํฌ๊ฐ€ ์žˆ๋Š” ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ๋ชจ๋ธ์˜ ํ๋ ˆ์ด์…˜๋œ ์ปฌ๋ ‰์…˜

Ollama ์„ค์น˜ ์‹œ ์ผ๋ฐ˜์ ์ธ ์‹ค์ˆ˜

  • API๊ฐ€ ์‘๋‹ตํ•  ๊ฒƒ์„ ๊ธฐ๋Œ€ํ•˜๊ธฐ ์ „์— Ollama๊ฐ€ ๋ฐฑ๊ทธ๋ผ์šด๋“œ ์„œ๋น„์Šค๋กœ ์‹คํ–‰ ์ค‘์ธ์ง€ ํ™•์ธํ•˜์ง€ ์•Š๋Š” ๊ฒƒ.
  • ๋จผ์ € ๋ฉ”๋ชจ๋ฆฌ ์š”๊ตฌ ์‚ฌํ•ญ์„ ํ™•์ธํ•˜์ง€ ์•Š๊ณ  ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ RAM๋ณด๋‹ค ํฐ ๋ชจ๋ธ์„ ์‹คํ–‰ํ•˜๋ ค๋Š” ๊ฒƒ.
  • GPU ๊ฐ์ง€๋ฅผ ๋ฌด์‹œํ•˜๋Š” ๊ฒƒ -- Ollama๋Š” NVIDIA์™€ AMD๋ฅผ ์ง€์›ํ•˜์ง€๋งŒ ์ตœ์‹  ๋“œ๋ผ์ด๋ฒ„๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.

A Note on Third-Party Facts

This article references third-party AI models, benchmarks, prices, and licenses. The AI landscape changes rapidly. Benchmark scores, license terms, model names, and API prices can shift between the time of writing and the time you read this. Before making deployment or compliance decisions based on this article, verify current figures on each providerโ€™s official source: Hugging Face model cards for licenses and benchmarks, provider websites for API pricing, and EUR-Lex for current GDPR and EU AI Act text. This article reflects publicly available information as of May 2026.

Run PromptQuorum with a local LLM, your own API keys, or both โ€” you pick the backend.

Join the PromptQuorum Waitlist โ†’

โ† Back to Local LLMs

Ollama ์„ค์น˜: macOS, Windows & Linux 2๋ถ„ ์„ค์น˜ ๊ฐ€์ด๋“œ | PromptQuorum