Skip to main content
PromptQuorumPromptQuorum
Home/Local LLMs/Ollama ๋ช…๋ น์–ด ๊ฐ€์ด๋“œ: ๋ชจ๋“  ๋ช…๋ น์–ด ์™„์ „ ํ•ด์„ค (2026)
Tools & Interfaces

Ollama ๋ช…๋ น์–ด ๊ฐ€์ด๋“œ: ๋ชจ๋“  ๋ช…๋ น์–ด ์™„์ „ ํ•ด์„ค (2026)

ยท11๋ถ„ ์ฝ๊ธฐยทBy Hans Kuepper ยท Founder of PromptQuorum, multi-model AI dispatch tool ยท PromptQuorum

Ollama๋Š” ๋ช…๋ น์ค„ ๋„๊ตฌ์ด๋ฉฐ, ๋ช…๋ น์–ด๋ฅผ ์ดํ•ดํ•˜๋ฉด ํ›จ์”ฌ ๊ฐ•๋ ฅํ•˜๊ฒŒ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด ๊ฐ€์ด๋“œ์—์„œ๋Š” ํ•ต์‹ฌ ๋ช…๋ น์–ด์ธ `ollama pull`, `ollama run`, `ollama list`, `ollama rm`, `ollama serve`, ๊ทธ๋ฆฌ๊ณ  ๋ชจ๋ธ ์–‘์žํ™” ๋ฐ ์ปค์Šคํ…€ Modelfile๊ณผ ๊ฐ™์€ ๊ณ ๊ธ‰ ์˜ต์…˜์„ ๋‹ค๋ฃน๋‹ˆ๋‹ค.

Ollama๋Š” ๋ช…๋ น์ค„ ๋„๊ตฌ์ด๋ฉฐ, ๋ช…๋ น์–ด๋ฅผ ์ดํ•ดํ•˜๋ฉด ํ›จ์”ฌ ๊ฐ•๋ ฅํ•˜๊ฒŒ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด ๊ฐ€์ด๋“œ์—์„œ๋Š” ํ•ต์‹ฌ ๋ช…๋ น์–ด์ธ `ollama pull`, `ollama run`, `ollama list`, `ollama rm`, `ollama serve`, ๊ทธ๋ฆฌ๊ณ  ๋ชจ๋ธ ์–‘์žํ™” ๋ฐ ์ปค์Šคํ…€ Modelfile๊ณผ ๊ฐ™์€ ๊ณ ๊ธ‰ ์˜ต์…˜์„ ๋‹ค๋ฃน๋‹ˆ๋‹ค. 2026๋…„ 4์›” ๊ธฐ์ค€์œผ๋กœ ์ด ๋ช…๋ น์–ด๋“ค์€ ์‹ค์ œ ์‚ฌ์šฉ ์‚ฌ๋ก€์˜ 95%๋ฅผ ์ปค๋ฒ„ํ•ฉ๋‹ˆ๋‹ค.

Key Takeaways

  • `ollama pull <๋ชจ๋ธ>` -- ๋ชจ๋ธ์„ ๋‹ค์šด๋กœ๋“œํ•ฉ๋‹ˆ๋‹ค (์˜ˆ: `ollama pull llama3.2:3b`).
  • `ollama run <๋ชจ๋ธ>` -- ๋ชจ๋ธ๊ณผ ์ฑ„ํŒ…์„ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค.
  • `ollama list` -- ๋‹ค์šด๋กœ๋“œ๋œ ๋ชจ๋“  ๋ชจ๋ธ๊ณผ ํฌ๊ธฐ๋ฅผ ํ‘œ์‹œํ•ฉ๋‹ˆ๋‹ค.
  • `ollama rm <๋ชจ๋ธ>` -- ๋‹ค์šด๋กœ๋“œ๋œ ๋ชจ๋ธ์„ ์‚ญ์ œํ•ฉ๋‹ˆ๋‹ค.
  • `ollama serve` -- Ollama API ์„œ๋ฒ„๋ฅผ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค (Mac/Windows์—์„œ๋Š” ์ž๋™์œผ๋กœ ์‹คํ–‰๋ฉ๋‹ˆ๋‹ค).
  • `ollama create <์ด๋ฆ„> -f <modelfile>` -- Modelfile๋กœ ์ปค์Šคํ…€ ๋ชจ๋ธ์„ ๋นŒ๋“œํ•ฉ๋‹ˆ๋‹ค.
  • 2026๋…„ 4์›” ๊ธฐ์ค€์œผ๋กœ ์ด ๋ช…๋ น์–ด๋“ค์€ ์•ˆ์ •์ ์ด๋ฉฐ ๋ชจ๋“  ์ผ๋ฐ˜์ ์ธ ์‚ฌ์šฉ ์‚ฌ๋ก€๋ฅผ ์ปค๋ฒ„ํ•ฉ๋‹ˆ๋‹ค.

Ollama์˜ ํ•„์ˆ˜ ๋ช…๋ น์–ด๋Š” ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

  • `ollama list` -- ๋‹ค์šด๋กœ๋“œ๋œ ๋ชจ๋ธ, ๋””์Šคํฌ ์‚ฌ์šฉ๋Ÿ‰, ์ˆ˜์ • ๋‚ ์งœ๋ฅผ ํ‘œ์‹œํ•ฉ๋‹ˆ๋‹ค.
  • `ollama pull <๋ชจ๋ธ>` -- ์ด๋ฆ„์œผ๋กœ ๋ชจ๋ธ์„ ๋‹ค์šด๋กœ๋“œํ•ฉ๋‹ˆ๋‹ค (์˜ˆ: `ollama pull mistral`).
  • `ollama run <๋ชจ๋ธ>` -- ๋ชจ๋ธ๊ณผ ์ฑ„ํŒ… ์„ธ์…˜์„ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค.
  • `ollama rm <๋ชจ๋ธ>` -- ๋ชจ๋ธ์„ ์‚ญ์ œํ•˜๊ณ  ๋””์Šคํฌ ๊ณต๊ฐ„์„ ํ™•๋ณดํ•ฉ๋‹ˆ๋‹ค.
  • `ollama serve` -- REST API ์„œ๋ฒ„๋ฅผ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค (์ผ๋ฐ˜์ ์œผ๋กœ ์ž๋™ ์‹คํ–‰๋ฉ๋‹ˆ๋‹ค).
  • `ollama help` -- ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ๋ชจ๋“  ๋ช…๋ น์–ด๋ฅผ ํ‘œ์‹œํ•ฉ๋‹ˆ๋‹ค.

Ollama์—์„œ ๋ชจ๋ธ์„ ์–ด๋–ป๊ฒŒ ๊ด€๋ฆฌํ•ฉ๋‹ˆ๊นŒ?

Ollama์˜ ๋ชจ๋ธ ๊ด€๋ฆฌ๋Š” ์ „์ ์œผ๋กœ ๋ช…๋ น์–ด ๊ธฐ๋ฐ˜์ž…๋‹ˆ๋‹ค:

bash
# ๋‹ค์šด๋กœ๋“œ๋œ ๋ชจ๋“  ๋ชจ๋ธ ๋‚˜์—ด
ollama list

# Ollama ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์—์„œ ๋ชจ๋ธ ๋‹ค์šด๋กœ๋“œ
ollama pull llama3.2:3b       # 7๋น„ํŠธ ๋ฒ„์ „ (~2.5 GB)
ollama pull llama3.2:3b-fp16  # ์ „์ฒด ์ •๋ฐ€๋„ (~6.5 GB)

# ํŠน์ • ์–‘์žํ™” ๋‹ค์šด๋กœ๋“œ
ollama pull qwen2.5:7b-q4   # 4๋น„ํŠธ ์–‘์žํ™”
ollama pull qwen2.5:7b-q8   # 8๋น„ํŠธ ์–‘์žํ™”

# ๋””์Šคํฌ ์‚ฌ์šฉ๋Ÿ‰ ํ™•์ธ
du -sh ~/.ollama/models

# ๋ชจ๋ธ ์‚ญ์ œ
ollama rm llama3.2:3b

# ์ปค์Šคํ…€ ๋ ˆ์ง€์ŠคํŠธ๋ฆฌ์—์„œ ๊ฐ€์ ธ์˜ค๊ธฐ (๊ณ ๊ธ‰)
ollama pull localhost:5000/custom-model

๋ชจ๋ธ์„ ์–ด๋–ป๊ฒŒ ์‹คํ–‰ํ•˜๊ณ  ์„œ๋น™ํ•ฉ๋‹ˆ๊นŒ?

Ollama๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์€ ๋‘ ๊ฐ€์ง€์ž…๋‹ˆ๋‹ค:

bash
# 1. ๋Œ€ํ™”ํ˜• ์ฑ„ํŒ… (CLI)
ollama run llama3.2:3b
# ํ”„๋กฌํ”„ํŠธ๋ฅผ ์ž…๋ ฅํ•˜๊ณ  Enter๋ฅผ ๋ˆ„๋ฅด์„ธ์š”

# 2. API ์„œ๋ฒ„ ์‹œ์ž‘ (๋ฐฑ๊ทธ๋ผ์šด๋“œ์—์„œ ์‹คํ–‰)
ollama serve
# API๋Š” http://localhost:11434/v1 ์—์„œ ์ˆ˜์‹  ๋Œ€๊ธฐ

# 3. ๋‹ค๋ฅธ ํ„ฐ๋ฏธ๋„์—์„œ API๋กœ ๋ชจ๋ธ ์‚ฌ์šฉ
curl http://localhost:11434/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama3.2:3b",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

Modelfile๋กœ ์ปค์Šคํ…€ ๋ชจ๋ธ์„ ์–ด๋–ป๊ฒŒ ๋งŒ๋“ญ๋‹ˆ๊นŒ?

Modelfile์€ ๊ธฐ๋ณธ ๋ชจ๋ธ์—์„œ ์‹œ์ž‘ํ•˜์—ฌ ์‹œ์Šคํ…œ ํ”„๋กฌํ”„ํŠธ, ํŒŒ๋ผ๋ฏธํ„ฐ, ๊ฐ€์ค‘์น˜๋ฅผ ์ถ”๊ฐ€ํ•จ์œผ๋กœ์จ ์ปค์Šคํ…€ ๋ชจ๋ธ์„ ์ •์˜ํ•˜๋Š” ์„ค์ • ํŒŒ์ผ(Dockerfile๊ณผ ์œ ์‚ฌ)์ž…๋‹ˆ๋‹ค.

bash
# Modelfile์ด๋ผ๋Š” ํŒŒ์ผ ์ƒ์„ฑ
FROM llama3.2:3b

# ์‹œ์Šคํ…œ ํ”„๋กฌํ”„ํŠธ ์ถ”๊ฐ€
SYSTEM """
You are a helpful expert in machine learning.
Always explain complex concepts in simple terms.
"""

# ํŒŒ๋ผ๋ฏธํ„ฐ ์กฐ์ •
PARAMETER temperature 0.7
PARAMETER top_p 0.9

# ์ปค์Šคํ…€ ๋ชจ๋ธ ๋นŒ๋“œ
ollama create ml-expert -f Modelfile

# ์‚ฌ์šฉ
ollama run ml-expert

Ollama๋Š” ์–ด๋–ค ์–‘์žํ™” ์˜ต์…˜์„ ์ง€์›ํ•ฉ๋‹ˆ๊นŒ?

์–‘์žํ™”๋Š” ๋” ๋‚ฎ์€ ์ •๋ฐ€๋„์˜ ์ˆซ์ž๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋ธ ํฌ๊ธฐ์™€ VRAM์„ ์ค„์ž…๋‹ˆ๋‹ค. Ollama๋Š” ์—ฌ๋Ÿฌ ์–‘์žํ™”๋ฅผ ์ง€์›ํ•˜๋Š” GGUF ํ˜•์‹์„ ์ง€์›ํ•ฉ๋‹ˆ๋‹ค:

QuantizationSize (7B)VRAMQualitySpeed
FP16 (์ „์ฒด ์ •๋ฐ€๋„)14 GB16 GB์ตœ๊ณ ๊ฐ€์žฅ ๋А๋ฆผ
Q8_0 (8๋น„ํŠธ)7 GB8 GB๋งค์šฐ ์šฐ์ˆ˜๋น ๋ฆ„
Q6_K (6๋น„ํŠธ)5.5 GB6 GB์šฐ์ˆ˜๋น ๋ฆ„
Q5_K_M (5๋น„ํŠธ)5 GB5.5 GB์–‘ํ˜ธ๋งค์šฐ ๋น ๋ฆ„
Q4_K_M (4๋น„ํŠธ)4.7 GB5 GB์–‘ํ˜ธ๋งค์šฐ ๋น ๋ฆ„
Q3_K_M (3๋น„ํŠธ)3.3 GB4 GB๋ณดํ†ต๊ฐ€์žฅ ๋น ๋ฆ„

Ollama๋กœ ์ž„๋ฒ ๋”ฉ์„ ์–ด๋–ป๊ฒŒ ์ƒ์„ฑํ•ฉ๋‹ˆ๊นŒ?

์ž„๋ฒ ๋”ฉ์€ ํ…์ŠคํŠธ์˜ ์ˆ˜์น˜์  ํ‘œํ˜„์œผ๋กœ, RAG(Retrieval-Augmented Generation) ๋ฐ ์‹œ๋งจํ‹ฑ ๊ฒ€์ƒ‰์— ์œ ์šฉํ•ฉ๋‹ˆ๋‹ค.

bash
# ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ ๊ฐ€์ ธ์˜ค๊ธฐ
ollama pull nomic-embed-text  # ์˜์–ด์— ์ตœ์ , 1์–ต 3700๋งŒ ํŒŒ๋ผ๋ฏธํ„ฐ

# ์ž„๋ฒ ๋”ฉ ์ƒ์„ฑ
curl http://localhost:11434/v1/embeddings \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nomic-embed-text",
    "input": "The quick brown fox jumps"
  }'

# ์‘๋‹ต์—๋Š” 768์ฐจ์› ๋ฒกํ„ฐ๋กœ์„œ ์ž„๋ฒ ๋”ฉ์ด ํฌํ•จ๋ฉ๋‹ˆ๋‹ค

Ollama๋ฅผ ์ œ์–ดํ•˜๋Š” ํ™˜๊ฒฝ ๋ณ€์ˆ˜๋Š” ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

์ฃผ์š” ํ™˜๊ฒฝ ๋ณ€์ˆ˜:

  • `OLLAMA_HOST` -- ์ˆ˜์‹  ๋Œ€๊ธฐ ์ฃผ์†Œ (๊ธฐ๋ณธ๊ฐ’: 127.0.0.1:11434). ๋„คํŠธ์›Œํฌ ์ ‘๊ทผ์„ ์œ„ํ•ด `0.0.0.0:11434`๋กœ ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค.
  • `OLLAMA_MODELS` -- ๋ชจ๋ธ ์ €์žฅ ์œ„์น˜ (๊ธฐ๋ณธ๊ฐ’: `~/.ollama/models`).
  • `OLLAMA_DEBUG` -- ์ƒ์„ธ ๋กœ๊ทธ๋ฅผ ๋ณด๋ ค๋ฉด `1`๋กœ ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค.
  • `OLLAMA_GPU` -- ์‚ฌ์šฉํ•  GPU (๊ธฐ๋ณธ๊ฐ’: ์ž๋™ ๊ฐ์ง€). `cuda` ๋˜๋Š” `rocm`์œผ๋กœ ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค.
  • `OLLAMA_KEEP_ALIVE` -- ๋ชจ๋ธ์„ ๋ฉ”๋ชจ๋ฆฌ์— ์œ ์ง€ํ•˜๋Š” ์‹œ๊ฐ„ (๊ธฐ๋ณธ๊ฐ’: 5๋ถ„).

Ollama ๋ช…๋ น์–ด ์‚ฌ์šฉ ์‹œ ์ž์ฃผ ํ•˜๋Š” ์‹ค์ˆ˜

  • ๋ชจ๋ธ ํƒœ๊ทธ๋ฅผ ์žŠ์–ด๋ฒ„๋ฆฌ๋Š” ๊ฒฝ์šฐ. `ollama pull llama3.2`๋Š” ๊ฐ€์žฅ ํฐ ๋ฒ„์ „์„ ๊ฐ€์ ธ์˜ต๋‹ˆ๋‹ค; `ollama pull llama3.2:3b`๋Š” 3B ๋ฒ„์ „์„ ๊ฐ€์ ธ์˜ต๋‹ˆ๋‹ค.
  • `ollama serve`๊ฐ€ ์ž๋™์œผ๋กœ ์‹คํ–‰๋œ๋‹ค๋Š” ๊ฒƒ์„ ๋ชจ๋ฅด๋Š” ๊ฒฝ์šฐ. Mac๊ณผ Windows์—์„œ๋Š” ์•ฑ์„ ์‹คํ–‰ํ•  ๋•Œ Ollama๊ฐ€ API๋ฅผ ์ž๋™์œผ๋กœ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค. Linux์—์„œ๋Š” ์ˆ˜๋™์œผ๋กœ ์‹œ์ž‘ํ•ด์•ผ ํ•  ์ˆ˜๋„ ์žˆ์Šต๋‹ˆ๋‹ค.
  • ์ž˜๋ชป๋œ ์–‘์žํ™”๋ฅผ ๊ฐ€์ ธ์˜ค๋Š” ๊ฒฝ์šฐ. VRAM ์‚ฌ์šฉ๋Ÿ‰์„ ์ œ์–ดํ•˜๊ธฐ ์œ„ํ•ด ํ•ญ์ƒ ์ •ํ™•ํ•œ ๋ชจ๋ธ ํƒœ๊ทธ(์˜ˆ: `qwen2.5:7b-q4`)๋ฅผ ์ง€์ •ํ•ฉ๋‹ˆ๋‹ค.
  • ๋ชจ๋ธ์„ ๊ฐ€์ ธ์˜จ ํ›„ Ollama๊ฐ€ ์˜คํ”„๋ผ์ธ์—์„œ ์ž‘๋™ํ•  ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒํ•˜๋Š” ๊ฒฝ์šฐ. Ollama ์ž์ฒด๋Š” ์˜คํ”„๋ผ์ธ์œผ๋กœ ์ž‘๋™ํ•˜์ง€๋งŒ, ๋ชจ๋ธ์€ ์ธํ„ฐ๋„ท์— ์—ฐ๊ฒฐ๋œ ์ƒํƒœ์—์„œ ๊ฐ€์ ธ์™€์•ผ ํ•ฉ๋‹ˆ๋‹ค.

Ollama ๋ช…๋ น์–ด์— ๊ด€ํ•œ ์ž์ฃผ ๋ฌป๋Š” ์งˆ๋ฌธ

Ollama ๋ชจ๋ธ์€ ์–ด๋””์— ์ €์žฅ๋ฉ๋‹ˆ๊นŒ?

๊ธฐ๋ณธ๊ฐ’: macOS/Linux์—์„œ๋Š” `~/.ollama/models`, Windows์—์„œ๋Š” `%USERPROFILE%\.ollama\models`. ์œ„์น˜๋ฅผ ๋ณ€๊ฒฝํ•˜๋ ค๋ฉด `OLLAMA_MODELS`๋ฅผ ์„ค์ •ํ•ฉ๋‹ˆ๋‹ค.

์ปดํ“จํ„ฐ ๊ฐ„์— ๋ชจ๋ธ์„ ์ด๋™ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๊นŒ?

๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค. `~/.ollama/models`์—์„œ ๋‹ค๋ฅธ ์ปดํ“จํ„ฐ์˜ `~/.ollama/models`๋กœ ๋ชจ๋ธ ํŒŒ์ผ์„ ๋ณต์‚ฌํ•˜๋ฉด `ollama list`๊ฐ€ ์ธ์‹ํ•ฉ๋‹ˆ๋‹ค.

ํ™œ์„ฑ ๋ชจ๋ธ์˜ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰์€ ์–ด๋–ป๊ฒŒ ํ™•์ธํ•ฉ๋‹ˆ๊นŒ?

`ollama ps`๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ํ˜„์žฌ ๋กœ๋“œ๋œ ๋ชจ๋ธ์„ ๋‚˜์—ดํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ๋ณธ์ ์œผ๋กœ ๋น„ํ™œ์„ฑ ์ƒํƒœ๊ฐ€ 5๋ถ„ ์ง€์†๋˜๋ฉด ๋ชจ๋ธ์ด ์–ธ๋กœ๋“œ๋ฉ๋‹ˆ๋‹ค.

์—ฌ๋Ÿฌ ๋ชจ๋ธ์„ ๋™์‹œ์— ์‹คํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๊นŒ?

๊ฐ€๋Šฅํ•˜์ง€๋งŒ VRAM์„ ๊ณต์œ ํ•ฉ๋‹ˆ๋‹ค. 8B ๋ชจ๋ธ ๋‘ ๊ฐœ๋ฅผ ์‹คํ–‰ํ•˜๋ ค๋ฉด 16 GB VRAM์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. ์ถ”๊ฐ€ ๋ชจ๋ธ๋งˆ๋‹ค ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰์ด ์ฆ๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.

GGUF์™€ ๋‹ค๋ฅธ ๋ชจ๋ธ ํ˜•์‹์˜ ์ฐจ์ด์ ์€ ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

GGUF๋Š” ์–‘์žํ™”๋˜์–ด ํšจ์œจ์ ์ด๋ฉฐ CPU/GPU์—์„œ ์‹คํ–‰๋ฉ๋‹ˆ๋‹ค. ๋กœ์ปฌ LLM์˜ ํ‘œ์ค€์ž…๋‹ˆ๋‹ค. ๋‹ค๋ฅธ ํ˜•์‹(safetensors, PyTorch .bin)์€ ๋” ๋งŽ์€ VRAM์ด ํ•„์š”ํ•˜๋ฉฐ ๋กœ์ปฌ ์ถ”๋ก ์— ์ตœ์ ํ™”๋˜์–ด ์žˆ์ง€ ์•Š์Šต๋‹ˆ๋‹ค.

์ž์ฒด ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์—์„œ Ollama ๋ชจ๋ธ์„ ์–ด๋–ป๊ฒŒ ์‚ฌ์šฉํ•ฉ๋‹ˆ๊นŒ?

`ollama serve`๋Š” `localhost:11434`์—์„œ OpenAI ํ˜ธํ™˜ API๋ฅผ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค. ํ•ด๋‹น URL์„ ๊ฐ€๋ฆฌํ‚ค๋Š” OpenAI SDK(Python, Node.js ๋“ฑ)๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์š”์ฒญ์„ ๋ณด๋‚ด๊ณ  ์‘๋‹ต์„ ๋ฐ›์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

์ถœ์ฒ˜

  • Ollama GitHub -- github.com/ollama/ollama
  • Ollama ๋ฌธ์„œ -- github.com/ollama/ollama/blob/main/docs
  • Ollama ๋ชจ๋ธ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ -- ollama.ai/library

A Note on Third-Party Facts

This article references third-party AI models, benchmarks, prices, and licenses. The AI landscape changes rapidly. Benchmark scores, license terms, model names, and API prices can shift between the time of writing and the time you read this. Before making deployment or compliance decisions based on this article, verify current figures on each providerโ€™s official source: Hugging Face model cards for licenses and benchmarks, provider websites for API pricing, and EUR-Lex for current GDPR and EU AI Act text. This article reflects publicly available information as of May 2026.

Run PromptQuorum with a local LLM, your own API keys, or both โ€” you pick the backend.

Join the PromptQuorum Waitlist โ†’

โ† Back to Local LLMs

Ollama ๋ช…๋ น์–ด ๋ ˆํผ๋Ÿฐ์Šค 2026: pull, run, serve | PromptQuorum