éèŠãªãã€ã³ã
- ããŒã«ã«7Bã¢ãã«ã¯ãGPT-4oããæç€ºçãªã¬ã€ãã³ã¹ãå¿ èŠã§ããããé·ãããã³ãããããæç¢ºãªæç€ºã
- æèã®é£éïŒãã¹ããããã€ã¹ãããã§èãããããïŒã¯ãæšè«ç²ŸåºŠã10ïœ20%åäžãããŸãã
- åžžã«åºå圢åŒãæå®ããŠãã ããïŒJSONãMarkdownããã¬ãŒã³ããã¹ãïŒãæ§é åãããŠããªãåºåã¯äºæž¬äžå¯èœã§ãã
- Few-Shotãµã³ãã«ïŒ1ïœ3åïŒã¯ããŒã«ã«ã¢ãã«ã§Zero-ShotããåªããŠããŸãããµã³ãã«æ°ãå€ãã»ã©äžè²«æ§ãåäžããŸãã
- ããŒã«å®çŸ©ïŒãPythonã®ãšãã¹ããŒãã§ããïŒã¯ããã¡ã€ã³åºæã®å¿çãæ¹åããŸãã
éèŠãªäºå®
- CoTã«ãã粟床åäžïŒ æšè«ã¿ã¹ã¯ã§10ïœ20%ã®æ¹å
- Few-ShotèŠä»¶ïŒ ããŒã«ã«7Bã¯3ïœ5åã®äŸãå¿ èŠ vs ã¯ã©ãŠãAPIã¯1ïœ2å
- ã³ã³ããã¹ãæ¶è²»ïŒ åãµã³ãã«ã¯50ïœ200ããŒã¯ã³ãæ¶è²»
- 枩床ã®åœ±é¿ïŒ 0.8ãã0.3ã«äœäžããããšãäºå®ã®ç²ŸåºŠã15ïœ25%åäž
- ã¢ãã«ãµã€ãºã®éãïŒ 7Bã¢ãã«ã¯70Bã¢ãã«ããæç€ºçãªã¬ã€ãã³ã¹ãå¿ èŠ
- åºå圢åŒã®äžè²«æ§ïŒ JSON仿§ã¯ä¿¡é Œæ§ã30ïœ40%åäžãããŸã
ããŒã«ã«ã¢ãã«ã¯ã©ã®ããã«ç°ãªããŸããïŒ
| ã¢ã¹ãã¯ã | GPT-5.2 (ChatGPT Plus) | ããŒã«ã«7B (Llama 3.1 8B) | ããŒã«ã«70B (Llama 3.3) |
|---|---|---|---|
| ã³ã³ããã¹ããŠã£ã³ã㊠| 128KããŒã¯ã³ | 4Kïœ128KããŒã¯ã³ | 128KããŒã¯ã³ |
| æç€ºã®éµå® | åªç§ | æç€ºçãªããã³ããã§è¯å¥œ | éåžžã«è¯å¥œ |
| Few-ShotåŠç¿ | 1ïœ2äŸ | 3ïœ5äŸãå¿ èŠ | 2ïœ3äŸ |
| æšè« | ãã«ãã¹ãããæé»ç | ã¹ããããã€ã¹ãããæç€ºçã«å¿ èŠ | äžçšåºŠã®æé»ç |
| ã·ã¹ãã ããã³ãã | APIã§åŠç | ããŒã«å¥ã«èšå® | ããŒã«å¥ã«èšå® |
| ããã©ã«ã枩床 | 1.0 (API) | 0.8 (Ollamaããã©ã«ã) | 0.8 (Ollamaããã©ã«ã) |
æèã®é£éã«ããããã³ããã£ã³ã°ã¯ç²ŸåºŠãã©ã®ããã«æ¹åããŸããïŒ
æèã®é£éïŒCoTïŒããã³ããã£ã³ã°ã¯ãLLMã«åçããåã«ã¹ããããã€ã¹ãããã§æšè«ã瀺ãããæ±ããŸãã ãã®ãã¯ããã¯ã¯ããã倧ããªã¯ã©ãŠãã¢ãã«ã®æé»çãªæšè«èœåãäžè¶³ããŠããããŒã«ã«ã®7Bïœ13Bã¢ãã«ã«ç¹ã«å¹æçã§ããã17 à 24ãã®ãããªæ°åŠçåé¡ã®å ŽåãCoTã®ãªãããŒã«ã«ã¢ãã«ã¯ãã°ãã°ééã£ãæšæž¬ãããŸããæç€ºçãªã¹ããããã€ã¹ãããã®æšè«ãããã°ãåé¡ãéšåã«åè§£ããŠç²ŸåºŠã10ïœ20%åäžãããŸãã
CoTãªãïŒ ã17 à 24ã¯äœã§ããïŒãâ ã¢ãã«ãçŽæ¥åçãå€ãã®å Žåééãã
CoTããïŒ ãã¹ããããã€ã¹ãããã§è§£ããŠãã ããïŒ17 à 24ãâ ã¢ãã«ã衚瀺ïŒ17 à 20 = 340ã17 à 4 = 68ãåèš= 408ãããæ£ç¢ºã§ãã
ããŒã«ã«AIãšãŒãžã§ã³ããããŒã«éžæã®ããã«å éšçã«æšè«ã䜿çšããæ¹æ³ãåŠã³ãŸãã
ð äžæã§èª¬æ
æèã®é£éããã³ããã£ã³ã°ã¯ãåçããåã«æšè«ãæç€ºçãªã¹ãããã«åè§£ããããã¢ãã«ã«æç€ºããè€éãªã¿ã¹ã¯ã§ç²ŸåºŠã10ïœ20%åäžãããŸãã
# Prompt with CoT
prompt = """
You will answer a question by thinking step-by-step.
Let me think about this:
Question: Why do local LLMs require more explicit prompting than cloud APIs?
Thinking:
1. First, consider the differences in model size...
2. Then, think about training data and fine-tuning...
3. Finally, consider the architecture and inference optimization...
Answer:
"""
# This guides the model to reason through the problemâ¢ð¡: ããã®ãã³ãïŒCoTã¯éšåçãªæšè«ã§åºåããã©ã€ãã³ã°ãããšãã«æã广çã§ããäŸïŒããããã¹ããããã€ã¹ãããã§åè§£ããŸãããããŸããç§ã¯æ³šç®ããŸã...ã
ããŒã«ã«ã¢ãã«ã§ã¯åºå圢åŒã®æå®ãéèŠãªã®ã¯ãªãã§ããïŒ
æ£ç¢ºãªåºå圢åŒïŒJSONãMarkdownããã¬ãŒã³ããã¹ãïŒãæå®ããããšã¯ãããŒã«ã«ã¢ãã«ãæç€ºçãªæç€ºãªãã«äºæž¬äžå¯èœãªåºåãçæãããããããŒã«ã«ã¢ãã«ã«ãšã£ãŠéèŠã§ãã GPT-4oã®ãããªã¯ã©ãŠãã¢ãã«ã¯ããããŸããªãªã¯ãšã¹ãããæå³ãæšæž¬ã§ããŸããããŒã«ã«7Bïœ13Bã¢ãã«ã¯ããã§ããŸãããæ§é åãããããã¥ã¡ã³ãæœåºãå¿ èŠãªããŒã«ã«RAGã·ã¹ãã ã§ã¯ãJSON圢åŒã®ä»æ§ãããŒã¹ãšã©ãŒãé²ããæœåºç²ŸåºŠã30ïœ40%åäžãããŸãã
äŸïŒ ãããã¹ããããšã³ãã£ãã£ãæœåºãããã¯ãªã¹ãã§ã¯ãªããã©ãã£ãããã¹ããè¿ãå¯èœæ§ããããŸãã
ããè¯ãïŒ ããšã³ãã£ãã£ãJSONã§æœåºããŸããããŒïŒpersonãlocationãorganizationãã
# Bad: ambiguous output
prompt = "Summarize this text"
# Good: explicit format
prompt = """
Summarize the text in EXACTLY 3 bullet points.
Format as a JSON list:
{
"summary": [
"- Point 1",
"- Point 2",
"- Point 3"
]
}
"""â¢â ïž: äžè¬çãªåé¡ïŒããŒã«ã«ã¢ãã«ã¯æã çã®JSONã®åºåãæåŠããŸããããããã€ãã¹ããããã«ãããã³ããã«ãåºåã¯JSONã®ã¿ãããŒã¯ããŠã³ãã§ã³ã¹ãªããã远å ããŠãã ããã
ããŒã«å²ãåœãŠã¯ããŒã«ã«ã¢ãã«ã®å¿çãã©ã®ããã«æ¹åããŸããïŒ
ç¹å®ã®ããŒã«ïŒã10幎ã®çµéšãæã€Pythonãšãã¹ããŒããïŒãå²ãåœãŠãããšã¯ãäžè¬çãªããã³ãããšæ¯èŒããŠãã¡ã€ã³åºæã®å¿çãåçã«æ¹åããŸãã ãã«ãœãããã³ããã£ã³ã°ãšåŒã°ãããã®ãã¯ããã¯ã¯ãã¢ãã«ã®å¿ççæãç¹å®ã®å°éç¥èãã¡ã€ã³ã«åºå®ããããšã§æ©èœããŸããããŒã«ã«ã¢ãã«ã¯ã¯ã©ãŠãã¢ãã«ãããããŒã«å®çŸ©ã«15ïœ25%è¯ãå¿çããŸããããã¯ãäžè¬çãªããã³ãããæ©èœããããšãå¯èœã«ããå ç¢ãªRLHFã¢ã©ã€ã³ã¡ã³ããäžè¶³ããŠããããã§ããäŸïŒ
- ãPythonãšãã¹ããŒãã§ããâ ããåªããã³ãŒã説æ
- ãå»åŠç ç©¶è ã§ããâ ãã詳现ãªãã€ãªã¡ãã£ã«ã«å¿ç
- ãæççãªã¢ããªã¹ãã§ããâ ããæ¹å€çæè
ãã¡ã€ã³ã¢ã©ã€ã¡ã³ãã匷åããããã®ãã¡ã€ã³ãã¥ãŒãã³ã°ãšçµã¿åãããŠãå€ãã®ãŠãŒã¹ã±ãŒã¹ããããã€ããŠããå Žåã
ð¬ ç°¡æœã«èª¬æ
ç°¡åã«èšãã°ããã«ãœãããã³ããã£ã³ã°ã¯ã¢ãã«ã«åçãããšãã«ã©ã®ãåžœåãããã¶ãããäŒããŸããPythonãšãã¹ããŒãåžœåã¯äžè¬çãªã¢ã·ã¹ã¿ã³ãåžœåãšã¯ç°ãªãïŒãããŠããè¯ãïŒã³ãŒããçæããŸãã
â¢ð¯: ãã¹ããã©ã¯ãã£ã¹ïŒå ·äœæ§ãéèŠã§ããããšãã¹ããŒãã§ããã¯åŒ±ãïŒã10幎ã®ããã¯ãšã³ãçµéšãæã€Pythonãšãã¹ããŒããAsync/Awaitãã¿ãŒã³ã«çŠç¹ãåœãŠãŠãããã¯åŒ·ãã
OllamaãLM Studioãllama.cppã§ã·ã¹ãã ããã³ãããèšå®ããã«ã¯ã©ãããã°ããã§ããïŒ
ã·ã¹ãã ããã³ããã¯ããŠãŒã¶ãŒã®ã¡ãã»ãŒãžã®åã«ã¢ãã«ã®ããŒã«ãšå¶çŽãå®çŸ©ããåããŒã«ïŒOllamaãLM Studioãllama.cppïŒã¯ãããèšå®ããããã«ç°ãªã圢åŒãå¿ èŠãšããŸãã
# Ollama (Modelfile)
FROM llama3.1:8b
SYSTEM """You are a Python expert with 10 years experience. Answer only Python questions. Provide code examples. Use type hints."""
PARAMETER temperature 0.7
PARAMETER top_p 0.9
PARAMETER repeat_penalty 1.1
# Ollama (API / OpenAI SDK)
response = client.chat.completions.create(
model="llama3.1:8b",
messages=[
{"role": "system", "content": "You are a Python expert..."},
{"role": "user", "content": "Write a FastAPI endpoint"}
],
temperature=0.7
)
# LM Studio (GUI)
# Settings â System Prompt field (paste your prompt)
# Or via API at localhost:1234 â identical format to Ollama
# llama.cpp (CLI)
./main -m llama-3.1-8b.gguf \
--system-prompt "You are a Python expert..." \
--temp 0.7 --top-p 0.9 --repeat-penalty 1.1 \
-p "Write a FastAPI endpoint"枩床ãšãµã³ããªã³ã°ãã©ã¡ãŒã¿ã¯åºåå質ã«ã©ã®ããã«åœ±é¿ããŸããïŒ
枩床ãtop_pãrepeat_penaltyã®ãã¥ãŒãã³ã°ã¯ãããã³ãã衚çŸã ããããããŒã«ã«7Båºåå質ã«å€§ããªåœ±é¿ãäžããããŒã«ã«ã¢ãã«ã¯ã¯ã©ãŠãAPIãšã¯ç°ãªãããã©ã«ããå¿ èŠã§ãã
ããŒã«ã«ã¢ãã«ã®éèŠãªæŽå¯ïŒ Ollamaã®ããã©ã«ã枩床ïŒ0.8ïŒã¯OpenAIã®APIããã©ã«ãïŒãã¯ã¬ãŠã¹ãµã³ããªã³ã°ä»ã1.0ïŒããé«ããæž©åºŠã0.3ïœ0.5ã«äœäžãããããšã¯ãããŒã«ã«7Bã¢ãã«ã®äºå®ç²ŸåºŠãåçã«æ¹åããŸããã³ãŒãã£ã³ã°ã¿ã¹ã¯ã®å Žåãæž©åºŠã0.1ïœ0.2ã«ãrepeat_penaltyã1.0ã«èšå®ããŸãïŒã³ãŒãã¯ã€ã³ããŒãã颿°åŒã³åºãã®ãããªç¹°ãè¿ããã¿ãŒã³ãå¿ èŠïŒã
| ãã©ã¡ãŒã¿ | äœãå¶åŸ¡ããã | ããã©ã«ã (Ollama) | æšå¥š |
|---|---|---|---|
| temperature | ã©ã³ãã æ§ | 0.8 | äºå®ã¯0.3ïœ0.5ãåµé çã¯0.7ïœ0.9 |
| top_p | èªåœã®å€æ§æ§ | 0.9 | äžè²«æ§ã¯0.8ãå€åã¯0.95 |
| repeat_penalty | ç¹°ãè¿ãåé¿ | 1.1 | ãã£ããã¯1.1ïœ1.2ãã³ãŒãã¯1.0 |
â¢ð: éèŠãã€ã³ãïŒæž©åºŠã¯ããžããã®ä¹æ°ã§ãã0.0ã§ã¯åžžã«æãå¯èœæ§ã®é«ãããŒã¯ã³ãéžæããŸãã1.0+ã§ã¯ãã©ã³ãã æ§ãå¢å ããŸããããŒã«ã«ã¢ãã«ã¯1.5枩床以äžã§é£œåããŸãã
ããŒã«ã«ã¢ãã«ã¯ã¯ã©ãŠãAPIããå€ãã®Few-Shotãµã³ãã«ãå¿ èŠãªã®ã¯ãªãã§ããïŒ
3ïœ5åã®ãµã³ãã«ïŒFew-ShotåŠç¿ïŒãããŒã«ã«ã¢ãã«ã«æäŸããããšã¯ãZero-Shotããåºåäžè²«æ§ã15ïœ25%åäžãããã¯ã©ãŠãã¢ãã«ã¯1ïœ2åã®ãµã³ãã«ã®ã¿å¿ èŠã§ãã
ããŒã«ã«ã¢ãã«ã¯ãã©ã¡ãŒã¿ãå°ãªãããã¬ãŒãã³ã°ããŒã¿ã倿§ã§ãªããããããå€ãã®ãµã³ãã«ããæ©æµãåããŸããFew-ShotåŠç¿ã¯ãã¢ãã«ã«å®éã®ã¿ã¹ã¯ãè§£ãããæ±ããåã«ãäºæ³ãããå ¥åºåãã¿ãŒã³ã瀺ãæèå åŠç¿ãã¯ããã¯ã§ãã
# Few-shot prompt
prompt = """
Classify sentiment. Examples:
"I love this product!" â positive
"Worst experience ever" â negative
"It's okay, nothing special" â neutral
Now classify: "This is amazing!"
Answer: """
# Model learns format and style from examplesâ¢ð ïž: å®è£ ã®ãã³ãïŒ3ã€ã®é¡äŒŒäŸããããµã³ãã«ãå€åãããŸãïŒ1ã€ç°¡åã1ã€äžçšåºŠã1ã€é£ããïŒã倿§æ§ã¯äžè¬åãæ¹åããç¹å®ã®ãã¿ãŒã³ãžã®éåŠç¿ã鲿¢ããŸãã
ããã³ãããšã³ãžãã¢ãªã³ã°ã®äžè¬çãªãšã©ãŒ
- æ§é ã®ãªãåé·ãªããã³ããã ãã©ãã©ãšããæç€ºã¯ããŒã«ã«ã¢ãã«ãæ··ä¹±ãããŸããç°¡æœã§æç€ºçã«ããŠãã ããã
- æèã®é£éã䜿çšããªãã CoTã¯ç²ŸåºŠã10ïœ20%åäžãããŸããæšè«ã¿ã¹ã¯ã§ã¯åžžã«å«ããŠãã ããã
- 1ã€ã®ããã³ããããã¹ãŠã«å¯Ÿå¿ãããšä»®å®ããã å埩ããŠãã¹ãããŠãã ãããããããªåèªã®å€æŽã§å€§ããªåºåã®å€åãçºçããŸãã
- åºå圢åŒãç¡èŠããã æç€ºçãªåœ¢åŒæå®ãªãã§ã¯ãåºåã¯äºæž¬äžå¯èœã§ãã
- ãããŸããªããŒã«å®çŸ©ã䜿çšããã ããšãã¹ããŒãã§ããã¯ãããŸããã10幎ã®çµéšãæã€Pythonãšãã¹ããŒãããããåªããŠããŸãã
â¢ð: ãåç¥ã§ããïŒæã广çãªããã³ããã¯3ïœ5ããŒãžã§ã³ãå埩åŠçããŸããããŒã«ã«ã¢ãã«ã®ããã³ããã£ã³ã°ã¯ãã»ããããŠå¿ãããã§ã¯ãããŸããâå°ããªæ¹åã¯é倧ãªç²ŸåºŠåäžã«è€åããŸãã
ããã³ãããšã³ãžãã¢ãªã³ã°ã®å°åå¥ã®èæ ®äºé
æ¥æ¬ïŒMETI AI Governance 2024ïŒïŒ ãšã³ã¿ãŒãã©ã€ãºAIãããã€ã¡ã³ãçšã«ãMETI AI Governance 2024ãã¬ãŒã ã¯ãŒã¯ãéµå®ããŠãã ãããããŒã«ã«LLMã¯ãæ¥æ¬ã®äŒæ¥ãèŠå¶èŠä»¶ãæºãããªããããã³ãããšã³ãžãã¢ãªã³ã°ãå®è¡ããã®ã«åœ¹ç«ã¡ãŸããããã³ããå質ã¯ããšã³ã¿ãŒãã©ã€ãºå±éã«ããã粟床ãšä¿¡é Œæ§ã«çŽçµããŸãã
æ±ã¢ãžã¢/APACïŒããŒã¿å±æåïŒïŒ ããŒã¿å¢çãã¬ãŒã ã¯ãŒã¯ã«æºæ ããæ±åã¢ãžã¢ãšAPACå°åã§ã¯ãããŒã«ã«LLMãæé©ã§ããããã³ãããšã³ãžãã¢ãªã³ã°ã¯ãããŒã«ã«ãµãŒããŒã§å埩åŠçã§ããããŒã¿ãå°åå€ã«ç§»åãããŸããã
ãšã³ã¿ãŒãã©ã€ãºå±éïŒä¿¡é Œæ§ãšç£æ»ïŒïŒ éèæ©é¢ãå»çæ©é¢ãæ³åæ©é¢ã§ã¯ãå®å šãªç£æ»ãã°ãšèª¬æå¯èœæ§ã®ããã«ãããã³ãããšã³ãžãã¢ãªã³ã°ã¯ããŒã«ã«ã§ææžåããå¿ èŠããããŸãããã¹ãŠã®ããã³ããå埩ãšã¢ãã«åºåã¯ãå éšã¢ãŒã«ã€ãã«èšé²ããŠãã ããã
ããŒã«ã«LLMããã³ããã£ã³ã°ã«ã€ããŠãããã質å
ããŒã«ã«LLMãGPT-4oããæç€ºçãªããã³ãããå¿ èŠãšããã®ã¯ãªãã§ããïŒ
ããŒã«ã«7Bïœ13Bã¢ãã«ã¯ãã©ã¡ãŒã¿ãå°ãªããGPT-4oãããã¬ãŒãã³ã°ããŒã¿ã倿§ïŒæšå®1.8Tãã©ã¡ãŒã¿ïŒã§ãã圌ãã¯ãããŸããªæå³ãæšæž¬ã§ããŸãããæç€ºçãªæç€ºïŒåœ¢åŒãããŒã«ãã¹ããããã€ã¹ãããã®æšè«ïŒããã®ã®ã£ãããè£åããŸããæèã®é£éããã³ããã£ã³ã°ã¯ãæšè«ã¿ã¹ã¯ã§ããŒã«ã«ã¢ãã«ã®ç²ŸåºŠã10ïœ20%åäžãããŸãã
ããŒã«ã«LLMã®ããã³ããã«äœåã®Few-Shotãµã³ãã«ãå«ããå¿ èŠããããŸããïŒ
ããŒã«ã«7Bã¢ãã«ã§ã¯3ïœ5åã®ãµã³ãã«ãæé©ã§ããGPT-4oã¯éåžž1ïœ2åã®ãµã³ãã«ã®ã¿ãå¿ èŠã§ããããå€ãã®ãµã³ãã«ã¯äžè²«æ§ãæ¹åããŸãããã³ã³ããã¹ããŠã£ã³ããŠããŒã¯ã³ãæ¶è²»ããŸãïŒã¢ãã«ã«ãã£ãŠ4Kïœ32KããŒã¯ã³ïŒã4Kã³ã³ããã¹ããŠã£ã³ããŠãæã€Llama 3.2 8Bã®å Žåã3ã€ã®ãµã³ãã«+ã¿ã¹ã¯ã«å¶éããŠãã ããã32K+ã³ã³ããã¹ããæã€ã¢ãã«ã®å Žåã5åã®ãµã³ãã«ã¯å®å šã§ãã
ãã¹ãŠã®ããŒã«ã«ã¢ãã«ã§æèã®é£éããã³ããã£ã³ã°ãæ©èœããŸããïŒ
æèã®é£éã¯ãæç€ºèª¿æŽãããã¢ãã«ïŒLlama 3.xãQwen 2.5ãMistral 7BïŒã§æ©èœããŸããããŒã¹ã¢ãã«ïŒæç€ºèª¿æŽãªãïŒã¯ãã¹ããããã€ã¹ãããã§èããããšããæç€ºã«ç¢ºå®ã«åŸããŸãããããŒã«ã«ã¢ãã«ã§ã¯ããã¹ããããã€ã¹ãããã§è§£ãïŒããŸãã¯ãæšè«ïŒãã®ãããªCoTãã¬ãŒãºãæåŸ ãããåºåã®éå§ã«æé©ã«æ©èœããŸãã
ããŒã«ã«LLMã§ã¯ã©ã®åºå圢åŒãæãä¿¡é Œæ§ããããŸããïŒ
JSONã¯ããŒã«ã«LLMã®æãä¿¡é Œæ§ã®é«ãæ§é ååºåãã©ãŒãããã§ããããã³ããã§æ£ç¢ºãªJSONã¹ããŒããæå®ããŠãã ããïŒãåçã¯JSONãªããžã§ã¯ãã®ã¿ã§ãããŒïŒååãã¹ã³ã¢ãæšè«ããMarkdownããããŒïŒ##ïŒã¯ã»ã¯ã·ã§ã³ã«å¯ŸããŠä¿¡é Œæ§ããããŸããXMLãŸãã¯ã«ã¹ã¿ã 圢åŒãèŠæ±ããããšãé¿ããŠãã ããâããŒã«ã«ã¢ãã«ã¯äžè²«æ§ã®ãªãã®ã«å¯Ÿå¿ããŸãã
ããŒã«ã«LLMããããã¯ããå€ããªãããã«ããã«ã¯ã©ãããã°ããã§ããïŒ
ã·ã¹ãã ãŸãã¯ã€ã³ã¹ãã©ã¯ã·ã§ã³ããã³ããã«æç€ºçãªå¶çŽã远å ããŠãã ããïŒã[ãããã¯]ã«ã€ããŠã®ã¿åçããŠãã ãããå¥ã®ããšãèãããå Žåã¯ãã[ãããã¯]ã«ã€ããŠã®ã¿æäŒããŸãããšèšã£ãŠãã ããããOllamaã®å Žåãã·ã¹ãã ããã³ãããã£ãŒã«ãã䜿çšããŠãã ãããllama.cppã®å Žåãã·ã¹ãã ã¡ãã»ãŒãžãšããŠåã«çœ®ããŠãã ããããã®å¢çèšå®ã¯ã匷ãRLHFã¢ã©ã€ã³ã¡ã³ããæã€ã¯ã©ãŠãã¢ãã«ããããŒã«ã«7Bã¢ãã«ã§å€§å¹ ã«åªããŠããŸãã
ããŒã«ã«ã¢ãã«ã®Zero-ShotãšFew-Shotããã³ããã£ã³ã°ã®éãã¯äœã§ããïŒ
Zero-Shotã¯äŸããããŸããïŒããã®ã¡ãŒã«ãã¹ãã ãŸãã¯ã¹ãã 以å€ã«åé¡ããŠãã ããããFew-Shotã¯ã¿ã¹ã¯ã®åã«2ïœ5åã®ã©ãã«ä»ãäŸãæäŸããŸããããŒã«ã«7Bã¢ãã«ã§ã¯ãFew-Shotã¯åé¡ãšãã£ãã·ã³ã°ã»ãã£ãã·ã³ã°ã»ãã£ãã·ã³ã°ã®ã¿ã¹ã¯ã§ã15ïœ25%ã®ç²ŸåºŠã§Zero-Shotãäžè²«ããŠäžåããŸããZero-Shotã¯çæã¿ã¹ã¯ïŒèŠçŽã翻蚳ïŒã«å¯ŸããŠæ©èœããŸããããã§åœ¢åŒã¯éèŠã§ã¯ãããŸããã
ããŒã«ã«ã¢ãã«ã®ããã³ããããã¹ãããŠå埩ããã«ã¯ã©ãããã°ããã§ããïŒ
5ïœ10åã®ç°ãªãäŸã§ãã¹ãããŠãã ãããäžåºŠã«1ã€ã®å€æ°ïŒããŒã«ã圢åŒããŸãã¯CoTæç€ºïŒã倿ŽããŠãã ããã倿Žå/åŸã«ç²ŸåºŠãŸãã¯äžè²«æ§ã枬å®ããŠãã ãããåçŽãªãã¹ãã»ããã䜿çšããŠãã ããïŒ2ïœ3åã®ç°¡åãªäŸã2ïœ3åã®é£ããäŸãã©ã®ããã³ããããŒãžã§ã³ãæé©ã«æ©èœãããã远跡ããŠãã ããã3ïœ5ããã³ããå€åã®ãµã€ã¯ã«ã§å埩ããŠãã ãããããã³ããã©ã€ãã©ãªã§æ©èœããããã³ãããããã¥ã¡ã³ãåããŠãã ããã
ç¹å®ã®ã¿ã¹ã¯ã«å¯ŸããŠããã³ãããšã³ãžãã¢ãªã³ã°ãå®è¡ããå¿ èŠããããŸããããŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ãå®è¡ããå¿ èŠããããŸããïŒ
ãŸãããã³ãããšã³ãžãã¢ãªã³ã°ãå®è¡ããŠãã ããïŒé«éãç¡æãå埩å¯èœïŒã20以äžã®ããã³ããå€åã®åŸã«ç²ŸåºŠããã©ããŒã«éããå Žåããã¡ã€ã³ãã¥ãŒãã³ã°ãå®è¡ããŠãã ããããã¡ã€ã³ãã¥ãŒãã³ã°ã«ã¯500以äžã®ã¿ã¹ã¯åºæã®äŸãš1ïœ4æéã®ãã¬ãŒãã³ã°æéãå¿ èŠã§ããæ±çšã¿ã¹ã¯ã®å Žåãããã³ãããšã³ãžãã¢ãªã³ã°ã¯éåžžååã§ãããã¡ã€ã³åºæã®ã¿ã¹ã¯ïŒå»çãæ³åãã³ãŒãã£ã³ã°ïŒã®å Žåããã¡ã€ã³ãã¥ãŒãã³ã°ã¯æ°žç¶çãªæ¹åããããããŸãã
ããŒã«ã«LLMã§ã¯ã·ã¹ãã ããã³ãããšãŠãŒã¶ãŒæç€ºã¯ã©ã®ããã«ç°ãªããŸããïŒ
ã·ã¹ãã ããã³ããã¯ããŠãŒã¶ãŒã¡ãã»ãŒãžã®åã«ã¢ãã«ã®ããŒã«ãšå¶çŽãå®çŸ©ãããªã¯ãšã¹ãæ§é ã®äžéšã§ãïŒOllamaãLM StudioããŸãã¯APIçµç±ïŒããŠãŒã¶ãŒæç€ºã¯äŒè©±ã®äžéšã§ããã·ã¹ãã ããã³ããã¯åºæ¬çãªåäœãèšå®ãããŠãŒã¶ãŒããã¹ãã«æç€ºãåã蟌ãããä¿¡é Œæ§ããããŸããããŒã«ã«ã¢ãã«ã®å Žåãããæžãããã·ã¹ãã ããã³ããã¯ãã¢ãã«ãã·ã¹ãã ã¬ãã«ã®å¶çŽãåªå ãããããäžè²«æ§ã15ïœ25%åäžãããŸãã
ç°ãªãããŒã«ã«ã¢ãã«éã§åãããã³ããã䜿çšã§ããŸããïŒ
éšåçã«ãåºæ¬çãªCoTæ§é ãšããŒã«å®çŸ©ã¯ã¢ãã«ïŒLlamaãQwenãMistralïŒéã§è»¢éãããŸãããã ããåã¢ãã«ã¯æé©ãªçµæãåŸãããã«ããã³ãã調æŽãå¿ èŠã§ããLlamaã¢ãã«ã¯ãã¹ããããã€ã¹ãããã§èãããããã«å¿çããQwenã¢ãã«ã¯ãæåã«ãç§ã¯...ãã奜ã¿ãŸãããããã€ããæ£ç¢ºãªã¢ãã«ã§ããã³ããããã¹ãããŠãã ããã倧ããªã¢ãã«ïŒ70BïŒã¯ããã³ããå€åã«å¯ŸããŠå°ããªã¢ãã«ïŒ7BïŒãã蚱容æ§ããããŸãã
ãœãŒã¹
- Chain-of-Thought Prompting Paper (Wei et al.) -- ã¹ããããã€ã¹ãããã®æç€ºã«ããæšè«ã«é¢ããåºç€çãªç ç©¶ã
- Prompt Engineering Guide (DAIR-AI) -- ããã³ããã£ã³ã°ãã¯ããã¯ãšãã¹ããã©ã¯ãã£ã¹ã®å æ¬çãªã³ã¬ã¯ã·ã§ã³ã
- Ollama Modelfile Reference -- ã·ã¹ãã ããã³ããããã©ã¡ãŒã¿ïŒæž©åºŠãtop_pãrepeat_penaltyïŒãã«ã¹ã¿ã ã¢ãã«äœæã®å ¬åŒããã¥ã¡ã³ãã