LLMãæ¬åœã«äœã§ããã
LLMïŒå€§èŠæš¡èšèªã¢ãã«ïŒã¯ããã©ã³ã¹ãã©ãŒããŒããŒã¹ã®ãã¥ãŒã©ã«ãããã¯ãŒã¯ã§ãäžããããå ¥åã·ãŒã±ã³ã¹ã®æãå¯èœæ§ã®é«ã次ã®ããŒã¯ã³ãäºæž¬ããããã«èšç·ŽãããŠããŸã â ããŒã¿ããŒã¹ãæ€çŽ¢ãšã³ãžã³ãæšè«ã·ã¹ãã ã§ã¯ãããŸããã ãã®ã¢ãã«ã¯ããã¬ãŒãã³ã°äžã«WebããŒãžãæžç±ãã³ãŒãããã®ä»ã®ããã¹ãããæ°çŸåèªãåŠçããããšã§ãããŒã¯ã³éã®çµ±èšçãªé¢ä¿ãåŠç¿ããŸãã
ããã³ãããå ¥åãããšãã¢ãã«ã¯ããã¹ããæ°å€ããŒã¯ã³IDã®ã·ãŒã±ã³ã¹ã«å€æããæ°åã®ãã©ã³ã¹ãã©ãŒããŒã¬ã€ã€ãŒãéããŠæž¡ãããã®ããã£ãã©ãªãŒå šäœïŒéåžž50,000ã100,000ããŒã¯ã³ïŒäžã®ç¢ºçååžãåºåããŸãããã®ååžããããŒã¯ã³ãéžæããã·ãŒã±ã³ã¹ã«è¿œå ãã忢ããŒã¯ã³ãçæããããåºåå¶éã«éãããŸã§ç¹°ãè¿ããŸãã
ãã®ã¢ãŒããã¯ãã£ã¯ããŠãŒã¶ãŒãæ··ä¹±ãããããã€ãã®åäœã説æããŸãããªãLLMã¯ä¿¡ãåŸããééã£ãäºå®ãå¹»èŠãããã®ãïŒæ€èšŒãããçå®ã§ã¯ãªããå¯èœæ§ã®é«ãããã¹ããäºæž¬ïŒããªãç®è¡ã«å€±æã§ããã®ãïŒããŒã¯ã³ãã¿ãŒã³ãå®éã®èšç®ã§ã¯ãªãïŒããããŠãªãããã³ãããèšãæãããšåºåãå€ããã®ãïŒç°ãªãããŒã¯ã³ã·ãŒã±ã³ã¹ãç°ãªã確çååžãããªã¬ãŒïŒã
| ç¹æ§ | LLM | å€å žçãªãœãããŠã§ã¢ |
|---|---|---|
| åäœæ¹æ³ | åŠç¿ããã確çååžçµç±ã§æ¬¡ã®ããŒã¯ã³ãäºæž¬ | 決å®çãªåœä»€ãå®è¡ |
| åºåã®æ±ºå®æ§ | 確çç â åãå ¥åãç°ãªãåºåãçæã§ããŸã | 決å®ç â åãå ¥åã¯åžžã«åãåºåãçæ |
| ç¥èã®åºæ | ãã¬ãŒãã³ã°äžã«ã¢ãã«ãŠã§ã€ãã«ä¿åããããã¿ãŒã³ | å®è¡æã«ããŒã¿ããŒã¹ãŸãã¯ãã¡ã€ã«ããèªã¿åããŸã |
| ãšã©ãŒã¿ã€ã | èªä¿¡ãæã£ãŠããããééã£ãŠïŒå¹»èŠïŒ | ã¯ã©ãã·ã¥ãŸãã¯ãšã©ãŒã³ãŒã |
| æŽæ°ã¡ã«ããºã | åãã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ãå¿ èŠ | ã³ãŒã倿ŽãŸãã¯ããŒã¿ããŒã¹æŽæ° |
ããŒã¯ã³åïŒããã¹ããæ°åã«ãªãæ¹æ³
**LLMãããã¹ããåŠçããåã«ããããæŽæ°ããŒã¯ã³IDã®ã·ãŒã±ã³ã¹ã«å€æããå¿ èŠããããŸã â ããŒã¯ã³åãšåŒã°ããããã»ã¹ã** GPT-4oã¯ãã€ããã¢ãšã³ã³ãŒãã£ã³ã°ïŒBPEïŒã䜿çšããããã¹ããäžè¬çãªéšååèªãŠãããã«åå²ããŸããClaude Opus 4.7ãšGemini 3.1 Proã¯åæ§ã®ãµãã¯ãŒãããŒã¯ã³åã¹ããŒã ã䜿çšããŸãã
ããŒã¯ã³åã¯èšèªã«äŸåããŠããŸããè±èªã®ããã¹ãã¯å¹³å1ããŒã¯ã³ããã0.75åèªã§ããäžåœèªã𿥿¬èªã¯1ããŒã¯ã³ããã0.5åèªã«ãªããŸã â åãããã¥ã¡ã³ãã¯äžåœèªã§ã¯è±èªã®çŽ2åã®ããŒã¯ã³ãããããAPIã³ã¹ããšã³ã³ããã¹ããŠã£ã³ããŠã®äœ¿çšæ³ã«çŽæ¥åœ±é¿ããŸãã
| å ¥åããã¹ã | ããŒã¯ã³ | ããŒã¯ã³æ° |
|---|---|---|
| "Hello, world!" | "Hello", ",", " world", "!" | 4 |
| "Tokenization" | "Token", "ization" | 2 |
| "GPT-4o" | "G", "PT", "-", "4", "o" | 5 |
| "äœ å¥œäžç"ïŒããã«ã¡ã¯äžçãäžåœèªïŒ | "äœ å¥œ", "äžç" | ã¢ãã«ã«å¿ããŠ2â4 |
ãã©ã³ã¹ãã©ãŒããŒæ³šæã¡ã«ããºã ãã©ã®ããã«æ©èœããã
ãã©ã³ã¹ãã©ãŒããŒã¢ãŒããã¯ãã£ã¯ã»ã«ãã¢ãã³ã·ã§ã³ãšåŒã°ããã¡ã«ããºã ã䜿çšããŠãã·ãŒã±ã³ã¹å ã®ãã¹ãŠã®ä»ã®ããŒã¯ã³ã«ã泚æãæããåããŒã¯ã³ã®çšåºŠã決å®ããŸãã åããŒã¯ã³ã®ããã«ãã¢ãã«ã¯3ã€ã®ãã¯ãã« â ã¯ãšãªïŒQïŒãããŒïŒKïŒãå€ïŒVïŒ â ãèšç®ããQãšKã®ãããç©ãšããŠæ³šæã¹ã³ã¢ã決å®ãããœããããã¯ã¹ã§ã¹ã±ãŒãªã³ã°ãšæ£èŠåããŸãã
ãã«ããããæ³šæã¯è€æ°ã®ãããããã«ããã£ãŠãã®ããã»ã¹ã䞊åã«å®è¡ããŸãïŒGPT-4oã¯æå€§å±€ã§96泚æãããã䜿çšïŒãåãããã¯ç°ãªãé¢ä¿ãã¿ãŒã³ãåŠç¿ããŸããããã€ãã®ãããã¯æ§æé¢ä¿ïŒäž»èª-åè©ïŒã«å°éåããä»ã¯æå³è«çãªé¡äŒŒæ§ã«ãä»ã¯ç §å¿ïŒä»£åè©ãåè©ã«é¢é£ä»ããïŒã
éèŠãªå®éã®çµæïŒãLost in the Middleã广ãStanford Universityã® Liu et al.ïŒ2023ïŒã®ç ç©¶ã¯ãLLMãé·ãã³ã³ããã¹ãã®çãäžã®æ å ±ãäœç³»çã«äžéããããšã瀺ããŠããŸããããã³ããã«ã2,000ãè¶ ããããŒã¯ã³ãããå ŽåãéèŠãªæç€ºãã·ã¹ãã ããã³ããïŒéå§ïŒã«é 眮ããæãéèŠãªå¶çŽããŠãŒã¶ãŒã¡ãã»ãŒãžã®çµããã§ç¹°ãè¿ããŸãã
LLMããã¬ãŒãã³ã°ãããæ¹æ³ïŒäºåãã¬ãŒãã³ã°ãšRLHF
LLMãã¬ãŒãã³ã°ã¯2ã€ã®æç¢ºã«åé¢ãããæ®µéã§è¡ãããŸããäºåãã¬ãŒãã³ã°ïŒçã®ããã¹ãããèšèªãã¿ãŒã³ãåŠç¿ããïŒããã³ãã¹ããã¬ãŒãã³ã°ã¢ã©ã€ã¡ã³ãïŒäººçãã£ãŒãããã¯ãéããŠåäœã調æŽããïŒã ãããã®æ®µéã¯ç°ãªãæ©èœãäœæããç°ãªãã©ãããã®ã¢ãã«ãåæ§ã®ãã³ãããŒã¯çµæã§ãç°ãªãåå¿ãããçç±ã説æããŸãã
äºåãã¬ãŒãã³ã°äžãã¢ãã«ã¯å€§éã®ã³ãŒãã¹ãåŠçããŸã â Llama 3.1ã¯çŽ15å ããŒã¯ã³ã§èšç·ŽãããŸããïŒGPT-4ã¯æšå®1ïœ2å ããŒã¯ã³ãç®æšã¯åçŽã§ããæ¬¡ã®ããŒã¯ã³ãäºæž¬ããŠãã ãããæç€ºçãªç¥èã¯ä¿åãããŸããïŒãã¹ãŠã®æ å ±ãã¢ãã«ãŠã§ã€ãã®çµ±èšçãã¿ãŒã³ãšããŠãšã³ã³ãŒããããŸãã
ãã¹ããã¬ãŒãã³ã°ã¢ã©ã€ã¡ã³ã â éåžžã匷ååŠç¿ãã人çãã£ãŒãããã¯ïŒRLHFïŒãŸãã¯ãã®å€çš®ïŒRLAIFãDPOïŒ â ãã¢ãã«ãæçšãªã¢ã·ã¹ã¿ã³ãã«åœ¢äœããŸãã人éã®è©äŸ¡è ã¯ãæçšæ§ãç¡å®³æ§ãèª å®ãã®åºåãè©äŸ¡ããŸããå ±é ¬ã¢ãã«ã¯ãããã®è©äŸ¡ã§èšç·ŽãããããŒã¹LLMã¯ãã®åŸãå ±é ¬ãæå€§åããããã«åŸ®èª¿æŽãããŸããRLHFã¯æåŠåäœãããŒã³ãã»ãã¥ãªãã£ã¡ã«ããºã ãæ±ºå®ããŸã â ããŒã¹ã¢ãŒããã¯ãã£ã§ã¯ãªãã
- äºåãã¬ãŒãã³ã°ïŒ Webã¹ã±ãŒã«ããŒã¿ã®æåž«ãªã次ããŒã¯ã³äºæž¬ãèšèªãã¿ãŒã³ãäžçç¥èãæšè«ã®ã·ã§ãŒãã«ãããã¢ãã«ãŠã§ã€ãïŒããã³ãã£ã¢ã¢ãã«ã§70Bã405Bãã©ã¡ãŒã¿ïŒã«ãšã³ã³ãŒãããŸãã
- ç£èŠããããã¡ã€ã³ãã¥ãŒãã³ã°ïŒSFTïŒïŒ ã¢ãã«ã¯ãçŽç²ãªããã¹ãäºæž¬åšã§ã¯ãªãã¢ã·ã¹ã¿ã³ããšããŠåäœããããã«ããã¥ãŒããããæç€ºå¿çãã¢ã§èšç·ŽãããŸãã
- RLHF / DPOïŒ äººçå奜ãã¢ãã«ãæçšã§ç¡å®³ã§èª å®ãªåºåã«åãã£ãŠå°ããŸããDPOïŒDirect Preference OptimizationïŒã¯Llamaããã³Mistralã¢ãã«ã§äœ¿çšããããããèšç®å¹çã®é«ãä»£æ¿ææ®µã§ãã
- Constitutional AIïŒAnthropicïŒïŒ Claudeã¯ããšããžã±ãŒã¹ã§äººçãã£ãŒãããã¯ãžã®äŸåæ§ãæžããããã«ãååã®ã»ããïŒãæ²æ³ãïŒã䜿ã£ãŠè¿œå ã§ãã¬ãŒãã³ã°ãããŸã â Claude Opus 4.7ã¯ãã®ã¢ãããŒãã䜿çšããŸãã
æšè«ãã©ã®ããã«æ©èœãããïŒãµã³ããªã³ã°ãšåŸ©å·å
æšè«äžãã¢ãã«ã¯ããŒã¯ã³ããšã«åºåãçæããŸã â èªåœå šäœã«ããã£ãŠç¢ºçååžãèšç®ããå¶åŸ¡ãããã³ãŒãã£ã³ã°ãã©ã¡ãŒã¿ã«åŸã£ãŠããããéžæããŸãã 3ã€ã®äž»ãªãã©ã¡ãŒã¿ã¯æž©åºŠããããpïŒæ žãµã³ããªã³ã°ïŒãæå€§ããŒã¯ã³ã§ãã
| ãã©ã¡ãŒã¿ | ç¯å² | 广 | æšå¥šãããçšé |
|---|---|---|---|
| 枩床 | 0.0 â 2.0 | 確çååžãéãããïŒäœïŒãŸãã¯å¹³åŠåããïŒé«ïŒ | ã³ãŒã/äºå®ã«ã€ããŠã¯0ïŒããã¹ãã«ã€ããŠã¯0.7ïŒåµé çãªã¿ã¹ã¯ã«ã€ããŠã¯1.0 |
| ãããpïŒæ žïŒ | 0.0 â 1.0 | ãµã³ããªã³ã°ãã环ç©ç¢ºçãpã«éããããŒã¯ã³ã«å¶é | ã»ãšãã©ã®ã¿ã¹ã¯0.9â0.95ïŒéåžžã«å¶éãããåºåã«ã€ããŠã¯0.5 |
| ãããk | 1 âããã£ãã©ãªãŒãµã€ãº | ãµã³ããªã³ã°ãæãå¯èœæ§ã®é«ã次ã®kããŒã¯ã³ã«å¶é | ãã£ãã«äœ¿çšãããªãïŒãããpã¯äžè¬çã«å¥œãŸããŸã |
| æå€§ããŒã¯ã³ | 1 âã³ã³ããã¹ãå¶é | åºåé·ã®ããŒãã¹ããã | åæãé¿ããããã«ãäºæ³åºåé·ã®2Ãã«èšå® |
| é »åºŠããã«ã㣠| -2.0 â 2.0 | ãã§ã«çæãããããŒã¯ã³ã®ç¹°ãè¿ããäœæž | é·ãããã¥ã¡ã³ã0.1â0.3ïŒã³ãŒã0 |
ã³ã³ããã¹ããŠã£ã³ããŠïŒã¢ãã«ãèŠãããšãã§ãããã®
ã³ã³ããã¹ããŠã£ã³ããŠã¯ãåäžã®æšè«åŒã³åºãã§ã¢ãã«ãåŠçã§ããæå€§ããŒã¯ã³æ°ã§ã â ã·ã¹ãã ããã³ãããäŒè©±å±¥æŽãããã¥ã¡ã³ããçŸåšã®ãŠãŒã¶ãŒã¡ãã»ãŒãžãçµã¿åããããã®ã** ã»ãã·ã§ã³éã§äœãä¿æãããŸããïŒã¢ãã«ã¯æ¯åæåãããªã»ãããããŸãã
ã³ã³ããã¹ããŠã£ã³ããŠã®ãµã€ãºã¯ã¢ãã«ã«ãã£ãŠå€§ããç°ãªããã©ã®ãŠãŒã¹ã±ãŒã¹ãå®è·µçã§ãããã«çŽæ¥åœ±é¿ããŸãã
| ã¢ãã« | ã³ã³ããã¹ããŠã£ã³ã㊠| æŠç®åèªçžåœ | å®çšçãªããã¥ã¡ã³ãå¶é |
|---|---|---|---|
| GPT-4oïŒOpenAIïŒ | 128,000ããŒã¯ã³ | ã96,000åèª | ã200ããŒãžã®PDF |
| Claude Opus 4.7ïŒAnthropicïŒ | 200,000ããŒã¯ã³ | ã150,000åèª | ã300ããŒãžã®PDF |
| Gemini 3.1 ProïŒGoogle DeepMindïŒ | 2,000,000ããŒã¯ã³ | ã1,500,000åèª | ã3,000ããŒãžã®PDF |
| LLaMA 3.1 70BïŒMetaãOllamaããïŒ | 128,000ããŒã¯ã³ | ã96,000åèª | ã200ããŒãžã®PDF |
ããã³ãããšã³ãžãã¢ãªã³ã°ã«ãšã£ãŠãããæå³ãããã®
LLMã¢ãŒããã¯ãã£ãçè§£ããããšã¯ãããã³ããå質ãçŽæ¥åäžãããŸã â ããŒã¯ã³äœçœ®ã枩床ãã³ã³ããã¹ããŠã£ã³ããŠäœ¿çšæ³ãåºåé·ã¯åºåä¿¡é Œæ§ã«æž¬å®å¯èœãªåœ±é¿ãäžããŸãã
- éèŠãªæç€ºãæåã«é 眮ããŠãã ããã ã·ã¹ãã ããã³ããã¯åãŠãŒã¶ãŒã¡ãã»ãŒãžã®åã«åŠçãããŸããé·ãããã³ããã«æ·±ãåããããæç€ºã¯ããLost in the Middleã广ã®ããäžéãããŸããå¶çŽãšããŒã«å®çŸ©ãã·ã¹ãã ããã³ããã«é 眮ããŸãã
- 枩床ã¯ãªã³ãªãã¹ã€ããã§ã¯ãããŸããã ã³ãŒãçæãšäºå®é¢é£ã¿ã¹ã¯ã«ã€ããŠ0ãã³ã³ãã³ãçæã«ã€ããŠã¯0.5ã0.7ã1.0ãè¶ ãããšã倿§æ§ãå¢å ããŸãããå¹»èŠãªã¹ã¯ã¯å€§å¹ ã«å¢å ããŸãã
- ããŒã¯ã³æ°ã¯ã³ã¹ããšé å»¶ã«ç·åœ¢ã«åœ±é¿ããŸãã APIã®äŸ¡æ Œèšå®ã¯ããŒã¯ã³ããšã«è¡ãããŸãïŒå ¥åãšåºåïŒã100æ¥ã®100æ¥ãŠãŒã¶ãŒãæã€10,000ããŒã¯ã³ã®ã·ã¹ãã ããã³ããã¯ãå ¥åã ãã§100äžããŒã¯ã³/æ¥ãè²»çšããŸã â ææ®µã容赊ãªãå§çž®ããŸãã
- ã¢ãã«ã¯åœŒããééã£ãŠããããšããç¥ããªããã å¹»èŠã¯ããŒã¯ã³äºæž¬ã®æ§é çç¹æ§ã§ã â ã¢ãã«ã¯çµ±èšçã«å¯èœæ§ã®é«ããã®ãæ€èšŒããããã®ãã§ã¯ãªããåºåããŸããéèŠãªã¢ããªã±ãŒã·ã§ã³ã§ã¯ãåžžã«äºå®çãªäž»åŒµãæ€èšŒããŸãã
- ã³ã³ããã¹ããŠã£ã³ããŠâ 泚æå質ã 200,000ããŒã¯ã³ã®ã³ã³ããã¹ããŠã£ã³ããŠã¯ãã¢ãã«ãåãããã«200,000ããŒã¯ã³ãã¹ãŠã«æ³šæãæã£ãŠããããšãæå³ããŸãããã50,000ããŒã¯ã³ãè¶ ããããã¥ã¡ã³ãã®å Žåãå®å šãªã³ã³ããã¹ãè©°ã蟌ã¿ã®ä»£ããã«RAGã䜿çšããŠãã£ã³ãã³ã°ãèæ ®ããŠãã ããã
äžè¬çãªLLM誀解
ãããã®LLMã«é¢ãã誀解ã¯åºãæ®åããŠããããã°ãã°äžååã«èšèšãããããã³ããã«ã€ãªãããŸãã
| 誀解 | å®éã«äœãèµ·ããã | ããã³ãããšã³ãžãã¢ãªã³ã°ãžã®åœ±é¿ |
|---|---|---|
| "ã¢ãã«ãç§ã®ããã¥ã¡ã³ããèªãã§çè§£ããŸã" | ã¢ãã«ã¯ããŒã¯ã³ã·ãŒã±ã³ã¹ãåŠçããç¶ç¶ãäºæž¬ããŸã â èªãçè§£ã¯ãããŸãã | äœãæœåºããããæç€ºçã«è¿°ã¹ãŸãïŒã¢ãã«ãç®çãæšæž¬ããããšãæ³å®ããªãã§ãã ãã |
| "ã¢ãã«ã¯ç§ãã¡ã®æåŸã®äŒè©±ãèŠããŠããŸã" | ãã¹ãŠã®APIåŒã³åºãã¯ã¹ããŒãã¬ã¹ã§ãïŒå±¥æŽã¯ã³ã³ããã¹ããŠã£ã³ããŠã«æç€ºçã«å«ãŸããå¿ èŠããããŸã | ã·ã¹ãã ããã³ãããŸãã¯äŒè©±å±¥æŽã«é¢é£ãã以åã®ã³ã³ããã¹ããå«ããŸã |
| "ã¢ãã«ã¯ä»æ¥ã®æ¥ä»ãç¥ã£ãŠããŸã" | ã¢ãã«ã«ã¯ãã¬ãŒãã³ã°ã«ãããªããããã仿¥ã®æ¥ä»ãäŒããããªãéãç¥ããŸãã | æ¥ä»ã«ææãªã¿ã¹ã¯ã®ã·ã¹ãã ããã³ããã«çŸåšã®æ¥ä»ãæ¿å ¥ããŠãã ãã |
| "ããé«ã枩床=ããè³¢ãåºå" | 枩床ã¯ãµã³ããªã³ã°ã®ã©ã³ãã æ§ãå¶åŸ¡ããèœåãããã©ãŒãã³ã¹ã§ã¯ãããŸãã | ããé«ã枩床ã§ã¯ãªããåæã¿ã¹ã¯ã«ã€ããŠäœæž©åºŠïŒ0.0â0.3ïŒã䜿çšïŒåµé çãªããªãšãŒã·ã§ã³ã«é¢ããŠé«ã |
| "ã¢ãã«ã¯ç¢ºå®ã«æåãæ°ããããšãã§ããŸã" | ããŒã¯ã³å¢çã¯ãµãã¯ãŒããŠãããã§ãïŒæ£ç¢ºãªæåãŸãã¯ã¯ãŒãæ°ã¯ãã€ãã£ãæ©èœã§ã¯ãããŸãã | ã¢ãã«ã«æ£ç¢ºãªã¯ãŒãæ°ãä¿¡é Œããªãã§ãã ããïŒåŸåŠçãŸãã¯ã³ãŒãã䜿çšããŠãã ãã |
PromptQuorumã䜿çšããã¢ãã«å šäœã®æž©åºŠå¹æããã¹ãããŸã
PromptQuorumã§ãã¹ãæžã¿ â æž©åºŠ0察枩床0.9ã®åãåµé çãªããªãŒãã£ã³ã°ãGPT-4oãClaude Opus 4.7ãGemini 3.1 Proã«éä¿¡ãããšãClaude Opus 4.7ã¯æ°æž©ã®éã§åºåã®å€åãæãäœããGemini 3.1 Proã¯æãé«ãã§ãã æž©åºŠ0.9ã§ã¯ãGemini 3.1 Proã¯æž©åºŠ0ã§ã®å¹³ååºåããå¹³å34ïŒ é·ãåºåãçæããŸããã
PromptQuorumã®ãã«ãã¢ãã«ãã£ã¹ãããã䜿çšãããšãç¹å®ã®æž©åºŠã§å©çšå¯èœãªãã¹ãŠã®ã¢ãã«ã«å¯ŸããŠåæã«åããã³ãããå®è¡ããåŽæ¬¡ã«åºåãæ¯èŒã§ããŸã â ããã¯ç¹å®ã®ã¿ã¹ã¯ã®æž©åºŠèšå®ããã£ãªãã¬ãŒãããã¢ãã«ã®ããã©ã«ããä¿¡é Œãã代ããã«ãå®çšçã«ããŸãã
LLMã¢ãŒããã¯ãã£å°åå¥ã®éã
LLMã¢ãŒããã¯ãã£ãšããã©ãŒãã³ã¹ã¯ããã¬ãŒãã³ã°ããŒã¿ã®æ§æãããŒã¯ã³åæŠç¥ãå°åå šäœã®èŠå¶èŠä»¶ã«ãã£ãŠå€§ããç°ãªããŸãã ã°ããŒãã«ã¢ãã«ãå±éããããŒã ã«ãšã£ãŠããããã®éããçè§£ããããšã¯éèŠã§ãã
Qwen 3ã¯CJKã¹ã¯ãªããïŒäžåœèªãæ¥æ¬èªãéåœèªïŒã®åªããããŒã¯ã³åå¹çãéæããŠããŸã** â æšæºäžåœèªã§çŽ0.3ããŒã¯ã³/æå察GPT-4oã®0.5ããŒã¯ã³/æåããã®ããŒã¯ã³ã®40ïŒ åæžã¯ãã¢ãžã¢èšèªã®ã¢ããªã±ãŒã·ã§ã³ã®APIã³ã¹ããšé å»¶ãçŽæ¥åæžããŸããQwenã®ãã¬ãŒãã³ã°ããŒã¿ã«ã¯20ïŒ ã®CJKå«éãå«ãŸããŠãããæå察ã»ãã³ãã£ãã¯å¯åºŠãæãé«ãã¹ã¯ãªããã®ããŒã¯ã³ååšãæé©åããŸãã
Mistral 7Bããã³Mistral Largeã¯EUå±éçšã«æç€ºçã«èšèšãããŠãããGDPRããã©ã³ã¹ã®AIæ³ãããã³ããŒã¿ã¹ãã¬ãŒãžãšã¢ãã«ã®éææ§ã«é¢ããEUèŠå¶ã®ã³ã³ãã©ã€ã¢ã³ã¹ã®ããã«ãã£ã«ã¿ãŒããããã¬ãŒãã³ã°ããŒã¿ããããŸãã äž»ã«ç¡ãã£ã«ã¿WebããŒã¿ã§èšç·Žãããã¢ãã«ãšã¯ç°ãªããMistralã¯ããŒã¿ã®åºæãææžåãããã¬ãŒãã³ã°ããEUåžæ°ã®å人ããŒã¿ãé€å€ããŠããããšãŒãããã®èŠå¶ç£æ¥ïŒéè¡ãå»çãæ³åæè¡ïŒã®æšæºéžæã«ãªããŸãã
DeepSeekã®ã¢ãŒããã¯ãã£ã¯ãã¬ãŒãã³ã°æ§æã«åæ ãããŠããŸãïŒäºåèšç·ŽããŒã¿ã®70ïŒ ã¯äžåœèªãšè±èªã15ïŒ ã¯ã³ãŒãã15ïŒ ã¯ä»ã®èšèªã§ãããã®æ¯çã¯ãäžåœèªã®èšèªæµæ¢æ§ãšã³ãŒãçæé床ãåªå ããã¢ãã«ãäœæãããªãœãŒã¹è²§åŒ±èšèªã§æããã«äœãããã©ãŒãã³ã¹ããããŸããããŒã¯ã³ååžãšæ³šæãã¿ãŒã³ã¯ãè±èªã§ã¯ãªãæšæºäžåœèªã®åšæ³¢æ°ãã¿ãŒã³ã«å¯ŸããŠæé©åãããŠããŸãã
é¢é£ããèªã¿ç©
- åºç€ïŒããã³ãããšã³ãžãã¢ãªã³ã°ãšã¯ïŒ â LLMã¢ãŒããã¯ãã£ã®ç¥èãäœç³»çãªããã³ããèšèšã«é©çšããæ¹æ³
- åºç€ïŒã³ã³ããã¹ããŠã£ã³ããŠã®èª¬æ â AIãå¿ããçç± â ã³ã³ããã¹ããŠã£ã³ããŠã®å¶éãšæ€çŽ¢æŠç¥ãžã®æ·±ãæœåš
- åºç€ïŒããŒã¯ã³ãã³ã¹ãïŒå¶éïŒAIããã³ããã£ã³ã°ã®çµæžåŠ â ããŒã¯ã³äŸ¡æ Œèšå®ãã¬ãŒãå¶éãããã³GPT-4oãClaudeãGeminiå šäœã®ã³ã¹ãæé©å
- åºç€ïŒAIå¹»èŠã説æ â LLMãªãç©ãäœã â ããŒã¯ã³äºæž¬ãšäžè¶³ããäºå®æ€çŽ¢ãã©ã®ããã«ä¿¡é Œãšã©ãŒã«å°ãã
ãããã質å
LLMã¯äººéã®ããã«ããã¹ããçè§£ããŠããŸããïŒ
ããããLLMã¯äººéã®æå³ã§ããã¹ããçè§£ããŸããã圌ãã¯ããã¬ãŒãã³ã°äžã«åŠç¿ãããã¿ãŒã³ã«åºã¥ããŠã以åã®ããŒã¯ã³ã«åºã¥ããŠçµ±èšçã«æãå¯èœæ§ã®é«ã次ã®ããŒã¯ã³ãäºæž¬ããŸããçè§£ãæå³ãæèã¯ãããŸãã â ããã£ãã©ãªãŒãçŽ50,000ã100,000ããŒã¯ã³ã§ããå é確çååžã®ã¿ã
LLMã®ããŒã¯ã³ã¯äœã§ããïŒ
ããŒã¯ã³ã¯LLMãåŠçããæå°åäœã§ã â è±èªã§ã¯çŽ0.75åèªã§ãããäžåœèªãŸãã¯æ¥æ¬èªã§ã¯çŽ0.5åèªã§ããåèªãéšååèªãå¥èªç¹ãã¹ããŒã¹ã¯ãã¹ãŠããŒã¯ã³ã§ããGPT-4oã¯ãã€ããã¢ãšã³ã³ãŒãã£ã³ã°ïŒBPEïŒã䜿çšããŠããã¹ããããŒã¯ã³ã«åå²ããŸãã1,000èªã®ããã¥ã¡ã³ãã¯è±èªã§çŽ1,300ããŒã¯ã³ãçæããŸãã
LLMã®æž©åºŠã¯äœãããŸããïŒ
枩床ã¯ã¢ãã«ã確çååžãããµã³ããªã³ã°ããæ¹æ³ãã©ã³ãã ã«å¶åŸ¡ããŸããæž©åºŠ0ã¯æé«ç¢ºçããŒã¯ã³ãåžžã«éžæããŸãïŒæ±ºå®çïŒã枩床1.0ã¯ååžã«æ¯äŸããŠãµã³ãã«ã1.5ãè¶ ãããšãååžãå¹³åŠåãããå¹»èŠãªã¹ã¯ãå¢å ããŸããã»ãšãã©ã®æ¬çªã¢ããªã±ãŒã·ã§ã³ã¯0.1ãã0.7ã®éã§æé©ã«æ©èœããŸãã
ããã³ããã§æ å ±ã®äœçœ®ãéèŠãªã®ã¯ãªãã§ããïŒ
ãã©ã³ã¹ãã©ãŒããŒæ³šæã¡ã«ããºã ã¯ãã³ã³ããã¹ããŠã£ã³ããŠã®éå§ãšçµäºã§ããŒã¯ã³ã«ããå€ãã®éã¿ãä»ããäžå€®ã®ããŒã¯ã³ãã â Liu et al.ã«ãããLost in the Middleã广ãšããŠææžåããããã¿ãŒã³ïŒ2023ïŒãã2,000ãè¶ ããããŒã¯ã³ã®ããã³ããã®å ŽåãæãéèŠãªæç€ºãéå§æã«é 眮ããããŒã®å¶çŽããŠãŒã¶ãŒã¡ãã»ãŒãžã®çµããã§ç¹°ãè¿ããŸãã
RLHFã¯äœã§ããããããŠããã¯ã¢ãã«åºåã«ã©ã®ããã«åœ±é¿ããŸããïŒ
匷ååŠç¿ãã人çãã£ãŒãããã¯ïŒRLHFïŒã¯ã人éã®è©äŸ¡è ãã¢ãã«åºåãè©äŸ¡ããå ±é ¬ã¢ãã«ããããã®è©äŸ¡ã§èšç·Žããããã¹ããã¬ãŒãã³ã°ã¹ãããã§ããããŒã¹LLMã¯ãã®åŸãå ±é ¬ãæå€§åããããã«åŸ®èª¿æŽãããŸããRLHFã¯æåŠåäœãããŒã³ãæçšæ§ãã»ãã¥ãªã㣠â ããŒã¹ã¢ãŒããã¯ãã£ã«å¯ŸããŠãç°ãªãã©ãããã®ã¢ãã«ãåãããã³ããã§ç°ãªãåå¿ãããçç±ã
ã³ã³ããã¹ããŠã£ã³ããŠãšã¡ã¢ãªã®éãã¯äœã§ããïŒ
ã³ã³ããã¹ããŠã£ã³ããŠã¯ãæšè«åŒã³åºãäžã«ã¢ãã«ãèŠãããšãã§ãããã¹ãŠã®ããã¹ããã«ããŒããŠããŸã â ã·ã¹ãã ããã³ãããå±¥æŽãçŸåšã®ã¡ãã»ãŒãžãæ°žç¶çãªã¡ã¢ãªã§ã¯ãããŸãããäŒè©±ãçµãããšãã¢ãã«ã¯äœãä¿æããŸããGPT-4oïŒ128,000ããŒã¯ã³ãClaude Opus 4.7ïŒ200,000ããŒã¯ã³ãGemini 3.1 ProïŒ2,000,000ããŒã¯ã³ã
ãLost in the Middleã广ã¯äœã§ããããããŠã©ã®ããã«ãããé¿ããŸããïŒ
Stanford Universityã®ããã© Liu et al.ïŒ2023ïŒã«ãã£ãŠææžåããããLost in the Middleã广ã¯ããã©ã³ã¹ãã©ãŒããŒæ³šæãé·ãã³ã³ããã¹ãã®äžå€®ã®æ å ±ãäœç³»çã«äžéããããšã瀺ããŠããŸããåé¿ããã«ã¯ïŒã·ã¹ãã ããã³ããã«éèŠãªæç€ºãé 眮ããå ¥åã®æåã®10ã15ïŒ ã«éèŠãªã³ã³ããã¹ããä¿æãããŠãŒã¶ãŒã¡ãã»ãŒãžã®çµããã§æãéèŠãªå¶çŽãç¹°ãè¿ããŸããã50,000ããŒã¯ã³ä»¥äžã®ããã¥ã¡ã³ãå Žåãå®å šãªã³ã³ããã¹ãè©°ã蟌ã¿ã®ä»£ããã«RAGã䜿çšããŠãã ããã
RLHFãšConstitutional AIã¯ã©ã®ããã«ç°ãªããŸããïŒ
RLHFã¯ã人éã®è©äŸ¡è ãã¢ãã«åºåãè©äŸ¡ããå ±é ¬ã¢ãã«ãèšç·ŽãããLLMããã®å ±é ¬ãæå€§åããããã«åŸ®èª¿æŽããããã¹ããã¬ãŒãã³ã°æè¡ã§ããConstitutional AIïŒClaudeã®Anthropicã«ãã£ãŠïŒã¯ãã¢ãã«ã®åäœãã¬ã€ãããæžã蟌ã¿ã®ååïŒãæ²æ³ãïŒã®ã»ããã§RLHFãæ¡åŒµããŸã â ããã«ããããšããžã±ãŒã¹ã§äººçãã£ãŒãããã¯ãžã®äŸåæ§ãäœäžããŸãã
ã¢ãŒããã¯ãã£ã®GPT-4oãClaudeãGeminã¯ã©ã®ããã«ç°ãªããŸããïŒ
3ã€ã¯ãã¹ãŠãã©ã³ã¹ãã©ãŒããŒããŒã¹ã®LLMã§ãããã¹ã±ãŒãªã³ã°ãã³ã³ããã¹ããŠã£ã³ããŠããã¹ããã¬ãŒãã³ã°ãç°ãªããŸããGPT-4oïŒOpenAIïŒïŒ128,000ããŒã¯ã³ãClaude Opus 4.7ïŒAnthropicïŒïŒ200,000ããŒã¯ã³ãConstitutional AIã䜿çšããŸããGemini 3.1 ProïŒGoogle DeepMindïŒïŒ2,000,000ããŒã¯ã³ããããã®éãã¯ã³ã¹ããé å»¶ãé©åæ§ã«åœ±é¿ãäžããŸã â GPT-4oã¯æšè«ã§èŒããé·ã³ã³ããã¹ãã§ã¯ClaudeãGeminã¯éåžžã«é·ãããã¥ã¡ã³ãåŠçã«é©ããŠããŸãã
1,000æåã®ããã¹ãã«ã¯ããã€ã®ããŒã¯ã³ããããŸããïŒ
è±èªã§ã¯ã1,000èªã¯çŽ1,300â1,350ããŒã¯ã³ã«çžåœããŸããçŽ1ããŒã¯ã³= 0.75åèªãäžåœèªãŸãã¯æ¥æ¬èªïŒ1ããŒã¯ã³â0.5åèª â 1,000ã®äžåœèªåèªâ2,000ããŒã¯ã³ãããŒã¯ã³æ°ã¯APIã³ã¹ããšã³ã³ããã¹ããŠã£ã³ããŠæ¶è²»ã«çŽæ¥åœ±é¿ããŸãã
枩床ãšãããpã®éãã¯äœã§ããïŒ
枩床ã¯å šäœã®ç¢ºçååžãéããŸãã¯å¹³åŠåããŸã â æž©åºŠ0 =決å®çãæž©åºŠ1.0 =æšæºã枩床2.0 =éåžžã«ã©ã³ãã ããããpïŒæ žãµã³ããªã³ã°ïŒã¯ã环ç©ç¢ºçãpã«éããæå°ã® ããŒã¯ã³éåã«ãµã³ããªã³ã°ãå¶éããŸããã»ãšãã©ã®ã¿ã¹ã¯å Žåã¯æž©åºŠã§ã¯ãªããããpã調æŽããããšããå§ãããŸãïŒ0.8â0.95ïŒïŒæž©åºŠã¯åµé æ§ãå¶åŸ¡ããã®ã«æé©ã§ãã
ãœãŒã¹ãšè©³çްèªã¿ç©
- Vaswani et al.ã2017ããæ³šæã¯ãã¹ãŠãå¿ èŠã§ãã â ã»ã«ãã¢ãã³ã·ã§ã³ã¡ã«ããºã ãå°å ¥ããå ã®TransformerããŒããŒããã¹ãŠã®çŸä»£çãªLLMã®åºç€
- Liu et al.ã2023ããLost in the MiddleïŒèšèªã¢ãã«ãé·ãã³ã³ããã¹ããã©ã®ããã«äœ¿çšãããã â ã¹ã¿ã³ãã©ãŒãç ç©¶ã¯ãé·ã³ã³ããã¹ãLLMã®äœçœ®äŸå泚æãã€ã¢ã¹ãææžåããŠããŸã
- Ouyang et al.ã2022ãã人çãã£ãŒãããã¯ã§æç€ºã«åŸãããã«ã¢ãã«ããã¬ãŒãã³ã°ããããšã â GPT-3ã«RLHFãå°å ¥ããInstructGPTããŒããŒãChatGPTãšææ°ã®ã¢ã©ã€ã³æžã¿LLMã®åºç€
- OpenAIãããŒã¯ãã€ã¶ãŒããã¥ã¡ã³ããŒã·ã§ã³ â ããŒã¯ã³èšæ°ãšGPTã¢ãã«ã®ããŒã¯ã³åã®ä»çµã¿ãžã®å¯Ÿè©±çãªã¬ã€ã
- Touvron et al.ã2023ããLlama 2ïŒãªãŒãã³åºç€ãšåŸ®èª¿æŽãã£ããã¢ãã«ã â LLaMA-2ã¢ãŒããã¯ãã£ããã¬ãŒãã³ã°ãã€ãã©ã€ã³ãInstruction-Tuningã®æ¹æ³è«ã«ã€ããŠã®Metaã®å æ¬çãªããŒããŒ
- AnthropicãConstitutional AIïŒAIãã£ãŒãããã¯ããã®ç¡å®³æ§ â çŽç²ãªRLHFã®ä»£æ¿ãšããŠãã¢ãã«åäœãã¬ã€ãããããã®ãæ²æ³ãã䜿çšããããšã«ã€ããŠã®Anthropicã®ç ç©¶
- HuggingFaceãããŒã¯ãã€ã¶ãŒã©ã€ãã©ãªïŒèŠçŽ â BPEãWordPieceãSentencePieceããã®ä»ã®ææ°LLMããŒã¯ã³åã¢ã«ãŽãªãºã ãžã®æè¡çãªæ·±ãæŽå¯
- Google DeepMindãGemini 1.5æè¡ã¬ããŒã â 100äžããŒã¯ã³ã³ã³ããã¹ããŠã£ã³ããŠãæã€ããã³ãã£ã¢ã¢ãã«ã®ã¢ãŒããã¯ãã£ãšããã©ãŒãã³ã¹åæ
- EleutherAIãGPT-NeoX-20BïŒãªãŒãã³ãœãŒã¹ã®èªå·±ååž°èšèªã¢ãã« â ãªãŒãã³ãœãŒã¹ã¢ãã«ãã¬ãŒãã³ã°ããã¥ã¡ã³ããŒã·ã§ã³ããã³LLMéçºã§ã®å»ºç¯æ±ºå®ã®åæ
- OpenAIãæ§é åç¶æ 空éã¢ãã«ã§ã»ã°ã¡ã³ãåãæ³šæãäºæž¬ããããšã§èšèªã¢ãã«ãæ¹åããŸã â å¹ççãªé·ã³ã³ããã¹ãåŠçã®ããã®çŽç²ãªTransformer泚æãžã®å¥æ¡ã«ã€ããŠã®ç ç©¶