RAGãšã¯äœã
ð In One Sentence
RAG ã¯ç¥èããŒã¹ããé¢é£ããã¥ã¡ã³ããååŸãããããã質åãšå ±ã«LLMã«æäŸããŸããã¢ãã«ã¯æšæž¬ã§ã¯ãªããããªãã®ããŒã¿ããå¿çããŸãã
ð¬ In Plain Terms
RAG ãªã = ã¯ããŒãºãããã¯è©ŠéšïŒã¢ãã«ã¯èšæ¶ããçããã詳现ãçºæå¯èœïŒãRAG ãã = ãªãŒãã³ããã¯ïŒã¢ãã«ã¯æåã«ããŒããåç §ïŒãããŒãã誀èªããŠããå°ãªããšãäºå®ãçºæããªãã
RAG ã¯é¢é£æ å ±ãèŠã€ããã¬ããªããŒãšããã®æ å ±ã䜿ã£ãŠæçµåçãå·çãããžã§ãã¬ãŒã¿ãçµã¿åãããŸãã ã¬ããªããŒã¯ãŠãŒã¶ãŒã¯ãšãªã«åºã¥ããŠç¥èããŒã¹ïŒã€ã³ããã¯ã¹ä»ã PDFããŠã§ãããŒãžãå éšããã¥ã¡ã³ãïŒãæ€çŽ¢ããŸãããžã§ãã¬ãŒã¿ã¯ããããååŸãããéè·¯ãèªã¿ããã®ã³ã³ãã³ããåŒçšãŸãã¯åæ ããåçãçæããŸãã
ãªãRAGãéèŠã
**RAG 㯠ãã«ã·ããŒã·ã§ã³ ãåæžããåçãææ°ã«ä¿ã€ããéèŠã§ãã** çŽç²ãªèšèªã¢ãã«ã¯å°éçãŸãã¯ææ°ã®è©±é¡ã§èªä¿¡æºã ã«è©³çްãäœãåºãããšãã§ããŸããRAG ã䜿ãã°ãåçã¯ããªãã管çããããã¥ã¡ã³ãã«åºå®ãããŸãã
ãã©ã€ãã·ãŒãšã¬ããã³ã¹ã«ãããŠãéèŠã§ããæ©å¯ããŒã¿ã§ã¢ãã«ããã¬ãŒãã³ã°ãã代ããã«ããã®ããŒã¿ãèªç€Ÿã¹ãã¢ã«ä¿ç®¡ããã¯ãšãªæã«é¢é£ã¹ããããã®ã¿ãã¢ãã«ã«äŸçµŠã§ããŸããã¢ãã«ã¯ããªãã®ã³ã³ãã³ãã«ã€ããŠæšè«ããŸãããæ°žç¶çã«åžåããããšã¯ãããŸããã
ååŸãããææžãã€ã³ãã©å€ã«åºããªãå ŽåãRAG ãã€ãã©ã€ã³å šäœãèªåã®ããŒããŠã§ã¢ã§åããããšãã§ããŸããGDPR 察å¿ã®ã¢ãŒããã¯ãã£ãç£æ»ãã°ããããã€ãã¿ãŒã³ã«ã€ããŠã¯ãæ¥åããŒã¿ã®ããã®ããŒã«ã« RAGãåç §ããŠãã ããã
RAGã·ã¹ãã ã®ä»çµã¿
å žåç㪠RAG ã·ã¹ãã 㯠4 ã€ã®äž»æ®µéãå®è¡ïŒååŸã玢åŒåãæ€çŽ¢ãçæã åæ®µéã¯ç¬ç«ããŠèª¿æŽå¯èœã§ãã
ãã®ãã€ãã©ã€ã³ãèªåã® PDF ãšããŒã«ã«ã¢ãã«ã§åããæé ã«ã€ããŠã¯ãèªåã® PDF ã§ããŒã«ã« RAG ãã¹ããããã€ã¹ãããã§åãããåç §ããŠãã ããã
- 1ååŸïŒããã¥ã¡ã³ãïŒPDFãç¥èããŒã¹èšäºããã±ãããã³ãŒãïŒãèªã¿èŸŒã¿ããã£ã³ã¯åãã¡ã¿ããŒã¿ïŒã¿ã€ãã«ãæ¥ä»ãäœæè ãã¿ã°ïŒãéå±ãããããšãã§ããŸãã
- 2玢åŒåïŒåãã£ã³ã¯ãåã蟌ã¿ã¢ãã«ã§ãã¯ãã«è¡šçŸã«å€æãããã¯ãã«ããŒã¿ããŒã¹ãŸãã¯æ€çŽ¢ã€ã³ããã¯ã¹ã«ä¿åãæ°ããã¯ãšãªã«å¯ŸããŠã»ãã³ãã£ãã¯ã«é¡äŒŒããã³ã³ãã³ããæ€çŽ¢ã
- 3æ€çŽ¢ïŒãŠãŒã¶ãŒã質åãå ¥åãããšãã·ã¹ãã ã¯ã¯ãšãªããã¯ãã«åããŠæãé¢é£ã®ãããã£ã³ã¯ãã€ã³ããã¯ã¹ããååŸããã£ã«ã¿ïŒæ¥ä»ç¯å²ãããã¥ã¡ã³ãçš®é¡ããŠãŒã¶ãŒããŒããã·ã§ã³ïŒããã®æ®µéã§é©çšå¯èœã
- 4çæïŒã·ã¹ãã ã¯ãŠãŒã¶ãŒã®è³ªåãšååŸããããã£ã³ã¯ãå«ãããã³ãããæ§ç¯ããŠèšèªã¢ãã«ã«éä¿¡ãã¢ãã«ã¯æäŸãããã³ã³ããã¹ããšäžè²«æ§ã®ããåçãçæã
ð æ€çŽ¢ãããã«ããã¯
RAG ã®å質㯠80%ãæ€çŽ¢ã«äŸåãåªããã¬ããªããŒãšåŒ±ãã¢ãã« = 匱ãã¬ããªããŒãš GPT-4o ããè¯ãçµæã玢åŒåãšãã£ã³ã¯åã®ãã¥ãŒãã³ã°ã«æéããããŠãã ããã
RAG vs ãã¡ã€ã³ãã¥ãŒãã³ã°
**RAG ãš ãã¡ã€ã³ãã¥ãŒãã³ã° ã¯ç°ãªãåé¡ã解決ããçµã¿åãããæé©ã§ãã** æåã« RAG ããå§ããŸããããã³ããã§ã¯å®çŸã§ããªãäžè²«ããåäœå€æŽãå¿ èŠãªå Žåã®ã¿ãã¡ã€ã³ãã¥ãŒãã³ã°ã远å ããŠãã ããã
| èŠçŽ | RAG | ãã¡ã€ã³ãã¥ãŒãã³ã° |
|---|---|---|
| ã¯ãšãªæååŸ | ããã¥ã¡ã³ããã | ãã¬ãŒãã³ã°æã«ãã©ã¡ãŒã¿ã«çµã¿èŸŒã¿ |
| ããŒã¿ã®é®®åºŠ | ãªã¢ã«ã¿ã€ã | éç |
| æ©å¯ããŒã¿ | ã€ã³ãã©ã«çãŸã | ã¢ãã«éã¿ã«åžå |
| ãã¬ãŒãµããªã㣠| ããã¥ã¡ã³ãè¿œè·¡å¯ | åºåŠãªã |
| æŽæ°ã³ã¹ã | äœ | é« |
| ã¹ã¿ã€ã«å€æŽ | äžå¯ | å¯èœ |
| æé©çšé | å€åããŒã¿ | å®å®åäœ |
| ãŠãŒã¹ã±ãŒã¹ | Q&A | æ³åŸææž |
ð RAG åªå ããã®åŸãã¡ã€ã³ãã¥ãŒãã³ã°
RAG ã¯ç¥èã远å ïŒå¯éïŒãã¯ãã«ã¹ãã¢æŽæ°ïŒããã¡ã€ã³ãã¥ãŒãã³ã°ã¯åäœã倿ŽïŒæ°žç¶ïŒåèšç·ŽïŒãã³ã³ãã³ãã«ã¯åžžã« RAG ã䜿çšãã¹ã¿ã€ã«/ããŒã³ã«ã®ã¿ãã¡ã€ã³ãã¥ãŒãã³ã°ã
äŸïŒRAG ãªã vs RAG ãã
RAG ã®å©ç¹ã¯èšæ¶ã ãã®åçãšååŸããã¥ã¡ã³ãããŒã¹ã®åçãæ¯èŒãããšæç¢ºã«ãªããŸãã 以äžã¯å éšããªã·ãŒè³ªåã®æŠå¿µçãªäŸã§ãã
æªãäŸ â RAG ãªã
"åœç€Ÿã®åºåŒµæ è²»åéããªã·ãŒã¯ïŒ"
ã¢ãã«ã¯äžè¬çãã¿ãŒã³ã«åºã¥ããŠæšæž¬ããçµç¹ã«ãã£ãŠèª€ãã®å¯èœæ§ã
è¯ãäŸ â RAG ãã
"ããªãã¯åœç€Ÿã®å éšããªã·ãŒè³ªåã«çããã¢ã·ã¹ã¿ã³ãã§ãã以äžãé¢é£ããªã·ãŒæç²ã§ãïŒ ...ååŸãããããªã·ãŒããã¹ããã£ã³ã¯... 次ã®è³ªåã«ãããã®æç²ã«åºã¥ããŠã®ã¿åçããŠãã ããïŒãåœç€Ÿã®åºåŒµæ è²»åéããªã·ãŒã¯ïŒãæç²ã«ãªãå Žåã¯ãæå®ãããŠããŸããããšè¿°ã¹ãŸãã"
ãã®å Žåãã¢ãã«ã¯å®éã®ããªã·ãŒããã¥ã¡ã³ãã«åºå®ãããæ å ±ãæ¬ ããŠããæã®å¯Ÿå¿ãæç¢ºã§ãã
ãã¯ãã«ããŒã¿ããŒã¹æ¯èŒè¡š
æ£ãããã¯ãã«ããŒã¿ããŒã¹ãéžã¶ããšã¯ãã€ã³ãã©ãã¬ã€ãã³ã·å¶çŽãã³ã³ãã©ã€ã¢ã³ã¹èŠä»¶ã«äŸåããŸãã 以äžã¯ 6 ã€ã®äž»èŠéžæè¢ã§ãã
| ããŒã¿ããŒã¹ | ã¿ã€ã | æé©çšé | EU ããŒã¿æ ç¹ | ã»ã«ããã¹ã | æŠç®ã³ã¹ã |
|---|---|---|---|---|---|
| Pinecone | ãããŒãžã ãã¯ãã«ïŒã¯ã©ãŠãïŒ | çŽ æ©ããããã¿ã€ãã³ã°ãMVPãéçšè² è·æå° | ã¯ããeu-west-1 | ããã | æ 100ïœ1,000â¬ïŒäœ¿çšéå¥ïŒ |
| Weaviate | ãã¯ãã« ãªãŒãã³ãœãŒã¹ | ãšã³ã¿ãŒãã©ã€ãºãããã€ã¡ã³ãããã€ããªããæ€çŽ¢ | ã¯ããã»ã«ããã¹ã | ã¯ãïŒKubernetesïŒ | ç¡æ + ã€ã³ãã©ïŒå¹Ž 500ïœ5,000â¬ïŒ |
| Chroma | ãã¯ã㫠軜é | ãããã¿ã€ããããŒã«ã«ã¢ããªã±ãŒã·ã§ã³ãã㢠| ã¯ããããŒã«ã« | ã¯ãïŒPythonïŒ | ç¡æ |
| Milvus | ãã¯ãã« é«æ§èœ | æ°çŸäžãã¯ãã«ã<100ms é å»¶ | ã¯ããã»ã«ããã¹ã | ã¯ãïŒKubernetesãDockerïŒ | ç¡æïŒãªãŒãã³ãœãŒã¹ïŒãŸãã¯æ 500ïœ2,000â¬ïŒãµããŒãïŒ |
| Qdrant | ãã¯ãã« ææ° Rust | é«åºŠãªãã£ã«ã¿ãªã³ã° + ãã¯ãã«ã髿§èœ | ã¯ããã»ã«ããã¹ã | ã¯ã | ç¡æãŸãã¯æ 500ïœ2,000â¬ïŒã¯ã©ãŠãïŒ |
| pgvectorïŒPostgreSQLïŒ | PostgreSQL æ¡åŒµ | ãã¯ãã« + SQL ã¯ãšãªãã€ã³ãã©ç°¡çŽ å | æ¢å PostgreSQL ãäœ¿çš | ã¯ã | ç¡æïŒæ¡åŒµïŒ+ æ¢å PostgreSQL |
ãã«ãã¢ãã«ã¯ãŒã¯ãããŒã§ã®RAG
RAG ã¯è€æ°ã¢ãã«ãšæ§é åããã³ããã£ã³ã°ãšçµã¿åããããšããã«åŒ·åã«ãªããŸãã
- ããã¥ã¡ã³ãåã蟌ã¿ã»æ€çŽ¢ã« 1 ã€ã®ã¢ãã«ãåççæã«å¥ã®ã¢ãã«äœ¿çšã
- chain-of-thought ã TRACE ã®ãããªæšè«éç¹ããã³ãããååŸã³ã³ããã¹ãã®äžã«é©çšã
- è€æ°ã¢ãã«ã§åã RAG ããã³ãããå®è¡ããŠãåã¢ãã«ãããã¥ã¡ã³ããã©ã掻çšããããæ¯èŒã
ð åãããã¥ã¡ã³ããç°ãªãåç
åã RAG ããã³ããã GPT-4oãClaude Opus 4.7ãGemini 2.0 Pro ã§åããã¯ãã«ã¹ãã¢ã«å¯ŸããŠå®è¡ããŠã¿ãŠãã ãããé·ãã»ã¹ã¿ã€ã«ã»ã³ã³ããã¹ã掻çšãç°ãªããŸããPromptQuorum ã¯åãã¯ãšãªãè€æ°ã¢ãã«ã«ã«ãŒãã£ã³ã°ããŠæ¯èŒãå¯èœã«ããŸãã
ãã®ã¢ãžã¥ã©ãªãã£ã RAG ã®æå€§å©ç¹ïŒæ€çŽ¢æ©ãã€ã³ããã¯ã¹ãçææ©ãããã³ãããåå¥ã«ã¢ããã°ã¬ãŒãå¯èœã§ã·ã¹ãã å šäœã®åæ§ç¯äžèŠã
èŠå¶ç°å¢ã«ãããRAG
RAG ã¯æ©å¯ããŒã¿ãæ±ãããå Žé¢ã§æšå¥šããããã¢ãã«ãã©ã¡ãŒã¿ã«æ±ºããŠå ¥ããªãããã
EU / GDPR : å人ããŒã¿åŠçãã EU äŒæ¥åãã®æšå¥šã¢ãŒããã¯ãã£ãããã¥ã¡ã³ãã¯ã€ã³ãã©ã«çãŸããã¯ãšãªæã«é¢é£ã¹ããããã®ã¿ LLM ã«æž¡ããããå人ããŒã¿å€éšã¢ãã«ãããã€ãã«æªéä¿¡ãGDPR 第 46 æ¡èªåæºè¶³ãEU AI æ³ç¬¬ 11 æ¡ã¯ç¥èæºã®èšé²ãèŠæ±âããŒãžã§ã³ç®¡çããã¥ã¡ã³ãä¿åã® RAG ãçŽæ¥æºããããã€ã BSI ã¬ã€ãã©ã€ã³ã¯æ©å¯ããŒã¿åŠçã«ããŒã«ã«ãŸãã¯èªç€Ÿãã¹ããã¯ãã«ããŒã¿ããŒã¹æšå¥šã
æ¥æ¬ïŒMETIïŒïŒ AI çµ±èšå€æã«äœ¿çšããŒã¿ãœãŒã¹ãèšé²ãããšããèŠæ±ããã¥ã¬ãŒãã»ããŒãžã§ã³ç®¡çããã¥ã¡ã³ãæã€ RAG ãæ£ç¢ºã«ãã®ç£æ»èšŒè·¡ãçæâååçãã¯ãšãªæååŸç¹å®ããã¥ã¡ã³ãã«è¿œè·¡å¯èœãæ¥æ¬äŒæ¥ãããã€ã¡ã³ãã¯éåžž RAG ãããŒã«ã«æšè«ïŒOllama çµç± LLaMAïŒãšåããããã¹ãŠããŒã¿ã瀟å ã€ã³ãã©ã«çãŸããã確ä¿ã
äžåœïŒCACïŒïŒ CAC çæ AI ãµãŒãã¹èŠå®ïŒ2023ïŒã¯ãããã¯ã·ã§ã³ AI ã·ã¹ãã 䜿çšåã«æ€çŽ¢ããŒã¿ãœãŒã¹èšé²ã»å¯©æ»ãèŠæ±ãæ¿èªãããåœå ãœãŒã¹æã€ RAG ã¢ãŒããã¯ãã£ãæšæºæºæ ã¢ãŒããã¯ãã£ããã¯ãã«ããŒã¿ããŒã¹ãããã€ããäžåœããŒã¿ã»ãã¥ãªãã£æ³ïŒæ°æ®å®å šæ³ïŒã®ããŒã¿ããŒã«ã©ã€ãŒãŒã·ã§ã³èŠä»¶ãæºããããšã確èªã
ããããééã
â ã¢ãã«ãæ¢ã«ç¥ã£ãŠããç¥èã« RAG ã䜿çš
Why it hurts: ã¢ãã«ãæ£ç¢ºã«ç¥ã£ãŠããã³ã³ãã³ãïŒäŸïŒPython æšæºã©ã€ãã©ãªïŒååŸã¯ãããŒã¯ã³æ¶è²»ãšé å»¶ãå¢ãã ãã§äŸ¡å€ããªã
Fix: RAG ããã¡ã€ã³åºæã»ç¬å çã»ææ°ããŒã¿ã«å¶éããŠãã ãããäžè¬ç¥è㯠LLM ãã©ã¡ãŒã¿ã«ååã
â ãã£ã³ã¯åãå°ããããïŒ<100 åèªïŒ
Why it hurts: åšå²ã³ã³ããã¹ãåªå€±ãããªã·ãŒæã¯é£æ¥æ®µèœãªãã§ã¯ææ§ãã¢ãã«ãæ£ç¢ºã«çè§£ã§ããªã
Fix: 200ïœ500 åèªãããŒã¹ã©ã€ã³ã«ãããªãã®ãã¡ã€ã³ã®ä»£è¡šã¯ãšãªã§å®éã«ãã¹ãããŠãã ããã
â é¢é£æ§ãããå€ããªã
Why it hurts: ãã¹ãŠã®ååŸããã¥ã¡ã³ãã LLM ã«æž¡ãããã®ã§ç¡é¢é£ã³ã³ããã¹ãã§ã¢ãã«ã«äœæ¥ããããã€ãºãšèª€ãå¢å
Fix: æå°ã¹ã³ã¢èšå®ïŒ>0.7 ã³ãµã€ã³é¡äŒŒåºŠãéŸå€ãè¶ ããªãããã¥ã¡ã³ãã¯ãç¥èããŒã¹ã«èŠã€ãããŸããããè¿ãã
â æ€çŽ¢å質ãšçæå質ãåé¢ãã¹ãããªã
Why it hurts: åçãééã£ãŠããŠãåå ïŒæ€çŽ¢ã§ããã¥ã¡ã³ãååŸå€±æ vs çæã§ã¢ãã«æšè«å€±æïŒãäžæ
Fix: ãã«ãã€ãã©ã€ã³è©äŸ¡åã« 20+ 代衚ã¯ãšãªã§æ€çŽ¢æ©ãç¬ç«ãã¹ãããŠãã ããã
â ã¡ã¿ããŒã¿ïŒæ¥ä»ãææè ãæš©éïŒãã£ã«ã¿ã䜿çšããŠããªã
Why it hurts: å€§èŠæš¡ã¹ãã¢ã§ãæ¥ä»/éšé/æš©éãã£ã«ã¿ãªãã§ã¯é³è åã»æ©å¯ã»ã¢ã¯ã»ã¹äžå¯ã³ã³ãã³ããè¿ã
Fix: ååŸæã«ã¡ã¿ããŒã¿ãéå±ãæ€çŽ¢æã«ãã£ã«ã¿ãé©çšãæš©éããŒã¹ã¢ã¯ã»ã¹å¶åŸ¡ïŒRBACïŒãå®è£ ã
RAGã®å®è£ æ¹æ³
- 1AI ãå¿çããå¿ èŠãããç¥èãœãŒã¹ãç¹å®ïŒããã¥ã¡ã³ããPDFãããŒã¿ããŒã¹ãAPIïŒã 2026 幎 4 æçŸåšãæãäžè¬çãªãœãŒã¹ã¯å éš PDFãç¥èããŒã¹èšäºã補åããã¥ã¡ã³ããã«ã¹ã¿ããŒãµããŒãïŒFAQã補åããã¥ã¡ã³ããéå»ãã±ãã解決ãç ç©¶ïŒè«æãªããžããªãå€éšããŒã¿ããŒã¹ã
- 2éçããã¥ã¡ã³ãããã¯ãã«ããŒã¿ããŒã¹ïŒPineconeãWeaviateãChromaãMilvusïŒã§æ€çŽ¢å¯èœãªåã蟌ã¿ã«å€æã ããã»ã¹ã¯ããã¥ã¡ã³ããæ®µèœãŸãã¯æã«ãã£ã³ã¯åãåãã£ã³ã¯ãæå³ã®æ°å€è¡šçŸã«å€æãé«éã»ãã³ãã£ãã¯æ€çŽ¢ã«ä¿åã
- 3ã¯ãšãªæïŒ(1) ãŠãŒã¶ãŒè³ªåããã¯ãã«åã(2) æãé¡äŒŒããã¥ã¡ã³ãååŸã(3) ååŸããã¥ã¡ã³ããšè³ªåã LLM ã«æž¡ãã äŸïŒãŠãŒã¶ãŒãããã¹ã¯ãŒãããªã»ããïŒãâ ã·ã¹ãã ãé¢é£ FAQ/ããã¥ã¡ã³ãæ€åº â LLM ãèšç·ŽããŒã¿ã§ãªãããã¥ã¡ã³ãåºç€ã®åççæã
- 4å€§èŠæš¡ããã¥ã¡ã³ãéåïŒ100+ ããŒãžïŒã«ã€ããŠããã£ã³ã¯åæŠç¥ãå®è£ ïŒ200ïœ500 åèªãã£ã³ã¯ããªãŒããŒã©ããä»ãã çè§£ãšæ€çŽ¢ç²ŸåºŠã®ãã©ã³ã¹ã代衚ã¯ãšãªã§ ãã£ã³ã¯ãµã€ãºããã¹ãã
- 5ååŸããã¥ã¡ã³ããå®éã«åçãå«ãããšãçæåã«æ€èšŒã ååŸãæªããã°ã¢ãã«ãæªããé¢é£æ§ãããå€ã䜿çšïŒé¢é£åºŠã¹ã³ã¢è¶ éæã®ã¿ååŸããã¥ã¡ã³ãã LLM ã«æž¡ãïŒäŸïŒ>0.7 ã³ãµã€ã³é¡äŒŒåºŠïŒã
ð ãã€ããªããæ€çŽ¢ã®å©ç¹
ãã¯ãã«æ€çŽ¢ïŒã»ãã³ãã£ãã¯ïŒ+ BM25ïŒããŒã¯ãŒãïŒãçµã¿åãããŠãã ãããWeaviate ãš Qdrant ã¯ãã€ãã£ããµããŒããã顧客å¥çŽ2024ããšããã¯ãšãªïŒã»ãã³ãã£ãã¯ã段èœããBM25 ã幎å·ãæãããåãã㊠= åªãããªã³ãŒã«ã
é¢é£è³æ
- ããã³ãããšã³ãžãã¢ãªã³ã°ãšã¯ïŒ â RAG ååŸã³ã³ããã¹ãã§åãããã³ããèšèšåºç€
- Chain-of-Thought ããã³ããã£ã³ã° â RAG ã§ååŸããã¥ã¡ã³ãäžã§è€éãªãã«ãã¹ãããæšè«ã« Chain-of-Thought ãçµã¿åãã
- ããã³ããã€ã³ãžã§ã¯ã·ã§ã³ïŒã»ãã¥ãªã㣠â ååŸããã¥ã¡ã³ãçµç±ã®éæ¥ããã³ããã€ã³ãžã§ã¯ã·ã§ã³ã¯ RAG ã®æåã®ã»ãã¥ãªãã£ãªã¹ã¯
- ãªãŒãã³ãœãŒã¹ vs å°æ LLM â RAG ãã€ãã©ã€ã³ã®çææ®µéåãã«ã¯ã©ãŠããšããŒã«ã« LLM ããéžæ
- AI ã®éçâLLM ãã§ããªãããš â LLM ç¥èã«ãããªããšãã«ã·ããŒã·ã§ã³éçãå æããã®ã« RAG ããªãå¿ èŠããçè§£
- å質ãã§ãã¯ãæ§ç¯ â RAG åºåæ€èšŒãã¿ãŒã³ïŒãã«ã·ããŒã·ã§ã³æ€åºããœãŒã¹æ€èšŒãå«ã
ãããã質å
RAGïŒRetrieval-Augmented GenerationïŒãšã¯ïŒ
å¿ççæåã«ç¥èããŒã¹ããé¢é£ããã¥ã¡ã³ããååŸããAIæè¡ãèšç·Žç¥èã«äŸåã§ãªããããªããæäŸã»ç®¡çããããã¥ã¡ã³ãã«åºã¥ãã
RAG ããã«ã·ããŒã·ã§ã³ãæžããä»çµã¿ïŒ
ååŸããã¹ãã«åçãåºå®ãããã³ããã¯ã¢ãã«ã«æäŸæç²ã®ã¿ããå¿çãæ å ±ãªããã°è¡šç€ºãæç€ºãèšç·Žç¥èäžè¶³ãªè©±é¡ã§ã®äœè©±åæ©ãæé€ã
RAG ãšãã¡ã€ã³ãã¥ãŒãã³ã°ã®éãïŒ
RAG ã¯ã¯ãšãªæã«å€éšç¥èååŸã»è¿œå ããã¡ã€ã³ãã¥ãŒãã³ã°ã¯ãã©ã¡ãŒã¿æ°žç¶å€æŽãRAG ã¯å€åããŒã¿åãããã¡ã€ã³ãã¥ãŒãã³ã°ã¯äžè²«åäœåãã
2026幎æè¯ã®ãã¯ãã«ããŒã¿ããŒã¹ïŒ
PineconeïŒãããŒãžãïŒãWeaviateïŒãªãŒãã³ïŒãChromaïŒè»œéïŒãMilvusïŒãšã³ã¿ãŒãã©ã€ãºïŒãEUïŒèªç€Ÿãã¹ãæšå¥šã
æé©ãã£ã³ã¯ãµã€ãºïŒ
200ïœ500åèªã10ïœ20%ãªãŒããŒã©ããã»ãšãã©æå¹ã<100ïŒæèåªå€±ã>1000ïŒç²ŸåºŠäœäžãããªãã®ã¯ãšãªãã¹ãã
OllamaïŒããŒã«ã«ïŒã§ RAG å¯èœïŒ
ã¯ããã¢ãã«éäŸåãLLaMA 3.1ãMistral ããŒã«ã« Ollama/LM Studio ã§ïŒããŒã¿ã¯èªç€ŸããŒãã
GPT-4oãClaudeãGemini ã§ RAGïŒ
ã¯ãã3ã€å šãŠååŸã³ã³ããã¹ãåå ¥ãClaude Opus 4.7ïŒæèäžè¶³ãããå ±åãGPT-4oïŒå¯éæèããç°¡æœåçã
é¢é£æ§ãããå€ãšã¯ïŒ
æå°é¡äŒŒåºŠã¹ã³ã¢ã0.7ã³ãµã€ã³=70%以äžã»ãã³ãã£ãã¯äžèŽå¿ é ã以äžïŒãèŠã€ãããŸãããé¿çã
RAG ã倧ã³ã³ããã¹ããŠã£ã³ããŠããåªãïŒ
å€§èŠæš¡éåïŒã¯ããæ°çŸäžããã¥ã¡ã³ããããªç§åäœæ€çŽ¢ãé¢é£ãã£ã³ã¯ã®ã¿æž¡ãããäœã³ã¹ã/ã¯ãšãªã
RAG çµç±ã®ããã³ããã€ã³ãžã§ã¯ã·ã§ã³é²æ¢ïŒ
ååŸå 容ãæç€ºãšããŠä¿¡é Œããªããããã³ããå ã«æç¢ºåºåããååŸå 容ã¯ãã©ãŒãããïŒåºåŠæ€èšŒããŠããå«å ¥ã
ãããã¯ã·ã§ã³ RAG ãã€ãã©ã€ã³ïŒ
ååŸã现ååãåã蟌ã¿ãæ€çŽ¢ããã£ã«ã¿ãããã³ããæ§ç¯ãçæãåŒçšå ä»ãåçãåæ®µéç¬ç«ãã¹ãå¯ã
ãã¯ãã«DB ãªã RAGïŒ
å°èŠæš¡ãªãå¯ãBM25 ããŒã¯ãŒãæ€çŽ¢ã§ <10,000 ãã£ã³ã¯æ©èœãå€§èŠæš¡ã»ãã³ãã£ãã¯é¡äŒŒåºŠã¯ DB å¿ é ã
ãœãŒã¹
- Lewis, P., et al. (2020). "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks." NeurIPS 2020. https://arxiv.org/abs/2005.11401
- Gao, Y., et al. (2023). "Retrieval-Augmented Generation for Large Language Models: A Survey." arXiv:2312.10997. https://arxiv.org/abs/2312.10997
- Guu, K., et al. (2020). "REALM: Retrieval-Augmented Language Model Pre-Training." ICML 2020. arXiv:2002.08909. https://arxiv.org/abs/2002.08909
- OpenAI. (2024). "Retrieval and Augmentation in Language Models." Platform documentation. https://platform.openai.com/docs/guides/prompt-engineering
ãããã質å
RAGãšã¯äœã§ããïŒ
RAGïŒæ€çŽ¢å¢åŒ·çæïŒã¯ãã¢ãã«ã®ãã¬ãŒãã³ã°ããŒã¿ã ãã«é Œãã®ã§ã¯ãªããå ã«é¢é£ããã¥ã¡ã³ããååŸããŠããåçãçæããŸããåçã¯ããªãã®ããã¥ã¡ã³ãã«æ ¹æ ã¥ããããçºæãããŠããŸããã
RAGã¯ã©ã®ããã«å¹»èŠãæžãããŸããïŒ
RAGã¯åçãååŸããããã¹ãã«åºã¥ãããŸããããã³ããã¯ã¢ãã«ã«æäŸãããæç²ããã®ã¿çããããã«æç€ºããäžè¶³ããŠããæ å ±ã瀺ããŸããããã«ããã¢ãã«ã詳现ãäœãäžããåŸåãæé€ããŸãã
RAGãšãã¡ã€ã³ãã¥ãŒãã³ã°ã®éãã¯ïŒ
RAGã¯ã¯ãšãªæã«ç¥èãååŸããããã³ããã«è¿œå ããŸãããã¡ã€ã³ãã¥ãŒãã³ã°ã¯ã¢ãã«ãã©ã¡ãŒã¿ãæ°žç¶çã«å€æŽããŸããRAGã¯å€åããããŒã¿ã«é©ããŠããããã¡ã€ã³ãã¥ãŒãã³ã°ã¯å®å®ããåäœã«é©ããŠããŸãã
RAGã¯ãã¹ãŠã®èšèªã¢ãã«ã§æ©èœããŸããïŒ
ã¯ããRAGã¯ã¢ãã«ã«äŸåããŸãããã³ã³ããã¹ãä»ãããã³ãããåãå ¥ãããã¹ãŠã®LLMããååŸãããããã¥ã¡ã³ãã䜿çšã§ããŸããããã¯GPT-4oãClaude OpusãGeminiãLlamaãªã©ã®ãªãŒãã³ãœãŒã¹ã¢ãã«ãOllamaçµç±ã®ããŒã«ã«ã¢ãã«ãå«ã¿ãŸãã
RAGã®æé©ãªãã£ã³ã¯ãµã€ãºã¯ïŒ
ã»ãšãã©ã®å ŽåïŒ1ãã£ã³ã¯200ïœ500åèªã飿¥ãããã£ã³ã¯éã«10ïœ20%ã®ãªãŒããŒã©ãããå°ãããã£ã³ã¯ïŒ50ïœ100åèªïŒã¯ç²ŸåºŠãæ¹åãã倧ãããã£ã³ã¯ïŒ500+åèªïŒã¯ã³ã³ããã¹ããæäŸããŸãããç¡é¢ä¿ãªæ®µèœãå«ããªã¹ã¯ããããŸãã
RAGã®é¢é£æ§éŸå€ãšã¯ïŒ
é¡äŒŒåºŠã¹ã³ã¢ã®æå°å€ã§ããååŸãããããã¥ã¡ã³ãã®é¡äŒŒåºŠãéŸå€ïŒäŸïŒ0.7ã³ãµã€ã³é¡äŒŒåºŠïŒæªæºã®å ŽåãLLMã«æž¡ãããŸãããããã«ãããäœå質ãªã³ã³ããã¹ããã¢ãã«ãæ··ä¹±ãããã®ãé²ããŸãã
RAGã¯å€§ããªã³ã³ããã¹ããŠã£ã³ããŠããåªããŠããŸããïŒ
å€§èŠæš¡ãªããã¥ã¡ã³ãéåã®å Žåã¯ããã§ããRAGã¯ã»ãã³ãã£ãã¯é¡äŒŒåºŠã䜿çšããŠæ°çŸäžã®ããã¥ã¡ã³ããããªç§ã§å¹ççã«æ€çŽ¢ããŸãã倧ããªã³ã³ããã¹ããŠã£ã³ããŠã¯ããè²»çšãããããã©ã®ããã¥ã¡ã³ããå«ããããäºåã«ç¥ãå¿ èŠããããŸãã
RAGãšãã¡ã€ã³ãã¥ãŒãã³ã°ãçµã¿åãããããšã¯ã§ããŸããïŒ
ã¯ããã¢ãã«ããã¡ã€ã³ãã¥ãŒãã³ã°ããŠã¹ã¿ã€ã«ãããŒã³ããŸãã¯ãã¡ã€ã³åäœãæ¹åããŸãããã®åŸRAGã䜿çšããŠããããçŸåšã®äºå®ã«æ ¹æ ã¥ããŸããããã«ããäž¡æ¹ã®é·æãåŸãããŸãïŒäžè²«ããåäœ+äºå®ã®æ ¹æ ã
RAGã§ããã³ããã»ã€ã³ãžã§ã¯ã·ã§ã³æ»æãé²ãã«ã¯ã©ãããŸããïŒ
ããã³ããã«å«ããåã«ååŸãããã³ã³ãã³ããæ€èšŒããŸããã·ã¹ãã æç€ºãšååŸãããããã¹ãéã«æç¢ºãªåºåãæåã䜿çšããŸããååŸãããã³ã³ãã³ããå®è¡å¯èœãªåœä»€ãšããŠæ±ããªãã§ãã ãããäžå¯©ãªãã¿ãŒã³ãç£èŠããŸãã
RAGã¯ãã¯ãã«ããŒã¿ããŒã¹ãå¿ èŠã§ããïŒ
å°ããªéåã«ã¯å¿ èŠãããŸãããBM25ããŒã¯ãŒãæ€çŽ¢ã¯10,000ããã¥ã¡ã³ãæªæºã§ãã¯ãã«ãªãã§åäœããŸãããã倧ããªéåã§ã®ã»ãã³ãã£ãã¯é¡äŒŒåºŠã«ã€ããŠã¯ããã¯ãã«ããŒã¿ããŒã¹ïŒWeaviateãPineconeãChromaãMilvusïŒãéèŠã§ãã