Sehyeog Kim
← Back to Agentic_AI_Theory

Contents

  • Frontend framework: ์‚ฌ์šฉ์ž๊ฐ€ ๋Œ€ํ™”/์ž‘์—…์„ ์š”์ฒญํ•˜๋Š” UI (part1)
  • Agent development framework: ์—์ด์ „ํŠธ ๋กœ์ง(๋ฃจํ”„, ์ƒํƒœ, ๋„๊ตฌ ์—ฐ๊ฒฐ)์„ ๋งŒ๋“œ๋Š” ํ”„๋ ˆ์ž„์›Œํฌ (part1)
  • Agent memory: ๋Œ€ํ™”/์„ธ์…˜ ์ƒํƒœ์™€ ์žฅ๊ธฐ ๊ธฐ์–ต ์ €์žฅ (part1)
  • Agent tools: ๊ฒ€์ƒ‰, DB, ์‚ฌ๋‚ด API ๋“ฑ โ€œํ–‰๋™โ€์„ ์ˆ˜ํ–‰ํ•˜๋Š” ๋„๊ตฌ ๋ฌถ์Œ (part1)
  • Agent design patterns: ์‹ฑ๊ธ€ ์—์ด์ „ํŠธ vs ๋ฉ€ํ‹ฐ ์—์ด์ „ํŠธ ๋“ฑ ๊ตฌ์กฐ ํŒจํ„ด (part3)
  • Agent runtime: ์—์ด์ „ํŠธ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์ด ์‹ค์ œ๋กœ ๋Œ์•„๊ฐ€๋Š” ์‹คํ–‰ ํ™˜๊ฒฝ (part3)
  • AI models: ์ถ”๋ก /์˜์‚ฌ๊ฒฐ์ • ์—”์ง„(part 3)
  • Model runtime: ๋ชจ๋ธ์„ ์„œ๋น™ํ•˜๋Š” ์ธํ”„๋ผ(๊ด€๋ฆฌํ˜• API/์ปจํ…Œ์ด๋„ˆ/GKE ๋“ฑ) (part 3)

Introduction

๐ŸŽ Agentic AI - runtime, model
Agentic AI๋ฅผ ํ•œ ๋ฌธ์žฅ์œผ๋กœ ์ •๋ฆฌํ•˜๋ฉด,ย ์‚ฌ์šฉ์ž ์˜๋„๋ฅผ ์ดํ•ดํ•˜๊ณ  โ†’ ์—ฌ๋Ÿฌ ๋‹จ๊ณ„ ๊ณ„ํš์„ ์„ธ์šฐ๊ณ  โ†’ ๋„๊ตฌ๋ฅผ ํ˜ธ์ถœํ•ด ์‹คํ–‰๊นŒ์ง€ ๋๋‚ด๋Š”ย ์ž์œจ ์‹œ์Šคํ…œ์ด๋‹ค. ๋‹จ์ˆœํžˆ โ€œ๋‹ต๋ณ€์„ ์ƒ์„ฑํ•˜๋Š” ๋ชจ๋ธโ€์ด ์•„๋‹ˆ๋ผ,ย ๊ณ„ํš(Planning)ย ๊ณผย ๋„๊ตฌ(Tools)ย ๋ฅผ ํ†ตํ•ด ์‹ค์ œ ์—…๋ฌด๋ฅผ ์™„๋ฃŒํ•˜๋„๋ก ์„ค๊ณ„๋œ ์•„ํ‚คํ…์ฒ˜์ธ ๊ฒƒ์ด๋‹ค.

Agent AI structure ๋งˆ์ง€๋ง‰ ์‹œ๊ฐ„์ด๋‹ค. ์ง€๊ธˆ๊นŒ์ง€๋Š” agent์˜ ํŒ”๋‹ค๋ฆฌ์— ๋Œ€ํ•ด์„œ ์ด์•ผ๊ธฐ๋ฅผ ํ•˜์˜€๋‹ค. ๊ฒฐ๊ตญ ์ด ๋ชจ๋“  ๊ฒƒ๋“ค์ด ์ปดํ“จํŒ…(๋ฉ”๋ชจ๋ฆฌ, CPU, network โ€ฆ) ์‹œ์Šคํ…œ์—์„œ ์ž‘๋™ํ•˜๋Š” ๊ฒƒ์„ ์žŠ์œผ๋ฉด ์•ˆ๋œ๋‹ค.
์ฆ‰, ์‹ฌ์žฅ๊ณผ ์ด ์—์ด์ „ํŠธ(๋ชธ์ฒด)๋ฅผ ์–ด๋–ป๊ฒŒ ์—ฐ๊ฒฐํ•˜๋Š”์ง€ ๊ทธ ํ™˜๊ฒฝ/๊ณต๊ฐ„์ธ runtime์„ ์ดํ•ดํ•ด์•ผํ•œ๋‹ค.

๐ŸŽ Agentic AI - runtime, model
(google cloud์—์„œ ๋ณด์—ฌ์ฃผ๋Š” run time, agent architecture)

What is runtime?


runtime : โ€œ์ฝ”๋“œ๊ฐ€ ์‹ค์ œ๋กœ ์‹คํ–‰๋œ๋Š” ํ™˜๊ฒฝ/๊ณต๊ฐ„โ€

์šฐ๋ฆฌ๊ฐ€ ์ž‘์„ฑํ•œ python ์ฝ”๋“œ, agent ๋ฃจํ”„ ๋กœ์ง, ๋ชจ๋ธ ์ถ”๋ก  ์š”์ฒญ, ๋„๊ตฌ ํ˜ธ์ถœ, ์™ธ๋ถ€ ๋ฉ”๋ชจ๋ฆฌ ์—ฐ๊ฒฐ ๋“ฑ๋“ฑ ์ง€๊ธˆ๊นŒ์ง€ ๋ฐฐ์› ๋˜ ๊ฒƒ๋“ค์ด ์ปดํ“จํŒ…์ด ํ•„์š”ํ•œ ์ž‘์—…์ด๋‹ค.
๋”ฐ๋ผ์„œ, Computing(๋ฉ”๋ชจ๋ฆฌ + CPU + Network + OS) ์œ„์—์„œ ์‹ค์ œ๋กœ ๋Œ์•„๊ฐ€๋Š” ๊ณต๊ฐ„์ด ๋ฐ”๋กœ runtime.

runtime์ด ์ค‘์š”ํ•œ ์ด์œ ๋Š” ์šฐ๋ฆฌ๊ฐ€ ํ•˜๋Š” ๋‹ค์Œ ์ž‘์—…๋“ค์„ ํšจ์œจ์ ์œผ๋กœ ๋งŒ๋“ค์–ด์ค€๋‹ค.
- ๋ฉ”๋ชจ๋ฆฌ ์‹คํ–‰ ํ™˜๊ฒฝ ๊ด€๋ฆฌ, ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ, ๋„คํŠธ์›Œํฌ ํ†ต์‹ , ์Šค์ผ€์ผ๋ง
- ๋ณด์•ˆ, ๋กœ๊น…, ๋ชจํ‹ฐ๋„ˆ๋ง

*์ฆ‰, ์„ค๊ณ„๋„๋Š” ์ฝ”๋“œ์ด๊ณ ,*
*runtime์€ ๊ทธ ์„ค๊ณ„๋„๊ฐ€ ์‹ค์ œ๋กœ ์ž‘๋™ํ•˜๋Š” โ€œํ˜„์‹ค ์„ธ๊ณ„โ€๋‹ค.*

๐ŸŽ Agentic AI - runtime, model
(์ด๋Ÿฌํ•œ runtime์„ ์ œ๊ณตํ•˜๋Š” platform์€ ๋Œ€ํ‘œ์ ์œผ๋กœ, Docker, AWS ECS, Cloud Run โ€ฆ ์ด ์žˆ๋‹ค)

Agent runtime


agent runtime์€ ๊ทธ๋ ‡๋‹ค๋ฉด ๋‹จ์ˆœํžˆ, ์šฐ๋ฆฌ๊ฐ€ ์„ค๊ณ„ํ•ด๋†“์€ agent system, framework๊ฐ€ ๊ตฌํ˜„๋˜๋Š” ํ™˜๊ฒฝ/๊ณต๊ฐ„์ด๋‹ค. ์•„๋ž˜์˜ google cloud์—์„œ ์˜ˆ์‹œ๋กœ ๊ฐ€์ ธ์˜จ multi agent system์„ ์‚ดํŽด๋ณด๋ฉด, Cloud run/Agent engine/GKE ํ”Œ๋žซํผ์ด agent ์˜ ์ž‘์—…๊ณต๊ฐ„์„ ์ œ๊ณตํ•œ๋‹ค.
๐ŸŽ Agentic AI - runtime, model

Agent runtime์€ ์—์ด์ „ํŠธ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์ด ์‹คํ–‰๋˜๋Š” ์ธํ”„๋ผ ํ™˜๊ฒฝ์ด๋‹ค. Google Cloud์—์„œ๋Š” ๋Œ€ํ‘œ์ ์œผ๋กœ Cloud Run๊ณผ GKE๊ฐ€ ์ด๋ฅผ ๋‹ด๋‹นํ•œ๋‹ค.
- Cloud Run์€ ์„œ๋ฒ„๋ฆฌ์Šค ์ปจํ…Œ์ด๋„ˆ ์‹คํ–‰ ํ”Œ๋žซํผ์œผ๋กœ, ์ธํ”„๋ผ ๊ด€๋ฆฌ ์—†์ด Agent๋ฅผ ๋น ๋ฅด๊ฒŒ ๋ฐฐํฌํ•  ์ˆ˜ ์žˆ๋‹ค. โ†’ ๊ฐ„ํŽธ ๋ฐฐํฌ.
- GKE๋Š” Kubernetes ๊ธฐ๋ฐ˜์˜ ํด๋Ÿฌ์Šคํ„ฐ ํ™˜๊ฒฝ์œผ๋กœ, ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ๊ณผ ๋ณต์žกํ•œ ์„œ๋น„์Šค ๊ตฌ์„ฑ์ด ํ•„์š”ํ•œ ๊ฒฝ์šฐ ์ ํ•ฉํ•˜๋‹ค โ†’ ๋Œ€๊ทœ๋ชจ ์ •๋ฐ€ ์ œ์–ด.

Google Kuernetes Engine


์กฐ๊ธˆ ๋” ๊นŠ๊ฒŒ ๋“ค์–ด๊ฐ€๋ณด๋ก ํ•˜์ž. ์ผ๋‹จ GKE๋ฅผ ๋” ๊นŠ๊ฒŒ ์ดํ•ดํ•˜๊ธฐ ์œ„ํ•ด์„œ container ์˜ ๊ฐœ๋…์„ ์ž ๊น ์งš๊ณ  ๋„˜์–ด๊ฐ€์ž.
๐ŸŽ Agentic AI - runtime, model
container์€ ์šฐ๋ฆฌ๊ฐ€ ์‹คํ–‰ ์ฝ”๋“œ + ๊ด€๋ จ ํŒจํ‚ค์ง€(dependencies)๋“ค์„ ํ•˜๋‚˜๋กœ ๋ฌถ์–ด์„œ ์ €์žฅํ•˜๋Š” ๊ณต๊ฐ„์ด๋‹ค. ์ฆ‰, ์šฐ๋ฆฌ๋Š” ์ด๋ ‡๊ฒŒ ๋ฌถ์–ด์„œ image(ํ…œํ”Œ๋ ›)ํ˜•ํƒœ๋กœ ์ €์žฅ์„ ํ•˜๊ณ , docker ๋ฅผ ์ด์šฉํ•˜์—ฌ image๋ฅผ ์–ด๋А ์ปดํ“จํ„ฐ์—์„œ๋‚˜ ํ˜ธ์ถœํ•˜๋ฉด ์ €๋ ‡๊ฒŒ container๊ฐ€ ์ƒ๊ธฐ๋Š” ๊ฒƒ์ด๋‹ค.(๊ต‰์žฅํžˆ portable, sharableํ•จ)
๐ŸŽ Agentic AI - runtime, model

์•ž์—์„œ ์šฐ๋ฆฌ๋Š” ๋‹ค์Œ์„ ์ดํ•ดํ–ˆ๋‹ค.
- Agent๋Š” ์ฝ”๋“œ + ๋ชจ๋ธ ํ˜ธ์ถœ + ๋„๊ตฌ ์‹คํ–‰์œผ๋กœ ๊ตฌ์„ฑ๋œ๋‹ค.
- ์ด ์ฝ”๋“œ๋Š” container๋กœ ๋ฌถ์„ ์ˆ˜ ์žˆ๋‹ค.
- Docker๋ฅผ ์‚ฌ์šฉํ•˜๋ฉด ์–ด๋””์„œ๋“  ๋™์ผํ•˜๊ฒŒ ์‹คํ–‰ํ•  ์ˆ˜ ์žˆ๋‹ค.

๊ทธ๋Ÿฐ๋ฐ ๋ฌธ์ œ๊ฐ€ ์ƒ๊ธด๋‹ค.

์ปจํ…Œ์ด๋„ˆ๊ฐ€ ํ•˜๋‚˜๊ฐ€ ์•„๋‹ˆ๋ผ, ์ˆ˜์‹ญ ๊ฐœ, ์ˆ˜๋ฐฑ ๊ฐœ๋ผ๋ฉด?
๐ŸŽ Agentic AI - runtime, model

์˜ˆ๋ฅผ ๋“ค์–ด,
- Agent A (Planning), Agent B (Tool execution), Agent C (Memory service)
- Agent D (Model wrapper), Vector DB, Logging service, Monitoring service
์ด ๋ชจ๋“  ๊ฒƒ์ด ๊ฐ๊ฐ ์ปจํ…Œ์ด๋„ˆ๋ผ๋ฉด?, ์ด๊ฑธ ๋ˆ„๊ฐ€ ๊ด€๋ฆฌํ• ๊นŒ?

โ†’ ์ฃฝ์œผ๋ฉด ๋‹ค์‹œ ์‚ด๋ ค์•ผ ํ•˜๊ณ , ํŠธ๋ž˜ํ”ฝ ๋งŽ์œผ๋ฉด ์ž๋™์œผ๋กœ ๋Š˜๋ ค์•ผ ํ•˜๊ณ , ์„œ๋ฒ„ ์—ฌ๋Ÿฌ ๋Œ€์— ๋‚˜๋ˆ ์„œ ๋ฐฐ์น˜ํ•ด์•ผ ํ•˜๊ณ , ๋„คํŠธ์›Œํฌ ์—ฐ๊ฒฐ๋„ ๊ด€๋ฆฌํ•ด์•ผ ํ•˜๊ณ 

Kurbernetes

์ด๋•Œ ๋“ฑ์žฅํ•˜๋Š” ๊ฒƒ์ด Kubernetes๋‹ค.

โ€œ์ปจํ…Œ์ด๋„ˆ๋ฅผ ์ž๋™์œผ๋กœ ๋ฐฐ์น˜ํ•˜๊ณ , ๊ด€๋ฆฌํ•˜๊ณ , ํ™•์žฅํ•ด์ฃผ๋Š” ์‹œ์Šคํ…œโ€
- ์—ฌ๋Ÿฌ ๋Œ€์˜ ์ปดํ“จํ„ฐ๋ฅผ ํ•˜๋‚˜์˜ Cluster๋กœ ๋ฌถ๊ณ , ๊ทธ ์œ„์—์„œ ์ปจํ…Œ์ด๋„ˆ๋ฅผ ์ž๋™์œผ๋กœ ๋ฐฐํฌํ•˜๊ณ 
- ํŠธ๋ž˜ํ”ฝ์— ๋”ฐ๋ผ ์ž๋™์œผ๋กœ ํ™•์žฅํ•˜๊ณ , ์žฅ์• ๊ฐ€ ๋‚˜๋ฉด ์ž๋™ ๋ณต๊ตฌํ•˜๋Š” ์ปจํ…Œ์ด๋„ˆ ์˜ค์ผ€์ŠคํŠธ๋ ˆ์ด์…˜ ํ”Œ๋žซํผ์ด๋‹ค.

โ€œgoogle์—์„œ ์ œ๊ณตํ•˜๊ณ  ๊ด€๋ฆฌํ•ด์ฃผ๋Š” kubernetes ์ด ๋‹จ์ˆœํžˆ Google Kubernets Engine(GKE)โ€
๐ŸŽ Agentic AI - runtime, model

Kubernetes๊ตฌ์กฐ์— ๋Œ€ํ•ด์„œ ์ž์„ธํ•˜๊ฒŒ ์‚ดํŽด๋ณด์ž.
๐ŸŽ Agentic AI - runtime, model

  • Node โ†’ ์‹ค์ œ ์ปดํ“จํ„ฐ
  • Pod โ†’ ์ปจํ…Œ์ด๋„ˆ ์‹คํ–‰ ๋‹จ์œ„
  • Cluster โ†’ ์—ฌ๋Ÿฌ ๋…ธ๋“œ์˜ ์ง‘ํ•ฉ
  • Control Plane โ†’ ์ „์ฒด๋ฅผ ๊ด€๋ฆฌํ•˜๋Š” ๊ด€๋ฆฌ์ž

๋ฉ€ํ‹ฐ ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์ด๋ผ๋ฉด:
- Coordinator Agent
- Worker Agent
- Tool Service
- Memory DB
- Model Adapter
- API Gateway

์ด ๋ชจ๋“  ๊ฒƒ์ด ๊ฐ๊ฐ ์ปจํ…Œ์ด๋„ˆ๋กœ ๊ตฌ์„ฑ๋  ์ˆ˜ ์žˆ๋‹ค.
GKE๋Š” ์ด๊ฑธ:
- ์„œ๋กœ ๋‹ค๋ฅธ ๋…ธ๋“œ์— ๋ฐฐ์น˜ํ•˜๊ณ 
- ํŠธ๋ž˜ํ”ฝ์— ๋”ฐ๋ผ ์ž๋™ ํ™•์žฅํ•˜๊ณ 
- ์žฅ์•  ๋ฐœ์ƒ ์‹œ ์ž๋™ ์žฌ์‹œ์ž‘ํ•˜๊ณ 
- ๋กœ๊น…๊ณผ ๋ชจ๋‹ˆํ„ฐ๋ง์„ ํ†ตํ•ฉํ•œ๋‹ค.

ํ•ญ๋ชฉ Cloud Run GKE
๋‚œ์ด๋„ ๋งค์šฐ ์‰ฌ์›€ ๋น„๊ต์  ๋ณต์žก
์„œ๋ฒ„ ๊ด€๋ฆฌ ์—†์Œ (์™„์ „ ์„œ๋ฒ„๋ฆฌ์Šค) ํด๋Ÿฌ์Šคํ„ฐ ๊ด€๋ฆฌ ํ•„์š”
์Šค์ผ€์ผ๋ง ์ž๋™ ์„ธ๋ฐ€ํ•œ ์ œ์–ด ๊ฐ€๋Šฅ
์‚ฌ์šฉ ๊ฒฝ์šฐ ๋‹จ์ผ Agent / ๊ฐ„๋‹จํ•œ API ๋ฉ€ํ‹ฐ ์—์ด์ „ํŠธ / ๋Œ€๊ทœ๋ชจ ์‹œ์Šคํ…œ
์ œ์–ด๊ถŒ ๋‚ฎ์Œ ๋†’์Œ

Model


๐ŸŽ Agentic AI - runtime, model
์ˆ˜๋งŽ์€ LLM ๋ชจ๋ธ๋“ค์ด ํ˜„์žฌ ์กด์žฌํ•˜๋Š” ์ƒํ™ฉ์ด๋‹ค. agent ์„ค๊ณ„์—์„œ LLM์„ ์‚ฌ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์€ ๋‹จ์ˆœํžˆ python code๋กœ ๋‹ค์Œ ์˜ˆ์‹œ(opanAI)๋ฅผ ํ˜ธ์ถœํ•  ์ˆ˜ ์žˆ๋‹ค. ์—ฌ๊ธฐ์„œ ์ค‘์š”ํ•œ์ ์€ ํ•ด๋‹น ํšŒ์‚ฌ์—์„œ ๋ฐœ๊ธ‰์„ ๋ฐ›์€ API key๊ฐ€ ์žˆ์–ด์•ผ ํ•œ๋‹ค.

from openai import OpenAI
client = OpenAI(api_key="...")

#OpenAI API key ๋ฐœ๊ธ‰ ๋ฐฉ๋ฒ•.

  1. API Platform ์— ์ ‘์†
    ๐ŸŽ Agentic AI - runtime, model
  2. ์šฐ์ธก ์ƒ๋‹จ์˜ API platform ํด๋ฆญ
  3. ์ขŒ์ธก Side bar โ†’ API keys

๐ŸŽ Agentic AI - runtime, model

  1. ์šฐ์ธก ์ƒ๋‹จ Create new secret key โ†’ Name ์ž‘์„ฑ , project ์„ ํƒ โ†’ Create secret key.
    ๐ŸŽ Agentic AI - runtime, model

  2. ๋ฐœ๊ธ‰๋œ API key copy โ†’ paste to the python code
    ๐ŸŽ Agentic AI - runtime, model

from openai import OpenAI
client = OpenAI(api_key="sk-proj-eivw_9cNtm5Gh7flhiQszio8ppdT4UrVcD------")

์ดํ›„ agent model์ด ์žฅ์ฐฉ ๋œ ๊ฒƒ์ด๊ณ , ์‚ฌ์šฉ๋ชจ๋ธ์ด openai ํšŒ์‚ฌ์˜ ๋ชจ๋ธ๋“ค์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.
(์•„๋ž˜์˜ pricing ์„ ์‚ดํŽด๋ณด๋ฉด Token๋‹น ๋‹ฌ๋Ÿฌ๋กœ ๊ฐ€๊ฒฉ์ด ์ธก์ •๋œ๋‹ค)
Pricing

Model runtime


Model runtime์€ โ€œAI ๋ชจ๋ธ์ด ์‹ค์ œ๋กœ ์ถ”๋ก (inference)์„ ์ˆ˜ํ–‰ํ•˜๋Š” ์‹คํ–‰ ํ™˜๊ฒฝโ€์ด๋‹ค

์—ฌ๊ธฐ์„œ ๋ชจ๋ธ์€ ๋‹จ์ˆœํ•œ python code๊ฐ€ ์•„๋‹ˆ๋ผ, ์ˆ˜์‹ญ\~์ˆ˜๋ฐฑ GB์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฐ€์ง„ ๊ฑฐ๋Œ€ํ•œ ์‹ ๊ฒฝ๋ง์ด๋‹ค. ๋”ฐ๋ผ์„œ,
- ๊ณ ์„ฑ๋Šฅ GPU ํ•„์š”
- ๋ณ‘๋ ฌ ์ฒ˜๋ฆฌ
- ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ
- ๋ชจ๋ธ ๋กœ๋”ฉ ์ตœ์ ํ™”
- ํŠธ๋ž˜ํ”ฝ ์Šค์ผ€์ผ๋ง
์ด ๋ชจ๋“  ๊ฒƒ์„ ์ฒ˜๋ฆฌํ•˜๋Š” ์ธํ”„๋ผ๊ฐ€ ํ•„์š”ํ•˜๋‹ค.

์—ฌ๊ธฐ์„œ ๋ชจ๋ธ๊ณ„๋ฐœ์„ ๋ฉ”์ธ์œผ๋กœ ํ•˜๋Š” ํšŒ์‚ฌ์ธ openai, antropic ๊ฐ™์€ ํšŒ์‚ฌ๋“ค์€ ๋ชจ๋ธ ์„œ๋น„์Šค๋ฅผ ์ œ๊ณต๋งŒํ•˜๊ณ , ์ž์ฒด์ ์œผ๋กœ model runtime์„ ์ž์‹ ๋“ค๋งŒ์˜ server์—์„œ ์ง„ํ–‰ํ•œ๋‹ค.

ํ•ญ๋ชฉ OpenAI Anthropic Vertex AI
๋ชจ๋ธ ์ œ๊ณต O O O
์ž์ฒด ๋ชจ๋ธ ๋ฐฐํฌ X X O
์ปค์Šคํ…€ ์„œ๋น™ X X O
์ธํ”„๋ผ ์ œ์–ด ๋‚ฎ์Œ ๋‚ฎ์Œ ๋†’์Œ
๊ธฐ์—… ํ†ตํ•ฉ ์ œํ•œ์  ์ œํ•œ์  ๋งค์šฐ ๊ฐ•ํ•จ

์ด์™€ ๋ฐ˜๋Œ€๋กœ Vertex AI๋Š” Google Cloud๊ฐ€ ์ œ๊ณตํ•˜๋Š” ๋‹จ์ˆœํžˆ ๋ชจ๋ธ์„ โ€œ์‚ฌ์šฉโ€ํ•˜๋Š” ์„œ๋น„์Šค๊ฐ€ ์•„๋‹ˆ๋ผ,

๋ชจ๋ธ์„ ํ•™์Šตํ•˜๊ณ , ๋ฐฐํฌํ•˜๊ณ , ์šด์˜ํ•˜๊ณ , ๋ชจ๋‹ˆํ„ฐ๋งํ•˜๋Š”
์ „์ฒด AI lifecycle์„ ๊ด€๋ฆฌํ•˜๋Š” ํ”Œ๋žซํผ

๐ŸŽ Agentic AI - runtime, model

Model runtime์€ ๋‹ค์Œ์„ ๋‹ด๋‹นํ•œ๋‹ค:
- ๋ชจ๋ธ weight ๋กœ๋”ฉ, GPU ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ, ๋ณ‘๋ ฌ ์ถ”๋ก  ์ฒ˜๋ฆฌ
- ์š”์ฒญ ํ ๊ด€๋ฆฌ, ํŠธ๋ž˜ํ”ฝ ์Šค์ผ€์ผ๋ง์žฅ์•  ๋ณต๊ตฌ

Vertex AI๋Š” ์ด๊ฒƒ์„:
- Managed endpoint ํ˜•ํƒœ๋กœ ์ œ๊ณตํ•œ๋‹ค.
- ์‚ฌ์šฉ์ž๋Š” endpoint URL๋งŒ ํ˜ธ์ถœํ•˜๋ฉด ๋œ๋‹ค.(url์ ‘์†ํ•˜๋ฉด, model ๊ด€๋ฆฌ ์ฝ˜์†”์ฐฝ์ด ๋œฌ๋‹ค)
- ๋‚ด๋ถ€ GPU, TPU, ์Šค์ผ€์ผ๋ง์€ Google์ด ๊ด€๋ฆฌํ•œ๋‹ค.

์‹ค์ œ google cloud โ†’ vertex AI console์— ์ ‘์†ํ•ด๋ณด๋ฉด,(url console์ฐฝ)
Google Cloud console

๐ŸŽ Agentic AI - runtime, model
์ง์ ‘ model setting์„ UI๋ฅผ ํ†ตํ•ด ํ•  ์ˆ˜ ์žˆ๊ณ , input, output๋„ ๊ด€๋ฆฌ๊ฐ€ ๊ฐ€๋Šฅํ•˜๋‹ค.

Conclusion

์ด๋ฒˆ ์‹œ๊ฐ„์—๋Š” agent ์‹œ์Šคํ…œ์ด ์‹ค์ œ๋กœ ๋™์ž‘ํ•˜๊ธฐ ์œ„ํ•œ ํ•ต์‹ฌ ์š”์†Œ์ธ computing ํ™˜๊ฒฝ runtime(์‹ฌ์žฅ)๊ณผ model(๋‡Œ)์— ๋Œ€ํ•ด ์‚ดํŽด๋ณด์•˜๋‹ค. ์ง€๊ธˆ๊นŒ์ง€ agent์˜ ๊ตฌ์กฐ์™€ ํŒ”๋‹ค๋ฆฌ๋ฅผ ์ดํ•ดํ–ˆ๋‹ค๋ฉด, ์ด์ œ ๊ทธ๊ฒƒ์ด ์‹ค์ œ๋กœ ์ž‘๋™ํ•˜๋Š” ์‹คํ–‰ ํ™˜๊ฒฝ๊นŒ์ง€ ์—ฐ๊ฒฐ๋œ ๊ฒƒ์ด๋‹ค.
ํŠนํžˆ ์ธ์ƒ ๊นŠ์—ˆ๋˜ ์ ์€, Google์ด agent ์ƒํƒœ๊ณ„๋ฅผ ๋งค์šฐ ๋น ๋ฅด๊ฒŒ ์ธํ”„๋ผ ์ˆ˜์ค€๊นŒ์ง€ ์ •๋ฆฌํ•ด๋‘์—ˆ๋‹ค๋Š” ์ ์ด๋‹ค. ๋ณด์•ˆ๊ณผ ํ†ตํ•ฉ ๋ฌธ์ œ๋กœ ์ธํ•ด ๊ฐ ๊ธฐ์—…์€ ์ž์‹ ๋“ค๋งŒ์˜ agent๋ฅผ ์ง์ ‘ ์„ค๊ณ„ํ•  ๊ฐ€๋Šฅ์„ฑ์ด ๋†’๋‹ค. ๊ทธ๋ฆฌ๊ณ  ๊ทธ ์„ค๊ณ„๋ฅผ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š” ํŒ”๋ ˆํŠธ์™€ ์‹คํ–‰ ํ™˜๊ฒฝ์„ Google์€ ์ด๋ฏธ ์ค€๋น„ํ•ด๋‘๊ณ  ์žˆ๋‹ค.
์•ž์œผ๋กœ๋Š” ๊ธฐ์—…๋ฟ ์•„๋‹ˆ๋ผ ๊ฐœ์ธ ๋‹จ์œ„์—์„œ๋„ agent๋ฅผ ๊ตฌ์ถ•ํ•˜๋Š” ์‹œ๋Œ€๊ฐ€ ์˜ฌ ๊ฒƒ์ด๋‹ค. ๊ทธ๋ ‡๋‹ค๋ฉด ํ•ต์‹ฌ ์งˆ๋ฌธ์€ ์ด๊ฒƒ์ด๋‹ค:

๋ˆ„๊ฐ€ ๋” ์•ˆ์ „ํ•˜๊ณ , ํšจ์œจ์ ์ด๋ฉฐ, ์ง€๋Šฅ์ ์ธ agent ์‹œ์Šคํ…œ์„ ์„ค๊ณ„ํ•  ์ˆ˜ ์žˆ๋Š”๊ฐ€?
Anthropic, OpenAI, Google์€ ์ด๋ฏธ ๋น ๋ฅด๊ฒŒ agent๋ฅผ ๊ตฌ์ถ•ํ•˜๊ณ  ์žˆ์œผ๋ฉฐ, ๋‹จ์ˆœํ•œ ์ฝ”๋“œ ์ž‘์„ฑ ๋‹จ๊ณ„๋Š” ๋„˜์–ด์„  ์ƒํƒœ์—์„œ ๋‹ค์–‘ํ•œ ์‚ฐ์—… ๋ถ„์•ผ๋กœ ํ™•์žฅ๋˜๊ณ  ์žˆ๋‹ค.

๋‚˜๋Š” ์—ฌ๊ธฐ์„œ ํ•œ ๊ฐ€์ง€ ์งˆ๋ฌธ์„ ๋˜์ง€๊ณ  ์‹ถ๋‹ค.

๋‚˜์˜ ์ „๋ฌธ ๋ถ„์•ผ์ธ Computational Mechanics + agent๋Š” ์–ด๋–ป๊ฒŒ ์—ฐ๊ฒฐ๋  ์ˆ˜ ์žˆ์„๊นŒ?
๋‹จ์ˆœํžˆAgent๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ,
Agent ๊ตฌ์กฐ๋ฅผ ์ดํ•ดํ•˜๊ณ  โ†’ ์ง์ ‘ ์„ค๊ณ„ํ•˜๊ณ  โ†’ ์ˆ˜์น˜ํ•ด์„ ๋ฐ ์‹œ๋ฎฌ๋ ˆ์ด์…˜๊ณผ ๊ฒฐํ•ฉํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ํ™•์žฅํ•ด๋ณด๊ณ ์ž ํ•œ๋‹ค.

์ด์ œ Theory ํŒŒํŠธ๋Š” ๋งˆ๋ฌด๋ฆฌํ•˜๊ณ , ์‹ค์ „์œผ๋กœ ๋„˜์–ด๊ฐ€์ž.
์‹ค์ „์€ ๋‘ ๊ฐ€์ง€ ํŠธ๋ž™์œผ๋กœ ์ง„ํ–‰๋œ๋‹ค.
1. Claude, Codex, Gemini, OpenClaw์™€ ๊ฐ™์€ ์ด๋ฏธ ๊ตฌ์ถ•๋œ agent system์„ ์ง์ ‘ ์‚ฌ์šฉํ•ด๋ณด๋ฉฐ ๊ธฐ๋Šฅ์„ ๋ถ„์„ํ•œ๋‹ค.
2. LangChain๊ณผ Google ADK๋ฅผ ํ™œ์šฉํ•ด ๋‚˜๋งŒ์˜ agent๋ฅผ ์ง์ ‘ ์„ค๊ณ„ํ•ด๋ณธ๋‹ค.
์ด์ œ ์„ค๊ณ„์ž์˜ ๊ด€์ ์œผ๋กœ ๋„˜์–ด๊ฐˆ ์‹œ๊ฐ„์ด๋‹ค.

์ง€๊ธˆ๊นŒ์ง€ Agent theory ๋ถ€๋ถ„์„ ๋งˆ๋ฌด๋ฆฌํ•˜๊ณ , ์ด์ œ ์‹ค์ „์œผ๋กœ ๋„˜์–ด๊ฐ€์ž. ์‹ค์ „์€ ๋‘๊ฐ€์ง€ ๋ถ„๋ฅ˜๋กœ ์ง„ํ–‰๋œ๋‹ค.
1. Claude, Codex, Gemini, Openclaw โ†’ ์ด๋ ‡๊ฒŒ ๋ฏธ๋ฆฌ ๋งŒ๋“ค์–ด์ง„ agent system์„ ์ง์ ‘ ์‚ฌ์šฉํ•ด๋ณด๋ฉฐ ์–ด๋–ค ๊ธฐ๋Šฅ๋“ค์ด ์žˆ๊ณ , ์–ด๋–ค ์ž‘์—…๋“ค์ด ๊ฐ€๋Šฅํ•œ์ง€๋ฅผ ์‚ดํŽด๋ณด์ž.
2. Langchain, Google ADK ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ๋‚˜๋งŒ์˜ agent๋ฅผ ์ง์ ‘ ์„ค๊ณ„ํ•ด๋ณด์ž.