HuggingFace Inference Embeddings

HuggingFace Inference Embeddings ๋…ธ๋“œ๋Š” Hugging Face Inference API ๋˜๋Š” AWS SageMaker Endpoint๋ฅผ ํ†ตํ•ด Sentence Transformers ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์„ ํ˜ธ์ถœํ•˜์—ฌ ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ ๋ฒกํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๋…ธ๋“œ์ž…๋‹ˆ๋‹ค. ์‚ฌ์ „ ํ•™์Šต๋œ ๋‹ค์–‘ํ•œ ์˜คํ”ˆ์†Œ์Šค ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ์„ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ์–ด ์œ ์—ฐํ•˜๊ณ  ๊ฐ•๋ ฅํ•œ ์ž„๋ฒ ๋”ฉ ๊ตฌ์ถ•์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค.


์ฃผ์š” ๊ธฐ๋Šฅ

  • Hugging Face์—์„œ ์ œ๊ณตํ•˜๋Š” Sentence Transformers ๊ณ„์—ด ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ ํ˜ธ์ถœ

  • ์‚ฌ์šฉ์ž ์ •์˜ Endpoint๋ฅผ ํ™œ์šฉํ•ด ์ž์ฒด ๋ฐฐํฌ ๋ชจ๋ธ์—๋„ ์—ฐ๊ฒฐ ๊ฐ€๋Šฅ

  • RAG, ์œ ์‚ฌ๋„ ๋ถ„์„, ๋ถ„๋ฅ˜, ํด๋Ÿฌ์Šคํ„ฐ๋ง ๋“ฑ ๋‹ค์–‘ํ•œ ํŒŒ์ดํ”„๋ผ์ธ์— ํ™œ์šฉ ๊ฐ€๋Šฅํ•œ ๋ฒกํ„ฐ ์ƒ์„ฑ

  • ๋†’์€ ํ™•์žฅ์„ฑ๊ณผ ์˜คํ”ˆ์†Œ์Šค ๊ธฐ๋ฐ˜ ์œ ์—ฐ์„ฑ ํ™•๋ณด

WindyFlo HuggingFace Inference Embeddings

์ž…๋ ฅ๊ฐ’ (Inputs)

ํ•ญ๋ชฉ
์„ค๋ช…
ํ•„์ˆ˜ ์—ฌ๋ถ€

Connect Credential

Hugging Face Inference API ๋˜๋Š” AWS Endpoint ํ˜ธ์ถœ์šฉ ์ธ์ฆ ์ •๋ณด (Credential์— ๋“ฑ๋ก)

ํ•„์ˆ˜

Model

์‚ฌ์šฉํ•  ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ์˜ ์ด๋ฆ„ (์˜ˆ: sentence-transformers/distilbert-base-nli-stsb-mean-tokens)

ํ•„์ˆ˜

Endpoint

Inference API ๋˜๋Š” SageMaker ๋ฐฐํฌ ์—”๋“œํฌ์ธํŠธ ์ฃผ์†Œ (์˜ˆ: https://xyz.eu-west-1.aws.endpoints)

ํ•„์ˆ˜


ํŒŒ๋ผ๋ฏธํ„ฐ (Parameters)

โ€ป ์ด ๋…ธ๋“œ๋Š” ๋ณ„๋„ ํŒŒ๋ผ๋ฏธํ„ฐ ํ•ญ๋ชฉ ์—†์Œ


์ถœ๋ ฅ๊ฐ’ (Outputs)

์ถœ๋ ฅ ํ•ญ๋ชฉ
์„ค๋ช…

HuggingFaceInferenceEmbeddings

์ž…๋ ฅ ํ…์ŠคํŠธ์— ๋Œ€ํ•œ ์ž„๋ฒ ๋”ฉ ๋ฒกํ„ฐ ๋ฐฐ์—ด ๊ฒฐ๊ณผ


ํ™œ์šฉ ์˜ˆ์‹œ

  • Hugging Face์—์„œ ์ œ๊ณตํ•˜๋Š” ๋‹ค์–‘ํ•œ ๊ณต๊ฐœ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ์„ ํ™œ์šฉํ•ด ์œ ์‚ฌ๋„ ๊ธฐ๋ฐ˜ ๊ฒ€์ƒ‰ ์‹œ์Šคํ…œ ๊ตฌ์„ฑ

  • ์‚ฌ๋‚ด์— ๋ฐฐํฌ๋œ AWS SageMaker ๊ธฐ๋ฐ˜ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ์„ Endpoint๋กœ ์ง€์ •ํ•˜์—ฌ ํ˜ธ์ถœ

  • ๊ธฐ์กด OpenAI, Cohere ๋“ฑ๊ณผ ๋น„๊ตํ•˜์—ฌ ์„ฑ๋Šฅ ๋˜๋Š” ๋น„์šฉ ๋ฉด์—์„œ ์ตœ์ ํ™”๋œ ๋Œ€์•ˆ์œผ๋กœ ์‚ฌ์šฉ

  • ๋ฒกํ„ฐ DB(Faiss, Pinecone ๋“ฑ)์™€ ํ•จ๊ป˜ ์—ฐ๊ณ„ํ•˜์—ฌ ๋ฌธ์„œ ๊ฒ€์ƒ‰, ์œ ์‚ฌ๋„ ๋ถ„์„, ์ถ”์ฒœ ์‹œ์Šคํ…œ ๊ตฌ์ถ•


์‚ฌ์šฉ ํŒ

  • Model ํ•„๋“œ๋Š” Hugging Face Hub์— ๋“ฑ๋ก๋œ ๋ชจ๋ธ ์ด๋ฆ„์„ ์ •ํ™•ํžˆ ์ž…๋ ฅํ•ด์•ผ ํ•˜๋ฉฐ, ์˜ˆ: sentence-transformers/all-MiniLM-L6-v2

  • Endpoint ํ•„๋“œ๋Š” Hugging Face Inference Endpoint ๋˜๋Š” SageMaker ์—”๋“œํฌ์ธํŠธ๋กœ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•˜๋ฉฐ, ์ ‘๋‘์‚ฌ(https://)๊นŒ์ง€ ํฌํ•จํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

  • Hugging Face์˜ Inference Endpoint๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒฝ์šฐ, ์œ ๋ฃŒ ๊ตฌ๋… ํ”Œ๋žœ์ด ํ•„์š”ํ•  ์ˆ˜ ์žˆ์œผ๋‹ˆ ์‚ฌ์ „์— ํ™•์ธํ•˜์„ธ์š”.

  • ๋‹ค์–‘ํ•œ ์–ธ์–ด๋ฅผ ์ง€์›ํ•˜๋Š” ๋ชจ๋ธ์„ ์„ ํƒํ•˜๋ฉด ๋‹ค๊ตญ์–ด ๋ฌธ์„œ ์ฒ˜๋ฆฌ์— ์œ ๋ฆฌํ•ฉ๋‹ˆ๋‹ค.


์ฃผ์˜์‚ฌํ•ญ

  • Connect Credential์— ๋“ฑ๋ก๋œ ์ธ์ฆ ์ •๋ณด๊ฐ€ ์—†๊ฑฐ๋‚˜ ์ž˜๋ชป๋œ ๊ฒฝ์šฐ API ํ˜ธ์ถœ์ด ์‹คํŒจํ•ฉ๋‹ˆ๋‹ค.

  • Model ์ด๋ฆ„๊ณผ Endpoint ์ฃผ์†Œ๊ฐ€ ์ผ์น˜ํ•˜์ง€ ์•Š๊ฑฐ๋‚˜ ํ˜•์‹์ด ์ž˜๋ชป๋˜๋ฉด ์ž„๋ฒ ๋”ฉ ์ƒ์„ฑ์ด ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.

  • ํ˜ธ์ถœ ๋Œ€์ƒ์ด Hugging Face๊ฐ€ ์•„๋‹Œ SageMaker์ผ ๊ฒฝ์šฐ, IAM ๊ถŒํ•œ ๋ฐ ๋„คํŠธ์›Œํฌ ์„ค์ •์„ ๋ณ„๋„๋กœ ๊ตฌ์„ฑํ•ด์•ผ ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

  • Inference Endpoint ์‚ฌ์šฉ๋Ÿ‰์— ๋”ฐ๋ผ ๋น„์šฉ์ด ๋ฐœ์ƒํ•˜๋ฉฐ, ์‹ค์‹œ๊ฐ„ ์ฒ˜๋ฆฌ ์‹œ์—๋Š” ์‘๋‹ต ์†๋„ ํ™•์ธ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.

Last updated