text-embedding-3試してみる
使い方
import os
from openai import OpenAI
os.environ["OPENAI_API_KEY"] = "sk-..."
client = OpenAI()
def get_embedding(text: str, model: str = "text-embedding-3-large"):
text = text.replace("\n", " ")
return client.embeddings.create(input=[text], model=model).data[0].embedding
text = "こんにちは。今日はいい天気ですね。"
embedding = get_embedding(text)
print(len(embedding))
print(embedding[:10])
出力
3072
[0.014592253603041172, -0.006042507942765951, -0.006200418807566166, 0.010370554402470589, 0.006480791140347719, -0.021849708631634712, -0.0100160613656044, 0.03805329650640488, 0.025600897148251534, -0.008514296263456345]
text-embedding-3-large だと 3072 次元のベクトルになる