ChatPremAI

PremAI 是一个一体化平台，简化了由生成性人工智能驱动的强大、生产就绪应用程序的创建。通过简化开发过程，PremAI 使您能够专注于增强用户体验并推动应用程序的整体增长。您可以快速开始使用我们的平台这里。

此示例介绍了如何使用 LangChain 与不同聊天模型交互，使用 ChatPremAI

安装与设置

我们首先安装 langchain 和 premai-sdk。您可以输入以下命令进行安装：

pip install premai langchain

在继续之前，请确保您已在 PremAI 上注册了账户并创建了项目。如果没有，请参考快速入门指南以开始使用 PremAI 平台。创建您的第一个项目并获取您的 API 密钥。

from langchain_community.chat_models import ChatPremAI
from langchain_core.messages import HumanMessage, SystemMessage

在LangChain中设置PremAI客户端

一旦我们导入了所需的模块，让我们设置我们的客户端。现在假设我们的 project_id 是 8。但请确保使用您的项目ID，否则会抛出错误。

要将langchain与prem一起使用，您不需要传递任何模型名称或设置任何与我们的聊天客户端相关的参数。默认情况下，它将使用在LaunchPad中使用的模型名称和参数。

注意：如果您在设置客户端时更改了 model 或任何其他参数，如 temperature 或 max_tokens，它将覆盖在LaunchPad中使用的现有默认配置。

import getpass
import os

# 第一步是设置环境变量。
# 您也可以在实例化模型时传递API密钥，但这
# 属于最佳实践，建议将其设置为环境变量。

if os.environ.get("PREMAI_API_KEY") is None:
    os.environ["PREMAI_API_KEY"] = getpass.getpass("PremAI API Key:")

# 默认情况下，它将使用通过平台部署的模型
# 在我的例子中，它是 "gpt-4o"

chat = ChatPremAI(project_id=1234, model_name="gpt-4o")

聊天补全

ChatPremAI 支持两种方法：invoke（与 generate 相同）和 stream。

第一种方法将给我们一个静态结果，而第二种方法将逐个流式传输令牌。以下是如何生成类似聊天的补全。

human_message = HumanMessage(content="Who are you?")

response = chat.invoke([human_message])
print(response.content)

I am an AI language model created by OpenAI, designed to assist with answering questions and providing information based on the context provided. How can I help you today?

以上看起来很有趣，对吧？我将我的默认启动板系统提示设置为：Always sound like a pirate 如果需要，您也可以覆盖默认系统提示。以下是您可以这样做的方法。

system_message = SystemMessage(content="You are a friendly assistant.")
human_message = HumanMessage(content="Who are you?")

chat.invoke([system_message, human_message])

AIMessage(content="I'm your friendly assistant! How can I help you today?", response_metadata={'document_chunks': [{'repository_id': 1985, 'document_id': 1306, 'chunk_id': 173899, 'document_name': '[D] Difference between sparse and dense informati…', 'similarity_score': 0.3209080100059509, 'content': "with the difference or anywhere\nwhere I can read about it?\n\n\n      17                  9\n\n\n      u/ScotiabankCanada        •  Promoted\n\n\n                       Accelerate your study permit process\n                       with Scotiabank's Student GIC\n                       Program. We're here to help you tur…\n\n\n                       startright.scotiabank.com         Learn More\n\n\n                            Add a Comment\n\n\nSort by:   Best\n\n\n      DinosParkour      • 1y ago\n\n\n     Dense Retrieval (DR) m"}]}, id='run-510bbd0e-3f8f-4095-9b1f-c2d29fd89719-0')

您可以在这里提供系统提示，如下所示：

chat.invoke([system_message, human_message], temperature=0.7, max_tokens=10, top_p=0.95)

/home/anindya/prem/langchain/libs/community/langchain_community/chat_models/premai.py:355: UserWarning: WARNING: Parameter top_p is not supported in kwargs.
  warnings.warn(f"WARNING: Parameter {key} is not supported in kwargs.")

AIMessage(content="Hello! I'm your friendly assistant. How can I", response_metadata={'document_chunks': [{'repository_id': 1985, 'document_id': 1306, 'chunk_id': 173899, 'document_name': '[D] Difference between sparse and dense informati…', 'similarity_score': 0.3209080100059509, 'content': "with the difference or anywhere\nwhere I can read about it?\n\n\n      17                  9\n\n\n      u/ScotiabankCanada        •  Promoted\n\n\n                       Accelerate your study permit process\n                       with Scotiabank's Student GIC\n                       Program. We're here to help you tur…\n\n\n                       startright.scotiabank.com         Learn More\n\n\n                            Add a Comment\n\n\nSort by:   Best\n\n\n      DinosParkour      • 1y ago\n\n\n     Dense Retrieval (DR) m"}]}, id='run-c4b06b98-4161-4cca-8495-fd2fc98fa8f8-0')

如果您在这里放置系统提示，它将覆盖您在从平台部署应用程序时固定的系统提示。

原生 RAG 支持与 Prem 仓库

Prem 仓库允许用户上传文档（.txt、.pdf 等）并将这些仓库连接到 LLMs。您可以将 Prem 仓库视为原生 RAG，其中每个仓库可以被视为一个向量数据库。您可以连接多个仓库。您可以在这里了解更多关于仓库的信息。

在 langchain premai 中也支持仓库。以下是您可以如何做到这一点。

query = "Which models are used for dense retrieval"
repository_ids = [
    1985,
]
repositories = dict(ids=repository_ids, similarity_threshold=0.3, limit=3)

首先，我们通过一些仓库 ID 来定义我们的仓库。确保这些 ID 是有效的仓库 ID。您可以在这里了解更多关于如何获取仓库 ID 的信息。

请注意：与 model_name 类似，当您调用参数 repositories 时，您可能会覆盖在启动平台中连接的仓库。

现在，我们将仓库与我们的聊天对象连接，以调用基于 RAG 的生成。

import json

response = chat.invoke(query, max_tokens=100, repositories=repositories)

print(response.content)
print(json.dumps(response.response_metadata, indent=4))

Dense retrieval models typically include:

1. **BERT-based Models**: Such as DPR (Dense Passage Retrieval) which uses BERT for encoding queries and passages.
2. **ColBERT**: A model that combines BERT with late interaction mechanisms.
3. **ANCE (Approximate Nearest Neighbor Negative Contrastive Estimation)**: Uses BERT and focuses on efficient retrieval.
4. **TCT-ColBERT**: A variant of ColBERT that uses a two-tower
{
    "document_chunks": [
        {
            "repository_id": 1985,
            "document_id": 1306,
            "chunk_id": 173899,
            "document_name": "[D] Difference between sparse and dense informati\u2026",
            "similarity_score": 0.3209080100059509,
            "content": "with the difference or anywhere\nwhere I can read about it?\n\n\n      17                  9\n\n\n      u/ScotiabankCanada        \u2022  Promoted\n\n\n                       Accelerate your study permit process\n                       with Scotiabank's Student GIC\n                       Program. We're here to help you tur\u2026\n\n\n                       startright.scotiabank.com         Learn More\n\n\n                            Add a Comment\n\n\nSort by:   Best\n\n\n      DinosParkour      \u2022 1y ago\n\n\n     Dense Retrieval (DR) m"
        }
    ]
}

理想情况下，您无需在此处连接仓库 ID 以获取增强检索生成。如果您在 prem 平台中连接了仓库，您仍然可以获得相同的结果。

Prem 模板

写作提示模板可能会非常混乱。提示模板通常很长，难以管理，并且必须不断调整以改进并在整个应用程序中保持一致。

使用 Prem，编写和管理提示变得非常简单。Templates 选项卡在 launchpad 内部帮助您编写所需的多个提示，并在 SDK 中使用这些提示使您的应用程序运行。您可以在这里阅读更多关于提示模板的信息。

要在 LangChain 中原生使用 Prem 模板，您需要将一个 id 传递给 HumanMessage。这个 id 应该是您提示模板变量的名称。HumanMessage 中的 content 应该是该变量的值。

假设例如，如果您的提示模板是这样的：

Say hello to my name and say a feel-good quote
from my age. My name is: {name} and age is {age}

那么您的 human_messages 应该如下所示：

human_messages = [
    HumanMessage(content="Shawn", id="name"),
    HumanMessage(content="22", id="age"),
]

将这个 human_messages 传递给 ChatPremAI 客户端。请注意：不要忘记传递额外的 template_id 以调用 Prem 模板生成。如果您不了解 template_id，可以在我们的文档中了解更多信息在这里。以下是一个示例：

template_id = "78069ce8-xxxxx-xxxxx-xxxx-xxx"
response = chat.invoke([human_messages], template_id=template_id)
print(response.content)

Prem 模板功能在流式传输中也可用。

流式处理

在本节中，让我们看看如何使用 langchain 和 PremAI 流式传输令牌。以下是操作方法。

import sys

for chunk in chat.stream("hello how are you"):
    sys.stdout.write(chunk.content)
    sys.stdout.flush()

It looks like your message got cut off. If you need information about Dense Retrieval (DR) or any other topic, please provide more details or clarify your question.

类似于上面的情况，如果您想覆盖系统提示和生成参数，您需要添加以下内容：

import sys

# For some experimental reasons if you want to override the system prompt then you
# can pass that here too. However it is not recommended to override system prompt
# of an already deployed model.

for chunk in chat.stream(
    "hello how are you",
    system_prompt="act like a dog",
    temperature=0.7,
    max_tokens=200,
):
    sys.stdout.write(chunk.content)
    sys.stdout.flush()

Woof! 🐾 How can I help you today? Want to play fetch or maybe go for a walk 🐶🦴

工具/函数调用

LangChain PremAI 支持工具/函数调用。工具/函数调用允许模型通过生成与用户定义的模式相匹配的输出，来响应给定的提示。

您可以在我们的文档中详细了解工具调用。
您可以在文档的这一部分中了解更多关于 langchain 工具调用的信息。

注意： 当前版本的 LangChain ChatPremAI 不支持带有流式支持的函数/工具调用。流式支持和函数调用将很快推出。

将工具传递给模型

为了传递工具并让 LLM 选择它需要调用的工具，我们需要传递一个工具模式。工具模式是函数定义以及关于函数的作用、函数每个参数是什么等的适当文档字符串。下面是一些简单的算术函数及其模式。

注意： 在定义函数/工具模式时，不要忘记添加有关函数参数的信息，否则会抛出错误。

from langchain_core.pydantic_v1 import BaseModel, Field
from langchain_core.tools import tool


# 定义函数参数的模式
class OperationInput(BaseModel):
    a: int = Field(description="第一个数字")
    b: int = Field(description="第二个数字")


# 现在定义函数，参数的模式将是 OperationInput
@tool("add", args_schema=OperationInput, return_direct=True)
def add(a: int, b: int) -> int:
    """将 a 和 b 相加。

    参数：
        a: 第一个整数
        b: 第二个整数
    """
    return a + b


@tool("multiply", args_schema=OperationInput, return_direct=True)
def multiply(a: int, b: int) -> int:
    """将 a 和 b 相乘。

    参数：
        a: 第一个整数
        b: 第二个整数
    """
    return a * b

将工具模式绑定到我们的 LLM

我们将使用 bind_tools 方法将上述函数转换为“工具”，并将其与模型绑定。这意味着每次调用模型时，我们都将传递这些工具信息。

tools = [add, multiply]
llm_with_tools = chat.bind_tools(tools)

之后，我们从现在与工具绑定的模型中获取响应。

query = "3 * 12 等于多少？另外，11 + 49 等于多少？"

messages = [HumanMessage(query)]
ai_msg = llm_with_tools.invoke(messages)

正如我们所看到的，当我们的聊天模型与工具绑定时，根据给定的提示，它会调用正确的一组工具，并按顺序进行调用。

ai_msg.tool_calls

[{'name': 'multiply',
  'args': {'a': 3, 'b': 12},
  'id': 'call_A9FL20u12lz6TpOLaiS6rFa8'},
 {'name': 'add',
  'args': {'a': 11, 'b': 49},
  'id': 'call_MPKYGLHbf39csJIyb5BZ9xIk'}]

我们将上述消息附加到 LLM，这作为上下文，使 LLM 知道它调用了哪些函数。

messages.append(ai_msg)

由于工具调用分为两个阶段，其中：

在第一次调用中，我们收集了 LLM 决定使用的所有工具，以便它可以将结果作为附加上下文，从而提供更准确且无幻觉的结果。
在第二次调用中，我们将解析 LLM 决定的那组工具并运行它们（在我们的例子中，这将是我们定义的函数，使用 LLM 提取的参数），并将此结果传递给 LLM。

from langchain_core.messages import ToolMessage

for tool_call in ai_msg.tool_calls:
    selected_tool = {"add": add, "multiply": multiply}[tool_call["name"].lower()]
    tool_output = selected_tool.invoke(tool_call["args"])
    messages.append(ToolMessage(tool_output, tool_call_id=tool_call["id"]))

最后，我们调用与工具绑定的 LLM，并将函数响应添加到其上下文中。

response = llm_with_tools.invoke(messages)
print(response.content)

最终答案是：

- 3 * 12 = 36
- 11 + 49 = 60

定义工具模式：Pydantic 类

上面我们展示了如何使用 tool 装饰器定义模式，但我们也可以使用 Pydantic 等效地定义模式。当你的工具输入更复杂时，Pydantic 非常有用：

from langchain_core.output_parsers.openai_tools import PydanticToolsParser

class add(BaseModel):
    """将两个整数相加。"""

    a: int = Field(..., description="第一个整数")
    b: int = Field(..., description="第二个整数")

class multiply(BaseModel):
    """将两个整数相乘。"""

    a: int = Field(..., description="第一个整数")
    b: int = Field(..., description="第二个整数")

tools = [add, multiply]

现在，我们可以将它们绑定到聊天模型，并直接获得结果：

chain = llm_with_tools | PydanticToolsParser(tools=[multiply, add])
chain.invoke(query)

[multiply(a=3, b=12), add(a=11, b=49)]

现在，像上面那样，我们解析这个并运行这些函数，再次调用 LLM 以获得结果。

ChatPremAI

安装与设置

在LangChain中设置PremAI客户端

聊天补全

原生 RAG 支持与 Prem 仓库

Prem 模板

流式处理

工具/函数调用

将工具传递给模型

将工具模式绑定到我们的 LLM

定义工具模式：Pydantic 类

相关

此页面是否有帮助？

您还可以留下详细的反馈在 GitHub 上

ChatPremAI

安装与设置​

在LangChain中设置PremAI客户端​

聊天补全​

原生 RAG 支持与 Prem 仓库​

Prem 模板​

流式处理​

工具/函数调用​

将工具传递给模型​

将工具模式绑定到我们的 LLM​

定义工具模式：Pydantic 类​

相关​

此页面是否有帮助？

您还可以留下详细的反馈 在 GitHub 上

安装与设置

在LangChain中设置PremAI客户端

聊天补全

原生 RAG 支持与 Prem 仓库

Prem 模板

流式处理

工具/函数调用

将工具传递给模型

将工具模式绑定到我们的 LLM

定义工具模式：Pydantic 类

相关

您还可以留下详细的反馈在 GitHub 上