[Feature]support bad_words #3055

Sunny-bot1 · 2025-07-28T13:01:41Z

功能描述

支持 bad_words，用于禁止模型生成某些特定词。通过将bad_words中的词语的采样概率设置为1e-10来避免生成该词。

使用方法

online inference

请求中加入bad_words参数

使用 curl 命令发送用户请求示例如下：

curl -X POST "http://0.0.0.0:9222/v1/chat/completions" \
-H "Content-Type: application/json" \
-d '{
  "messages": [
    {"role": "user", "content": "How old are you"}
  ],
  "bad_words": ["age", "I"]
}'

使用 python 脚本发送用户请求示例如下：

import openai
host = "0.0.0.0"
port = "8170"
client = openai.Client(base_url=f"http://{host}:{port}/v1", api_key="null")

response = client.chat.completions.create(
    model="null",
    messages=[
        {"role": "system", "content": "I'm a helpful AI assistant."},
    ],
    extra_body={"bad_words": ["you", "me"]},
    stream=True,
)
for chunk in response:
    if chunk.choices[0].delta:
        print(chunk.choices[0].delta.content, end='')
print('\n')

offline inference

sampling_params = SamplingParams(bad_words=["you", "me"])
llm = LLM(model= MODEL_DIR)
outputs = llm.generate(PROMPTS, sampling_params)

paddle-bot · 2025-07-28T13:01:47Z

Thanks for your contribution!

yuanlehome · 2025-07-29T17:04:20Z

fastdeploy/worker/iluvatar_model_runner.py

+        # Update bad tokens len
+        max_bad_tokens_len = (
+            paddle.max(self.share_inputs["bad_tokens_len"])
+            if paddle.max(self.share_inputs["bad_tokens_len"]) > 0


这里的写法可以优化一下，可能会执行两次max算子，可以在初始化时把bad_tokens_len初始值填充为1

yuanlehome · 2025-07-29T17:05:28Z

fastdeploy/engine/sampling_params.py

+                    continue
+
+                if prompt_token_ids not in self._bad_words_token_ids:
+                    self._bad_words_token_ids.extend(prompt_token_ids)


这里需要去重吗？

需要吧，如果给的bad_words有重复的话

…into bad_words

support bad_words

0cd9be9

Sunny-bot1 added 5 commits July 29, 2025 11:02

support online infer bad_words

988fa37

update

c151bca

add CI test

8d35dac

update

bab54fe

update

e3d7092

yuanlehome reviewed Jul 29, 2025

View reviewed changes

Sunny-bot1 added 2 commits July 30, 2025 01:59

update

0591e40

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

4d117ea

…into bad_words

yuanlehome approved these changes Jul 29, 2025

View reviewed changes

Merge branch 'develop' into bad_words

c263976

yuanlehome merged commit 74aa31d into PaddlePaddle:develop Jul 30, 2025
10 of 13 checks passed

ckl117 mentioned this pull request Aug 1, 2025

[BugFix] fix request_output sampling_params in PD #3154

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]support bad_words #3055

[Feature]support bad_words #3055

Uh oh!

Sunny-bot1 commented Jul 28, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Jul 28, 2025

Uh oh!

yuanlehome Jul 29, 2025

Uh oh!

yuanlehome Jul 29, 2025

Uh oh!

Sunny-bot1 Jul 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Feature]support bad_words #3055

[Feature]support bad_words #3055

Uh oh!

Conversation

Sunny-bot1 commented Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

功能描述

使用方法

Uh oh!

paddle-bot bot commented Jul 28, 2025

Uh oh!

yuanlehome Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

yuanlehome Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Sunny-bot1 Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sunny-bot1 commented Jul 28, 2025 •

edited

Loading