Implementation of prompt techniques by chakravarthik27 · Pull Request #1018 · PacificAI/langtest

chakravarthik27 · 2024-05-01T14:32:20Z

Few-Shot Prompting:

Few-shot prompting is an advanced technique used to enhance the performance of a large language model (LLM) by utilizing a small number of targeted examples (known as "shots"). These examples comprise specific prompts and are designed to direct the LLM toward desired responses for particular tasks.

The Langtest framework assists in evaluating the LLM model by utilizing multiple datasets with few-shot prompts. The evaluation employs distinct prompt configurations for two datasets, "BoolQ" and "NQ-open". Each dataset uses tailored instructions and designated prompt types to shape the model’s responses, whether for instructional completions or conversational engagements.

BoolQ Configuration:
The BoolQ (Boolean Questions) configuration tests the model’s capability to provide a straightforward 'true' or 'false' response based on the context. The guidelines emphasize the importance of conciseness and accuracy. This configuration includes sample interactions to instruct the model on handling context-dependent questions efficiently.

NQ-open Configuration:
The NQ-open (Natural Questions - open book) setup assesses the model's ability to furnish concise answers to open-ended questions demanding specific information. Similar to BoolQ, this configuration uses an "instruct" prompt type aimed at eliciting direct and relevant responses without superfluous details.

Both configurations use the few-shot prompting approach to teach the model the anticipated response format and depth, enabling it to generalize from limited examples to new, unexplored queries, thereby testing its accuracy and contextual appropriateness with minimal guidance.

Configuration Methods:

Configuration in the Harness class can be done in two ways: using a YAML file or directly passing arguments in dictionary format to the Harness config.

YAML Configuration (saved as config.yaml):

prompt_config:
  "BoolQ":
    instructions: "Provide a concise response. The answer should be either `true` or `false`."
    prompt_type: "instruct"
    examples:
      - user:
          context: "The Good Fight -- A second 13-episode season premiered on March 4, 2018. On May 2, 2018, the series was renewed for a third season."
          question: "Is there a third series of The Good Fight?"
        ai:
          answer: "True"
      - user:
          context: "Lost in Space -- The fate of the castaways is never resolved, as the series was unexpectedly canceled at the end of season 3."
          question: "Did the Robinsons ever get back to Earth?"
        ai:
          answer: "False"
  "NQ-open":
    instructions: "Provide a brief and precise answer."
    prompt_type: "instruct"
    examples:
      - user:
          question: "Where does the electron come from in beta decay?"
        ai:
          answer: "An atomic nucleus"
      - user:
          question: "Who wrote 'You're a Grand Old Flag'?"
        ai:
          answer: "George M. Cohan"

tests:
  defaults:
    min_pass_rate: 0.8
  robustness:
    uppercase:
      min_pass_rate: 0.8
    add_typo:
      min_pass_rate: 0.8

Using the Harness Class:

harness = Harness(
                  task="question-answering",
                  model={"model": "gpt-3.5-turbo-instruct", "hub": "openai"},
                  data=[{"data_source": "BoolQ", "split": "test-tiny"},
                        {"data_source": "NQ-open", "split": "test-tiny"}],
                  config="config.yaml"
                 )

Execute the following commands to generate, run, and report:

harness.generate().run().report()

…ement-the-prompt-techniques

chakravarthik27 added 3 commits May 1, 2024 19:31

Basic Implementation of prompt techniques handles from config.

71f2183

Refactor field order in MessageType and Conversion classes in prompts.py

4160fea

lint fix

38c4749

chakravarthik27 self-assigned this May 2, 2024

chakravarthik27 added 12 commits May 3, 2024 20:19

Refactor field order in MessageType and Conversion classes in prompts.py

06c2ef6

fixed lint

6e6216c

improved to get prompt based on the style like chat or instruct

21ae26b

Handle the prompts for mulitple datasets.

649a5dc

Refactor prompt manager to handle default prompt configuration

73f16e6

Integrated with model_handler and prompt manager.

db35333

error handling, when prompt_config is not available in config

ca0a4c1

improve the prompt handling for instruct models

e1cb02a

Refactor prompt manager to handle default prompt configuration

cb0c2ed

improved in the lm studio

f5d2c91

support orderless field in MessageType object

5e536ee

fix the prompt issues and nb update

3b0712c

chakravarthik27 changed the title ~~Basic Implementation of prompt techniques handles from config.~~ Implementation of prompt techniques May 9, 2024

chakravarthik27 linked an issue May 9, 2024 that may be closed by this pull request

Implement the prompt techniques like 5-shot, 10-shot, few-shot, and CoT. #1011

Closed

chakravarthik27 added the ⭐ Feature Indicates new feature requests label May 9, 2024

chakravarthik27 added this to the 2.2.0 milestone May 9, 2024

chakravarthik27 requested a review from ArshaanNazir May 10, 2024 15:49

chakravarthik27 added 2 commits May 11, 2024 12:56

Merge remote-tracking branch 'origin/release/2.2.0' into feature/impl…

cd10c70

…ement-the-prompt-techniques

fix lint and format issue

e7e94a2

ArshaanNazir approved these changes May 11, 2024

View reviewed changes

chakravarthik27 merged commit e2d08fc into release/2.2.0 May 11, 2024

chakravarthik27 deleted the feature/implement-the-prompt-techniques branch August 30, 2024 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of prompt techniques#1018

Implementation of prompt techniques#1018
chakravarthik27 merged 17 commits intorelease/2.2.0from
feature/implement-the-prompt-techniques

chakravarthik27 commented May 1, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chakravarthik27 commented May 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Few-Shot Prompting:

Configuration Methods:

Using the Harness Class:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chakravarthik27 commented May 1, 2024 •

edited

Loading