Skip to content

[Chat]: Fix llama critic model type#3486

Closed
JThh wants to merge 1 commit intohpcaitech:mainfrom
JThh:main
Closed

[Chat]: Fix llama critic model type#3486
JThh wants to merge 1 commit intohpcaitech:mainfrom
JThh:main

Conversation

@JThh
Copy link
Copy Markdown
Contributor

@JThh JThh commented Apr 6, 2023

📌 Checklist before creating the PR

  • I have created an issue for this PR for traceability
  • The title follows the standard format: [doc/gemini/tensor/...]: A concise description
  • I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Resolves #3475, resolves #3428

📝 What does this PR do?

A hotfix to replace the original llama for the llama causal model with a mere llama model to ensure the field of last_hidden_state presents during forward.

💥 Checklist before requesting a review

  • I have linked my PR to an issue (instruction)
  • My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
  • I have performed a self-review of my code
  • I have added thorough tests.
  • I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

  • 🌝 Yes, I do.
  • 🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

@JThh JThh closed this Apr 6, 2023
@JThh
Copy link
Copy Markdown
Contributor Author

JThh commented Apr 6, 2023

Duplicate PRs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

maybe LlamaModel?

1 participant