Add type hint model by samsja · Pull Request #18 · PrimeIntellect-ai/prime-rl

samsja · 2025-02-28T23:50:44Z

No description provided.

Signed-off-by: Sami Jaghouar <sami.jaghouar@gmail.com>

Jackmin801 · 2025-03-01T01:02:39Z

 )

 ModelName: TypeAlias = Literal["debugmodel", "150M", "1B", "Qwen32B", "Qwen1.5B", "Qwen7B"]
+ModelType: TypeAlias = LlamaForCausalLM | Qwen2ForCausalLM


Hrmm actually would transformers.modeling_utils.PreTrainedModel have worked? Could make it that we dont need to keep adding to this in the future

Oh but theyre removing the GenerationMixin soon. Hrmm

Hrmm actually would transformers.modeling_utils.PreTrainedModel have worked? Could make it that we dont need to keep adding to this in the future

hmm but my goal is to be able to control click into the llama and qwen code easily. Moreover I don't think that the PreTrainedModel is precise enough. For example in our apply_fsdp code we relay on the fact that model.model.layers exists. This is true for both LlamaForCausalLM and Qwen2ForCausalLM but probably not for other pretrained model

* fix wandb * add robust eval * add eval to orch * fix nccl ready * deepdive: separate, explicitly named caches for train and online eval (#18) * delete cache deepdive * add 105 ckpt interval * update deepdeive cache * fix eval * fix eval --------- Co-authored-by: sami jaghouar <sami@primeintellect.ai> Co-authored-by: Sebastian Müller <sebastian@primeintellect.ai> Co-authored-by: Mika Senghaas <mail@mikasenghaas.de>

samsja added 2 commits February 28, 2025 23:47

fix ac ckpt

0aeaf8d

Signed-off-by: Sami Jaghouar <sami.jaghouar@gmail.com>

add type hint for model type

0c53ca8

Signed-off-by: Sami Jaghouar <sami.jaghouar@gmail.com>

samsja requested review from Jackmin801 and apaz-cli and removed request for apaz-cli February 28, 2025 23:51

samsja merged commit f44d483 into main Mar 1, 2025

Jackmin801 reviewed Mar 1, 2025

View reviewed changes

samsja pushed a commit that referenced this pull request Nov 12, 2025

use prime-evals==0.1.2 (#18)

13a245f

samsja pushed a commit that referenced this pull request Dec 4, 2025

use prime-evals==0.1.2 (#18)

cdc9f8a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add type hint model#18

Add type hint model#18
samsja merged 2 commits intomainfrom
add-type-hint-model

samsja commented Feb 28, 2025

Uh oh!

Jackmin801 Mar 1, 2025

Uh oh!

Jackmin801 Mar 1, 2025

Uh oh!

samsja Mar 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

samsja commented Feb 28, 2025

Uh oh!

Jackmin801 Mar 1, 2025

Choose a reason for hiding this comment

Uh oh!

Jackmin801 Mar 1, 2025

Choose a reason for hiding this comment

Uh oh!

samsja Mar 1, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants