Skip to content

在 SS ViT 中,你们对 “MLP 权重的病态 (pathologies of MLP weights)” 进行了观察与重置 (re‑setting) —— 在长期训练或大规模下游迁移 (transfer‑learning / fine‑tuning) 场景中,这种重置机制是否可能对模型 “稳定性 /可迁移性 (stability / generalization)” 产生负面影响? #1

@imshinuohu

Description

@imshinuohu
No description provided.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions