Skip to content

Release 1.21 docs#718

Merged
quic-hemagnih merged 14 commits intoquic:mainfrom
tv-karthikeya:release-1.21-docs
Jan 19, 2026
Merged

Release 1.21 docs#718
quic-hemagnih merged 14 commits intoquic:mainfrom
tv-karthikeya:release-1.21-docs

Conversation

@tv-karthikeya
Copy link
Copy Markdown
Contributor

No description provided.

abukhoy and others added 13 commits January 12, 2026 14:29
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
  - Removing  mosaicml/mpt-7b model as it is no longer available in HF
  - Removing  e5-mistral-7b-instruct and stella_en_1.5B_v5 since they are no longer part of release1.21

Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
@quic-hemagnih
Copy link
Copy Markdown
Contributor

Comments on documentation -

I see below as repeated twice-
Efficient Transformer Library - 1.21.0 Release Notes
Newly Supported Models
Key Features & Enhancements
Embedding Model Upgrades
Fine-Tuning Support
Efficient Transformer Library - 1.20.0 Release Notes
Newly Supported Models
Key Features & Enhancements
Embedding Model Upgrades
Fine-Tuning Support

We should remove decode only from GPT-OSS and add Disaggregated serving ready via vLLM
GPT-OSS (Decode-Only)

Add Gemma3 model also in the list

Add a line in the Release notes that if running GPT-OSS models natively via vLLM then we need a PR#685 of the qefficient for python 3.12

Mention the VLM model names for which CB is enabled. Continuous Batching (VLMs): Extended to Vision Language Models with multi-image handling

mention "Users can enable the feature by passing “use_onnx_subfunctions=True “ call during export " for onnx subfunctions.

Add below also
Onboarding Guide for adding new Causal models (#574)
Onboarding Guide for adding new Custom ops in QEff (#638)
Organized examples into domain-specific subdirectories (#615)

Copy link
Copy Markdown
Contributor

@quic-hemagnih quic-hemagnih left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see below as repeated twice-
Efficient Transformer Library - 1.21.0 Release Notes
Newly Supported Models
Key Features & Enhancements
Embedding Model Upgrades
Fine-Tuning Support
Efficient Transformer Library - 1.20.0 Release Notes
Newly Supported Models
Key Features & Enhancements
Embedding Model Upgrades
Fine-Tuning Support

We should remove decode only from GPT-OSS and add Disaggregated serving ready via vLLM
GPT-OSS (Decode-Only)

Add Gemma3 model also in the list

Add a line in the Release notes that if running GPT-OSS models natively via vLLM then we need a PR#685 of the qefficient for python 3.12

Mention the VLM model names for which CB is enabled. Continuous Batching (VLMs): Extended to Vision Language Models with multi-image handling

mention "Users can enable the feature by passing “use_onnx_subfunctions=True “ call during export " for onnx subfunctions.

Add below also
Onboarding Guide for adding new Causal models (#574)
Onboarding Guide for adding new Custom ops in QEff (#638)
Organized examples into domain-specific subdirectories (#615)

Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
@quic-hemagnih quic-hemagnih merged commit dcbb7be into quic:main Jan 19, 2026
4 checks passed
tchawada pushed a commit to tchawada/QEff_tanisha that referenced this pull request Feb 4, 2026
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
Co-authored-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Co-authored-by: Amit Raj <amitraj@qti.qualcomm.com>
tchawada pushed a commit to tchawada/QEff_tanisha that referenced this pull request Feb 4, 2026
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
Co-authored-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Co-authored-by: Amit Raj <amitraj@qti.qualcomm.com>
tchawada pushed a commit to tchawada/QEff_tanisha that referenced this pull request Feb 4, 2026
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
Co-authored-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Co-authored-by: Amit Raj <amitraj@qti.qualcomm.com>
qcdipankar pushed a commit to qcdipankar/efficient-transformers that referenced this pull request Feb 8, 2026
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
Co-authored-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Co-authored-by: Amit Raj <amitraj@qti.qualcomm.com>
Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>
smedhe pushed a commit to smedhe/QEff_Sharvari that referenced this pull request Mar 8, 2026
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
Co-authored-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Co-authored-by: Amit Raj <amitraj@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
smedhe pushed a commit to smedhe/QEff_Sharvari that referenced this pull request Mar 8, 2026
Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: vtirumal <vtirumal@qti.qualcomm.com>
Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
Co-authored-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Co-authored-by: Amit Raj <amitraj@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>

Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants