Conversation
|
Confirmed the WebNN accepted Float16Array for float16 operand type from Chromium build 136.0.7051.0 via source.chromium.org. Please take a look the doc and code changes for KNOWN_COMPATIBLE_CHROMIUM_VERSION @ |
|
@fdwr Improved the code in f6f3bf8 for your review comments, thanks a lot! Updated the ORT dist to dev version which includes microsoft/onnxruntime#24437 PTAL |
demos/text-generation/models/onnxruntime/DeepSeek-R1-Distill-ONNX/README.md
Outdated
Show resolved
Hide resolved
fdwr
left a comment
There was a problem hiding this comment.
👍😎 demo. Thank you for adding it 👨💻.
Can we merge it now, or is there anything to do first (like upload any corresponding models or ORT distributions)? |
The development version of ORT distributions has been updated in the codebase and verified to be functioning correctly. No additional ORT distributions are required. Please help to merge this PR. Thanks much @fdwr ! |


This PR added text generation demo with following models:
Currently, the demo uses the test version of ONNX Runtime Web. We will switch to the dev version once the fix from microsoft/onnxruntime#24437 is included in the NPM packages.
Test URL: https://ibelem.github.io/webnn-developer-preview/demos/text-generation/ , will keep optimizing the performance.
Tested pass on WebNN DirectML GPU backend.
@fdwr PTAL
CC @Honry @huningxin