Skip to content

Add Segment Any Text (SaT) model#7

Merged
BrandonWeng merged 5 commits intoFluidInference:mainfrom
smdesai:main
Oct 23, 2025
Merged

Add Segment Any Text (SaT) model#7
BrandonWeng merged 5 commits intoFluidInference:mainfrom
smdesai:main

Conversation

@smdesai
Copy link
Contributor

@smdesai smdesai commented Oct 23, 2025

This model segments sentences and works in situations where text is lacking punctuation.

@@ -0,0 +1,64 @@
# Segment Any Text CoreML
Segment Any Text is state-of-the-art sentence segmentation with 3 Transfomer layers. A pytorch version of the model is used in [wtsplit](https://github.com/segment-any-text/wtpsplit) and additional details can be found in this [paper](https://arxiv.org/abs/2406.16678).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

d you want to link your HF repo as well if folks want to use it directly?

Copy link
Contributor Author

@smdesai smdesai Oct 23, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point. Just updated the README.

Copy link
Member

@BrandonWeng BrandonWeng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit but could you place this under models/segment-text/coreml

@smdesai
Copy link
Contributor Author

smdesai commented Oct 23, 2025

nit but could you place this under models/segment-text/coreml

No problem, I wasn't sure of the structure. It's done.

Copy link
Member

@BrandonWeng BrandonWeng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thank you! once your project is open sourced, do feel free to add it here and to show case as well (same with FluidAudio repo)

@BrandonWeng BrandonWeng merged commit f7519ee into FluidInference:main Oct 23, 2025
@BrandonWeng BrandonWeng added new model adding a new model coreml labels Oct 23, 2025
@smdesai
Copy link
Contributor Author

smdesai commented Oct 23, 2025

thank you! once your project is open sourced, do feel free to add it here and to show case as well (same with FluidAudio repo)

Will do.

Alex-Wengg added a commit that referenced this pull request Jan 29, 2026
Add issue #7 documenting the Mimi decoder beeping artifact caused by
ANE float16 precision loss in streaming state feedback. Update issue #4
to reflect the per-model compute unit configuration.
Alex-Wengg added a commit that referenced this pull request Jan 30, 2026
- Add issue #8: voice-dependent duration differences between MLX and CoreML
  (azelma -2.00s, javert +1.92s — model-level behavior, not fixable in Swift)
- Update issue #4: all models now use .cpuAndGPU (not per-model config)
- Update issue #7: reflect all-models .cpuAndGPU fix
Alex-Wengg added a commit that referenced this pull request Jan 31, 2026
* docs: add ANE float16 beeping issue to iOS CoreML issues

Add issue #7 documenting the Mimi decoder beeping artifact caused by
ANE float16 precision loss in streaming state feedback. Update issue #4
to reflect the per-model compute unit configuration.

* docs: add voice duration differences issue, update compute unit docs

- Add issue #8: voice-dependent duration differences between MLX and CoreML
  (azelma -2.00s, javert +1.92s — model-level behavior, not fixable in Swift)
- Update issue #4: all models now use .cpuAndGPU (not per-model config)
- Update issue #7: reflect all-models .cpuAndGPU fix
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

coreml new model adding a new model

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants