Overview
Explore integrating Nvidia's Parakeet model via MLX as an additional offline transcription option alongside WhisperKit.
Background
Parakeet is Nvidia's ASR (Automatic Speech Recognition) model that has been ported to MLX for Apple Silicon. This could provide an alternative local transcription option with potentially different accuracy/performance characteristics compared to WhisperKit.
Investigation Tasks
1. Technical Feasibility
2. Architecture Integration
3. User Experience
4. Testing Requirements
Success Criteria
- Parakeet model integrates seamlessly as a third transcription option
- Users can switch between WhisperKit and Parakeet models easily
- Performance metrics documented for both models
- Clear guidance provided for users on which model to choose
Out of Scope
- Replacing WhisperKit entirely (keep both options available)
- Supporting Intel Macs (Parakeet requires MLX/Apple Silicon like current MLX service)
References
- Current WhisperKit implementation:
Services/WhisperKitService.swift
- TranscriptionService protocol:
Core/Domain/TranscriptionService.swift
- MLX service pattern:
Services/MLXService.swift
Priority: Medium
Effort: Medium-Large (requires research, prototyping, and integration)
Type: Enhancement
Overview
Explore integrating Nvidia's Parakeet model via MLX as an additional offline transcription option alongside WhisperKit.
Background
Parakeet is Nvidia's ASR (Automatic Speech Recognition) model that has been ported to MLX for Apple Silicon. This could provide an alternative local transcription option with potentially different accuracy/performance characteristics compared to WhisperKit.
Investigation Tasks
1. Technical Feasibility
2. Architecture Integration
TranscriptionServiceprotocol for Parakeet3. User Experience
4. Testing Requirements
Success Criteria
Out of Scope
References
Services/WhisperKitService.swiftCore/Domain/TranscriptionService.swiftServices/MLXService.swiftPriority: Medium
Effort: Medium-Large (requires research, prototyping, and integration)
Type: Enhancement