blob: 2c0e887b33d57156225d3c99155f80746f05b9fa (
plain) (
blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
|
High-performance inference of OpenAI's Whisper automatic speech
recognition (ASR) model:
-Plain C/C++ implementation without dependencies
-Apple Silicon first-class citizen - optimized via ARM NEON,
Accelerate framework, Metal and Core ML
-AVX intrinsics support for x86 architectures
-VSX intrinsics support for POWER architectures
-Mixed F16 / F32 precision
-4-bit and 5-bit integer quantization support
-Zero memory allocations at runtime
-Support for CPU-only inference
-Efficient GPU support for NVIDIA
-Partial OpenCL GPU support via CLBlast
-OpenVINO Support
-C-style API
|