High-performance inference of OpenAI's Whisper automatic speech recognition (ASR) model: Features: * Plain C/C++ implementation without dependencies * Apple Silicon first-class citizen - optimized via ARM NEON, Accelerate framework, Metal and Core ML * AVX intrinsics support for x86 architectures * VSX intrinsics support for POWER architectures * Mixed F16 / F32 precision * Integer quantization support * Zero memory allocations at runtime * Vulkan support * Support for CPU-only inference * Efficient GPU support for NVIDIA * OpenVINO Support * Ascend NPU Support * Moore Threads GPU Support * C-style API