High-performance inference of OpenAI's Whisper automatic speech recognition
(ASR) model:

Features:
* Plain C/C++ implementation without dependencies
* Apple Silicon first-class citizen - optimized via ARM NEON, Accelerate
  framework, Metal and Core ML
* AVX intrinsics support for x86 architectures
* VSX intrinsics support for POWER architectures
* Mixed F16 / F32 precision
* Integer quantization support
* Zero memory allocations at runtime
* Vulkan support
* Support for CPU-only inference
* Efficient GPU support for NVIDIA
* OpenVINO Support
* Ascend NPU Support
* Moore Threads GPU Support
* C-style API