Skip to feed
ModDojo

DFlash

@bstnxbt

DFlash brings speculative decoding to Apple Silicon using stock MLX without a fork. It is positioned as a performance-focused inference enhancement for local model execution on Apple hardware.

Categories

Tools

Tags

#apple silicon#mlx#inference#speculative decoding#llm performance