Naive NF4 dequantize op for CPU #1602

matthewdouglas · 2025-04-17T14:49:33Z

Introduces a naive PyTorch-native implementation of dequantize_4bit for CPU. Currently has limitations on shape and does not support the FP4 type.

github-actions · 2025-04-17T14:54:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

matthewdouglas added cross-platform x64 CPU aarch64 labels Apr 17, 2025

matthewdouglas added this to the v0.46.0 milestone Apr 17, 2025

Additional 4bit CPU ops

78595b4

matthewdouglas force-pushed the cpu-ops branch from 9198900 to 78595b4 Compare April 17, 2025 14:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Naive NF4 dequantize op for CPU #1602

Naive NF4 dequantize op for CPU #1602

matthewdouglas commented Apr 17, 2025

github-actions bot commented Apr 17, 2025

Naive NF4 dequantize op for CPU #1602

Are you sure you want to change the base?

Naive NF4 dequantize op for CPU #1602

Conversation

matthewdouglas commented Apr 17, 2025

github-actions bot commented Apr 17, 2025