Load safetensors with i8 by chadvoegele · Pull Request #3042 · huggingface/candle

chadvoegele · 2025-08-01T11:32:55Z

Hi,

I have a small ML-enabled app that uses candle with wasm under the hood. Downloading the model takes a long time so I quantized it to 8-bit integers. Candle doesn't have an I8 DType and the safetensors load doesn't support I8.

This PR adds the ability to load a safetensors file with I8 by upcasting it to I64, similar to the existing I32 to I64 conversion.

Based on this PR for adding a DType::I32, I skipped adding a DType::I8 for now, but would consider that if requested.

Thanks!

greenrazer · 2025-08-05T23:17:45Z

Thanks!

Load safetensors i8

23f6c08

chadvoegele marked this pull request as ready for review August 1, 2025 11:34

chadvoegele changed the title ~~Load safetensors i8~~ Load safetensors with i8 Aug 1, 2025

greenrazer self-assigned this Aug 5, 2025

greenrazer merged commit 86bcf1e into huggingface:main Aug 5, 2025
9 checks passed

john-sharratt pushed a commit to john-sharratt/candle that referenced this pull request May 7, 2026

Load safetensors i8 (huggingface#3042)

2c2e0ae

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load safetensors with i8#3042

Load safetensors with i8#3042
greenrazer merged 1 commit into
huggingface:mainfrom
chadvoegele:load_safetensors_i8

chadvoegele commented Aug 1, 2025

Uh oh!

greenrazer commented Aug 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chadvoegele commented Aug 1, 2025

Uh oh!

greenrazer commented Aug 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants