fixing hardcoded cuda() for cpu inference

#21

by alexgambashidze - opened 10 days ago

base: refs/heads/main

←

from: refs/pr/21

Discussion Files changed

+16

-12

alexgambashidze

10 days ago

Fixed hardcoded cuda() to be able to inference with cpu.

Update modeling_deepseekocr.py2c5c9c44

alexgambashidze changed pull request title from Update modeling_deepseekocr.py to fixing hardcoded cuda() for cpu inference 10 days ago

AlexSytin

10 days ago

and it works on cpu, is there a way it would work on mps?

arcaputo3

10 days ago

@AlexSytin this should work on mps https://huggingface.co/deepseek-ai/DeepSeek-OCR/discussions/20

grhone

8 days ago

•

edited 8 days ago

These file changes need to be merged, but for anyone else looking for a quick workaround, you can replace these lines in the modeling_deepseekocr.py file to get this model working on CPU:

Line 505:

 # inputs_embeds[idx].masked_scatter_(images_seq_mask[idx].unsqueeze(-1).cuda(), images_in_this_batch)
 inputs_embeds[idx].masked_scatter_(images_seq_mask[idx].unsqueeze(-1), images_in_this_batch)

Line 917:

output_ids = self.generate(
    # input_ids.unsqueeze(0).cuda(),
    # images=[(images_crop.cuda(), images_ori.cuda())],
    # images_seq_mask = images_seq_mask.unsqueeze(0).cuda(),
    input_ids.unsqueeze(0),
    images=[(images_crop, images_ori)],
    images_seq_mask = images_seq_mask.unsqueeze(0),
    ...
)

Line 960:

# outputs = tokenizer.decode(output_ids[0, input_ids.unsqueeze(0).cuda().shape[1]:])
outputs = tokenizer.decode(output_ids[0, input_ids.unsqueeze(0).shape[1]:])

Line 971:

# outputs = tokenizer.decode(output_ids[0, input_ids.unsqueeze(0).cuda().shape[1]:])
outputs = tokenizer.decode(output_ids[0, input_ids.unsqueeze(0).shape[1]:])

bigpappic

8 days ago

Did you make these corrections

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment